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A detailed study is presented of the expected performance of the 
ATLAS detector. The reconstruction of tracks, leptons, photons, 
missing energy and jets is investigated, together with the performance 
of ^-tagging and the trigger. The physics potential for a variety of 
interesting physics processes, within the Standard Model and beyond, is 
examined. The study comprises a series of notes based on simulations 
of the detector and physics processes, with particular emphasis given to 
the data expected from the first years of operation of the LHC at CERN. 



Display of a high-px H — > ZZ* — > ee^/i decay (m# = 130 GeV), after full simulation and reconstruction in the 
ATLAS detector. The four leptons and the recoiling jet with Ej =135 GeV are clearly visible. Hits in the Inner 
Detector are shown in green for the four reconstructed leptons, both for the precision tracker (pixel and silicon 
micro-strip detectors) at the inner radii and for the transition radiation tracker at the outer radii. The other tracks 
reconstructed with pj > 0.5 GeV in the Inner Detector are shown in blue. The two electrons are depicted as 
reconstructed tracks in yellow and their energy deposits in each layer of the electromagnetic LAr calorimeter 
are shown in red. The two muons are shown as combined reconstructed tracks in orange, with the hit strips in 
the resistive-plate chambers and the hit drift tubes in the monitored drift-tube chambers visible as white lines 
in the barrel muon stations. The energy deposits from the muons in the barrel tile calorimeter can also be seen 
in purple. 
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Preface 

The Large Hadron Collider (LHC) at CERN promises a major step forward in the understanding of 
the fundamental nature of matter. The ATLAS experiment is a general-purpose detector for the LHC, 
whose design was guided by the need to accommodate the wide spectrum of possible physics signatures. 
The major remit of the ATLAS experiment is the exploration of the TeV mass scale where ground- 
breaking discoveries are expected. In the focus are the investigation of the electroweak symmetry break- 
ing and linked to this the search for the Higgs boson as well as the search for Physics beyond the Standard 
Model. 

In this report a detailed examination of the expected performance of the ATLAS detector is provided, 
with a major aim being to investigate the experimental sensitivity to a wide range of measurements and 
potential observations of new physical processes. An earlier summary of the expected capabilities of 
ATLAS was compiled in 1999 [1]. A survey of physics capabilities of the CMS detector was published 
in [2]. 

The design of the ATLAS detector has now been finalised, and its construction and installation have 
been completed [3]. An extensive test-beam programme was undertaken. Furthermore, the simulation 
and reconstruction software code and frameworks have been completely rewritten. Revisions incorpo- 
rated reflect improved detector modelling as well as major technical changes to the software technology. 
Greatly improved understanding of calibration and alignment techniques, and their practical impact on 
performance, is now in place. 

The studies reported here are based on full simulations of the ATLAS detector response. A variety 
of event generators were employed. The simulation and reconstruction of these large event samples thus 
provided an important operational test of the new ATLAS software system. In addition, the processing 
was distributed world-wide over the ATLAS Grid facilities and hence provided an important test of 
the ATLAS computing system - this is the origin of the expression "CSC studies" ("computing system 
commissioning"), which is occasionally referred to in these volumes. 

The work reported does generally assume that the detector is fully operational, and in this sense 
represents an idealised detector: establishing the best performance of the ATLAS detector with LHC 
proton-proton collisions is a challenging task for the future. The results summarised here therefore 
represent the best estimate of ATLAS capabilities before real operational experience of the full detector 
with beam. Unless otherwise stated, simulations also do not include the effect of additional interactions 
in the same or other bunch-crossings, and the effect of neutron background is neglected. Thus simulations 
correspond to the low-luminosity performance of the ATLAS detector. 

This report is broadly divided into two parts: firstly the performance for identification of physics 
objects is examined in detail, followed by a detailed assessment of the performance of the trigger sys- 
tem. This part is subdivided into chapters surveying the capabilities for charged particle tracking, each of 
electron/photon, muon and tau identification, jet and missing transverse energy reconstruction, b-tagging 
algorithms and performance, and finally the trigger system performance. In each chapter of the report, 
there is a further subdivision into shorter notes describing different aspects studied. The second major 
subdivision of the report addresses physics measurement capabilities, and new physics search sensitiv- 
ities. Individual chapters in this part discuss ATLAS physics capabilities in Standard Model QCD and 
electroweak processes, in the top quark sector, in b-physics, in searches for Higgs bosons, supersymme- 
try searches, and finally searches for other new particles predicted in more exotic models. 

References 

[1] ATLAS Collaboration, CERN-LHCC/99-014, CERN-LHCC/99-015 (1999). 
[2] CMS Collaboration, G. L. Bayatian et al., J. Phys. G34 (2007) 995. 
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Cross-Sections, Monte Carlo Simulations and Systematic 
Uncertainties 

Abstract 

The studies presented in this volume share several common features, including 
use of the same event samples for Standard Model processes, and the same de- 
tector description and simulation framework for all samples. Common cross- 
section assumptions were made. These assumptions, and the Monte Carlo 
generator programs employed, are listed. Information is also given on the dif- 
ferent detector configurations and geometries simulated, and on the consistent 
treatment of systematic uncertainties. 

1 Introduction 

The studies presented in this volume have many shared features starting from a common simulation 
framework of the ATLAS detector, and the same detector description. They are based on a full simulation 
of the ATLAS detector using the GEANT4 [1] program and the event samples produced were shared by 
the various analysis groups. For the simulation of the physics events standard event generators for high 
energy proton-proton collisions were used and interfaced to the ATLAS simulation framework. 

In many searches for new particles at the LHC Standard Model processes represent important back- 
grounds and the signal significance depends on the precise knowledge of these backgrounds. As dis- 
cussed in several studies presented in this book, methods were investigated on how to determine these 
cross-sections from the data themselves. However, this will not be always possible, and reliable theo- 
retical predictions must be used to estimate these backgrounds. In addition, the Standard Model cross- 
sections are relevant for the estimate of signal rates and consequent measurement precision for Standard 
Model parameters, or for tests of the Standard Model. All studies presented in this book made common 
assumptions on the cross-sections for Standard Model processes. 

In this introductory note features common to the simulations used in these studies are discussed. 
After reviewing the cross-section assumptions and models used for different processes, the Monte Carlo 
generator programs employed are summarised. Information is given, next, on the different detector 
configurations and geometries simulated. Finally, a common treatment is described of systematic uncer- 
tainties which affect many analyses. 

2 Cross-Sections of Physical Processes 

A consistent set of cross-sections for Standard Model processes was used in all studies reported in this 
volume. Over recent years considerable progress has been made in the calculation of higher-order QCD 
corrections (often expressed as "K-factors") for many physics processes at the LHC. Wherever these 
corrections are known for both the signal and the dominant background processes, they were included 
in the analyses. In case the K-factors are not known for the dominant background processes, the studies 
have consistently refrained from using K-factors and resorted to Born-level predictions for both signal 
and backgrounds. In this note we detail the values of cross-sections that were used in the different studies 
reported in this volume: we do not discuss the uncertainties, such as those from missing higher order 
corrections. 

For the simulation of physics processes both leading order (LO) and next-to-leading order (NLO) 
Monte Carlo programs were used. For the simulation of several processes tree-level matrix element 
calculations with parton shower matching were adopted. Unless otherwise stated, all tree-level Monte 
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Carlo calculations were normalized to the NLO cross-section calculation. In the case of parton shower 
matching several final state parton multiplicities were simulated for predefined shower matching cuts 
and the sum of the exclusive cross-sections was normalized to the result of the higher-order calculation. 
By applying this procedure, it is expected that the shapes of inclusive distributions are reasonably well 
described. However, large uncertainties are expected in the absolute cross-section predictions in extreme 
phase-space regions such as, for example, final states with high jet multiplicities. 

Table 1: Leading order (LO) and higher order (N)NLO cross-sections for some important Standard 
Model production processes for pp collisions at a centre-of-mass energy of 14 TeV. In the calculation 
of all cross-sections the CTEQ6L and CTEQ6M structure function parametrizations have been used. 
For inclusive W and Z production, the cross-section quoted includes the branching ratio into one lepton 
generation. 



Process 




Comments 
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Order in 


a (nb) 










pert, theory 




Total inelastic pp 




PYTHIA [2] 




79- 10 6 


Non Single Diffractive 




PYTHIA [2] 




65-10 6 
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> 25 GeV 


PYTHIA [2] 


LO 


367- 10 3 








NLOJET++ [3,4] 


NLO 


477-10 


r-jet 




Pj > 25 GeV 


PYTHIA [2] 


LO 


180 


bb -> li + X 




Pj > 6 GeV 


PYTHIA [2] 


LO 


6.1 • 10 3 


bb^jXjX+X 




^ l/M2 > 6/4 GeV 


PYTHIA [2] 


LO 


110 


it 








NLO 


0.794 








Ref. [5] 


NLO+NLL 


0.833 


Single top 




t-channel 


AcerMC [6] 


LO 


0.251 


production 






Ref. [7-9] 


NLO 


0.246 






s -channel 


AcerMC [6] 


LO 


0.007 








Ref. [7] 


NLO 


0.011 






Wt 


AcerMC [6] 


LO 


0.058 








Ref. [10-12] 


NLO 


0.066 








FEWZ [13] 


LO 


16.8 








FEWZ [13] 


NLO 


20.7 








FEWZ [13] 


nnlo 


20.5 


Z^tt 




mee > 60 GeV 


FEWZ [13] 


LO 


1.66 








FEWZ [13] 


NLO 


2.03 








FEWZ [13] 


NNLO 


2.02 


WW 




m wM > 20 GeV,/?*' > 10 GeV 


MCFM [14] 


LO 


0.072 






MCFM [14] 


NLO 


0.112 


wz 




"VM/zW > 20 GeV ' Pt /Z > 10 GeV 


MCFM [14] 


LO 


0.032 






MCFM [14] 


NLO 


0.056 


zz 




m zW > 12 GeV 


MCFM [14] 


LO 


0.0165 






MCFM [14] 


NLO 


0.0221 


n im,qg^ 


li) 


80<m rr <150 GeV 


RESBOS [15] 


NLO 


0.0209 


(88^ 


n) 


80<m rr <150 GeV 


RESBOS [15] 


NLO 


0.0080 



The cross-sections for the most relevant Standard Model production processes are summarised in 
Table 1 . The cross-sections for new physics signal processes are presented in the respective sub-chapters 
of this book. For the calculation of the leading order cross-sections the CTEQ6L [16, 17] set of struc- 
ture function parametrizations was used. Processes available at (N)NLO were calculated by using the 
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CTEQ6M [16, 17] parametrizations. The following comments concern the various cross-sections: 

• The total pp cross-section at a centre-of-mass energy of 14 TeV is predicted by PYTHIA [2] to be 
102 mb. This is split into elastic (23 mb) and inelastic (79 mb) parts. The total inelastic pp cross- 
section includes contributions from single and double-diffractive scattering which are estimated 
to be 14 and 10 mb, respectively. The non-single diffractive cross-section, which is usually also 
denoted as the minimum bias cross-section, is given by (Jnsd = Oinei. — Csd = 65 mb. 

• Multijet production via QCD processes is the dominant high-/? T process at the LHC and is an im- 
portant background in many physics studies. Even if next-to-leading order corrections are partially 
known, the remaining uncertainties from missing higher-order corrections remain large. We there- 
fore used leading-order estimates in most physics studies and large errors were assigned to cover 
the uncertainty. 

• The pair production of ^-quarks provides a copious source of leptons at the LHC. The single and 
dimuon cross-sections from bb production were calculated with pj thresholds as expected at the 
trigger level. A leading order PYTHIA calculation has been used in the present studies. Even if 
the higher-order corrections are known [18], large uncertainties remain. 

• For the tt production cross-section several calculations beyond leading order exist. In the studies 
presented in this volume the NLO calculation including a next-to-leading log (NLL) resummation 
[5] was used. The cross-sections for the three relevant sub-processes for single-top production 
were calculated at NLO. 

• The inclusive production cross-sections of W and Z bosons are known at next-to-next-to-leading 
order (NNLO) and these values were used in the studies. The residual uncertainties from variations 
of the renormalization and factorization scales are estimated to be at the level of a few percent [13]. 
In many cases the production of W and Z bosons with jets constitutes an important background to 
searches. Exclusive W/Z + jet cross sections have in general been calculated with leading order 
Monte Carlos, such as PYTHIA, or the parton shower matched Monte Carlos ALPGEN or Sherpa. 
These calculations were normalized to the inclusive NNLO cross-sections. Only in case of the 
Wbb and Zbb production were exclusive NLO cross-sections calculated, to which the tree-level 
Monte Carlo generator results were normalized. The results of these calculations for a few rele- 
vant phase space regions are: 



Process 


Comments 


Reference 


Order in 
pert, theory 


CJ(pb) 


Wbb 


/4 > 10 GeV, |tj 6 | < 2.5, AR bb >0.7 


ALPGEN [19] 


LO 


68.7 




m ( * ] > 30 GeV, m bb > 9.24 GeV 


MCFM [14] 


NLO 


176.9 


Zbb 


p\ > 10 GeV, \ t] b \< 2.5, AR bb >0.7 


AcerMC [6] 


LO 


60.7 




m { * ] > 30 GeV, m bb > 9.24 GeV 


MCFM [14] 


NLO 


86.4 


Zbb 


p\ > 5 GeV, \rj b \< 2.5, AR bb >0.7 


AcerMC [6] 


LO 


27.9 




m { * ] > 60 GeV, m bb > 9.24 GeV 


MCFM [14] 


NLO 


44.8 



• The cross-sections for diboson production are available at NLO. In addition to the ^-initiated 
processes, the gg box-diagram contributions are sizeable, and both have been taken into account 
in the analyses. For ZZ production the gg box contributions were estimated to be at the level of 
30% [20] and the NLO result was scaled accordingly. A re-evaluation of this contribution using 
the program of Ref. [21] yielded a contribution of 23.8%. For the 77 production process the box 
contribution was calculated using the RESBOS Monte Carlo program [15]. 
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3 Monte Carlo Simulation 

The samples of fully simulated events were made using a variety of Monte Carlo generators. Interfaces 
in the ATLAS software framework provided mechanisms to feed the particle-level events generated into 
the ATLAS simulation software packages. The production of these events was a major effort: a plethora 
of physics processes were simulated, and over 1300 different data sets were produced. Unless otherwise 
stated, samples were produced simulating only one proton-proton interaction: the effect of additional 
interactions was neglected. 

The principal general-purpose Monte Carlo generators employed were PYTHIA, HERWIG, Sherpa, 
AcerMC, ALPGEN, MadGraph/MadEvent and MC@NLO. In addition to these, further generators were 
used for specific processes: Charybdis, CompHEP, TopReX and WINHAC. The versions of the gen- 
erators used are summarised in Table 2. Parton-level Monte Carlo generators used either PYTHIA or 
HERWIG/JIMMYfor hadronisation and underlying event modelling. HERWIG hadronisation was com- 
plemented by an underlying event simulation from the JIMMY program [22] (versions 4.2 and 4.31). 
The underlying event model parameters were tuned, for PYTHIA and HERWIG/JIMMY, to published 
data from Tevatron and other experiments, as described in Ref. [23] and references therein. For Sherpa, 
the default parton shower and underlying event modelling was used. Examples of the specific processes 
generated with each program are given in the Appendix. 

Table 2: Monte Carlo event generators used for the production of event samples for the studies reported 
here. The fourth column shows, for the parton-level event generators, which software was used for the 
hadronisation and underlying event (UE) simulation. 



Generator 


Versions 


Reference 


Hadronisation+UE 


PYTHIA 


6.323-6.411 


[2] 




HERWIG 


6.508-6.510 


[24] 


JIMMY for UE 


Sherpa 


1.008-1.011 


[25] 




AcerMC 


3.1-3.4 


[26] 


PYTHIA,HERWIG 


ALPGEN 


2.05-2.13 


[19] 


HERWIG/JIMMY 


MadGraph/MadEvent 


3.X-4.15 


[27] 


PYTHIA 


MC@NLO 


3.1-3.3 


[28] 


HERWIG/JIMMY 


Charybdis 


1.001-1.003 


[29] 


HERWIG/JIMMY 


CompHEP 




[30] 


PYTHIA 


TopReX 


4.11 


[31] 


PYTHIA 


WINHAC 


1.21 


[32] 


PYTHIA 



The decay of t leptons was normally not treated by the main Monte Carlo generators themselves, but 
rather via the TAUOLA package [33], version 2.7. The radiation of photons from charged leptons was 
also treated specially, using the PHOTOS QED radiation package, version 2.15 [34]. These two pack- 
ages were used for a range of processes and generators: this required implementation of new interfaces 
for HERWIG and Sherpa. When simulating specific b-hadron decays for B-physics analyses [35], the 
EvtGen [36] dedicated b-hadron decay package was used in combination with PYTHIA. 

The Monte Carlo tools in ATLAS are taken, where available, from the LHC Computing Grid GENSER 
(generator services) sub-project [37]. These are modified with custom ATLAS software patches when 
needed. For most Monte Carlo programs more than one version was employed during the long series 
of simulations: changes in version were motivated by physical model, or technical improvements to the 
package. Common particle mass definitions were also used where relevant (for example, the top mass 
was taken to be 175 GeV, unless otherwise stated). The Monte Carlo tools are then either wrapped in- 
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side the ATLAS Athena environment [38], or interfaced via the Les Houches accord event format [39], 
depending on the implementation simplicity. The latter interfaces were used for the Sherpa, AcerMC, 
ALPGEN, MadGraph/MadEvent, MC@NLO and CompHEP event generation. These interfaces rely on 
widespread use of the HepMC C++-based event record format [40] : several improvements were made 
during the series of event production processings. 

LHAPDF, the Les Houches accord PDF interface library [41], was used throughout, and was linked 
to all Monte Carlo event generators to provide the PDF set values. The PDF sets [16] used were CTEQ6L 
for leading order (LO) Monte Carlo event generators, and CTEQ6M for the next-to-leading order (NLO) 
Monte Carlo event generator MC@NLO. 

4 Detector Description 

One important aspect of the Computing Commissioning Challenge was the test of the alignment and 
calibration procedures with an imperfect, i.e. more realistic, description of the ATLAS detector. In 
particular, misalignments were introduced for the inner detector and additional material was added in the 
inner detector and in front of the calorimeters. In addition, distorted magnetic field configurations were 
introduced, where the symmetry axis of the field did not coincide with the beam axis. 

The goal was to establish and validate the alignment and calibration procedures and to determine the 
known distortions. This has a strong physics motivation: for example, a knowledge of the energy scale 
of the electromagnetic calorimeter with a precision of 0.02%, as required for a precise measurement of 
the W mass, requires knowledge of the total radiation length of the material in the inner detector with a 
precision at the level of 1%. 

Two different geometries were used in the simulations. In a so-called as-built geometry realistic 
alignment shifts and distortions of the magnetic field were introduced. In the distorted geometry addi- 
tional material was added. The calibration samples were simulated and calibration constants determined 
with the as-built geometry. All physics samples were, however, simulated with the distorted geometry 
and the calibrations constants as determined from the as-built geometry were applied. 



As-built geometry The as-built geometry includes misalignments of the main subdetectors (pixel de- 
tector, silicon microstrip tracker (SCT), and transition radiation tracker (TRT)) of the inner detector. The 
misalignments were introduced as independent translations and rotations at three levels: (i) of the main 
subdetector parts (pixel detector, SCT barrel, two SCT endcaps, TRT barrel and two TRT endcaps), (ii) 
of major detector sub-units, like pixel and SCT barrel layers, pixel and SCT endcap disks and TRT barrel 
modules and (iii) of individual silicon detector modules. The sizes of displacements were chosen to lie 
within the expected build tolerances. The actual displacements were assigned randomly in most cases. 

The shifts described in the following were applied for the levels (i) and (ii) in the global ATLAS co- 
ordinate system, defined as a right-handed system with the x-axis pointing to the centre of the LHC ring, 
the y-axis in the vertical direction and the z-axis along the beam direction. The level (iii) misalignments 
refer to the local coordinate system of individual detector modules. 

At level (i), the whole subdetector parts were displaced in the three spatial coordinates at the level of 
1-2 mm followed by rotations around the three axes at the level of 10-50 mrad. 

The alignments of the endcap detector sub-units include additional in-plane (x-y) displacements and 
rotations around the z-axis. They were generated randomly from uniform distributions centred around 
zero with a width of ±150 }im and ±1 mrad, respectively. For the TRT barrel modules the translations 
are generated randomly from uniform distributions around zero, with widths of ±200 fim, ±100 jJ,m and 
±300 /xm respectively for modules of the three TRT layers. In addition, a systematic radial shift of +1.0, 
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-0.5 and +1.5 mm is applied for all modules of the respective layers. No rotations nor displacements 
were applied to the TRT endcap modules. 

For the individual pixel and SCT detector modules individual position displacements were applied 
randomly from uniform distributions with widths of 30-50 }Xm for pixel and 100-150 }Xm for SCT mod- 
ules, followed by rotations around the three axes, also randomly chosen from uniform distributions with 
widths of ± 1 mrad. 



Distorted geometry The distorted geometry is based on the as-built geometry with additional material 
added in different locations of the inner detector and in front of the electromagnetic calorimeter. Material 
corresponding to an increase of 1-3% of a radiation length was added just behind the first pixel layer, 
and just behind the second SCT layer, and in the endcaps adjacent to one of the endcap pixel disks and 
adjacent to two of the endcap SCT disks. This amount of additional material is considered to be much 
larger than the uncertainty on the knowledge of the exact amount of the material. Within the active 
tracking volume the material in regions of service routing was increased by 1-5% of a radiation lenght. 
For services outside the active tracking volume the material was increased by up to 15% of a radiation 
length. These increases are also expected to be larger than the uncertainties. It should be noted that for 
the inner detector the extra material was only added in one half of the azimuthal angle (0 < (j) < n) to 
allow for a straightforward study of the difference in calibration and performance with single particles. 

Additional material was also added in a 0 -asymmetric way in front of the calorimeter. In the region 
77 >0 additional material corresponding to 8-11% Xq were added in front of the barrel cryostat, 5% Xq 
between the barrel presampler and strip layers (in n/2 < 0 < 3n/2), and 7-11% Xo behind the cryostat. 
In the region r\ <0, additional material corresponding to 5% Xo was added between the barrel presampler 
and the strip layer in the region —n/2<ty<n/2. The density of material in the gap between the barrel 
and the endcap cryostat was increased by 70%. Again, this is considered to be conservative and larger 
than the uncertainties on the precise knowledge of the material distribution in this region of the detector. 



Applications in performance of physics studies Several performance studies were carried out using 
the as-built and distorted geometries in simulation and the impact is documented elsewhere in this vol- 
ume. Among the important studies is the impact of the misalignments on the b-tagging performance 
or on the reconstructed resolution of the Z resonance in muon final states. In addition the impact on 
the mass resolution and reconstruction efficiencies was studied for H —>■ yy, H —>■ ZZ — > 41 and Z — > ee 
samples. 

5 Treatment of systematic uncertainties 

The results of the physics and performance studies are affected by systematic uncertainties, some of 
which are common to many studies. To allow a uniform treatment of these uncertainties across the 
various analyses, the following effects and prescriptions were applied. 

There are detector-related uncertainties, such as those on particle identification efficiencies, on back- 
ground rejections, and on the precise knowledge of energy scales and resolution functions. These un- 
certainties can be largely constrained and determined from the data themselves. However, this can only 
be done with a finite, and integrated luminosity-dependent, accuracy. In the present studies, rough esti- 
mates of these uncertainties were used, considering three canonical integrated luminosity values: 0.1, 1 
and 10 fb _1 . Detector-related systematic uncertainties were applied to signal and background samples 
by varying the energy scale, resolution, or efficiency or rejections. 
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In addition, uncertainties come from the approximations made in Monte Carlo generators, modelling 
and from the theoretical calculation of cross-sections. Unless stated otherwise when discussing individ- 
ual analyses, the following assumptions were applied, for the various systematic uncertainties. 

5.1 Uncertainties on the detector performance 

Electrons and photons For electrons and photons, uncertainties on the identification efficiency of 
1.0%, 0.5% and 0.2% were assumed for the three values of integrated luminosity, 0.1, 1 and 10 fb _1 , 
respectively. These values can be determined from data by applying the so called tag-and-probe meth- 
ods [42, 43] to known resonance decays, like Z — > ee. The uncertainty on the energy scale was assumed 
to be 1 % (0. 1 %) for integrated luminosities below (above) 1 fb~ 1 . The electron and photon resolutions 
were estimated to be known with precisions of 20%, 10% and 5% at the three values of integrated lumi- 
nosity. The electron fake rates were assumed to have overall uncertainties of 50%, 20% and 10% at the 
three integrated luminosity values. All uncertainties were assumed to be independent of pj and tj. 

Muons Uncertainties on the identification efficiency of 1%, 0.3% and 0.1% were used for muons with 
Pt < 100 GeV for the three integrated luminosity values. As for electrons, it should be noted that these 
numbers are expected to be conservative, since the statistical precision that can be obtained from studies 
of Z — > jiji decays amounts to 0.2% for an integrated luminosity of 0.1 fb _1 . For higher muon momenta 
the efficiencies must be estimated using extrapolations based on Monte Carlo and therefore larger values 
were assumed: for muons with a pj of 1 TeV, for example, the uncertainties were assumed to be 5%, 3% 
and 1%, respectively. 

The muon energy scale was assumed to be known with precisions of 1%, 0.3% and 0.1% for the 
three integrated luminosity values. Furthermore, uncertainties of 12%, 4% and 1% were assumed on the 
muon momentum resolution below 100 GeV, whereas a value of 100% was used for muons with p T of 
1 TeV. All these uncertainties were considered to be independent of r\ . 

Jets and Missing Ej Unless otherwise stated, the overall uncertainty on the jet energy scale was as- 
sumed to ±5% over the pseudorapidity region \ r\\ <3.2 and ±10% for jets in the forward calorimeters, 

3.2 < |tj| <4.9. This scale uncertainty is applied, independently of jet p T , for both light-quark jets 
and jets from ^-quarks. In addition, unless stated otherwise, an uncertainty of 10% on the jet energy 
resolution was considered. 

The missing transverse energy, £"™ ss , is calculated by summing high-^x objects like leptons and 
jets, in addition a component from unclustered energy is added. Part of the uncertainty in £™ ss is thus 
correlated with the jet and lepton energy scale uncertainties, but also a wrong calibration of unclustered 
energy can affect £"™ ss . 

After identified objects were rescaled or smeared, the Ej lss was re-calculated with the corrected 
energies. In most of the studies also the low pj part of the unclustered energy was modified. In this 
procedure, the momenta of the leptons and jets with pj > 20 GeV were subtracted first from the £"™ ss , 
a 10% uncertainty on the remainder was applied, and then the effects of the leptons and jets were added 
back in. 

Heavy-flavour tagging For the ^-tagging efficiency a 5% relative uncertainty was assumed, indepen- 
dently of luminosity. This is considered to be a conservative estimate for integrated luminosities of 
1 fb _1 or higher. For the mistag rate of light and c-jets an integrated-luminosity independent uncertainty 
of 10% was assumed. It is expected that the mistag rates can be measured with this precsion or better 
from data sets exceeding an integrated luminosity of 0.1 fb _1 . The efficiency and mistag variations were 
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implemented in analyses by randomly rejecting 5% of the jets tagged as b-jets or by randomly changing 
the tag status of light and c-jets. 

5.2 Uncertainties on cross-sections and Monte Carlo modelling 

Several theoretical uncertainties affect the predicted cross-sections. The details and the size of the uncer- 
tainty depend on the signal and background processes considered and no general numbers can be quoted. 
They are therefore usually addressed in the respective studies presented in this volume. The main effects 
can be classified as follows: 

• The theoretical calculations are affected by missing unknown higher-order corrections. These 
uncertainties are usually estimated by varying the renormalization and factorization scales within 
factors of two around the nominal scale chosen. 

• Despite the normalization of tree-level Monte Carlo programs - with or without parton shower 
matching - to the (N)NLO cross sections, large uncertainties remain, in particular for exclusive 
final states in specific phase space regions after the application of cuts. These uncertainties have 
been estimated either by varying parton-shower matching cuts or by comparisons with different 
Monte Carlo event generators. 

• Uncertainties in the parton distribution functions result in uncertainties on the calculated cross- 
sections which are typically of the order of 10%. These uncertainties have either been addressed 
by varying the eigenvalues of the CTEQ parametrization parameters [17] within the suggested 
values or by comparing the CTEQ and MRST2001 [44] parametrizations. 

Appendix 

In the following, additional technical information is given on some of the Monte Carlo event generators 
employed, together with example processes. 

PYTHIA 

The PYTHIA Monte Carlo event generator [2] was employed for the event simulation of many samples. 
The new implementation of parton showering, commonly known as pj -ordered showering, was used, 
as was the new underlying event model where the phase-space is interleaved/shared between initial-state 
radiation (ISR) and the underlying event. In addition to the standard processes implemented in PYTHIA, 
two extensions were implemented containing a chiral lagrangian model [45], and an R-hadron model. 

HERWIG and JIMMY 

HERWIG [24] was used, for example, for simulation of SUSY signal processes [46] . The pre-generated 
input tables for these processes were provided by ISAJET and ISAWIG [47]. 

Sherpa 

The Sherpa Monte Carlo event generator [25], was used for several processes, most notably for the 
production of electroweak bosons in association with jets: these profited from the implemented CKKW 
parton-showering and matrix-element matching technique. Some representative processes for which 
Sherpa was used are: afforZ produced in association with up to four light jets; Higgs boson production 
via vector boson fusion; and associated production of bbA. 
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AcerMC 

Some processes for which AcerMC [26] was used were: Zbb production; Ztt production; it production; 
single top processes; ttbb production; and tttt production. The AcerMC program was used both with 
PYTHIA and HERWIG hadronisation, to allow tests of systematic uncertainties related to parton shower 
modelling. 

A procedure was developed for combining samples with it production modelled with MC@NLO 
with samples from the AcerMC ttbb process. There is an overlap of the two samples since the extra 
gluon in the NLO it calculation can split into a bb pair during parton showering. For studies where this 
channel was relevant [48], events with additional bb pairs in the MC@NLO samples were rejected, since 
the matrix-element ttbb generation is expected to describe such events better in the region of the phase 
space selected by the analysis (especially for relatively large opening angle between the two quarks of 
the bb pair). The corresponding number of events (10% of the total) was also removed from the high 
jet-multiplicity it sample for normalization purposes. 

ALPGEN 

The ALPGEN Monte Carlo event generator [19] was used for several processes, most notably for the 
production of electroweak bosons in association with jets, in order to profit from the implemented MLM 
parton-showering and matrix-element matching technique. Some processes for which ALPGEN was 
used were: W or Z production in association with up to five light jets; it production with up to three 
additional light jets; bb or cc production with up to three additional light jets; electroweak boson pair 
production in association with up to three jets Higgs production via vector boson fusion; and photon pair 
production in association with up to three jets. 

MadGraph/MadEvent 

The MadGraph/MadEvent Monte Carlo event generator [27] was used for a selection of processes, for 
example for exclusive final states involving multiple electroweak bosons and associated light jets, as 
well as some Standard Model Higgs boson production channels. Although MadGraph/MadEvent pro- 
cesses in the 4.X versions can be combined with a native version of parton-showering and matrix-element 
matching technique, this functionality was not used here. Some representative processes for which Mad- 
Graph/MadEvent was used are: W or Z production in association with four light partons; WW, WZ or 
ZZ pair production in association with two light partons; electroweak boson production in association 
with two photons; and photon pair production in association with two additional partons. 

MC@NLO 

The MC@NLO event generator [28] is one of the few Monte Carlo tools incorporating full NLO QCD 
corrections to a selected set of processes in a consistent way. It was used to simulate a number of 
processes, including: inclusive W or Z production; it production; electroweak boson pair production; 
and Higgs boson production and decay, for the and yy Higgs boson decay modes. 

Charybdis 

The Charybdis Monte Carlo event generator [29] is a special-purpose program implementing production 
and decay of microscopic black holes in models with TeV-scale gravity. 
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CompHEP 

The CompHEP Monte Carlo event generator [30] was used for a small set of processes: excited electron 
production, Z' — > e + e~y, and the production of E6 heavy iso-singlet D quarks decaying to Z or W pairs, 
or to a ZH pair in association with additional quarks. 

TopReX 

The TopReX Monte Carlo event generator [31] was used for top pair or single top production involving 
flavour-changing neutral current (FCNC) couplings in top quark decays, explicitly: tt production where 
one top quark decays conventionally (to bW), and the other to either qy or qZ; and single top production 
and decay to either qy or qZ. TopReX was interfaced directly with PYTHIA for parton showering, 
hadronisation and the underlying event: a point to note is that TopReX single top generation is intimately 
interfaced with the PYTHIA old (virtuality-ordered) parton showering model and thus cannot be used 
with the new PYTHIA pj -ordered showering. 

WINHAC 

WINHAC [32] is a Monte Carlo event generator dedicated to the hadro-production of single W bosons 
decaying into leptons. Comparisons done within ATLAS have shown that the WINHAC predictions 
match well the predictions of PHOTOS for radiative corrections to W boson leptonic decays. PHOTOS 
was used throughout this work. 
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Abstract 

The ATLAS inner detector will see of the order of 1000 charged particle tracks 
for every beam crossing at the design luminosity of the CERN Large Hadron 
Collider (LHC). This paper summarizes the design of the detector and outlines 
the reconstruction software. The expected performance for reconstructing sin- 
gle particles is presented, along with an indication of the vertexing capabilities. 
The effect of the detector material on electrons and photons is discussed along 
with methods for improving their reconstruction. The studies presented focus 
on the performance expected for the initial running at the start-up of the LHC. 



1 Introduction 

In ATLAS, at the LHC design luminosity of 10 34 cm _2 s _1 , approximately 1000 particles will emerge 
from the collision point every 25 ns within |tj| < 2.5, creating a very large track density in the detec- 
tor. To achieve the momentum and vertex resolution requirements imposed by the benchmark physics 
processes, high-precision measurements will be made in the inner detector (ID), shown in Fig. 1. Pixel 
and silicon microstrip (SCT) trackers, used in conjunction with the straw tubes of the transition radiation 
tracker (TRT), will make high-granularity measurements. The original performance specifications were 
set out in 1994 and are detailed in [1] - the focus being on challenging physics channels such as the 
measurement of leptons from the decays of heavy gauge bosons and the tagging of Z?-quark jets. 



6.2m 




Barrel semiconductor tracker 
'detectors 
Barrel transition radiation tracker 
End-cap transition radiation tracker 
End-cap semiconductor tracker 



Figure 1: Cut-away view of the ATLAS inner detector. 

The ID surrounds the LHC beam-pipe which is inside a radius of 36 mm. The layout of the detector 
is illustrated in Fig. 2 and detailed in [2]. Its basic parameters are summarised in Table 1. The ID is 
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immersed in a 2 T magnetic field generated by the central solenoid, which extends over a length of 5.3 m 
with a diameter of 2.5 m. 



Solenoid coil 




Figure 2: Plan view of a quarter-section of the ATLAS inner detector showing each of the major elements 
with its active dimensions. 



Item 


Radial extension (mm) 


Length (mm) 


Pixel 


Overall envelope 


45.5 < R < 242 


0< \z\ 


<3092 


3 cylindrical layers 


Sensitive barrel 


50.5 <R< 122.5 


0<\z\ 


< 400.5 


2x3 disks 


Sensitive end-cap 


88.8 <R< 149.6 


495 < 


z\ < 650 


SCT 


Overall envelope 


255 < R < 549 (barrel) 


0< \ z \ 


<805 






251 < R < 610 (end-cap) 


810 < 


z\ < 2797 


4 cylindrical layers 


Sensitive barrel 


299</?<514 


0<|z| 


<749 


2x9 disks 


Sensitive end-cap 


275 < R < 560 


839 < 


z\ < 2735 


TRT 


Overall envelope 


554 < R < 1082 (barrel) 


0< \z\ 


<780 






617 <R< 1106 (end-cap) 


827 < 


z\ < 2744 


73 straw planes 


Sensitive barrel 


563 < R < 1066 


o<kl 


<712 


160 straw planes 


Sensitive end-cap 


644 < R < 1004 


848 < 


z| < 2710 



Table 1 : Main parameters of the inner detector. 



The precision tracking detectors (pixels and SCT) cover the region \t}\ < 2.5. In the barrel region, 
they are arranged on concentric cylinders around the beam axis while in the end-cap regions, they are 
located on disks perpendicular to the beam axis. The highest granularity is achieved around the vertex 
region using silicon pixel sensors. All pixel modules are identical and the minimum pixel size on a 
sensor is 50 x 400 /Am 2 . The pixel layers are segmented in R — 0 and z with typically three pixel layers 
crossed by each track. The first layer, called the "vertexing layer", is at a radius of 51 mm. The intrinsic 
accuracies in the barrel are 10 /im (R — Q) and 1 15 /im (z) and in the disks are 10 fim (R — 0) and 115 /im 
(/?). The pixel detector has approximately 80.4 million readout channels. 

For the SCT, eight strip layers (four space points) are crossed by each track. In the barrel region, this 
detector uses small-angle (40 mrad) stereo strips to measure both coordinates, with one set of strips in 
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each layer parallel to the beam direction, measuring R — (j). Each side of a detector module consists of 
two 6.4 cm long, daisy-chained sensors with a strip pitch of 80 ;Um. In the end-cap region, the detectors 
have a set of strips running radially and a set of stereo strips at an angle of 40 mrad. The mean pitch 
of the strips is also approximately 80 ;Um. The intrinsic accuracies per module in the barrel are 17 }xm 
(R — <j>) and 580 fim (z) and in the disks are 17 jj,m (R — <p) and 580 ;Um (R). The total number of readout 
channels in the SCT is approximately 6.3 million. 

A large number of hits (typically 30 per track, with a maximum of 36, see Fig. 34) is provided by the 
4 mm diameter straw tubes of the TRT, which enables track- following up to \r} \ = 2.0. The TRT only 
provides R — 0 information, for which it has an intrinsic accuracy of 130 jJ,m per straw. In the barrel 
region, the straws are parallel to the beam axis and are 144 cm long, with their wires divided into two 
halves, approximately at r\ = 0. In the end-cap region, the 37 cm long straws are arranged radially in 
wheels. The total number of TRT readout channels is approximately 351,000. 



Item 


Intrinsic accuracy 
(Mm) 


Alignment tolerances 
(Urn) 


Radial (R) 


Axial (z) 


Azimuth (R-0) 


Pixel 










Layer-0 


lO(/?-0) 115 (z) 


10 


20 


7 


Layer- 1 and -2 


10 115 (z) 


20 


20 


7 


Disks 


10 (R-(j>) 115 (R) 


20 


100 


7 


SCT 










Barrel 


17 580 (z) 1 


100 


50 


12 


Disks 


17 (£-0)580 (R) 1 


50 


200 


12 


TRT 


130 






30 2 


1. Arises from the 40 mrad stereo angle between back-to-back sensors on the SCT modules with axial (barrel) 

or radial (end-cap) alignment of one side of the structure. The result is pitch-dependent for end-cap SCT modules. 

2. The quoted alignment accuracy is related to the TRT drift-time accuracy. 



Table 2: Intrinsic measurement accuracies and mechanical alignment tolerances for the inner detector 
sub-systems, as defined by the performance requirements of the ATLAS experiment. The numbers in the 
table correspond to the single-module accuracy for the pixels, to the effective single-module accuracy 
for the SCT and to the drift-time accuracy of a single straw for the TRT. 

The combination of precision trackers at small radii with the TRT at a larger radius gives very robust 
pattern recognition and high precision in both R — 0 and z coordinates. The straw hits at the outer radius 
contribute significantly to the momentum measurement, since the lower precision per point compared to 
the silicon is compensated by the large number of measurements and longer measured track length. 

The inner detector system provides tracking measurements in a range matched by the precision mea- 
surements of the electromagnetic calorimeter [2]. The electron identification capabilities are enhanced by 
the detection of transition-radiation photons in the xenon-based gas mixture of the straw tubes. The semi- 
conductor trackers also allow impact parameter measurements and vertex reconstruction ("vertexing") 
for heavy-flavour and T-lepton tagging. The secondary vertex measurement performance is enhanced by 
the innermost layer of pixels, at a radius of about 5 cm. 

Charged particle tracks with transverse momentum pj > 0.5 GeV and |tj| < 2.5 are reconstructed 
and measured in the inner detector and the solenoid field. However, the efficiency at low momentum is 
limited because of the large material effect in the inner detector (see Fig. 3). The intrinsic measurement 
performance expected for each of the inner detector sub-systems is summarised in Table 2. This per- 
formance has been studied extensively over the years [1], both before and after irradiation of production 
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modules, and also, more recently, during the combined test beam (CTB) runs in 2004 [2,3] and in a series 
of cosmic-ray tests in 2006 [2,4]. The results have been used to update and validate the modelling of 
the detector response in the Monte-Carlo simulation. This paper describes the expected performance of 
the inner detector in terms of tracking, vertexing and particle identification. The alignment of the inner 
detector is described elsewhere ( [2] and the references therein). 




Figure 3: Material distribution (Xo, A) at the exit of the ID envelope, including the services and thermal 
enclosures. The distribution is shown as a function of | T] | and averaged over (j) . The breakdown indicates 
the contributions of external services and of individual sub-detectors, including services in their active 
volume. 



2 Track reconstruction 

The inner detector track reconstruction software [5] follows a modular and flexible software design, 
which includes features covering the requirements of both the inner detector and muon spectrometer [2] 
reconstruction. These features comprise a common event data model [6] and detector description [7], 
which allow for standardised interfaces to all reconstruction tools, such as track extrapolation, track fit- 
ting including material corrections and vertex fitting. The extrapolation package combines propagation 
tools with an accurate and optimised description of the active and passive material of the full detector [8] 
to allow for material corrections in the reconstruction process. The suite of track-fitting tools includes 
global-^ 2 and Kalman-filter techniques, and also more specialised fitters such as dynamic noise adjust- 
ment (DNA) [9], Gaussian-sum filters (GSF) [10] and deterministic annealing filters [11]. Optimisation 
of these tools continues and their performance will need to be evaluated on real data. The tools intended 
to cope with electron bremsstrahlung (DNA and GSF - see Section 5.1) will be run after the track re- 
construction, as part of the electron-photon identification. Other common tracking tools are provided, 
including those to apply calibration corrections at later stages of the pattern recognition, to correct for 
module deformations or to resolve hit-association ambiguities. 

Track reconstruction in the inner detector is logically sub-divided into three stages: 

1. A pre-processing stage, in which the raw data from the pixel and SCT detectors are converted 
into clusters and the TRT raw timing information is translated into calibrated drift circles. The 
SCT clusters are transformed into space-points, using a combination of the cluster information 
from opposite sides of a SCT module. 

2. A track-finding stage, in which different tracking strategies [5, 12], optimised to cover different 
applications, are implemented. (The results of studies of the various algorithms are reported else- 
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where [13].) The default tracking exploits the high granularity of the pixel and SCT detectors to 
find prompt tracks originating from the vicinity of the interaction region. First, track seeds are 
formed from a combination of space-points in the three pixel layers and the first SCT layer. These 
seeds are then extended throughout the SCT to form track candidates. Next, these candidates are 
fitted, "outlier" clusters are removed, ambiguities in the cluster-to-track association are resolved, 
and fake tracks are rejected. This is achieved by applying quality cuts. For example, a cut is made 
on the number of associated clusters, with explicit limits set on the number of clusters shared be- 
tween several tracks and the number of holes per track (a hole is defined as a silicon sensor crossed 
by a track without generating any associated cluster). The selected tracks are then extended into 
the TRT to associate drift-circle information in a road around the extrapolation and to resolve the 
left-right ambiguities. Finally, the extended tracks are refitted with the full information of all three 
detectors. The quality of the refitted tracks is compared to the silicon-only track candidates and 
hits on track extensions resulting in bad fits are labelled as outliers (they are kept as part of the 
track but are not included in the fit). 

A complementary track-finding strategy, called back-tracking, searches for unused track segments 
in the TRT. Such segments are extended into the SCT and pixel detectors to improve the tracking 
efficiency for secondary tracks from conversions or decays of long-lived particles. 

3. A post-processing stage, in which a dedicated vertex finder is used to reconstruct primary ver- 
tices. This is followed by algorithms dedicated to the reconstruction of photon conversions and of 
secondary vertices. 

3 Tracking performance 

3.1 Introduction to performance studies 

The expected performance of the tracking system for reconstructing single particles and particles in 
jets is determined using a precise modelling of the individual detector response (including electronic 
noise and inefficiencies), geometry and passive material in the simulation. In this paper, a consistent 
set of selection cuts for reconstructed tracks has been used. Generally, only prompt particles (those 
originating from the primary vertex) with pj > 1 GeV and |tj| < 2.5 are considered. Standard quality 
cuts require reconstructed tracks to have at least seven precision hits (pixels and SCT). In addition, 
the transverse and longitudinal impact parameters at the perigee must fulfil respectively \do\ < 2 mm 
and \zo — z v \ x sin$ < 10 mm, where z v is the position of the primary vertex along the beam and 
6 is the polar angle of the track. Stricter selection cuts, called ^-tagging cuts, are defined by: at least 
two hits in the pixels, one of which should be in the vertexing layer, as well as \do\ < 1 mm and 
\zo — z v \ x sin(9 < 1.5 mm. A reconstructed track is matched to a Monte-Carlo particle if at least 80% 
of its hits were created by that particle. The efficiency is defined as the fraction of particles which are 
matched to reconstructed tracks passing the quality cuts, and the fake rate is defined as the fraction of 
reconstructed tracks passing the quality cuts which are not matched to a particle. 

3.2 Track parameter resolutions 

The resolution of a track parameter X can be expressed as a function of pr as: 

o x {pt) = OxH{1(Bpx/pt) (1) 

where Ox(°°) is the asymptotic resolution expected at infinite momentum, p x is a constant representing 
the value of p T for which the intrinsic and multiple-scattering terms in the equation are equal for the 
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parameter X under consideration and © denotes addition in quadrature. This expression is approximate, 
working well at high pj (where the resolution is dominated by the intrinsic detector resolution) and at 
low p T (where the resolution is dominated by multiple scattering). Ox (°°) and px are implicitly functions 
of the pseudorapidity. Figures 4, 5 and 6 show the momentum resolution for isolated muons and the trans- 
verse and longitudinal impact parameter resolutions for isolated pions 1 , all without a beam constraint and 
assuming the effects of misalignment, miscalibration and pile-up to be negligible. The resolutions are 
taken as the RMS evaluated over a range which includes 99.7% of the data (corresponding to ±3(7 for a 
Gaussian distribution). The TRT measurements are included in the track fits for tracks with \r} \ < 2.0, 
beyond which there are no further TRT measurements. Table 3 shows the values of ax(°°) and p x for 
tracks in two r\ -regions, corresponding to the barrel and end-caps. The use of the beam-spot constraint 
in the track fit improves the momentum resolution for high-momentum tracks by about 5%. The impact 
parameter resolutions are quoted only for tracks with a hit in the vertexing layer (this requirement has a 
very high efficiency, as illustrated in Fig. 14 by the small difference between the standard quality and the 
b-tagging quality tracks). Figure 7 shows the comparison of the impact parameter resolutions for pions 
and muons. The muon distributions are very close to Gaussian, while those for the pions are slightly 
broader and have small tails, in addition. The tails are even larger for electrons, and this is discussed in 
Section 5. 



Track parameter 


0.25 < |tj 


<0.50 


1.50 < |tj 


< 1.75 


o x (°°) 


Px (GeV) 


<jxH 


Px (GeV) 


Inverse transverse momentum (q/pr) 


0.34 TeV" 1 


44 


0.41 TeV" 1 


80 


Azimuthal angle (0) 


70 Airad 


39 


92 jurad 


49 


Polar angle (cot(9) 


0.7 xl0~ 3 


5.0 


1.2xl0- 3 


10 


Transverse impact parameter (do) 


10 ^m 


14 


12 ^im 


20 


Longitudinal impact parameter (z 0 x sin 6) 


91 jum 


2.3 


71 ^m 


3.7 



Table 3: Expected track-parameter resolutions (RMS) at infinite transverse momentum, Ox(°°), and 
transverse momentum, px, at which the multiple- scattering contribution equals that from the detector 
resolution (see Eq. (1)). The momentum and angular resolutions are shown for muons, whereas the 
impact-parameter resolutions are shown for pions (see text). The values are shown for two r\ -regions, 
one in the barrel inner detector where the amount of material is close to its minimum and one in the 
end-cap where the amount of material is close to its maximum. Isolated, single particles are used with 
perfect alignment and calibration in order to indicate the optimal performance. 

The consequences of the pseudorapidity variation of the track parameter resolutions can be seen 
from the reconstructed J/y —>■ jiji masses in the barrel and end-caps. This is shown in Fig. 8 where both 
muons are either in the barrel or the end-caps. 

The determination of the lepton charge at high p T is particularly important for measuring charge 
asymmetries arising from the decays of possible heavy gauge bosons (W' and Z'). Typically, such mea- 
surements require that the charge of the particle be determined to better than 3 a 2 . Whereas the charge 
of high-energy muons will be measured precisely in the muon system, the charge of high-energy elec- 
trons can only be measured by the inner detector. Figure 9 shows the reconstructed values of q/pr for 
negatively charged isolated muons and electrons with pj = 0.5 TeV and pj = 2 TeV. The peaks of the 
distributions are at negative values, reflecting the negative charges of the simulated particles. It can be 
seen that the shape of the muon distributions is unchanged in going from 0.5 to 2 TeV - at high momen- 

1 Muons suffer less from interactions and hence provide the best reference; impact parameter determination is important for 
vertexing, and this is more commonly required for hadrons, for example when fr-tagging. 

2 The charge of a particle is considered well measured if it is at least 3a from 0 in the variable q/p. 
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Figure 4: Relative transverse momentum resolution (left) as a function of 1 77 1 for muons with p T = 1, 5 
and 100 GeV. Transverse momentum, at which the multiple-scattering contribution equals the intrinsic 
resolution (corresponding to px in Eq. (1)), as a function of |t]| (right). 
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Figure 5: Transverse impact parameter, do, resolution (left) as a function of 1 77 1 for pions with pr = 1,5 
and 100 GeV. Transverse momentum, at which the multiple-scattering contribution equals the intrinsic 
resolution (corresponding to px in Eq. (1)), as a function of 1 77 1 (right). 
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Figure 6: Modified longitudinal impact parameter, zo x sin(9, resolution (left) as a function of \r}\ for 
pions with pr = 1,5 and 100 GeV. Transverse momentum, at which the multiple- scattering contribution 
equals the intrinsic resolution (corresponding to px in Eq. (1)), as a function of |t]| (right). 




Figure 7: Resolution of the transverse impact parameter, do (left) and the modified longitudinal impact 
parameter, zo x sin(9 (right) for 5 GeV muons and pions with \t] \ < 0.5 - corresponding to the first two 
bins of the previous two figures. 
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Figure 8: Probability for the reconstructed invariant mass of muon pairs from J/yr — > jiji. decays in 
events with prompt J / \ff production. Distributions are shown for both muons with |tj| < 0.8 (left) and 
|tj| > 1.5 (right). 

turn, the resolution of q/pr is independent of the true momentum of the muon and determined by the 
intrinsic resolution of the detector. 

For electrons, things are more complicated. As well as the intrinsic resolution, there are competing 
effects from bremsstrahlung (which lowers the track momentum and makes the charge easier to measure) 
and the conversion of bremsstrahlung photons (leading to pattern-recognition problems and degraded 
charge determination). At 0.5 TeV, the effects of the conversions are significant, causing the electrons to 
be measured worse than the corresponding muons. However, at 2 TeV, the intrinsic resolution dominates 
the electron charge misidentification, and this is partially compensated for by the bremsstrahlung. The 
fractions of muons and electrons for which the sign of the charge is incorrectly determined are shown in 
Fig. 10. For these plots, perfect alignment has been assumed; any misalignment will degrade the charge 
sign determination. 

3.3 Track reconstruction efficiency 

Figures 11, 12 and 13 show the efficiencies for reconstructing isolated muons, pions and electrons. In 
addition to multiple-scattering, pions are affected by hadronic interactions in the inner detector material, 
while electrons are subject to even larger reconstruction inefficiencies which arise from the effects of 
bremsstrahlung. As a result, the efficiency curves as a function of | r\ | for pions and electrons reflect the 
shape of the amount of material in the inner detector (see Fig. 3). As expected, the efficiency becomes 
larger and more uniform as a function of \r\ | at higher energies. 

Previous studies [1] have shown that the reconstruction efficiency is little affected by the "pile-up" of 
additional minimum bias events at high luminosity (10 34 cm^V). A more challenging environment is 
found in the core of an energetic jet. Figure 14 shows the track reconstruction efficiency for prompt pions 
(produced before the vertexing layer) and the fake rate for tracks in jets in tt events as a function of |tj|. 
For these events, the mean jet pj is 55 GeV, and the mean pj of the accepted tracks which they contain 
is 4 GeV. The loss of efficiency at | r\ | = 0 with the ^-tagging criteria arises from inefficiencies in the pixel 
vertexing layer, which are assumed here to be 1%; this improves at higher |tj|, owing to the presence 
of larger clusters when the track incidence angle decreases. Beyond \r\ \ ~ 1, the tracking performance 
deteriorates, mostly because of increased material. As shown in Fig. 15, the fake rate increases near the 



24 



Tracking - The Expected Performance of the Inner Detector 





Figure 9: Reconstructed inverse transverse momentum multiplied by the charge for high-energy muons 
(jU~) (left) and electrons (e~) (right) for p T = 0.5 TeV (top) and p T = 2 TeV (bottom) and integrated 
over a flat distribution in r\ with \ r\ \ < 2.5. Those tracks which have been incorrectly reconstructed with 
a positive charge are indicated by the shaded regions. At 2 TeV, the fraction of electrons (muons) whose 
charge has been misidentified is 12.8% (13.7%). 
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Figure 10: Charge misidentification probability for high-energy muons and electrons as a function of p T 
for particles with | tj | < 2.5 (left) and as a function of | tj | for p T = 2 TeV (right). 
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Figure 11: Track reconstruction efficiencies as a function of |tj| for muons (left) and pions (right) 
with p T = 1, 5 and 100 GeV. 




Figure 12: Track reconstruction efficiencies as 
a function of |tj| for electrons with pj = 1, 5 
and 100 GeV. 
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Figure 13: Track reconstruction efficiencies as a 
function of \t]\ for muons, pions and electrons 
with pr = 5 GeV. The inefficiencies for pions and 
electrons reflect the shape of the amount of mate- 
rial in the inner detector as a function of | T7 1 . 
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Figure 14: Track reconstruction efficiencies and 
fake rates as a function of \r}\, for charged pions 
in jets in it events and for different quality cuts (as 
described in Section 3.1). "Reconstruction" refers 
to the basic reconstruction before additional qual- 
ity cuts. 



Figure 15: Track reconstruction efficiencies and 
fake rates as a function of the distance AR (de- 
fined as AR = ^Ari 2 + Atp 2 ) of the track to the 
jet axis, using the standard quality cuts and inte- 
grated over 1 7] | < 2.5, for charged pions in jets in 
it events. 



core of the jet, where the track density is the highest and induces pattern-recognition problems. This 
effect increases as the jet pj increases. Using alternative algorithms, a few percent efficiency can be 
gained at the cost of doubling the fake rate in the jet core. 

The reconstruction described in Section 2 is aimed at tracks with pj > 0.5 GeV. Multiplicity studies 
in minimum bias events will be among the first analyses undertaken by ATLAS. In these events, the 
peak of the track pj spectrum is around 0.3 GeV. The reconstruction of these low-momentum tracks 
will be difficult because of the high curvature of the tracks, increased multiple scattering, and at very 
low momentum, reduced numbers of hits, since the tracks may fail to reach the outer layers of the 
inner detector. To complement the track-finding strategy described in Section 2, an additional strategy 
is employed in which hitherto unused pixel and SCT hits are used. To further aid the reconstruction, 
the algorithm for the space-point track seeding is modified to use looser internal cuts and the cut on the 
number of precision hits is reduced to at least five hits. Tracks are accepted with p T > 0.1 GeV, and in 
some cases, inefficiencies for pj > 0.5 GeV are recovered. The resulting track reconstruction efficiency 
is shown in Fig. 16. The distribution of candidate fake tracks is shown in Fig. 17. 



4 Vertexing performance 
4.1 Primary vertices 

Vertexing tools constitute important components of the higher-level tracking algorithms. The residuals of 
the primary vertex reconstruction are shown in Fig. 18, as obtained without using any beam constraint, 
for it events and H —> yy events with m# = 120 GeV. The results shown here for H —> yy events are 
based on tracks reconstructed from the underlying event and do not make use of the measurement of the 
photon direction in the electromagnetic calorimeter. The primary vertex in it events has always a rather 
large multiplicity and includes a number of high-pr tracks, resulting in a narrower and more Gaussian 
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Figure 16: Track reconstruction efficiencies as a function of pj for | tj | < 2.5 and pj > 0. 1 GeV (left) and 
as a function of \r\ | for two different pj ranges (right) in minimum bias events (non-diffractive inelastic 
events). 
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Figure 17: Rate of candidate fake tracks as a function of p r for |tj| < 2.5 and p T > 0.1 GeV (left) and 
as a function of |t]| (right) in minimum bias events (non-diffractive inelastic events). The rate of such 
tracks is a function of the amount of material, indicating that a large fraction of them are secondaries for 
which the Monte-Carlo truth information is not kept 
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distribution than for H — > yy events. Table 4 shows the resolutions of the primary vertex reconstruction 
in these tt and H —> 77 events, without and with a beam constraint in the transverse plane, as well as the 
efficiencies to reconstruct and select correctly (within ±300 jUm) these primary vertices in the presence 
of pile-up at a luminosity of 10 33 cm~ 2 s _1 . 



Event type 


x-y resolution 


z resolution 


Reconstruction 


Selection 




(Mm) 


(Aim) 


efficiency (%) 


efficiency (%) 


(no BC) 


18 


41 


100 


99 


ff (BC) 


11 


40 


100 


99 


H^-yy (no BC) 


36 


72 


96 


79 


yy (BC) 


14 


66 


96 


79 



Table 4: Primary vertex resolutions (RMS), without and with a beam constraint (BC) in the transverse 
plane, for tt events and H — >• 77 events with m H = 120 GeV in the absence of pile-up. Also shown, in the 
presence of pile-up at a luminosity of 10 33 cm~ 2 s _1 , are the efficiencies to reconstruct and then select 
the hard- scattering vertex within ±300 Aim of the true vertex position in z. The hard- scattering vertex is 
selected as the primary vertex with the largest Zpj, summed over all its constituent tracks. 
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Figure 18: Primary vertex residual along x, in the transverse plane (left), and along z, parallel to the 
beam (right), for events containing top-quark pairs and H — >• 77 decays with m H = 120 GeV. The results 
are shown without pile-up and without any beam constraint. 



4.2 Secondary vertices 

The resolution for the reconstruction of the radial position of secondary vertices for J/y —> fifi decays 
in events containing B-hadron decays (mean pj of 15 GeV for the J/y) is shown in Fig. 19. While 
there are some tails in the resolution distributions (left-hand plot), these are small. The corresponding 
distributions for three -prong hadronic T-decays in Z — >• TT events (mean p T of 36 GeV for the T-lepton) 
are shown in Fig. 20. Because there are three charged tracks in close proximity, the reconstruction of 
these decays is more challenging: the vertex resolutions are Gaussian in the central region, but have long 
tails as can be seen from the points showing 95% coverage in right-hand plot. 
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Figure 19: Resolution for the reconstruction of the radial position of the secondary vertex for J/y — ► fiji 
decays in events containing B-hadron decays for tracks with 1 77 1 around 0 (left) and as a function of the 
pseudorapidity of the J/\ff (right). The J /iff have an average transverse momentum of 15 GeV. 




Figure 20: Resolution for the reconstruction of the radial position of the secondary vertex for three- 
prong hadronic T-decays in Z —> %x events for tracks with 1 77 1 around 0 (left) and as a function of 
the pseudorapidity of the % (right). In the right-hand plot, the circles with bars correspond to Gaussian 
fits, as illustrated in the left-hand plot; the points showing 68.3% (95%) coverage show the width of the 
integrated distribution containing 68.3% (95%) of the measurements (corresponding to la (2a) for a 
Gaussian distribution). The T-leptons have an average transverse momentum of 36 GeV. 
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Finally, Fig. 21 shows the resolution as a function of decay radius for the reconstruction of the radial 
position of secondary vertices for K® decays (mean pj of 6 GeV) in events containing B-hadron decays. 
The resolution in each radial slice is determined from a Gaussian fit to the core of the distribution. It can 
be seen that there are significant tails: just before the barrel layers, the resolution for decays in the barrel 
region is good, giving rise to the core; while that from the end-caps is variable, depending on the actual 
position of the decay, giving rise to a broader distribution. The tails can be reduced and the resolutions 
improved somewhat by tighter cuts on track quality and the reconstructed invariant mass, if desirable. 
The effect of crossing the three successive pixel layers is clearly visible as well as the degraded resolution 
for decays beyond the last pixel layer. Figure 22 shows the resolution as a function of decay radius for the 
reconstruction of the invariant mass of the charged-pion pair for the same — >• % + %~ decays. Figure 23 
shows the efficiency to reconstruct the K® decays. The reconstruction requires 3D information provided 
by the silicon detectors, and hence the efficiency falls to zero once the decay is beyond the penultimate 
SCT layers. 

5 Particle identification, reconstruction of electrons and photon conver- 
sions 

The reconstruction of electrons and of photon conversions is a particular challenge for the inner detector. 
The fraction of energy lost by electrons traversing the inner detector is shown in Fig. 24. In the energy 
range over which the inner detector will measure electrons, the fraction has little dependence on the 
actual electron energy. Electrons lose on average between 20 to 50% of their energy (depending on \r}\) 
by the time they have left the SCT, as illustrated in Fig. 25. The probability for photons to convert is 
fairly independent of their energies for pj > 1 GeV. A histogram of the location of photon conversions 
in \r\ | < 0.8 is shown in Fig. 26 - the radial structure of the detector is clearly visible. Between 10 to 50% 
of photons have converted into an electron-positron pair before leaving the SCT, as illustrated in Fig. 27. 

The TRT plays a central role in electron identification, cross-checking and complementing the elec- 
tromagnetic calorimeter, especially at energies below 25 GeV [2]. In addition, the TRT contributes to the 
reconstruction and identification of electron track segments from photon conversions down to 1 GeV and 
of electrons which have radiated a large fraction of their energy in the silicon layers. 

5.1 Electron reconstruction 

In the absence of bremsstrahlung, the distribution Ptrue/Pream should be Gaussian; but in the presence 
of bremsstrahlung, this is far from true, as can be seen for the end-cap in Fig. 28 (left-hand plot). By 
fitting electron tracks in such a way as to allow for bremsstrahlung, it is possible to improve the recon- 
structed track parameters, as shown in Figs. 28 and 29 for two examples of bremsstrahlung recovery 
algorithms. These algorithms rely exclusively on the inner detector information and therefore provide 
significant improvements only for electron energies below ~ 25 GeV. The dynamic noise adjustment 
(DNA) method extrapolates track segments to the next silicon detector layer. If there is a significant % 2 
contribution, compatible with a hard bremsstrahlung, the energy loss is estimated and an additional noise 
term is included in the Kalman filter [9]. The Gaussian-sum filter (GSF) is a non-linear generalisation 
of the Kalman filter, which takes into account non-Gaussian noise by modelling it as a weighted sum of 
Gaussian components and therefore acts as a weighted sum of Kalman filters operating in parallel [10]. 
With real data, to improve the fitted track parameters for electrons without deteriorating the fits for non- 
electrons, it is necessary to assess whether a track is likely to correspond to an electron or not. This 
can be done to some extent by the algorithms themselves by looking at the fits; additional information 
can be obtained from the transition radiation in the TRT (see 5.2) and the electromagnetic calorimeter. 
Ultimately, since information is lost during the bremsstrahlung, there is an unavoidable degradation of 
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Figure 21: Resolution for the reconstructed radial position of the secondary vertex for K® — > n + K~ 
decays in events containing B-hadron decays in various radial intervals (upper) and as a function of the 
Kf decay radius (lower). The resolutions are best for decays just in front of the detector layers. The 
barrel pixel layers are at: 51, 89 and 123 mm; the first two SCT layers are at 299 and 371 mm. 
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Figure 22: Resolution for the reconstruction of the invariant mass of the charged-pion pair for —> 
K + K~ decays in events containing B-hadron decays in various radial intervals (upper) and as a function 
of the decay radius (lower). 
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Figure 23: Efficiency to reconstruct charged-pion pairs for K® —>■ n + n decays in events containing B- 
hadron decays as a function of the decay radius (left) and as a function of the |tj| of the K® (right). 




Figure 24: Probability distribution as a func- 
tion of the fraction of energy lost by electrons 
with pt = 10 GeV and 25 GeV (integrated over 
a flat distribution in Tj with |t]| < 2.5) traversing 
the complete inner detector. 




Figure 25: Fraction of energy lost on average by 
electrons with pj = 25 GeV as a function of | r\ \ , 
when exiting the pixel, the SCT and the inner de- 
tector tracking volumes. For |rj| > 2.2, there is 
no TRT material, hence the SCT and TRT lines 
merge. 



34 



Tracking - The Expected Performance of the Inner Detector 




Radius (mm) 



Figure 26: Radial position of photon conversions 
in the barrel region (|tj| < 0.8) deduced from 
Monte-Carlo truth information (arbitrary normali- 
sation). 




Radius (mm) 



Figure 27: Probability for a photon to have con- 
verted as a function of radius for different values 
of \t]\, shown for photons with pj > 1 GeV in 
minimum bias events. 




Figure 28: Probability distributions for the ratio of the true to reconstructed momentum (left) and its 
reciprocal (right) for electrons with pj = 25 GeV and \ r\\ > 1.5. The results are shown as probabilities 
per bin for the default Kalman fitter and for two bremsstrahlung recovery algorithms (see text). 

the electron measurement. The algorithms serve to reduce the bias of the track fits caused by the in- 
creased track curvature. Only by adding additional information, such at the position of the cluster in the 
electromagnetic calorimeter [2], is it possible to make a real improvement on the measured momentum. 

By allowing for changes in the curvature of the track, the bremsstrahlung recovery algorithms "fol- 
low" the tracks better and correctly associate more of the hits, leading to improvements in the recon- 
struction efficiencies, as can be seen in Fig. 30. GSF has 2-3% greater efficiency than the default recon- 
struction, since it does not flag hits as outliers, hence a track is less likely to fail the quality cuts on the 
numbers of hits. 

Figure 31 shows the improvements from bremsstrahlung recovery for the reconstructed J/yr — > 
ee mass. Integrating over the complete pseudorapidity acceptance of the ID, and without using any 
bremsstrahlung recovery, only 42% of events are reconstructed within ±500 MeV of the nominal J/yr 
mass, whereas with the use of the bremsstrahlung recovery, this fraction increases to 53% and 56% for 
DNA and GSF respectively, and the bias of the peak position is reduced. In the inner detector alone, the 
J /iff signal in the end-caps is more or less completely lost because of the effects of the increased material 
compared to that in the barrel. The poor performance in the end-caps arises from the significant fraction 
of energy lost by electrons (O(30)% by the time they have left the pixels) as well as the change in track 
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Figure 29: Probability distributions for the ratio of the true to reconstructed momentum for electrons 
with p T = 25 GeV and |tj| < 0.8 (left) and p T = 10 GeV and \rj\ > 1.5 (right). The results are shown as 
probabilities per bin for the default Kalman fitter and for two bremsstrahlung recovery algorithms (see 
text). 
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Figure 30: Efficiencies to reconstruct electrons as a function of | tj | for electrons with pj - 25 GeV (left) 
and pj = 10 GeV (right). The results are shown for the default Kalman fitter and for two bremsstrahlung 
recovery algorithms (see text). 
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Figure 31: Probability for the reconstructed invariant mass of electron pairs from J j\\f —> ee decays 
in events with B° d — ► J /y(ee)K®. Distributions are shown for both electrons with \r\\ < 0.8 (left) and 
1 77 1 > 1.5 (right). The results are shown for the default Kalman fitter and for two bremsstrahlung recovery 
algorithms (see text). The true J/\\r mass is shown by the vertical line. 

direction. These distributions should be contrasted with those for the muonic decays of the J/xff in Fig. 8. 

To conclude, the material of the inner detector causes a significant amount of bremsstrahlung for 
electrons, biasing their fitted parameters. This can be partially compensated within the inner detector 
using the so-called bremsstrahlung recovery procedures, DNA and GSF These algorithms should be 
applied to tracks in a way so as to improve electrons and not degrade pions or muons. DNA runs in a 
time comparable with other simple fitters, while GSF, albeit producing better results, is a factor of twenty 
slower than DNA. Exactly how these algorithms are used will depend on individual physics analyses. 

5.2 Electron identification 

While the end-cap TRT (discrete radiator foils) is relatively easy to simulate, the barrel TRT (matrix of 
fibres) is harder and the best indication of the expected performance comes from the test beam (CTB), 
where a complete barrel TRT module was tested. Using pion, electron and muon samples in the en- 
ergy range between 2 and 350 GeV, the barrel TRT response has been measured in the CTB in terms 
of the high-threshold hit probability, as shown in Fig. 32. The measured performance has been used 
to parametrise the response in the TRT barrel. The transition-radiation X-rays contribute significantly 
to the high-threshold hits for electron energies above 2 GeV and saturation sets in for electron energies 
above 10 GeV. Figure 33 shows the resulting pion identification efficiency (probability of pions being 
misidentified as electrons) for an electron efficiency of 90%, achieved by performing a likelihood eval- 
uation based on the high-threshold probability for electrons and pions for each straw. Figure 33 also 
shows the effect of including time-over-threshold information, which improves the pion rejection by 
about a factor of two when combined with the high-threshold hit information. At low energies, the pion 
rejection (the inverse of the pion efficiency plotted in Fig. 33) improves with energy as the electrons 
emit more transition radiation. The performance is optimal at energies of ~ 5 GeV, and pion-rejection 
factors above 50 are achieved in the energy range of 2-20 GeV. At very high energies, the pions become 
relativistic and therefore produce more 5-rays and eventually emit transition radiation, which explains 
why the rejection slowly decreases for energies above 10 GeV. 

The electron-identification performance expected for the TRT in ATLAS, including the time-over- 
threshold information, is shown as a function of |t]| in Fig. 35 in the form of the pion identification 
efficiency expected for an electron efficiency of 90% or 95%. The shape observed is closely correlated to 
the number of TRT straws crossed by the track (see Fig. 34), which decreases from approximately 35 to a 
minimum of 20 in the transition region between the barrel and end-cap TRT, 0.8 < \t}\ < 1.1, and which 
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Figure 32: Average probability of a high-threshold 
hit in the barrel TRT as a function of the Lorentz 
/-factor for electrons (open squares), muons (full 
triangles) and pions (open circles) in the energy 
range 2-350 GeV, as measured in the combined 
test-beam (CTB). 
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Figure 33: Pion efficiency shown as a function 
of the pion energy for 90% electron efficiency, 
using high-threshold hits (open circles), time- 
over-threshold (open triangles) and their combina- 
tion (full squares), as measured in the combined 
test-beam. 
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Figure 34: Number of hits on a track as a function 
of | tj | for a track crossing the TRT. 



Figure 35: Pion efficiency expected from simula- 
tion as a function of 1 7] | for an efficiency of 90% 
or 95% for electrons with p T = 25 GeV. 



also decreases rapidly at the edge of the TRT fiducial acceptance for |tj| > 1.8. Because of its more 
efficient and regular foil radiator, the performance in the end-cap TRT is better than in the barrel TRT 
where it consists of radiating fibres [2]. 

5.3 Conversion reconstruction 

Figure 36 shows the efficiency for reconstructing conversions of photons with pj = 20 GeV and | X] | < 2. 1 
as a function of the conversion radius and pseudorapidity, using the standard tracking algorithm com- 
bined with the back-tracking algorithm described in Section 2. At radii above 50 cm, the efficiency for 
reconstructing single tracks drops and that for reconstructing the pair drops even faster because the two 
tracks are merged. If both tracks from the photon conversion are reconstructed successfully, vertexing 
tools can be used to reconstruct the photon conversion with high efficiency up to radii of 50 cm. The over- 
all conversion-identification efficiency can be greatly increased at large radii by flagging single tracks as 
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photon conversions under certain conditions. (The identification is distinct from the reconstruction, since 
with a single electron, the photon conversion cannot be reconstructed.) Only tracks which have no hits 
in the vertexing layer, which are not associated to any fitted primary or secondary vertex, and which pass 
a loose electron identification cut requiring more than 9% high-threshold hits on the TRT segment of the 
track are retained. The resulting overall efficiency for identifying photon conversions is almost uniform 
over all radii below 80 cm, as shown in Fig. 37. 




Figure 36: Efficiency to reconstruct conversions of photons with pj = 20 GeV and \ r\ \ < 2. 1 , as a function 
of the conversion radius (left) and pseudorapidity (right). Shown are the efficiencies to reconstruct single 
tracks from conversions, the pair of tracks from the conversion and the conversion vertex. 




Figure 37: Efficiency to identify conversions of photons with pj - 20 GeV and \r\\ < 2.1, as a function 
of the conversion radius (left) and pseudorapidity (right). The overall efficiency is a combination of the 
efficiency to reconstruct the conversion vertex, as shown also in Fig. 36, and of that to identify single- 
track conversions (see text). 



6 Conclusions 

This paper documents the expected performance for the ATLAS inner detector, focusing on the low- 
luminosity running at the start-up of the LHC. Most of the performance specifications set out in [1] have 
been met - it is only at larger values of | r\ | , where there are significant amounts of material, that the 
track-finding efficiencies are less than the targets. 
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The reconstruction of muons, electrons and pions has been studied in detail as a function of transverse 
momentum and pseudorapidity. For high-pr muons in the barrel region, the resolution for l/pr is 
expected to be 0.34 TeV -1 and the resolution for the transverse impact parameter 10 ;Um. The charge of 
muons and electrons will be measured in the inner detector over the complete acceptance up to 1 TeV 
with misidentification probabilities on average of no more than a few percent. In the barrel region, muons 
with pr > 1 GeV can be identified with efficiencies in excess of 98%. For high-pr muons, this rises 
to > 99.5% across the whole acceptance. Electrons and pions suffer from material effects; for tracks 
around 5 GeV, they are reconstructed with efficiencies between 70 and 95%. The inner detector is able 
to reconstruct pions down to 0.2 GeV with efficiencies around 50%. Fake rates are low; even in the core 
of moderate-energy jets (O(50) GeV Et), rates are less than 1%. 

Algorithms have been developed to reconstruct primary and secondary vertices, as well as K® (and 
other V°s) decays and conversions. In the case of tt events, primary vertices can be identified with 99% 
efficiency in the presence of low-luminosity pile-up. decays can be reconstructed up to a radius of 
400 mm, while conversions can be identified by reconstructing pairs of tracks or tagging single electrons 
in the TRT with 80% efficiency all the way up to a radius of 800 mm. 

Electrons suffer from bremsstrahlung caused by the significant material in the inner detector. Algo- 
rithms have been developed to improve the reconstruction of electrons, reducing the bias on the measured 
momentum. While reasonable electron reconstruction is possible in the inner detector barrel, it is quite 
difficult in the end-caps because of the increased amount of bremsstrahlung - here, the use of the elec- 
tromagnetic calorimeter will be essential. Electrons can be identified by their transition radiation in the 
TRT. For an electron efficiency of 90% at pj = 25 GeV, the pion misidentification probability is of the 
order of a few percent over most of the acceptance, and the pion rejection will be optimal around 5 GeV. 

After many years of preparing the ATLAS inner detector software and having tested it on simulated 
and test-beam data, we are ready to reconstruct and analyse data from collisions. We now look forward 
to the first data from the LHC. 
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Abstract 

This note describes the calibration of electromagnetic clusters, as implemented 
in current releases of the ATLAS reconstruction program. A series of correc- 
tions are applied to calibrate both the energy and position measurements; these 
corrections are derived from Monte-Carlo simulations and validated using test- 
beam data. The possibility of obtaining inter-calibration energy corrections 
from Z — > ee data is also discussed. 

1 Introduction 

In order to realise the full physics potential of the LHC, the ATLAS electromagnetic calorimeter must 
be able to identify efficiently electrons and photons within a large energy range (5 GeV to 5 TeV), and to 
measure their energies with a linearity better than 0.5%. The W boson mass measurement, not considered 
here, will require better precision. 

The procedure to measure the energy of an incident electron or photon in the ATLAS electromag- 
netic (EM) calorimeter has been described in Ref. [1]. Each step of the energy reconstruction has been 
validated by a series of beam tests over many years, both using only the calorimeter [2, 3] and also 
combined with representative components from all detector sub-systems. This has allowed considerable 
refinement of the calorimeter simulation. This simulation is then used to model the behaviour of the full 
detector. 

One of the key ingredients for the description of the detector performance is the amount and position 
of the upstream material. The understanding of the ATLAS detector geometry has also made progress 
over the years; an overview of the present knowledge of the detector and its expected performance can be 
found in [4]. The amount of material in front of the calorimeter for the as-built detector is significantly 
larger than was initially estimated; this leads to larger energy losses for electrons and to a larger fraction 
of photons converting (see Figs. 1 and 2). 

The standard ATLAS coordinate system is used: the beam direction defines the z-axis, and the x-y 
plane is transverse to the beam direction. The azimuthal angle 0 is measured around the beam axis and 
the polar angle 6 is the angle from the beam axis. The pseudorapidity is defined as t] = — ln(tan(0/2)). 

1.1 Electron and photon candidates 

The "sliding window" algorithm [5] is used to find and reconstruct electromagnetic clusters. This forms 
rectangular clusters with a fixed size, positioned so as to maximise the amount of energy within the clus- 
ter. An alternate algorithm is available which forms clusters based on connecting neighbouring cells until 
the cell energy falls below a threshold; this is not used by the default electron and photon reconstruction. 
The optimal cluster size depends on the particle type being reconstructed and the calorimeter region: 
electrons need larger clusters than photons due to their larger interaction probability in the upstream ma- 
terial and also due to the fact that they bend in the magnetic field, radiating soft photons along a range 
in (j). Several collections of clusters are therefore built by the reconstruction software, corresponding to 
different window sizes. These clusters are the starting point of the calibration and selection of electron 
and photon candidates. 

One of the recent improvements in the calibration procedure is that electron and photon candidates 
are treated separately. For each of the reconstructed clusters, the reconstruction tries to find a matching 
track within a At] x A(j) window of 0.05 x 0.10 with momentum p compatible with the cluster energy E 
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Figure 1: Average energy loss vs. |tj| for E = 
100 GeV electrons before the presampler/s trips 
(crosses/open circles), and reconstructed energies 
before/after (solid/open boxes) corrections. 



Figure 2: Fraction of photons converting at a ra- 
dius of below 80 cm (115 cm) in open (full) cir- 
cles, as a function of |tj | [4]. 



(Ejp < 10 [6,7]). If one is found, the reconstruction checks for presence of an associated conversion. 
An electron candidate is created if a matched track is found and no conversion is flagged. Otherwise, the 
candidate is classified as a photon. 

This early classification allows applying different corrections to electron and photon candidates. 
It is the starting point of a more refined identification based largely on shower shapes, described in 
companion notes [6, 7]. Four levels of electron quality are defined (loose, medium, tight, and tight 
without isolation). The available photon selection corresponds to the tight electron selection (excluding 
tracking requirements). The medium and tight selections are used in some parts of the calibration analysis 
described in this note. But the corrections derived are then applied to all electron and photon candidates. 



1.2 Calorimeter granularity 

The electromagnetic calorimeter (Fig. 3) was designed to be projective in tj , and covers the pseudorapid- 
ity range |tj| < 3.2. Precision measurements are however restricted to \r} \ < 2.5; regions forward of this 
are outside of the scope of this note. The calorimeter is installed in three cryostats: one containing the 
barrel part (|tj| < 1.475), and two which each contain the two parts of the end-cap (1.375 < |tj| < 3.2). 
Its accordion structure provides complete 0 symmetry without azimuthal cracks. The total thickness 
of the calorimeter is greater than 22 radiation lengths (X 0 ) in the barrel and greater than 24X 0 in the 
end-caps. It is segmented in depth into three longitudinal sections called layers, numbered from 1 to 3 
outwards from the beam axis. These layers are often called "front" (or "strips"), "middle," and "back." 
The tj granularity of the calorimeter for the front and middle layers is shown in Table 1. The (j> size of 
cells is 0.025 in layer 2 and 0.1 in layer 1. Layer 3 has a granularity of Atj x A<p = 0.050 x 0.025. For 
| tj | < 1 . 8, a presampler detector is used to correct for the energy lost by electrons and photons upstream 
of the calorimeter. All these regions must be treated separately in deriving the individual corrections. 

The effect of the choice of cluster size on electron and photon energy reconstruction has been studied 
in Refs. [1] and [8]. These results are still the baseline of the present software. For electrons, the energy 
in the barrel electromagnetic calorimeter is collected over an area corresponding to 3 x 7 cells in the 
middle layer, i.e. Atj x A0 = 0.075 x 0.175. For unconverted photons, the area is limited to 3 x 5 cells in 
the middle layer, whereas converted photons are treated like electrons. The cluster width in tj increases 
with increasing | tj | ; therefore, an area of 5 x 5 cells in the middle layer is used for both electrons and 
photons in the end-cap calorimeter. 
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Figure 3: Sketch of the accordion structure of the 
EM calorimeter [8]. 





| V) | range 


Cell T\ 








Layer 1 


Layer 2 


Barrel 


0-1.4 


0.025/8 


0.025 




1.4-1.475 


0.025 


0.075 


end-cap 


1.375-1.425 


0.05 


0.05 




1.425-1.5 


0.025 


0.025 




1.5-1.8 


0.025/8 


0.025 




1.8-2.0 


0.025/6 


0.025 




2.0-2.4 


0.025/4 


0.025 




2.4-2.5 


0.025 


0.025 



Table 1: Calorimeter Vj granularity in layers 1 
and 2. 



1.3 Geometries and data sets 

The present knowledge of the detector geometry, resulting from the detector survey, is described in [4] 
(Sec. 9). But even before the final survey, it was known that the inner detector services located in the 
crack region would be wider than originally expected, and that the end-cap electromagnetic calorimeter 
would be shifted by about 4 cm, compared to the nominal (and pointing) geometry described in Ref. [1]. 
This is taken into account in the simulation, and is treated as a misalignment in the cell calibration 
procedure described below. 

High statistics samples of single electrons and photons, processed with the full detector simulation 
based on GEANT 4.7 [9], were used to derive and study the corrections. Two detector geometries are 
available. The first is the "ideal geometry," which contains the best knowledge of the dead material, 
but which has no misalignments except for the 4 cm shift of the end-caps. The data sets based on 
this geometry are used to derive the corrections and for most of the performance studies. The second 
available geometry is a distorted one, in which extra material is added between the tracking detectors and 
the calorimeters, and in which misalignments are introduced. For example, the amount of material in the 
inner detector has increased in some regions by up to 7% of a radiation length for positive 0, and the 
density of material in the gap between the barrel and end-cap cryostats has increased by a factor of 1.7. 
The distorted data-sets using this geometry are used to estimate systematic uncertainties and to check the 
sensitivity of the methods to additional material. In addition to these single-particle data sets, Z — > ee 
decays are also available. 

The standard calorimeter reconstruction for simulated data includes the effects of possible cell-level 
miscalibrations by smearing the measured energy of each cell (by about 0.7%), therefore increasing the 
constant term of the energy resolution. (The fractional energy resolution is conventionally parametrised 
as a(E)/E = a/E @b/ ' \J~E~ ©c, where a is the noise term, b is the sampling term and c is the constant 
term.) Unless otherwise stated, the results in this note do not include this additional smearing, and 
therefore correspond to assuming a perfect cell-level calibration. 

1.4 Energy and position reconstruction 

The calibration of the LAr calorimeter is factorised into a channel-by-channel calibration of the electron- 
ics readout and an overall energy scale determination. 

The first step, often called "electronics calibration", converts the raw signal extracted from each cell 
(in ADC counts) into a deposited energy. The method used for this step, which is beyond the scope of this 
note, was described in Ref. [1]. It was refined and validated when final barrel and end-cap modules were 
studied in test beams [2, 3]. In the past two years, the experience gained and the algorithm developed 
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were integrated into the standard ATLAS calibration software [10]. 

The second step deals with clusters. The energies deposited in the cells of each individual layer of 
a cluster are summed, and an energy-weighted cluster position is calculated for each layer. There are 
several important effects which must then be understood: 

• Due to the accordion geometry, the amount of absorber material crossed by incident particles varies 
as a function of 0. This produces a 0 modulation of the reconstructed energy. 

• The shower is not fully contained in the X] window chosen for clusters, and the cells have a finite 
granularity. This introduces a modulation in the energy and a bias in the measured position ("S- 
shape") which depend on the particle impact point within a cell. 

• A perfectly projective particle, coming from the origin of the coordinate system, intersects the cal- 
orimeter at the same r\ position in all layers. The luminous region, however, extends significantly 
in z; a particle from a vertex away from the origin intersects the calorimeter at slightly different r\ 
positions in each layer. Properly combining these r\ measurements requires an accurate parametri- 
sation of the shower depth within each layer. 

An early study of these corrections, using both simulation and test beam data, can be found in [11]. 
The present prediction of these effects and their dependencies on the impact point and energy of the 
incident particle are described in detail in this note. 

The measured energy and position of EM clusters are corrected as described below (see Fig. 4). 
The required scale of the correction is illustrated by the upper points in Fig. 1 , which shows the recon- 
structed energies of E = 100 GeV electrons before and after calibration. It is about 10% over most of the 
calorimeter, but is larger in the transition region between cryostats. 



Figure 4: Cluster correction steps. 



• To start with, the energies in the cluster cells are summed, and an energy weighted (tj , 0 ) position is 
calculated for each calorimeter layer. Before applying the cluster corrections, the energy resolution 
has a constant term of about 0.65% (quoted for photons at \ r\ \ = 0.3). 

• As the first step, corrections are applied to the cluster position, measured in each layer. These are 
described in Sec. 2. The position measurements from the first two layers are then combined to de- 
fine the shower impact point in the calorimeter, which can then be used for energy reconstruction. 

• The next step is to combine the energies deposited in each layer. Two separate procedures have 
been developed to do this which are described in Sees. 3.1 and 5. In the first one, per-layer energy 
coefficients, called longitudinal weights, are adjusted to optimise at the same time the energy 
resolution and the linearity of the response. In the second one, the simulation is used to correct for 
different types of energy loss one by one, by correlating each of them with measured observables. 
The corrections are calculated separately for electrons and photons, and determined as a function 
of |tj |. This reduces the local constant term to about 0.61%. 

• The third step, described in Sec. 3.2, uses the shower impact point to correct the total energy for 
modulations in r\ and 0. This reduces the local constant term to about 0.43%. 
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In spite of the skill and care put into the detector construction, calibration, and operation, some local 
or "medium range" inhomogeneities in the calorimeter response have to be expected: localised high- 
voltage or temperature effects or unexpected additional dead material must be detected and corrected for 
using data. It is planned to use Z —> ee decays to measure and correct for such effects and to help fix the 
absolute energy scale. The method developed and the precision expected are described in Sec. 6. 

2 Cluster position measurement 

The position of a cluster is measured in r\ and 0. The positions are first calculated independently for each 
calorimeter layer as the energy-weighted barycenters of all cluster cells in the layer. (The barrel and end- 
cap are also treated separately at this stage.) Secondly, the individual layer measurements are corrected 
for known systematic biases. Finally, the position measurements from layers 1 and 2 are combined to 
produce the overall cluster position. The position corrections are derived using single-particle electron 
and photon data samples. Each sample is mono-energetic, and the available samples span the range 
5-1000 GeV. 

The r\ positions that are calculated at this stage are "detector"-T], corresponding to the angle that 
would be made by a particle originating from the origin of the detector coordinate system. In order 
to properly compare the calculated detector-T7 positions with the r\ of a generated incident particle, 
which will in general have its production vertex offset in z from the detector origin, one must assume 
a depth for each calorimeter layer. Here, "depth" refers to the radial distance from the beam axis for 
the barrel calorimeter, and to the distance from the x - y plane passing through the origin for the end- 
cap calorimeter. The depths used are those which optimise the 77 -position resolution; they are shown in 
Fig. 5. 




(a) Barrel. (b) End-cap. 

Figure 5: Calorimeter depths versus \r\ \ for layers 1 and 2 and for 100 GeV photons. The points show 
the derived optimal depths, and the curves are piecewise polynomial fits to the points. For layer 2 of the 
barrel, a single curve yielded an adequate fit across \r\ \ = 0.8; this may be revisited in future versions. 
From 100 GeV photons. 



2.1 r\ position correction (S-shape) 

The cluster r\ position is first calculated in each layer as the energy-weighted barycenter of the cluster 
cells in that layer. (In layer 1 , only the three strips around the cluster center are used, regardless of the 
specified cluster size.) Due to the finite granularity of the readout cells, these measurements are biased 
towards the centers of the cells. For examples, see Fig. 6. This figure plots the difference in r\ between 
the incident particle and the reconstructed cluster (At] = r\ imtt — T] reco ) as a function of v, the relative r\ 
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offset of the cluster within the cell, which varies from — 1/2. . . 1/2 across the cell. (The sign of AT7 is 
inverted for negative r\, and in plots it is usually shown as a fraction of the cell X] width.) The general 
functional form shown in this figure is often referred to as "S-shape". 



(a) Layer 1, barrel. (b) Layer 1, end-cap. 




(c) Layer 2, barrel. (d) Layer 2, end-cap. 

Figure 6: At] versus v before and after correction for different regions and for 100 GeV electrons. Note 
the small systematic offset in the end-cap due to a change in the end-cap geometry since the corrections 
were derived. For comparison, the "vl2" points show results reconstructed using the same geometry as 
that used to derive the corrections. 

Figure 6 shows the correction averaged over an |t]| range. The actual correction, however, varies 
continuously over r\ , due to changes in the detector geometry (the corrections change to a much greater 
extent near discontinuities in the calorimeter). For example, the calorimeter cells are not perfectly pro- 
jective (as the inner and outer cell faces are parallel to the beam-line, rather than being perpendicular to a 
line from the detector origin); this induces a bias away from the center of the calorimeter. The correction 
will also depend on the cluster energy, as that affects the average shower depth. 

To derive the correction, the calorimeter is divided in r\ into regions based on where the behaviour 
of the correction changes discontinuously. Within each region, an empirical function is constructed to 
describe the correction, and an unbinned fit is performed to simulated data for a particular cluster size, 
type, and energy. 

The function used for the empirical fit is of the form 

f(v) = Atari- 1 Bv + Cv + D\v\+E, (1) 

where — 1/2 < v < 1/2 across a cell (for the actual fit, the parameters are redefined to reduce correla- 
tions). To turn this into a function of t], the fit parameters are written as polynomials (usually of second 
or third degree) in 1 r\ | : 

A = J>W> (2) 
and similarly for the other parameters. The fit parameters are then the coefficients a,-, etc. 
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One feature to note about this function is that, in general, /(— 1/2) / /(1/2), so that it will be 
discontinuous crossing a cell boundary. For layer 1 , this is usually acceptable, since reconstructed cluster 
positions cluster well away from the cell boundary (Fig. 7(a)). However, in layer 2, the distribution of 
reconstructed cluster positions remains populated across the cluster boundary (Fig. 7(b)). Therefore, for 
layer 2, the function is modified so that /(— 1/2) = f{\/2). 





ATLAS 

0.2 ' 0.21 ' 0.22 0.23 024 b.25' 0.26 0.27 0.28' 0.29 0.3 



(a) Barrel layer 1 (b) Barrel layer 2 

Figure 7: Arj versus \r\ \ in layers 1 and 2 of the barrel, along with the empirical fit function. 



In some cases, there is still a significant periodic residual after fitting to this form; in such cases, an 
additional general trigonometric term is added to the fit: 

/(v)=/(v) + acos(/3jrv+y). (3) 



Finally, a few regions near the calorimeter edges do not exhibit the S -shape form; a general polynomial 
is used as the empirical function there. 

The correction is evaluated separately for each cluster size and type (electrons, photons). The differ- 
ence in the correction between electrons and photons is a few percent, and there is about a 10% difference 
between 5x5 and 3 xN clusters. 

The correction also depends on energy; over the range 25-1000 GeV, the required correction varies 
by ~ 20%. To apply the correction for a given cluster, the correction is first tabulated for each of the 
energies for which simulated data samples were available. The final correction is then found by doing 
a cubic polynomial interpolation within this table. Note a subtlety here: the energies at which the cor- 
rections are tabulated are the true cluster energies. However, when the correction is applied, only the 
reconstructed cluster energy is known. Since the position corrections are done before the energy correc- 
tions, the reconstructed cluster energy will be systematically lower than the true energy. If this were used 
for the interpolation, this would bias the position measurements. So, for the purpose of this interpolation, 
a crude energy correction is performed by scaling the reconstructed cluster energy by the ratio of the true 
to reconstructed energy observed in a 100 GeV sample, parametrised as a function of \rj\. This energy 
correction is used only for the energy interpolation of the position corrections. 

Plots of Atj before and after corrections for several regions are shown in Fig. 6. Note that since the 
present corrections were derived, the simulated detector geometry was changed slightly in the end-cap, 
in order to match more closely the as-built detector. This results in a small systematic offset of 0(1O~ 4 ) 
in these regions. 

The r\ position resolution for photons versus | tj | is shown for the two main calorimeter layers (strips 
and middle) in Fig. 8. The resolution is fairly uniform as function of \r} \ and is 2.5-3.5 x 10~ 4 for the 
strips (which have a size of 0.003 in tj in the barrel electromagnetic calorimeter) and 5-6 x 10~ 4 for the 
middle-layer cells (which have a size of 0.025 in tj). The regions with worse resolution correspond to the 
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barrel/end-cap transition region and, for the strips, to the region with \r}\ > 2, where the strip granularity 
of the end-cap calorimeter becomes progressively much coarser. 




Figure 8: Expected 77 position resolution versus 
I r\ I for E = 100 GeV photons for the two main lay- 
ers of the barrel and end-cap EM calorimeters [4]. 




Figure 9: Profile plot of A<p versus | v\ | before (tri- 
angles) and after (circles) correction. For 100 GeV 
electrons. 



2.2 (f) position correction 

The measurement of the cluster 0 position must also be corrected. These corrections are applied only in 
calorimeter layer 2 (the 0 granularity is best in this layer). As opposed to the 77 direction, the accordion 
geometry results in more energy sharing between cells in the 0 direction, which washes out the S-shape 
in this direction. There is, however, a small bias in the 0 measurement which depends on the average 
shower depth with respect to the accordion structure (and thus on \r}\). A profile plot of A(j) = 0tme — <p ieco 
before the correction is shown in Fig. 9. (The sign of the offset is flipped for 77 < 0, as the two halves 
of the calorimeter are identical under a rotation.) The discontinuity at \r}\ = 0.8, where the absorber 
thickness and the middle layer depth change, is clearly visible. 




Figure 10: Expected 0 position resolution as a 
function of | X] | for electrons and photons with an 
energy of 100 GeV. 



Figure 1 1 : Resolution of r\ position measurement 
from layers 1 and 2 combined for 100 GeV pho- 
tons. 



The correction derived here is symmetric in (j) . In the real detector, the absorbers sag slightly due to 
gravity, causing a (j) -dependent modulation in the 0 offset with a maximum value of about 0.5 mrad [8]. 
This has not been included in the present simulations, and it is therefore not taken into account in this 
correction. Studies have shown, however, that the extra smearing of the position measurement from this 
effect has a negligible contribution to the widths of the invariant mass distributions of e + e~ pairs. (These 
studies were performed by generating decays of massive particles using a toy Monte Carlo, smearing the 
decay products with energy and angular resolutions roughly appropriate to ATLAS, and comparing the 
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widths of the resulting invariant mass distributions before and after shifting the 0 positions by Acos0. 
No significant broadening was observed for A < 50 mrad.) The contribution of this effect to the constant 
term of the energy resolution has not been studied quantitatively, but should also be small. 

To produce a correction, the data are binned in 77. The result for one sample is shown in Fig. 9. This 
function is interpolated in 77; it is then also interpolated in energy as for the 77 position correction. 

The 0 position resolution versus \r] \ is shown for calorimeter layer 2 in Fig. 10. Electron clusters, 
which get smeared in the 0 direction as they radiate while propagating through the magnetic field, have 
a worse 0 position resolution than do photon clusters. A discontinuous step is seen in the resolution 
at 1 77 1 =0.8, where the absorber thickness changes, and the resolution is worst in the transition region 
between the cryostats. 

2.3 Position measurement combination 

The individual layer 77 and (j) measurements are combined to produce the overall 77 and (j) for a cluster. 
For 0, only layer 2 is used, so the combination is trivial except in the overlap region, where the energy- 
weighted average of the barrel and end-cap 0 measurements is used. For 77, both layer 1 and layer 2 are 
averaged. However, layer 1 is weighted three times as much as layer 2 to roughly take into account the 
better resolution in layer 1 . This prescription, which does not use the actual position resolutions and does 
not account for correlations, is known to be suboptimal and will be improved in future software versions. 

Note that the 77 combination implicitly assumes that the incoming particle is projective. If its produc- 
tion vertex is shifted from the origin, then the combined 77 will be biased. This is illustrated in Fig. 11, 
which shows the resolution of the combined cluster 77 measurement. Here, the measured cluster 77 is 
compared to the 77 position of the calorimeter intersected by the true particle track at a depth correspond- 
ing to the cluster barycenter. This is shown both for all clusters and for clusters with the z position of the 
production vertex within 5 mm of the detector center. 

2.4 Shower direction 

At high luminosity, the inner detector cannot accurately determine the interaction vertex due to the large 
number of additional interactions. This is an issue for the reconstruction of a H — > 77 signal. For this 
analysis, achieving the best possible resolution on the invariant mass of the photon pair is crucial for 
separating the signal peak from the continuum background. If the z-position of the interaction vertex 
is unknown, then there will be a large uncertainty in the polar angle of the photons and thus in the 
pair invariant mass. We can, however, recover information about the incidence angle of the photons by 
comparing the impact points that are reconstructed in the first and second layers of the EM calorimeter. 
To do this, we need to know the photon 77 position and the shower barycenter in each of the two layers 
(Fig. 5). We can then draw a straight line between these two (77, depth) points; extending this line to the 
beam axis gives an estimate of the position of the interaction vertex. 

Here, this method is applied to single photons with energies compatible with photons from H —>■ yy 
decays. For m# = 120 GeV, these photons are predominantly in the range 50 — 100 GeV. Figure 12 
shows the resolutions of the photon angle and the interaction vertex measurements as functions of 1 77 1 . 
Figure 13 shows the same resolution as a function of the photon energy, for |tj| < 0.5. 

3 Cluster energy measurement 

Most of the energy of an electromagnetically interacting particle is deposited in the sensitive volume of 
the calorimeter, including the lead absorbers and the liquid-argon gaps. A small fraction is deposited in 
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(a) Angular resolution. (b) Vertex resolution. 

Figure 12: Angular and vertex resolution as functions of \rj | (Gaussian fits), multiplied by \[E. 
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(a) Angular resolution. (b) Vertex resolution. 

Figure 13: Angular and vertex resolution as functions of E (Gaussian fits), for \r\ \ < 0.5. 



non-instrumented material in the inner detector, the cryostats, the solenoid, and the cables between the 
presampler and the first EM calorimeter layer. Energy also escapes from the back of the calorimeter. 

The cluster energy is calculated as a linearly weighted sum of the energy in each of the three calorim- 
eter layers plus the presampler. The factors applied to the four energies are called longitudinal weights 
and their purpose is to correct for the energy losses, providing optimum linearity and resolution. 

The ATLAS longitudinal weighting method was first described in Ref. [8]. However, recent ATLAS 
test beam analyses [2,3, 12] provided simple extensions of the technique. They also allowed validating 
this method with real data. 

The first section below describes the weighting correction that is performed in current versions of 
the reconstruction, called the 4-weight method. This is followed by a description of the corrections for 
r\- and (j) -dependent modulations in the energy. A more advanced energy-dependent calibration scheme, 
called the calibration hit method, is described separately in Section 5. 

3.1 4-weight method 

The weighting method described in this section is is a modification of that described in Ref. [8] and is 
currently the default in ATLAS reconstruction. The weights used are functions only of | r\ | ; no energy 
dependencies are used. The method could be readily extended to include 0- and energy-dependent 
weights in order to minimise residual non-linearities. The reconstructed energy is given by 

£ reco = A(B + W ps E ps + E X +E 2 + W 3 E 3 ) , (4) 

where E ps and £1. 3 are the cluster energies in the presampler and the three layers of the calorimeter 
(including sampling fractions). The offset term B corrects for upstream energy losses for which the 
corresponding electron has not reached the presampler (PS). In the limiting case of no energy in the 
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PS, this offset corresponds to the energy an electron loses before it undergoes a hard bremsstrahlung 
for which the resulting photon passes through the PS without converting (i.e., no energy recorded in the 
PS). The parameters A, B, W ps , and W3 are calculated by a % 2 minimisation of (i?true — E mco ) 2 / o{E tivie ) 2 
using Monte Carlo single particle samples, where o(E tme ) is a parametrisation of the expected energy 
resolution. This minimisation is done for separate \r]\ bins, defined by the AT7 = 0.025 granularity of 
the second layer of the calorimeter. Equal-sized samples with energies between 10 and 200 GeV are 
combined for the fits (the linearity of low energy points could be improved by using more events at those 
energies.) The fits are done separately for each cluster size and particle type (electron and photon). 

A special parametrisation is applied in the gap region between the barrel and end-cap calorimeters 
(1.447 < 1 77 1 < 1.55), within which the parametrisation of Eq. (4) is not adequate. Moreover, this region 
is instrumented with scintillator tiles that can be used to recover some of the energy lost in the gap. The 
parametrisation used in the crack is 

£ reco = A(B + E h + E e + W scint £ scin t) , (5) 

where E/, and E e are the energies the cluster deposits in the barrel and end-cap calorimeters, respectively, 
^scint is the scintillator energy, and W sc i nt the weight applied to it. This parametrisation is found to perform 
significantly better than that used in [1]. 

The longitudinal weights in Eq. (4) were extracted for electrons and photons and are shown as a 
function of \r\ | in Fig. 14. In Fig. 14(a) one can see that the overall scale A for electrons (solid) is larger 
than that for photons. The reason is due to the fact that photons travel on average 9/1Xq before they 
start losing energy. This effect is close to 1 % in the middle of the barrel and increases with the increase 
of upstream material. The offset term B is shown in Fig. 14(b); photons have a very small offset, as 
expected. (Future versions of the correction will use larger statistics to reduce the scatter observed in the 
fit results.) The PS weight W ps shown in Fig. 14(c) is the usual factor applied to preshower/presampler 
energy responses to correct for upstream losses. Finally, in Fig. 14(d), W3 is a weight applied to the last 
calorimeter layer to correct for energy leakage behind the calorimeter. 

Detailed studies have revealed that the physical meaning attributed to these weights is only approx- 
imate. For example, the weights compensate for losses after the PS via the minimisation procedure. In 
addition, the weights have a non-negligible energy dependence. However, this energy dependence does 
not result in large non-linearities because the weights adjust their values to compensate. These effects 
are more evident at low energies E < 15 GeV, and with large amounts of upstream material. A more 
rigorous treatment of the longitudinal weighting is presented in Sec. 5. 

The performance of this method is shown in Sec. 4 

3.2 Cluster energy modulation corrections 

As the 0 impact position of a particle shifts across the accordion structure of the absorbers, the amount of 
passive absorber material it encounters and thus the ratio R = E reco /E Xrae varies slightly, with a periodicity 
equal to that of the absorber spacing. This effect is small, with a maximum value of about a half -percent. 
Further, at lower energies, the 0 position resolution becomes comparable to the absorber spacing; this 
contributes to washing out the effect at these energies. The reconstructed energy is corrected for this. 

To derive the correction, the calorimeter is binned in | r\ \ . The binning used is not uniform, but is 
chosen so as to segregate regions of the calorimeter with non-uniform R. Within each \r\\ bin, R is 
plotted versus the 0 offset of the cluster relative to the absorber. These plots are divided into 0 bins, each 
bin is fit to a Gaussian, and the means of the fits are plotted. The resulting plot is then normalised to 
unity and fit to a two-term Fourier series: 

f((j>) = l+A[acos(N(j) + C) + {l-a)cos(2N(j)+D)], (6) 
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(a) Overall scale A. (b) Offset B. 




(c) Longitudinal weight W ps . (d) Longitudinal weight W3 . 

Figure 14: Fitted longitudinal weights for electrons (solid) and photons (open) as functions of \r\\. 



for fit parameters A, a, C, and D. Parameter a is restricted to the range 0-1. N is the total number of 
absorbers in 2k (1024 in the barrel and 768 in the end-capend-cap). An example of such a fit is shown 
in Fig. 15. 

Fits are performed separately for each energy, cluster size, and particle type. To apply the correction, 
it is calculated for each r\ and energy bin. It is then interpolated both in r\ and in energy. This correction 
reduces the constant term in the energy resolution (for photons at \r\ \ = 0.3) from 0.61% to 0.50%. 

Energy modulations are also observed along the 77 direction. The energy of a cluster is defined as 
the energy within a rectangular window of fixed size in 77 x (j). The window can only shift by an integral 
number of cells; however, the impact point of a particle may be anywhere within a cell. Thus, on average, 
a larger fraction of the cluster energy will be contained in the window when the particle hits at the center 
of a cell than if it hits near an edge. The size of this effect is a few tenths of a percent, and is larger 
for smaller cluster sizes. The modulation can be fit well with a quadratic; see Fig. 16. Note that this 
modulation is very small, < 0.1%, in the 0 direction, due to increased energy sharing between the cells; 
this modulation is not presently corrected. (A larger modulation was seen in the test beam [13], which 
used 3x3 clusters.) 

The plots to fit are prepared in a similar manner as for the 0 modulations, except that the x-axis 
is taken to be the 77 offset within a cell. The plots from all bins where the detector is mostly uniform 
are then combined into a single plot; that is, the 1 77 1 ranges 0.05-0.75, 0.85-1.30, and 1.70-2.50. The 
resulting plot is then scaled so as to average to unity and fit to a quadratic. The correction is performed 
separately for each energy, cluster size, and particle type. The final correction is then determined by 
interpolating in energy. An example fit is shown in Fig. 16. Applying this correction further reduces the 
constant term to 0.43%. A major contribution to the remaining constant term is from the 0-dependency 
of the inner detector material distribution. (The present weighting correction is averaged over 0 .) 
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Figure 15: Energy modulation in 0 for 200 GeV 
3x7 electrons with 0.2 < \r)\ < 0.4, along with 
the modulation fit. 

4 Energy calibration performance 




Figure 16: Energy modulation in Tj for 200 GeV 
3x7 electrons, along with the modulation fit [4]. 



This section shows the performance of the calibration chain used in the current version of the ATLAS 
reconstruction software used for all of the electron and photon reconstruction and identification studies 
reported here and elsewhere. 

4.1 Single electrons and photons 

In Fig. 17, the energy response, plotted as the difference between measured and true energy divided by 
the true energy, is shown for electrons with an energy of 100 GeV for two illustrative r\ -positions in 
the barrel electromagnetic calorimeter. The central value of the energy is reconstructed with excellent 
precision (~ 3 x 10~ 4 ) if one assumes perfect knowledge of the material in front of the calorimeter. Both 
the Gaussian core and the non-Gaussian component of the tail of the energy distribution are significantly 
worse at the point with larger | tj | due to the larger amount of material in front of the calorimeter. The 
resolution and non-Gaussian tails are better for photons than for electrons, but are somewhat worse for 
all photons than for photons that do not convert before leaving the volume of the inner detector. 

The linearity (relative difference between the fitted mean energy and the true energy) and resolution 
are shown in Fig. 18 for electrons and photons. The expected performance is very similar for electrons 
and photons, with a somewhat larger degradation at larger values of 1 17 1 in the case of electrons, as 
expected from the impact of upstream material. For electrons, the linearity is shown for | r\ \ = 0.3 (barrel) 
and 1 77 1 =2.0 (end-cap). The deterioration of the performance seen in the end-cap is attributed to the 
absence of a presampler (|t]| > 1.8) and the relatively limited statistics of the simulated samples. The 
resolution shown in Fig. 18(b) is given for three \r\\ points: \r\\ = 0.3 (inner barrel), |tj| = 1.1 (outer 
barrel), and |t]| = 2.0 (end-cap). The resolution drop at larger \r\ \ is attributed to the significant increase 
of upstream material in front of the calorimeter with respect to the small | r\ | region. The extra material 
causes increased early showering upstream of the calorimeter, which affects the lateral shower shape in 
the calorimeter. Since Eq. (4) absorbs the corrections for lateral losses into the overall scale constant A, 
an increase in lateral-loss fluctuations will result in a deterioration of the resolution. The fits in Fig. 18(b) 
give a sampling term of (10.17 ±0.33)% at small 1 77 1 , and (14.5 ± 1.0)% in the end-cap. 

In Fig. 19, the energy resolution for electrons and photons is shown as a function of |tj |. The photon 
resolution is better than the electron resolution in regions with more material in front of the calorimeter. 
The extracted constant term of the resolution is shown for photons in Fig. 20 after the weight and mod- 
ulation corrections. This figure also shows the constant term observed when the standard simulation of 
cell-level miscalibrations is enabled in the reconstruction program. In Fig. 21, the linearity and resolution 
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(a) Electrons, = 0.325. (b) Electrons, |r/| = 1.075. 




(c) All photons, 1 77 1 = 1.075. (d) Unconverted photons, |t)| = 1.075. 



Figure 17: Difference between measured and true energy normalised to true energy at E = 100 GeV. 
as a function of \r\ | is shown for a range of energies for single photons. 

4.2 Mass resolution obtained in H — >• 4 e and H — >• 77 final states 

Figure 22 shows the reconstructed distribution, after calibration, of the invariant mass of the electrons 
in H — >• 4e decays, with m H = 130 GeV. (Loose electron selection applied, as defined in [6].) A global 
constant term of 0.7% has been included in the electromagnetic calorimeter resolution for the two plots in 
this subsection. The central value of the reconstructed invariant mass is correct to ~ 1 GeV, correspond- 
ing to a precision of 0.7%, and the expected Gaussian resolution is ~ 1.5%. The non-Gaussian tails in the 
distribution amount to 20% of events lying further than 2a away from the peak. They are mostly due to 
bremsstrahlung, particularly in the innermost layers of the inner detector, but also to radiative decays and 
to electrons poorly measured in the barrel/end-cap transition region of the electromagnetic calorimeter. 

Figure 23 shows the reconstructed photon pair invariant mass for H — ► 77 decays with m H = 120 GeV 
(tight photon selection applied and barrel/end-cap transition region excluded). The photon directions are 
derived from a combination of the direction measurement in the electromagnetic calorimeter described 
above (see Section 2.4) with the primary vertex information from the inner detector. The central value of 
the reconstructed invariant mass is correct to ~ 0.2 GeV, corresponding to a precision of 0.2%, and the 
expected resolution is ~ 1.2%. Most of the non-Gaussian tails at low values of the reconstructed photon 
pair mass are seen to be due to photons which convert in the inner detector. The shift in the means comes 
from the fact that the corrections to-date do not distinguish between converted and unconverted photons. 

4.3 Study of systematic effects using H — >• 4 e 

The energy linearity for electrons in H —> 4 e is shown in Fig. 24(a) for samples based on the ideal (full 
triangles) and distorted (circles) geometries. The departure from linearity for the distorted geometry is 
attributed to the presence of extra material in front of the calorimeter. The corresponding resolution is 
shown in Fig. 24(b) for the distorted geometry. 
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Figure 18: Energy linearity (left) and resolution (right) for electrons (top) and photons (bottom). 




f 1*. 



Figure 19: Energy resolution for electrons and 
photons as a function of | rj \ . 



Figure 20: Extracted constant term of the energy 
resolution for photons, as a function of | r\ \ , after 
weight and modulation corrections. Also shown 
with cell-level miscalibrations enabled. 



The uniformity in 0 and r\ observed in this sample is shown in Fig. 25. The non-uniformities seen 
at higher \r\ \ and at positive 0 are due to simulated extra material in these regions. In the 0 -uniformity 
plot (Fig. 25(a)) a residual modulation is observed. This is most likely due to an artefact in the simulation. 
The longitudinal weights used in the reconstruction depend only on r\, and are averaged over 0. Adding 
a dependency on (j) as well would make the energy scale along (j) more uniform and also improve the 
mass resolution of Z — ► ee. 



5 Energy correction using calibration hits 

This section describes an alternate method for calculating the total energy from the energies in the in- 
dividual calorimeter layers and the presampler. It is a development of ideas introduced in [14, 15] to 



58 



Electrons and Photons - Calibration and Performance of the . 












5 


it— J J 



(a) Energy linearity. (b) Energy resolution. 

Figure 21: Energy linearity and resolution for photons (5x5 clusters). 




Figure 22: M{eeee) from Higgs boson decays 
with mu = 130 GeV (energy from calorimeter 
only, with no Z boson mass constraint). 



Figure 23: M(yy) from Higgs boson decays with 
m H = 120 GeV. The shaded plot corresponds to at 
least one photon converting at r < 80 cm. 



analyse test beam data and is described in some detail in [16]. Special simulations are used in which the 
energy deposited by a particle is recorded in all detector materials, not just the active ones. Through these 
simulations, the energy depositions in the inactive material can be correlated with the measured quanti- 
ties. For example, the energy lost in the material in front of the calorimeter (inner detector, cryostat, etc.) 
can be estimated from the energy deposited in the presampler. The result is a method which provides a 
modular way to reconstruct the energies of electrons and photons by decoupling all the different correc- 
tions. This approach eases comparisons between electrons and photons, and might be particularly useful 
in the initial stages of the experiment. 

The cluster energy is decomposed into three pieces, which will be treated separately below: 



E — Seal + £front + ^backj 



(7) 



where E ca \ is the energy deposited in the electromagnetic calorimeter, Ef mni is the energy deposited in 
the presampler and in the inactive material in front of the calorimeter, and £"back is the energy that leaks 
out the rear of the EM calorimeter. 

This analysis uses simulated single-particle, mono-energetic electron and photon samples, with en- 
ergies ranging from 25 to 500 GeV. 
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-0.01 r 
-0.01 5^- 



(a) LincrgN linearity. 



(b) Energy resolution. 



Figure 24: Electron linearity and resolution in H — ► 4e for the ideal (full triangles) and distorted (circles) 
geometries. 
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(a) Energy uniformity in (f> integrated over pj and tj . 



(b) Energy uniformity in rj integrated over pj and (f>. 



Figure 25: Electron energy uniformity in tj and 0, integrated over other kinematic variables, for the 
ideal (full triangles) and distorted (circles) geometries. 
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5.1 Reconstruction of the energy deposited in the calorimeter 



The energy deposited by a particle in the EM calorimeter, E ca \, is estimated as 



£cal=C cal (X,T7)(l+/ out (X,77))£ cl , 



(8) 



where 



• E c \ = Y] = i E{, and £1. 3 are the energies deposited in each of the three calorimeter layers in a given 
cluster. In the following, £" ps will denote the energy deposited in the presampler. The energies £",- 
available at this stage of the reconstruction are the energies deposited in the liquid-argon ionisation 
medium divided by a region-dependent sampling fraction. 

• X is the the longitudinal barycentre or shower depth, defined by 



where E t is as above and X t is the longitudinal depth, expressed in radiation lengths, of compart- 
ment i, computed from the centre of the detector. The X,-, which are computed using a geantino 1 
scan, are functions of 77 . 

• 77 is the cluster barycentre, corrected for the S-shape effect (see Sec. 2.1). 

• /out is the fraction of the energy deposited outside the cluster. 

• C ca \(X, T?) is the calibration factor for the energy in the EM calorimeter. 

The calibration factor C ca i is defined as the average ratio between the true energy deposited in the EM 
calorimeter (both absorbers and ionisation medium) and the reconstructed cluster energy E c \. It is within 
a few percent of unity, and takes into account effects such as the dependence of the sampling fraction 
on Y\ and on the longitudinal profile of the shower. Once the correction factor C ca i is expressed as a 
function of X it is fairly energy independent. The correction factor averaged over all energies is shown 
in Fig. 26(a). Its dependence on X is parametrised with a second order polynomial. The fit is performed 
excluding the bins with less than 0.5% of the total statistics. This criterion is also applied to all the fits 
performed in the following. 

Due to the presence of the magnetic field and bremsstrahlung radiation, the fraction of energy de- 
posited in the calorimeter outside of the cluster is energy dependent. Since only single electrons and 
photons with no noise or underlying event are simulated, this fraction is easily calculated. The profile of 
the out-of-cluster energy is asymmetric with the tail on the high side. However the most probable value, 
obtained with a Gaussian fit around the maximum of the distribution (—2a, +1.5a), is energy indepen- 
dent when plotted as a function of X. The most probable value of the fraction of energy deposited outside 
the cluster averaged over all energies is shown in Fig. 26(b) for electrons and photons and the two | T7 1 
values. Electrons and photons behave similarly in the central region but differently in the forward region. 
This is due to the large difference in the amount of material present in front of the calorimeter (~ 2.5Xq 
at \r\ \ =0.3 and ~ 1X$ at \r\ \ = 1.65) combined with the presence of bremsstrahlung and the magnetic 
field. 



1 A "geantino" is an imaginary non-interacting particle used in the simulation. The properties of the material crossed by the 
particle are recorded. 



X = 



Y]=i EiXj + £p S .Xp 



(9) 
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Longitudinal Barycenter (X o ) Longitudinal Barycenter (X Q ) 



(a) Qai vs. X. (b) Fraction of out-of-cluster energy. 

Figure 26: Correction factor C ca \ and fraction of out-of-cluster energy as a function of the shower depth 
X, averaged over all energies, at two representative \t] \ points. The dashed lines show the results of the 
parametrisation. 



5.2 Energy deposited in front of the calorimeter 

The energy lost in the material in front of the calorimeter (inner detector, cryostat, coil, and material 
between the presampler and strips) is parametrised as a function of the energy lost in the active material 
of the presampler (E ps ): 



E iront = a{E cal ,ri)+ b(E cal , r\ )£ ps + c(£ cal , r\ )£ p 2 s . 



(10) 

An example of this relation is shown in Fig. 27. All coefficients are parametrised in terms of the energy 
deposited by a particle in the calorimeter (£ ca i) and tj. The coefficient c is used only in the end-cap, 
1.55 < |tj| < 1.8, and is set to zero otherwise. Note explicitly that £ front includes the energy deposited in 
the presampler and between the presampler and the strips. An alternate form for £f r0 nb which depends 
on the energy in the first calorimeter layer in addition to £" ps , was also tried. This did not improve the 
resolution, so the simpler parametrisation above is retained. 
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Figure 27: Energy lost in front of the EM calorim- 
eter as a function of the energy measured in the 
presampler at |tj | = 0.3 for electrons of 100 GeV. 
The dashed curve shows the parametrisation de- 
rived for electrons. 



Figure 28: Energy lost in front of the calorime- 
ter as a function of shower depth X, for electrons 
of 100 GeV at | T] | = 1 .9, in a region where the cal- 
orimeter is not instrumented with the presampler. 



In the region 1.8 < \r] \ < 3.2, not instrumented with the presampler, the energy deposited in front of 
the calorimeter is parametrised as a function of X with a second degree polynomial. Figure 28 shows 
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this correlation for electrons and photons of 100 GeV at |tj| = 1.9. The coefficients of this polynomial 
are parametrised in terms of E ca \. 

5.3 Longitudinal leakage correction 

The energy deposited by the showers behind the EM calorimeter is computed as a fraction of the energy 
reconstructed in the calorimeter. This fraction, when parametrised as a function of X, is fairly energy 
independent both for electrons and photons. Averaged over the particle energies, it is parametrised by 

/leak = E back /E cal = f^(ri)X + f\ eak (r 1 )e x . (11) 
Figure 29 shows the leakage and the result of the fit for |tj| = 0.3 and 1.65. 




(a)|ij|=0.3. (b) | ij | = 1.65. 



Figure 29: Fraction of energy deposited behind the calorimeter, averaged over particle energies, as a 
function of the shower depth X. The parametrisation used is superimposed. 



5.4 Results 

The total cluster energy is computed by adding these three contributions. Example distributions of re- 
constructed energies are shown in Fig. 30. Mean values and standard deviations are found from a fit to a 
Crystal-Ball function (a Gaussian with a low-side tail of the form (1 — x)~"). 

The resolution is shown in Fig. 31 as a function of the particle energy for electrons and photons at 
two \r\ | values and in Fig. 32 for various photon energies and all X] values. The sampling term is shown 
in Fig. 33 as a function of \r\ | for electrons and photons. 

For electrons, the sampling term increases from 8.7% at low | tj | to 21 % at | tj | = 1 .55. This worsening 
of the energy resolution is related to the increase of the material in front of the calorimeter. This effect is 
much less relevant for photons, which have a maximum sampling term of 12%. The constant term is in 
general lower than 0.6% and is related to the energy modulation in a cell (see Sec. 3.2), not corrected at 
this stage. The linearity, the ratio between the fitted mean value and the true particle energy, is shown in 
Fig. 34. It is better than 0.5% over the full \r\ \ range and in the energy interval 25-500 GeV. 

The results from the calibration hits correction are comparable in terms of resolution and linearity 
with the longitudinal weights method. However there are a few differences worth mentioning. The coef- 
ficients of the longitudinal weights method are averaged over a range of energies, while the parametrisa- 
tions of the calibration hits method are energy dependent. This means that it should be easier to extend 
the calibrated energy range for the calibration hits method without compromising energy linearity. An- 
other important difference is that while the coefficients of the longitudinal weights method have no direct 
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(c) E = 100 GeV, |tj| =0.3. (d) E = 1 00 GeV, |tj | = 1 .65. 

Figure 30: Total reconstructed energy profiles. 




(a) |rj | =0.3. (b) |t] | = 1.65. 

Figure 31: Resolution versus particle energy. 
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Figure 32: Resolution for various photon energies Figure 33: Sampling term as a function of |tj |. 
as a function of | rj | . 
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(a) Electrons. (b) Photons. 

Figure 34: Linearity for various particle energies as a function of |t] |. 



physical meaning, the parametrisation of the calibration hits method allows isolating the different com- 
ponents of the calibrated cluster energy: that deposited in the calorimeter, inside and outside of the 
cluster, and in front and behind of it. The knowledge of these separate contributions, which depend on 
accurate and detailed simulations of the tracker and the calorimeters, could be particularly useful in the 
early stages of the experiment, for example to disentangle effects such as a miscalibration of the calo- 
rimeter or an imperfect knowledge of the inner detector material. It is also worth noting that the estimate 
of the energy lost in front of the calorimeter is crucial to obtaining a good resolution and linearity; at 
low energies and large rapidities, a large fraction of the energy of an electron is deposited in front of the 
calorimeter. The calculation of missing momentum could also benefit from this separation of effects. 



6 In-situ calibration with Z — > ee events 

6.1 Motivation 

In the EM calorimeter, the construction tolerances and the calibration system ensure that the response is 
locally uniform, with a constant term < 0.5% over regions of size At] x A0 = 0.2 x 0.4. This has been 
shown with test beam data [13]. Electron pairs from Z boson decays can then be used to intercalibrate 
the 384 regions of such size within the acceptance of |rj| < 2.4. These regions must be intercalibrated 
to within 0.5% in order to achieve a desired global constant term of < 0.7%. The basic idea of this 
calibration method is to constrain the di-electron invariant mass distribution to the well-known Z boson 
line shape. A second goal of the calibration is to provide the absolute calorimeter electromagnetic energy 
scale. This must be known to an accuracy of ~ 0.1% in order to achieve the ATLAS physics goals 2 . 

6.2 Description of the method 

Long-range non-uniformities can arise for many reasons, including variations in the liquid argon im- 
purities and temperature, amount of upstream material, mechanical deformations, and high voltage (as 
localised calorimeter defects may necessitate operating a small number of channels below nominal volt- 
age). For a given region i, we parametrise the long-range non-uniformity modifying the measured elec- 
tron energy as Ef co = Ej me ( 1 + a,). Neglecting second-order terms and supposing that the angle between 
the two electrons is perfectly known, the effect on the di-electron invariant mass is: 

M;f eco „ M true (1 + Oft + O/j = M true (1 + ^ (12) 

where j3; ; - = a,- + a ; -. 

2 Except for the W boson mass measurement, which needs a much better knowledge of the energy scale (~ 0.02%). 



65 



Electrons and Photons - Calibration and Performance of the . . . 



The method to extract the a's is fully described in [17] and is done in two steps. First, the j3's 
are determined, then the a's. For a given pair of regions the coefficient and its associated 

uncertainty are determined by minimising the following log-likelihood: 

-lnL tot =£-lnL(M*/( 1 + ^ 



(13) 



where k counts all selected events populating the pair of regions is the di-electron invariant 

mass of event k, and L(M, c%) quantifies the compatibility of an event with the Z boson line shape 
and is described below. Fits with only one event are removed. Once the /3 's are determined from the 
minimisation, the a's can be found from the overdetermined linear system given by j8,y = a,- + OLj. This 
is done using a generalised least squares method, and gives an analytic solution. 

The Z boson line shape is modeled with a relativistic Breit-Wigner distribution [18, 19]: 



BW(M) - 



M 



(14) 



(M 2 -M|) 2 + r|M 4 /M|' 

where Mz and Tz are the mass and the width of the Z boson. They were measured precisely at LEP; 
the values used are, respectively, 91.188 ± 0.002 GeV and 2.495 ±0.002 GeV [20]. In proton-proton 
collisions, the mass spectrum of the Z boson differs from the Breit-Wigner shape of the partonic process 
cross section. The probability that a quark and antiquark in the interacting pp system produce an object 
of mass M falls with increasing mass. In order to take this into account, the Breit-Wigner is multiplied 
by the ad-hoc parametrisation Jz?(M) = l/M^. The parton luminosity parameter j8 is assumed to be a 
constant and is determined by fitting the Z boson mass distribution obtained with events generated with 
PYTHIA version 6.403 [21]. Figure 35(a) shows the Z boson mass distribution fitted with a Breit-Wigner 
with and without the parton luminosity factor. The fitted value of the parameter j8 is 1.59 ±0.10; this 
will be used in the following. Since the photon propagator and the interference term between the photon 
and the Z boson were not taken into account in the previous parametrisation, the parton luminosity term 
also accounts for the effects of these two terms. 





(a) Z line shape 



(b) Residual distribution 



Figure 35: (a) Z boson mass distribution for PYTHIA events fitted with a Breit-Wigner distribution with 
(solid line) and without (dashed line) the parton luminosity factor. # 2 /Mx>f is 1-09 and 3.96, respec- 
tively, (b) Residual distribution fitted with a Gaussian. 



Finally, in order to take into account the finite resolution of the electromagnetic calorimeter, the 
Breit-Wigner multiplied by the parton luminosity term is convoluted with a Gaussian: 

■-u 2 l2oj, 

(15) 



L(M,o M )= / BW(M-m)^(M-m)— du, 
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where c% is the resolution of the measured mass. It is related to the electron energy resolution via 




At |t]| = 0.3, the sampling term of the electron energy resolution is equal to 10.0% and increases with 
increasing \r\\. Technically, the integral is converted to a discrete summation over the convolution pa- 
rameter u which takes values between -5o M and +5o M - 



6.3 Generator-level tests 

The method is first tested on generator-level Z — > ee Monte Carlo events. These were generated using 
PYTHIA 6.403 [21] with M z = 91.19 GeV and T z = 2.495 GeV. Events are required to have at least one 
electron with pj > 10 GeV and |tj| < 2.7 and a di-electron invariant mass M ee > 60 GeV. To simulate 
the detector resolution, generated electron energies are smeared to obtain Oe/E = 10%/^/E/ GeV. 

For each calorimeter region i, a bias a,- is generated from a Gaussian distribution with a mean /ibias 
and width a b i as - These will be called the "injected" a's, a n j- 

For the first tests, jluas is fixed to 0 and Obi as to 2%. The calibration method explained above is 
applied to 50,000 events after selection. The residual distribution (Ofi t — a; n j) is shown in Fig. 35(b). 
The mean value of the residual distribution corresponds to the energy scale, and its width to the energy 
resolution. Thus it can be seen that the fitting method gives unbiased estimators of the injected a's. 

In the case where /Xbi as is different from zero, the mean value of the residual distribution will be 
different from zero. For example, for /^bias = —3%, (am — anj) = 0.1%. This is a consequence of 
neglecting the higher-order terms in the Taylor expansion of Eq. (12). Iterating the procedure twice 
suffices to recover an unbiased estimate of the a's, as shown in Fig. 36(a). 





Number of iterations 



Figure 36: (a) Mean value of the Gaussian fitting the residual distribution as a function of the number of 
iterations for different mean values of the injected a's; (b) Constant term as a function of the number of 
events or as a function of the luminosity. 

Figure 35(b) also shows the resulting uniformity. After the fit, the RMS of the distribution has been 
reduced from 2% to 0.4%. The RMS of the residual distribution is a measure of the expected long- 
range constant term. Figure 36(b) shows the long-range constant term as a function of the number of 
reconstructed Z —> ee decays or of the integrated luminosity assuming an event selection efficiency of 
25%. Therefore, by summing the local constant term of 0.5% with the long-range constant term of 0.4% 
obtained here, a total constant term of about 0.7% could be achieved with ~ 100 pb _1 . These results 
assume perfect knowledge of the material in front of the electromagnetic calorimeter. 
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6.4 Results with distorted geometry 

The previous section showed results based on generator-level Monte Carlo. The results in this section 
use PYTHIA events with full detector simulation and reconstruction, using a geometry with additional 
material in front of the electromagnetic calorimeter. 

The number of events available is 349,450 corresponding to an integrated luminosity of ~ 200 pb _1 . 
Events with at least two reconstructed electrons are kept. The two leading electrons are required to be of 
at least medium quality [6], to have p T > 20 GeV and |tj | < 2.4, and to be of opposite sign. Finally, the 
di-electron invariant mass is required to be within 80 < M ee < 100 GeV. The total selection efficiency is 
21.5%; the efficiency for finding two electron candidates within \t] \ < 2.4 is 50%. 

The calibration method is applied first without injecting any biases (a^j = 0 for all regions). How- 
ever, the presence of the misalignments and extra material means that there will be some biases intrinsic 
to the simulation. These "true" biases can be estimated using generator information: 

«true,-^L > ( 17 ) 

where k counts over the Ni electrons falling in region /, and pj CO ' and p| en ' are the reconstructed 
and true transverse momenta of electron k. The distribution of a aue is shown in Fig. 37(a), as is the 
results of the fit. The low-end tail corresponds to regions located in the gap between the barrel and end- 
cap cryostats (Fig. 38(a)), where the density of material has been increased by a factor of 1.7. There 
is fair agreement between the a's extracted using the data-driven method and those estimated from 
generator information. Figure 37(b) shows the difference between a n t and ow; a Gaussian fitted to 
this distribution has a mean of 0.1% and a width of 0.5%. The distribution of Ofe as a function of T\ 
and 0 is shown in Fig. 38 for the ideal and distorted geometries. The asymmetry between positive and 
negative 0 is due to the effect of the extra material in the inner detector at positive 0 . The difference 
between positive and negative 0 values is about 0.6%. 

The same exercise is also done by introducing, on top of the non-uniformities due to extra material, a 
bias Oinj generated from a Gaussian distribution with a mean /i bias = 0 and width a b i as = 2%. Results are 
shown in Fig. 39. The Gaussian fitted to this distribution also has a mean of 0.1% and a width of 0.5%. 

One can conclude that, using ~ 87,000 reconstructed Z — > ee events (which corresponds to about 
200 pb _1 ), and with an initial spread of 2% from region to region, the long-range constant term should 
not be greater than 0.5%. 3 This should give an overall constant term ~ 0.7%. The bias on the absolute 
energy should be small and of the order of 0.2%. If the exercise is repeated with only 100 pb 1 of data, 
the Gaussian fitted to the residual distribution also has a mean of 0.2%, but the width is larger, leading to 
a long-range constant term of 0.8%. 

7 Estimation of the systematic uncertainty on the energy scale 

The absolute energy scale has been obtained using electrons from Z —> ee decays. It has been determined 
on events simulated with the misaligned geometry while the longitudinal weights were found with the 
ideal geometry. On top of the non-uniformities due to extra material, a bias modeling the calorimeter 
non-uniformities is introduced and is generated from a Gaussian distribution with a mean /Xbi as = 0 and 
width abias = 2%. The resulting bias on the energy scale can be assessed by comparing the fitted a's 
with those from generator information; the bias is equal to 0.2%. This bias is understood and is due to 
the fact that the model of the Z boson line shape doesn't take into account the effects of bremsstrahlung. 
Work is ongoing to improve this issue. 

3 Part of the RMS of the residual distribution is also due to uncertainties on the measurement of OW- 
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(a) Ofl t integrated over 0 as a function of 77. (b) integrated over 77 as a function of <j>, fitted in two 

separate regions. 

Figure 38: Ofi t distributions with Oi n j = 0 and with distorted/ideal (full/open circles) geometry. 
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The background has been neglected but it has been checked that the contribution from QCD events 
where the two jets are misidentified as electrons is small. Thus, it should have a negligible effect on the 
determination on the energy scale. 

Electrons from Z boson decays have a pj spectrum with a maximum value around 45 GeV. Care 
will thus have to be taken to extrapolate the calibration obtained from Z — > ee decays to electron energy 
regions not well populated by these events. Corrections determined with Z boson decays were applied 
to single electron samples with different generated transverse momenta (20, 40, 120, and 500 GeV) 
reconstructed with the misaligned geometry. Figure 40 shows (ow) after correction as a function of pj 
for four | T\ | bins. In principle, (a tms ) should be equal to zero. This is true for the 40 GeV electron sample 
at a level of 0.2% except in the bin (1.4 < |tj| < 2.0) containing the crack region. For central electrons 
(| 7] | < 0.6), the dependence versus pj is smaller than 0.5%. The effect is worse for non-central electrons. 
For instance, at pj = 120 GeV, ow after corrections varies from 1 to 1.6 percent. This non-linearity is 
due to the presence of extra material in front of the calorimeter. 




! 0 100 200 300 400 500 



Figure 40: (ow) after correction as a function of pj for four 77 bins. 

To conclude, at the Z boson energy scale, the estimate of the systematic uncertainty is around 0.2%. 
At other energy scales, the systematic uncertainty is dominated by effects of extra material. For central 
electrons, corrections can be extrapolated over the full pj spectrum to a level of 0.5%. The linearity is 
degraded for non-central electrons at a level of 1 or 2 percent except in the crack region where it is worse. 
These numbers depend on the amount of extra material added to the misaligned geometry compared to 
the ideal geometry and will likely be different with real data. 

The performance presented here corresponds to our current understanding of the determination of 
the absolute energy scale. Improvements are expected to achieve systematic uncertainties smaller than 
0.5%. For instance, including information from the E/p ratio measured for isolated high-/^ electrons 
from W — > ev decays will compliment the direct calibration of the absolute scale with Z —>■ ee events. 
Photon conversions can also help to determine the amount of material in front of the calorimeter. 

Conclusion 

The methods and algorithms described in this note were already mentioned in Ref. [1] many years ago. 
Over the years, they have reached a higher level of stability and maturity, and have been implemented in 
the ATLAS reconstruction software. It is believed that, given the constraints of the ATLAS detector, in 
particular the amount of dead material in front of the calorimeter, the performances described here will 
not evolve much further. 

The real challenge at the beginning of data-taking will be the detection and correction for additional 
inner detector material or calorimeter inhomogeneities which would not have affected the somewhat 
smaller-scale detectors used in the test beam. Discrepancies between data and simulation will have to be 
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understood prior to the use of the methods described above. The in-situ calibration with Z — ► ee events 
described in Section 6 will play an important role, and refinements of the method presented here are 
expected. 
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Abstract 

This note discusses the overall ATLAS detector performance for the recon- 
struction and identification of high-pr electrons over a wide range of trans- 
verse energies, spanning from 10 GeV to 1000 GeV. 

Electrons are reconstructed using information from both the calorimeter and 
the inner detector. The reference offline performance in terms of efficiencies 
for electrons from various sources and of rejections against jets is described. In 
a second part, this note discusses the requirements and prospects for electrons 
as probes for physics within and beyond the Standard Model: Higgs-boson, su- 
persymmetry and exotic scenarios. In the last part, this note outlines prospects 
for electron identification with early data, corresponding to an integrated lumi- 
nosity of 100 pb _1 , focusing on the use of the signal from Z —>■ ee decays for 
a data-driven evaluation of the offline performance. 

1 Introduction 

Excellent particle identification capability is required at the LHC for most physics studies. Several 
channels expected from new physics, for instance some decay modes of the Higgs boson into electrons, 
have small cross-sections and suffer from large (usually QCD) backgrounds. Therefore powerful and 
efficient electron identification is needed to observe such signals. Even for standard processes, the signal- 
to-background ratio is usually less favourable than at past and present hadron colliders. The ratio between 
the rates of isolated electrons and the rate of QCD jets with pj in the range 20-50 GeV is expected to be 
~ 10~ 5 at the LHC, almost two orders of magnitude smaller than at the Tevatron pp collider. Therefore, 
to achieve comparable performances, the electron identification capability of the LHC detectors must be 
almost two orders of magnitude better than what has been achieved so far. 

Physics channels of prime interest at the LHC are expected to produce electrons with pj between 
a few GeV and 5 TeV Good electron identification is therefore needed over a broad energy range. In 
the moderate pj region (20 - 50 GeV), a jet-rejection factor exceeding 10 5 will be needed to extract a 
relatively pure inclusive signal from genuine electrons above the residual background from jets faking 
electrons. The required rejection factor decreases rapidly with increasing pj to ~ 10 3 for jets in the TeV 
region. For multi-lepton final states, such as possible H — > eeee in the mass region 130 < m# < 180 
GeV, a rejection of ~ 3000 per jet should be sufficient to reduce the fake-electron backgrounds to a level 
well below that from real electrons. In this case, however, the electrons have a rather soft pj spectrum 
(as low as 5 GeV), resulting in lower reconstruction and identification efficiencies. 

Since the publication of the ATLAS physics TDR [1], the ATLAS detector description has been 
greatly improved, with, in particular, the introduction of a more realistic material description for the 
inner detector and for the region between the inner detector and the first layer of the electromagnetic 
calorimeter [2] [3]. This has led to some significant changes in the expected performance. The re- 
construction software has also evolved significantly. Each step of the energy reconstruction has been 
validated by a series of beam tests [4] [5] [6] using prototype modules of the liquid argon electromag- 
netic calorimeter, and also more recently, combined with prototype modules of the inner detector. At 
present, two electron reconstruction algorithms have been implemented in the ATLAS offline software, 
both integrated into one single package and a common event data model. 

- The standard one, which is seeded from the electromagnetic (EM) calorimeters, starts from clusters 
reconstructed in the calorimeters and then builds the identification variables based on information 
from the inner detector and the EM calorimeters. 
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- A second algorithm, which is seeded from the inner detector tracks, is optimized for electrons 
with energies as low as a few GeV, and selects good-quality tracks matching a relatively isolated 
deposition of energy in the EM calorimeters. The identification variables are then calculated in the 
same way as for the standard algorithm. 

The standard algorithm is the one used to obtain the results presented in this note, while the track- 
based algorithm is used for low pj and non-isolated electrons and is the subject of another note [7]. 

This note is organised as follows. Section 2 discusses the reconstruction and identification of elec- 
trons in the fiducial range of the ATLAS detector (|t]| < 2.5), whereas section 3 describes the iden- 
tification of electrons in the forward region (2.5 < \r\\ < 4.9). Section 4 describes some important 
performance aspects of electron identification in discovery physics processes. Section 5 discusses the 
strategies for measuring reconstruction and identification efficiencies using a data-driven approach based 
on Z — > ee events. 

2 Calorimeter-seeded reconstruction and identification 

In the standard reconstruction of electrons, a seed electromagnetic tower with transverse energy above ~ 

3 GeV is taken from the EM calorimeter [3] and a matching track is searched for among all reconstructed 
tracks which do not belong to a photon-conversion pair reconstructed in the inner detector. The track, 
after extrapolation to the EM calorimeter, is required to match the cluster within a broad At] x A0 window 
of 0.05x0.10. The ratio, E/p, of the energy of the cluster to the momentum of the track is required to 
be lower than 10. Approximately 93% of true isolated electrons, with Ej > 20 GeV and \t]\ < 2.5, are 
selected as electron candidates. The inefficiency is mainly due to the large amount of material in the inner 
detector and is therefore r\ -dependent. As an example, 4% of electron candidates with pj = 40 GeV 
fail the cut E/p < 10 and most of the losses are in the end-cap region. Various identification techniques 
can be applied to the reconstructed electron candidates, combining calorimeter and track quantities and 
the TRT information to discriminate jets and background electrons from the signal electrons. A simple 
cut-based identification procedure is described below together with its expected performance. This is 
followed by a brief overview of the possibilities offered by more advanced methods, such as a likelihood 
discriminant. 

2.1 Electron-jet studies 

For the purposes of this note, the electron identification efficiency is defined as 
_N^_ 

£ — ^ytruth ' 

where N] A is the number of reconstructed and identified candidates and N l J uth is the number of true 
electrons selected using the appropriate kinematic cuts at the generator level. A geometrical matching 
(within a cone of size AR = 0.2) between the reconstructed cluster and the true electron is required in 
the calculation of Af] d . A classification is applied to define whether a reconstructed electron candidate 
should be considered as signal or background. This classification is based on the type of the Monte Carlo 
particle associated to the reconstructed track, as well as that of its non-electron parent particle. As shown 
in Table 1, candidates are divided into four categories and signal efficiencies are calculated separately 
for isolated and non-isolated electrons. 

For the jet rejection studies, the PYTHIA (version 6.4) [10] event generator has been used to produce 
the large statistics of jet background samples required to assess both the trigger and offline performance 
of the electron reconstruction and identification tools described in this note. Two different samples were 
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generated to cover the Ej -range of interest for single electrons (10-40 GeV). The first one, referred 
to as filtered di-jets, contains all hard-scattering QCD processes with Ej > 15 GeV, e.g. qg —> qg, 
including heavy-flavour production, together with other physics processes of interest, such as prompt- 
photon production and single W/Z production. The second one, referred to as minimum bias, contains 
the same processes without any explicit hard-scattering cut-off. A filter was applied at the generator 
level to simulate the LI trigger requirements [11], with the goal of increasing in an unbiased way the 
probability that the selected jets pass the electron identification cuts after GEANT [12] simulation. The 
summed transverse energy of all stable particles (excluding muons and neutrinos) with |t]| < 2.7 in 
a region A(j) x At] = 0.12 x 0.12 was required to be greater than a chosen Ej -threshold for an event 
to be retained. For the filtered di-jet sample, this Ej -threshold is 17 GeV, while for the minimum-bias 
sample, it is 6 GeV. The filter retains 8.3% of the di-jet events and 5.7% of the minimum-bias events. 
The total number of events available for analysis after filtering, simulation and reconstruction, amounts 
to 8.2 million events for the di-jet sample and to 4.1 million events for the minimum-bias sample. 



Category 


Type of particle 


Type of parent particle 


Isolated 
Non-isolated 
Background electron 
Non-electron 


Electron 
Electron 
Electron 


Z, W, t, T or \i 
J /iff, fo-hadron or ohadron decays 
Photon (conversions), 7r°/r) Dalitz decays, i(/rf/.s-hadron decays 



Table 1: Classification of simulated electron candidates according to their associated parent particle. 
Muons are included as source because of the potential emission of a Bremsstrahlungs photon. 



E T > 17 GeV 


E T > 8 GeV 


Isolated 


Non-isolated 


Background 


Non-isolated 


Background 


W - 75.0% 


fr-hadrons - 38.7% 


y-conv. - 97.8% 


t-hadrons - 39.3% 


y-conv. - 98.4% 


Z - 20.9% 


c-hadrons - 60.6% 


Dalitz decays - 1.8% 


c-hadrons - 59.7% 


Dalitz decays - 1.3% 


t - <0.1% 


J/y - 0.7% 


i(/t//.s-hadrons - 0.4% 


j/w - 1-0% 


«/rf/.s-hadrons - 0.3% 


T - 4.1% 











Table 2: Contribution and origin of isolated, non-isolated, and background electron candidates in the two 
di-jet samples before the identification criteria are applied. 



The jet rejections quoted in this note are normalised with respect to the number of particle jets 
reconstructed using particle four-momenta within a cone size AR = 0.4 and derived from a dedicated 
un-filtered generated sample of di-jets or minimum-bias events. In the di-jet and minimum-bias samples, 
the average numbers per generated event of such particle jets with Ej above 17 and 8 GeV, respectively, 
and in the range \r\\ < 2.47, are 0.74 and 0.31, respectively. 

After reconstruction of electron candidates and before any of the identification cuts are applied, 
the signal is completely dominated by non-isolated electrons from b— and ohadron decays. The ex- 
pected signal-to-background ratios for the filtered di-jet {Ej above 17 GeV) and minimum-bias (Ej 
above 8 GeV) samples are 1:80 and 1:50, respectively. The residual jet background is dominated by 
charged hadrons. Only a small fraction of the background at this stage consists of electrons from pho- 
ton conversions or Dalitz decays, namely 6.4% and 9.4%, respectively. Table 2 summarises the relative 
compositions of the filtered di-jet and minimum-bias samples in terms of the three categories containing 
electrons described in Table 1. 
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Type 


Description 


Variable name 




Acceptance of the detector 


|ij|<2.47 




Hadronic leakage 


Ratio of Ej in the first sampling of the 
hadronic calorimeter to Ej of the EM cluster 




Second layer 


Ratio in 77 of cell energies in 3 x 7 versus 7x7 cells. 


R n 


of EM calorimeter. 


Ratio in <j) of cell energies in 3 x 3 versus 3x7 cells. 
Lateral width of the shower. 


R<l> 


Medium cuts (includes loose cuts ) 


First layer 


Difference between energy associated with 


AE S 


of EM calorimeter. 


the second largest energy deposit 
and energy associated with the minimal value 
between the first and second maxima. 






Second largest energy deposit 
normalised to the cluster energy. 


Rmaxl 




Total shower width. 






Shower width for three strips around maximum strip. 






Fraction of energy outside core of three central strips 


fside 




but within seven strips. 




Track quality 


Number of hits in the pixel detector (at least one). 
Number of hits in the pixels and SCT (at least nine). 
Transverse impact parameter (<1 mm). 




Tight (isol) (includes medium cuts) 


Isolation 


Ratio of transverse energy in a cone AR < 0.2 
1 sc energy. 




Vertexing-layer 


Number of hits in the vertexing-layer (at least one). 




Track matching 


(< 0.005). 

A<j> between the cluster and the track (< 0.02). 






Ratio of the cluster energy 


E/p 




to the track momentum. 




TRT 


Total number of hits in the TRT. 
Ratio of the number of high-threshold 
hits to the total number of hits in the TRT. 






trt 


Same as TRT cuts above, 
but with tighter values corresponding to about 90% 
efficiency for isolated electrons. 





Table 3: Definition of variables used for loose, medium and tight electron identification cuts. The cut 
values are given explicitly only when they are independent of Y\ and pj . For a detailed description of 
the cut variables used for the loose and medium cuts, refer to sections 2.1.1.1 and 2.1.1.2. 



2.1.1 Cut-based method description 

Standard identification of high-/;>7 electrons is based on many cuts which can all be applied indepen- 
dently. These cuts have been optimised in up to seven bins in Tj and up to six bins in pj . Three reference 
sets of cuts have been defined: loose, medium and tight, as summarised in Table 3. This provides flex- 
ibility in analysis, for example to improve the signal efficiency for rare processes which are not subject 
to large backgrounds from fakes. 

2.1.1.1 Loose cuts This set of cuts performs a simple electron identification based only on limited 
information from the calorimeters. Cuts are applied on the hadronic leakage and on shower-shape vari- 
ables, derived from only the middle layer of the EM calorimeter (lateral shower shape and lateral shower 
width ). This set of cuts provides excellent identification efficiency, but low background rejection. 
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2.1.1.2 Medium cuts This set of cuts improves the quality by adding cuts on the strips in the first 
layer of the EM calorimeter and on the tracking variables: 

• Strip-based cuts are effective in the rejection of 71° — > 77 decays. Since the energy-deposit pattern 
from 7T°'s is often found to have two maxima due to 71° — > 77 decay, showers are studied in a 
window At] x A0 = 0. 125 x 0.2 around the cell with the highest Ej to look for a second maximum. 
If more than two maxima are found the second highest maximum is considered. The variables 
used include AE S = E maX 2 — E m i n , the difference between the energy associated with the second 
maximum E maX 2 and the energy reconstructed in the strip with the minimal value, found between 
the first and second maxima, E m i n . Also included are: /? maX 2 = £max2/(l + 9 x 10~ 3 £y), where Ej 
is the transverse energy of the cluster in the EM calorimeter and the constant value 9 is in units 
of GeV -1 ; w sto t, the shower width over the strips covering 2.5 cells of the second layer (20 strips 
in the barrel for instance); w S 3, the shower width over three strips around the one with the maximal 
energy deposit; and F^, the fraction of energy deposited outside the shower core of three central 
strips. 

• The tracking variables include the number of hits in the pixels, the number of silicon hits (pixels 
plus SCT) and the tranverse impact parameter. 

The medium cuts increase the jet rejection by a factor of 3-4 with respect to the loose cuts, while 
reducing the identification efficiency by ~ 10%. 

2.1.1.3 Tight cuts This set of cuts makes use of all the particle-identification tools currently available 
for electrons. In addition to the cuts used in the medium set, cuts are applied on the number of vertexing- 
layer hits (to reject electrons from conversions), on the number of hits in the TRT, on the ratio of high- 
threshold hits to the number of hits in the TRT (to reject the dominant background from charged hadrons), 
on the difference between the cluster and the extrapolated track positions in Vj and 0, and on the ratio 
of cluster energy to track momentum, as shown in Table 3. Two different final selections are available 
within this tight category: they are named tight (isol) and tight (TRT) and are optimised differently for 
isolated and non-isolated electrons. In the case of tight (isol) cuts, an additional energy isolation cut is 
applied to the cluster, using all cell energies within a cone of AR < 0.2 around the electron candidate. 
This set of cuts provides, in general, the highest isolated electron identification and the highest rejection 
against jets. The tight (TRT) cuts do not include the additional explicit energy isolation cut, but instead 
apply tighter cuts on the TRT information to further remove the background from charged hadrons. 

Figures 1 and 2 compare the distributions expected from Z — > ee decays and from the filtered di-jet 
sample for a few examples of the basic discriminating variables described above for electron identifica- 
tion. 

2.1.2 Performance of cut-based electron identification 

The performance of the cut-based electron identification is summarised in Tables 4 and 5. Table 4 shows, 
for each of the background samples, the composition of each of the three categories of electron candi- 
dates containing real electrons, as it evolves from reconstruction (no identification cuts) to loose, medium 
and tight cuts. In the case of non-isolated electrons, there is a strong reduction of the initially dominant 
component from c-hadrons as the identification cuts applied become tighter. In the case of background 
electrons, there is a significant reduction of the contribution from photon conversions when applying 
tight cuts, since the vertexing-layer requirement does not much affect electrons from Dalitz decays and 
u/d /s-hadrom. As shown in Table 5, the signal from prompt electrons is dominated by non-isolated elec- 
trons from heavy flavours, which are usually close in space to hadrons from the jet fragmentation. The 
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Figure 1 : Left: ratio between the transverse energy of the electron candidate and the sum of this trans- 
verse energy and that contained in the first layer of the hadronic calorimeter. The distributions are shown 
for electrons from Z — > ee decays (solid line) and for filtered di-jets (dotted line). Right: difference in r\ 
between cluster and extrapolated track positions for electrons from Z — > ee decays (solid line) and for 
filtered di-jets (dotted line). 




Figure 2: Shower-shape distributions for electrons from Z — > ee decays (solid lines) compared to those 
from filtered di-jets (dotted lines). Shown are the energy ratios R§ (left) and (right) described in Ta- 
ble 3. 

resulting overlap between the electron shower and nearby hadronic showers explains the much lower ef- 
ficiency observed for these electrons than for isolated electrons from Z —> ee decays. These non-isolated 
electrons will nevertheless provide the most abundant initial source of signal electrons and will be used 
for alignment of the electromagnetic calorimeters and the inner detector, for E/p calibrations, and more 
generally to improve the understanding of the material of the inner detector as a radiation/conversion 
source. For tight cuts and an electron Ej of ~ 20 GeV, the isolated electrons from W, Z and top-quark 
decays represent less than 20% of the total prompt electron signal. 

For the lower Ej -threshold of 8 GeV, the expected signal from isolated electrons is negligible. Not 
surprisingly, the tight (TRT) cuts are more efficient to select non-isolated electrons from heavy-flavour 
decay, while the tight (isol) cuts are more efficient at selecting isolated electrons. After tight cuts, the 
signal-to-background ratio is close to 3:1, and depends only weakly on the Ej - threshold in the 10- 
40 GeV Ej -range studied here. The residual background is dominated by charged hadrons, which could 
be further rejected by stronger cuts (TRT and/or isolation). The initial goal of obtaining a rejection of the 
order of 10 5 against jets has been achieved with an overall efficiency of 64% for isolated electrons with 
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Isolated 




E T > 17 GeV 




No cut 




Medium 


Tight (TRT) 


Tight (isol) 


W 


75.0 


75.1 


74.9 


73.9 


73.6 


z 


20.9 


20.9 


21.1 


22.4 


22.9 


T 


4.1 


4.0 


4.0 


3.7 


3.6 


Non-isolated 




E T > 17 GeV 


E T > 8 GeV 




No cut 




Medium 


Tight (TRT) 










Tight (TRT) 




fc-hadrons 


38.7 


57.6 


71.1 


74.2 


79.1 


39.3 


51.2 


55.2 


57.0 


59.5 


c-hadrons 


60.6 


41.4 


27.6 


24.4 


19.6 


59.7 


47.6 


43.2 


41.3 


38.6 


J/v 


0.7 


1.0 


1.3 


1.4 


1.3 


1.0 


1.2 


1.6 


1.7 


1.9 


Background 




E T > 17 GeV 


E T > 8 GeV 




No cut 


Loose 


Medium 


Tight (TRT) 










Tight (TRT) 




y-conv. 


97.8 


97.7 


94.9 


88.0 


88.1 


98.4 


98.1 


94.5 


78.5 


83.0 


Dalitz decays 


1.8 


1.9 


4.0 


8.5 


8.0 


1.3 


1.4 


3.5 


12.5 


12.4 


u/d /s-hadrons 


0.4 


0.4 


1.1 


3.5 


3.9 


0.3 


0.5 


2.0 


9.0 


4.6 



Table 4: Percentage contribution and origin of isolated, non-isolated and background electrons in the 
filtered di-jet and minimum-bias samples. The classification is based on the type of the parent particle of 
the electron. 



Ej ~ 10-40 GeV. The efficiency may be improved with further optimisation of the cuts, as discussed 
below. 

Table 6 shows the efficiencies for prompt electrons and the jet rejections in more detail in the case of 
medium identification cuts, using a fine binning as a function of | tj | . The efficiency for prompt electrons 
is significantly worse in the end-cap region (|t]| > 1.52) with a correspondingly higher background 
rejection. The overlap region region between the barrel and end-cap calorimeters (1.37 < |t]| < 1.52) 
has both worse efficiency and rejection, as expected because of the large amount of passive material in 
front of the EM calorimeter. To improve the electron efficiency in the end-cap region, the EM calorimeter 
cuts in the first layer and the tracking cuts will need to be studied and tuned further. 

2.1.3 Expected differential rates for inclusive electron signal and background 

Figure 3 (left: Ej > 17 GeV and right: Ej > 8 GeV) show the expected differential cross-sections 
for electron candidates as a function of Ej , for an integrated luminosity of 100 pb~' . The different 
histograms correspond to electron candidates before any identification cuts and after the loose, medium, 
tight (TRT) and tight (isol) cuts. As illustrated in Table 5, these differential rates are dominated by the 
jet background except when applying the tight cuts. 

The expected differential cross-sections after tight (TRT) cuts are shown in Fig. 4, where they are 
broken down into their three main components, isolated electrons from W, Z and top-quark decays, non- 
isolated electrons from b, c decay, and the residual jet background. The shapes of the spectra for the 
non-isolated electrons and residual jet background are very similar, whereas the spectrum from isolated 
electrons exhibits the expected behaviour for a sample dominated by electrons from W, Z decay. For an 
integrated luminosity of 100 pb _1 , Fig. 4 (right) shows that one may expect approximately ten million 
reconstructed and identified inclusive electrons from b, c decay with Ej > 10 GeV, while Fig. 4 (left) 
shows that for the same integrated luminosity one may expect 500 000 such electrons with Ej > 20 GeV, 
with a dominant contribution from W, Z decays for Ej > 35 GeV These large data samples expected 
for a modest integrated luminosity are an integral part of the trigger menu strategy for early data, as 
explained in more detail in [11], and will clearly be extremely useful to certify many aspects of the 
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Cuts 


E T > 17 GeV 


E T >8GeV 


Efficiency (%) 


Jet rejection 


Efficiency (%) 


Jet rejection 


Z -> ee 


b.c^e 




Single electrons 
(E T =10 GeV) 


b,c^e 






87.96 ± 0.07 


50.8 ± 0.5 


567 ± 1 


75.8 ± 0.1 


55.8 ± 0.7 


513 ± 2 


Medium 


77.29 ± 0.06 


30.7 ± 0.5 


2184 ± 13 


64.8 ± 0.1 


41.9 ± 0.7 


1288 ± 10 


Tight (TRT.) 


61.66 ± 0.07 


22.5 ± 0.4 


(8.9 ± 0.3)10 4 


46.2 ± 0.1 


29.2 ± 0.6 


(6.5 ± 0.3) 10 4 


Tight (isol.) 


64.22 ± 0.07 


17.3 ± 0.4 


(9.8 ± 0.4) 10 4 


48.5 ± 0.1 


28.0 ± 0.6 


(5.8 ± 0.3) 10 4 




Fraction of surviving candidates (%) 


Fraction of surviving candidates (%) 




Isolated 


Non-isolated 


Jets 


Non-isolated 


Jets 


Medium 


1.1 


7.4 


91.5(5.5 + 86.0) 


9.0 


91.0 (5.0 + 86.0) 


Tight (TRT) 


10.5 


63.3 


26.2 (8.3 + 17.9) 


77.8 


22.2 (7.1+ 15.1) 


Tight (isol) 


13.0 


58.3 


28.6(8.7 + 19.9) 


75.1 


24.9 (6.4+ 18.5) 



Table 5: Expected efficiencies for isolated and non-isolated elections and corresponding jet background 
rejections for the four standard levels of cuts used for electron identification. The results are shown for 
the simulated filtered di-jet and minimum-bias samples, corresponding respectively to Ej -thresholds of 
17 GeV (left) and 8 GeV (right). The three bottom rows show the fractions of all surviving candidates 
which fall into the different categories for the medium cuts and the two sets of tight cuts. The isolated 
electrons are prompt electrons from W, Z and top-quark decay and the non-isolated electrons are from 
b, c decay. The residual jet background is split into its two dominant components, electrons from photon 
conversions and Dalitz decays (first term in brackets) and charged hadrons (second term in brackets). 
The quoted errors are statistical. 



electron identification performance of ATLAS with real data. One example is the understanding of 
material effects and of inter-calibration between inner detector and EM calorimeter using E/p for a 
clean subset of the inclusive electrons with Ej > 10 GeV This sample will be complementary to the 
samples of low-mass electron pairs from J/yr and T decays, discussed in [7]. A second example is the 
certification of the isolated electron identification using a clean sample of W — > ev decays. Clearly, 
with more statistics, the large samples of Z — > ee decays which will be collected will provide the 
opportunity to refine the understanding of the performance to an extremely high level of accuracy, as 
discussed in Section 5. 

2.1.4 Systematic uncertainties on expected performance 

To estimate possible systematic uncertainties related to the cut-based electron identification, two shower 
shape variables have been studied as a function of the amount of material in front of the EM calorimeter. 
Figure 5 illustrates the impact of additional material, the effect of which has not been included in the 
EM cluster corrections which are applied as described in [3], for electrons from H —>■ eeee decays. 
The results are shown in two | r\ | -ranges for the nominal material and for the case of additional material 
accounting in total to ~ 0.1 Xo and ~ 0.2 Xo (Fig. 5). It is evident that in regions with significant 
amounts of material the shower is broader (less energy in the core). These differences reduce the electron 
efficiency; however, the true systematic error on the efficiency due to such effects will depend on how 
well the inner-detector material can be measured using data. 

Figure 6 shows the fraction of energy in the strip layer outside the three core strips and inside the 
seven-strip window for the same 1 77 1 -ranges. The impact of the additional material is also clearly visible. 
The estimated change in the electron efficiencies quoted in Table 5 is expected to be less than 2%. It is 
important to note that the material effects are more pronounced in the strip layer than in the middle layer 
of the calorimeter. Therefore, one should expect larger uncertainties from this source of systematics for 
the medium electron cuts than for the loose electron cuts, which rely only on the middle layer of the 
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E T > 17 GeV 


E T > 8 GeV 




Efficiency (%) 


Jet rejection 


Efficiency (%) 


Jet rejection 




Z -> ee 






Single electrons 
(E T =10 GeV) 


b,c^e 




0.00 - 0.80 


88.2 ± 0.1 


35 ± 1 


3740 ± 50 


79.3 ± 0.2 


51 ± 1 


1960 ± 30 


0.80 - 1.35 


83.5 ± 0.1 


40 ± 1 


1581 ± 20 


70.6 ± 0.2 


52 ± 1 


914 ± 11 


1.35 - 1.50 


71.5 ± 0.4 


41 ± 2 


444 ± 5 


49.6 ± 0.5 


40 ± 3 


342 ± 5 


1.50 - 1.80 


63.8 ± 0.2 


18 ± 1 


2440 ± 40 


41.8 ± 0.4 


24 ± 2 


890 ± 15 


1.80 - 2.00 


62.5 ± 0.2 


12 ± 1 


9800 ± 450 


55.1 ± 0.4 


25 ± 2 


4660 ± 220 


2.00 - 2.35 


65.8 ± 0.2 


16 ± 1 


8400 ± 300 


55.0 ± 0.3 


21 ± 2 


6000 ± 250 


2.35 - 2.47 


67.8 ± 0.3 


14 ± 2 


4050 ± 170 


62.5 ± 0.6 


30 ± 3 


3980 ± 250 


0.00 - 2.47 


77.3 ± 0.06 


31 ± 1 


2184 ± 13 


64.8 ± 0.1 


42 ± 1 


1288 ± 8 



Table 6: Expected efficiencies for isolated and non-isolated electrons and corresponding jet background 
rejections for the medium identification cuts as a function of | r\ \ . The results are shown for the simulated 
filtered di-jet and minimum-bias samples, corresponding respectively to Ej -thresholds of 17 GeV (left) 
and 8 GeV (right). The quoted errors are statistical. 




E T (GeV) ' K (Gov; 



Figure 3: Differential cross-sections as a function of Ej before identification cuts and after loose, 
medium, tight (TRT) and tight-isol cuts, for an integrated luminosity of 100 pb _1 and for the simulated 
filtered di-jet sample with Ej above 17 GeV (left) and the simulated minimum-bias sample with Ej 
above 8 GeV (right). 



calorimeter. 

Another important source of systematics affects the jet rejections quoted in Table 5: this arises from 
the exact pj -spectrum and mixture of quark and gluon jets, and to a certain extent from heavy flavour jets 
present in the background under consideration. The numbers quoted in this note are related to the rather 
low-pr di-jet background which is relevant for the search for early signals from single electrons. Other 
background samples relevant to certain physics studies have been shown to display worse rejections, by 
up to a factor of 3 to 5. This clearly indicates that the fake electron rates will only be better understood 
with real data. 

2.1.5 Multivariate techniques 

In addition to the standard cut-based electron identification described above, several multivariate tech- 
niques have been developed and implemented in the ATLAS software. These include a likelihood dis- 
criminant, a discriminant called H-matrix, a boosted decision tree, and a neural network. Table 7 sum- 
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Figure 4: Differential cross-sections as a function of Ej after tight (TRT) cuts, shown separately 
for the expected components from isolated electrons, non-isolated electrons and residual jet back- 
ground, for an integrated luminosity of 100 pb _1 and for the simulated filtered di-jet sample with Ej 
above 17 GeV (left) and the simulated minimum-bias sample with Ej above 8 GeV (right). 




Figure 5: Energy containment, (Table 3), for 1.12 < |t]| < 1.25 (left) and 1.62 < |t]| < 1.75 (right). 
The symbols correspond to the nominal description and the histogram to the one with additional material. 

marises the gains in efficiency and rejection which may be expected with respect to the cut-based method 
by using the likelihood discriminant method. The gains appear to be artificially large in the case of the 
loose and medium cuts, because these cuts do not make use of all the information available in terms of 
electron identification, since they were designed for robustness and ease of use with initial data. Nev- 
ertheless, they indicate how much the electron efficiency may be improved once all the discriminant 
variables will be understood in the data. 

Figure 7 shows the rejection versus efficiency curve obtained using the likelihood discriminant 
method, compared to the results obtained for the two sets of tight cuts shown in Table 5. The likeli- 
hood discriminant method provides a gain in rejection of about 20-40% with respect to the cut-based 
method for the same efficiency of 61-64%. Alternatively, it provides a gain in efficiency of 5-10% (tight 
and medium cuts) for the same rejection. Multivariate methods of this type will of course only be used 
once the detector performance has been understood using the simpler cut-based electron identification 
criteria. 
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Figure 6: Energy fraction outside a three-strip core, F s id e (Table 3), for 1.12 < |tj| < 1.25 (left) and 
1.62 < 1 77 1 < 1.75 (right). The symbols correspond to the nominal description and the histogram to the 
one with additional material. 



Cuts 


Cut-based method 


Likelihood method 




Rejection Rj 


Efficiency (%) at fixed Rj 


Rejection at fixed e e 




87.97 ±0.05 


567 ±1 


89.11 ±0.05 


2767 ±17 


Medium 


77.29 ±0.06 


2184±7 


88.26 ±0.05 


(3.77±0.08) x 10 4 


Tight (isol) 


64.22 ±0.07 


(9.9 ±0.2) x 10 4 


67.53 ±0.06 


(1.26 ±0.05) x 10 5 


Tight (TRT) 


61. 66 ±0.07 


(8.9 ±0.2) x 10 4 


68.71 ±0.06 


(1.46 ±0.06) x 10 5 



Table 7: For the loose, medium and tight electron identification cuts, expected electron efficiencies for 
a fixed jet rejection and jet rejections for a fixed electron efficiency, as obtained from the likelihood 
discriminant method. The quoted errors are statistical. 



2.2 Isolation studies 

Many physics analyses in ATLAS will be based on final states with isolated leptons from decays of W- or 
Z-bosons. These channels usually have the advantage of small background expectation from processes 
with similar signature, compared to channels with hadronic final states. Nevertheless, they may also 
suffer from jet background processes, namely if leptons from semi-leptonic heavy-quark decays mimic 
the isolated leptons of the signal. Therefore, dedicated tools beyond the lepton identification algorithms 
are needed in order to suppress such sources of background by factors of up to the order of 10 3 . In 
this section, the performance of a projective likelihood estimator for the separation of isolated electrons 
from non-isolated electron backgrounds is described. The four variables chosen as input to this isolation 
likelihood are: 

- transverse energy deposited in a small cone of AR < 0.2 around the electron cluster; 

- transverse energy deposited in a hollow cone of 0.2 < AR < 0.4 around the electron cluster; 

- sum of the squares of the transverse momenta of all additional tracks measured in a cone of AR < 
0.4 around the electron cluster; 

- impact parameter significance of the electron track (with respect to the primary vertex in the trans- 
verse plane). 

Electrons from Z —> ee decays were used as a clean source of isolated electrons. The reconstructed 
electrons from this sample were required to be matched to a Monte Carlo electron from Z-boson decay 
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Figure 7: Jet rejection versus isolated electron efficiency obtained with a likelihood method (full circles) 
compared to the results from the two sets of tight cuts (open triangle and open square). 




Figure 8: Background electron rejections versus signal efficiencies for electrons in Z —>■ ee decays (left) 
and in tt decays (right), for two illustrative bins in |tj | and pj . 

and to pass the medium identification cuts in order to be considered as signal electrons. Background 
electrons were selected from a high-statistics tt sample, filtered for a pair of like-sign Monte Carlo 
electrons, and matched to a Monte Carlo electron from b/c-decay. 

The results of the performance studies of the isolation likelihood are shown in Fig. 8 for two illus- 
trative bins in \rj\ and pj . The best results are achieved for high-pr electrons measured in the barrel 
region of the EM calorimeter. As can be seen in Fig. 8 left, for electrons with only little hadronic activity 
in the final state, such as those from Z — > ee and H — > eeee decays, the isolation likelihood provides a 
background rejection of the order of 10 3 , for signal electron efficiencies of 80% (barrel) and 50% (end- 
caps). The difference observed between barrel and end-caps is mostly due to the rj -dependence of the 
medium identification cuts shown in Table 6. For comparison, the efficiency for the selection of signal 
electrons in tt events is shown in Fig. 8 right: due to the additional hadronic activity in these final states, 
the efficiency decreases by 5-10% for the same background rejection, when compared to that quoted for 
Z — > ee decays. 
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Figure 9: Example of discriminating variables used in the forward region for signal electrons (full circles) 
and the QCD di-jet background (open circles). Shown in the case of the FCal are the fraction of the total 
cluster energy deposited in the cell with maximum energy (left) and the relative lateral moment (right). 

3 Electron identification outside the inner detector acceptance 

Electron identification in the forward region (|tj | > 2.5) will be important in many physics analyses, in- 
cluding electroweak measurements and searches for new phenomena. In contrast to the central electrons, 
forward electron reconstruction can only use information from the calorimeters, since the inner detector 
covers only < 2.5. Such electrons can therefore only be identified cleanly above the background in 
specific topologies, such as Z —> ee or H — »• eeee decays. 

This section describes the performance of a cut-based method used to identify electrons in the for- 
ward region and separate them from the QCD background. The comparison of the performance obtained 
with a likelihood method is also presented. 

Signal electrons are selected from Z — >• ee decays and background electrons from a high-statistics 
sample of QCD di-jet events. Three | r\ | -regions are considered: the first one covers the inner wheel of the 
electromagnetic end-cap, i.e. 2.5 < \r\ \ < 3.2 (the HEC is not used), the second one covers the overlap 
region between the electromagnetic end-cap and the forward calorimeter (FCal), i.e. 3.2 < \ r\ \ < 3.4, and 
the last region covers the FCal acceptance, i.e. 3.4 < |t]| < 4.9. A topological clustering algorithm [13] 
is used in this analysis and only clusters with Ej > 20 GeV are considered. Two examples of the 
discriminating variables used in these studies are shown in Fig. 9, namely the fraction of the total cluster 
energy deposited in the cell with maximum energy and the relative lateral moment. The relative lateral 
moment is defined as Iat2/(lat2 + lat max ), where the lateral moments lat2 and lat max differ in the treatment 
of the two most energetic cells. Other examples include the first moment of the energy density, the 
relative longitudinal moment, defined in the same way as the relative lateral moment only with two 
longitudinal moments, the second moments of the distances of each cell to the shower barycentre and to 
the shower axis, and the distance of the cluster barycentre from the front face of the calorimeter. 

The likelihood discriminant uses the same variables as the cut-based method. Figure 10 shows the 
performance of the cut-based and likelihood discriminant methods for electrons from Z — > ee decay 
with Ej > 20 GeV. For an electron identification efficiency of 80%, both methods achieve the required 
goal of ~ 1% fake rate from the QCD background. This performance is expected to yield, for example, 
a clean Z — > ee sample with one electron already selected in the central region and one electron in the 
forward region [14]: the expected background contribution under the Z-boson peak is estimated to be 
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2.5 < hi < 3.2 



3.4 < \t\\ < 4.9 



Figure 10: Expected rejection against QCD jets versus efficiency for signal electrons from Z — > <?e decay, 
for the cut-based and likelihood discriminant methods in the inner wheel of the electromagnetic end- 
cap (left) and in the FCal (right). The rejection power of the likelihood method is expected to increase 
when additional variables beyond the minimal set shown here are added. 




Figure 1 1 : Electron identification efficiency as a function of Tj (left) and Ej (right) for electrons with 
Ej > 5 GeV from H — ► eeee decays. 



below ~ 1%. 



4 Electrons as probes for physics within and beyond the Standard Model 

4.1 Electrons in Higgs-boson decays 

Electrons from the H — ► eeee decay with ra# < 2mz are an important benchmark for the evaluation of the 
performance of the electron reconstruction and identification [15]. Here, only electrons with |n| < 2.5 
and Ej > 5 GeV are considered. The electron efficiency as a function of | T] | and Ej for loose, medium, 
and tight electron cuts is shown in Fig. 11. The drop in efficiency at low Ej is mainly due to the loss of 
discrimination power of the shower-shape cuts at lower transverse energies. A loss of efficiency is also 
visible in the transition region between the barrel and end-cap calorimeters. The results shown here are in 
quantitative agreement with those obtained for electrons from Z — > ee decay discussed in Section 2.1.2. 
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Figure 12: Electron identification efficiency as a function of Ej (left) and |t]| (right). The full sym- 
bols correspond to electrons in SUSY events and the open ones to single electrons of fixed Ej . The 
efficiencies as a function of 1 77 1 are shown only for electrons with Ej > 17 GeV. 



4.2 Electrons produced in decays of supersymmetric particles 

In many supersymmetry (SUSY) scenarios, the most abundantly produced sparticles are squarks (directly 
or from a gluino decay), which generally decay into a chargino or neutralino and jets. In turn, charginos 
and neutralinos are very likely to decay into leptons. One interesting mode for SUSY searches is the 
tri-lepton signal, in which three isolated leptons are expected in the final state. Such SUSY events would 
feature high-/^ isolated leptons accompanied by a high multiplicity of high-£y jets. Hence, it is crucial 
to efficiently identify electrons in such an environment, while preserving the very high jet rejection 
presented in Section 2. The electron identification efficiency in SUSY events is calculated using the 
SU3 ATLAS point [16]. In this scenario, a large number of charginos and neutralinos are produced and 
numerous leptons are expected in the final state. 



■ Tight cuts 

■ Medium 
1 Loose 



Figure 13: Electron identification efficiency as a function of the distance AR to the closest jet in SUSY 
events, for electrons with Ej > 17 GeV. 



Figure 12 shows the identification efficiency of the loose, medium and tight (isol) cuts as a function 
of Ej and \r\\. The efficiencies shown as a function of Ej are compared with efficiencies for single 
electrons of Ej = 10, 25, 40, 60 and 120 GeV. As expected, single electrons display higher efficiencies 
than those in SUSY events, because of the large hadronic activity in these events. The efficiencies 
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Figure 14: Electron identification efficiency as a function of Ej (left) and \r\ \ (right), for electrons from 
Z' — > e + e~ decays with m z i = 1 TeV. 

obtained for values of Ej below 20 GeV, are significantly below the plateau values at high Ej , for 
which the cuts were initially optimised. 

The efficiencies as a function of \t}\ show the same features as those discussed in Table 6, namely 
the efficiency in the end-cap region is lower than in the barrel, whereas the jet rejection is significantly 
higher. Specific drops in efficiency can be seen for |t]| ~ 1.35, which corresponds to the barrel/end-cap 
transition region, and for |t}| 0.8, which corresponds to the change in the lead thickness between the 
two types of electrodes in the barrel EM calorimeter. 

Figure 13 shows the electron identification efficiency as a function of the distance A R to the closest 
jet in SUSY events. Jets are reconstructed from topological clusters using a AT? = 0.4 cone algorithm. 
For values of AR > 0.4, the efficiencies are compatible with those expected for single electrons, whereas 
for values of AR < 0.4, the efficiencies decrease because of the overlap between the hadronic showers 
from the jet and the electron shower itself. 



Jet Ej -range 


140-280 GeV 


280 -560 GeV 


560- 1120 GeV 




Efficiency 


Rejection 


Efficiency 




Efficiency 






86.6 ±0.2% 


825 ±35 


89.6 ±0.1% 


620 ±25 


9 1.5 ±0.4% 


550 ±20 


Medium cuts 


80.6 ±0.2% 


4000 ±370 


84.6 ±0.1% 


2300 ±170 


86.7 ±0.5% 


1900 ±120 



Table 8: Electron identification efficiencies and QCD di-jet background rejections obtained for loose and 
medium identification cuts, including a calorimeter isolation cut (see text), and for three different jet Ej 
-ranges. The signal electrons are from Z — »• e + e~ decays with m z i = 1 TeV and are required to have Ej 
> 100 GeV. 



4.3 Electrons in exotic events 

High-mass di-electron final states are a promising source of early discovery physics, because of the sim- 
plicity and robustness of very high-/;>7 electron reconstruction, identification and resolution. Very high- 
er electrons refer here to those with transverse momentum ranging from 100 GeV up to several TeV. 
The backgrounds to very high-p^ electron pairs are expected to be small, and, therefore, only loose or 
medium identification cuts are considered here. Isolated electrons are required to satisfy the calorimeter 
isolation cut described in Section 2. 
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Figure 14 shows efficiencies as a function of Ej and |t]| for the loose and medium identification 
cuts, for electrons from Z' — > e + e~ decays with m z i = 1 TeV [17]. From these curves, one can note 
the slow increase in efficiency with Ej before reaching a plateau in the very high-£y region. Overall 
efficiencies of ~ 90% and of ~ 85% can be achieved for loose and medium electron cuts, respectively, 
with a uniform behaviour limited to the barrel region, i.e. \r\\ < 1.5. 

The QCD background rejection was studied as a function of the jet transverse energy, as shown 
in Table 8. Using the medium identification cuts, which correspond to an overall efficiency of ~ 85%, a 
jet rejection factor of several thousand can be achieved for Ej > 100 GeV, which should be sufficient 
to observe the signal in many exotic scenarios. 

5 Electrons from Z — > ee decays in early data 

The experimental uncertainty on the electron identification efficiency is expected to be the source of one 
of the main systematic errors in many measurements, and in particular in cross-section determinations. In 
addition, a reliable monitoring of the electron identification efficiency is important in the commissioning 
phase of the detector and software. The previous sections have shown detailed estimates of the expected 
electron identification efficiency based on simulated samples. This section focuses on the measurement 
of electron reconstruction and identification efficiencies using a data-driven approach based on Z — > ee 
events. 

The tag-and-probe method [18] is used in this analysis. It consists of tagging a clean sample of events 
using one electron, and then measuring the efficiency of interest using the second electron from the Z- 
boson decay. Although more difficult because of trigger-threshold issues and of more severe background 
conditions, the same approach could be applied toJ/y and T resonances, thus covering the lower end of 
the pr spectrum [7]. 

5.1 Tag-and-probe method 

The tag condition typically requires an electron identified with tight cuts. Both electrons are also required 
to be above a pj threshold consistent with the trigger used. The invariant mass of the lepton pair is then 
used to identify the number of tagged events, N\ (containing Z —>■ ee decays), and a sub-sample N2, 
where the second pre-selected electron further passes a given set of identification cuts. The efficiency for 
a given signature is given by the ratio between N2 and Ni . 

To account for background, the lepton-pair invariant mass spectrum is fitted around the Z mass peak 
using a Gaussian distribution convoluted with a Breit-Wigner plus an exponential function. The dominant 
background arises from QCD and is estimated using a procedure explained in [18]; its contribution is 
small in general and its impact on the measurement is therefore very limited. 

The probe electron is checked against the selection as an electron candidate (to which only the pre- 
selection cuts are applied), and as a loose, medium or tight electron. To monitor in detail the efficiency 
dependence, the results are presented in bins of Vj and pr , at the expense of an increased statistical error 
in each bin. 

A quantitative comparison between the efficiency computed with this tag-and-probe method (Etp) 
and the efficiency obtained from the Monte Carlo truth (£mc) is use d to validate the tag-and-probe 
method. 

5.2 Electron reconstruction efficiency 

The reconstruction and identification of electrons is based on seed-clusters in the electromagnetic calorime- 
ter matched to tracks, as explained in Section 2. The tag electron is a reconstructed electron selected using 
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Figure 15: Efficiency of the electron pre-selection as a function of \r\ | (left) and Ej (right) for Z — > ee 
decays, using the tag-and-probe method and the Monte Carlo truth information. 

tight (isol) cuts and also required to pass the trigger EM13i/el5i [11]. The tag electron is also required to 
be outside the barrel/end-cap transition region (1.37 < \r\\ < 1.52). The probe electron is pre-selected 
by identifying a cluster in the opposite hemisphere, such that the azimuthal difference between tag and 
probe electrons is A(j) > 3 /4k. Both tag and probe electrons are required to have Ej >15 GeV. The 
invariant mass of the lepton pair is required to be between 80 and 100 GeV. Figure 15 compares Ejp 
and Emc as a function of | r\ | and Ej . Table 9 summarises the results obtained for this first step in the 
reconstruction and identification of the probe electron. 



Ej —range (GeV) 


15-25 


25-40 


40-70 


r\ —range 


£ TP 


&£tp/mc 


£tp 


A£tp/mc 


£tp 


&£tp/mc 


0-0.80 


96.1 ±0.4 


2.0+0.4 


96.2+0.2 


0.1±0.2 


99.0+0.1 


2.0±0.1 


0.80-1.37 


94.9±0.6 


1.5±0.6 


96.0+0.2 


1.6±0.2 


95.1+0.2 


-0.5±0.2 


1.52-1.80 


89.0±1.2 


3.6±1.2 


88.8+0.6 


1.3±0.6 


91.9+0.6 


1.7±0.6 


1.80-2.40 


83.0±1.0 


0.6±1.0 


83.2+0.6 


0.8+0.6 


84.9+0.6 


1.1+0.6 



Table 9: Efficiency of the electron pre-selection, Ejp, in percent as obtained from the tag-and-probe 
method, for different ranges of electron Ej and \r\\. The errors quoted for Ejp are statistical and cor- 
respond to an integrated luminosity of 100 pb~ ! . Also shown is the difference, Ae tp / mc , between this 
estimate of the pre-selection efficiency and that obtained using the matching to the Monte Carlo electron. 



5.3 Electron identification efficiency. 

In this section, the electron identification efficiency is presented with respect to the reconstructed elec- 
trons discussed in Section 5.2. The QCD background was not considered here, since it is less than a 
few percent below the Z-boson mass peak. The reconstructed probe electron was checked against loose, 
medium and tight selection cuts. Table 10 summarises the results obtained for this second step in the 
reconstruction and identification of the probe electron. Figure 16 shows as a function of T7 and pp the 
comparison between Ejp and Emc, for the medium cuts. The losses at high r\ are due to the material in 
the inner detector, as discussed in Section 2. 



89 



Electrons and Photons - Reconstruction and Identification of Electrons 



• Tag-and-probe 
— Monte Carlo truth 



0 0.2 0.4 0.6 0.8 



1.8 2 2.2 2.4 



• Tag-and-probe 
— Monte Carlo truth 



20 30 40 50 60 70 80 90 



Figure 16: Efficiency of the medium electron identification cuts relative to the pre-selection cuts as a 
function of |tj | (left) and Ej (right) for Z — > ee decays, using the tag-and-probe method and the Monte 
Carlo truth information. 

5.4 Statistical and systematic uncertainties 

A number of uncertainties may affect these tag-and-probe measurements once the accumulated data will 
provide high enough statistics to perform similar measurements to those quoted above: 

• Differences between Ejp and £mc 

The relative difference Ae TP / MC in regions (in pj and |tj|), where the efficiency is flat, is less 
than 0.5%, assuming that the statistical error on Emc is negligible. Ae TP / MC marginally depends 
on the definition of a true electron and the systematic uncertainty related to this is estimated to 
be < 0.1%, when varying the cut on the separation in 7]/(j) space (AR) between the reconstructed 
electron candidate and the true electron. 

• Statistical uncertainty. 

The size of the available Z-boson sample is a source of systematic error. With an integrated lumi- 
nosity of 100 pb _1 , the error is expected to be in the range 1-2% for pj > 25 GeV, and ~ 4% in 
the low-pr bin. 

• Selection criteria 

Another source of systematic error comes from varying the selection criteria. For instance, un- 
certainties introduced by varying the cut on the Z-boson mass or requiring an isolation criterion 
for the probe electron were evaluated. The magnitude of the uncertainty introduced is smaller 
than 0.5% for pj > 40 GeV. At low pj , this uncertainty is estimated to be in the 1-2% range. 

• QCD background contribution 

Adding the expected contribution from the QCD background to the signal does not degrade the 
results, except for 1.52 < \r\ \ < 1.8, a region which is close to the barrel/end-cap transition re- 
gion and also where the efficiency is not uniform. The contribution from the uncertainties on the 
residual QCD background is expected to be negligible. 

6 Conclusion 

Excellent electron identification will clearly play an important role at the LHC, since high-pr leptons 
will be powerful probes for physics within and beyond the Standard Model. Based on this motivation, 
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Loose 

\n\\pr 


15-25 


25-40 


40-70 


Etp 


A£tp/mc 


Etp 


A£tp/mc 


Etp 


^■Ejp/MC 


0-0.8 


95.2 ±2.0 


-4.1 ±2.0 


98.8±0.3 


-0.5 ±0.3 


99.8±0.1 


0.2 ±0.1 


0.8-1.37 


92.3 ±2.1 


-6.9±2.1 


98.9 ±0.3 


-0.7 ±0.3 


99.6 ±0.2 


0.0 ±0.2 


1.52-1.8 


100.0 ±2.8 


1.7±2.8 


99.4 ±0.5 


0.0 ±0.5 


99.6 ±0.5 


0.0±0.5 


1.8-2.4 


98.8±1.6 


0.6±1.7 


98.8±0.5 


0.0 ±0.5 


99.1 ±0.4 


-0.2 ±0.4 



Medium 

\n\\pT 


15-25 


25-40 


40-70 


£tp 


^E T p/Mc 


Etp 


AE T p/MC 


Etp 


^Epp/Mc 


0-0.8 


83.6±2.3 


-4.3 ±2.7 


89.7 ±0.7 


-0.8 ±0.8 


92.6 ±0.5 


-0.2 ±0.6 


0.8-1.37 


75.6±2.8 


-7.5±3.4 


87.6±0.9 


0.7 ±1.0 


90.9 ±0.8 


-0.4 ±0.8 


1.52-1.8 


71.9±4.4 


5.9 ±6.5 


76.9±1.9 


-2.2 ±2.4 


83.6±1.9 


0.7 ±2.3 


1.8-2.4 


78.0±2.7 


6.5 ±3.7 


79.2 ±1.4 


1.7±1.8 


82.5 ±1.4 


-1.0±1.6 




Tight 

\n\\pr 


15-25 


25-40 


40-70 


£tp 


^EjpjMC 


Etp 


^Epp/MC 


Etp 


kEpp/MC 


0-0.8 


68.7 ±2.6 


-5.2±3.5 


73.8 ±1.0 


-1.2±1.3 


77.0 ±0.9 


— 1.5 ±1.1 


0.8-1.4 


61.8±3.0 


-3.1±4.7 


72.9 ±1.2 


0.7 ±1.7 


77.3±1.1 


0.2±1.5 


1.5-1.8 


55.7 ±4.5 


6.8±8.6 


65.9±2.1 


-0.8±3.1 


73.7 ±2.2 


1.2±3.1 


1.8-2.4 


66.2 ±3.0 


8.5±4.9 


66.0 ±1.6 


2.6±2.5 


73.4±1.6 


0.7 ±2.2 



Table 10: Loose, medium and tight electron identification efficiencies relative to the pre-selection effi- 
ciencies for different bins in Ep and | TJ | . The first error is statistical and corresponds to an integrated 
luminosity of 100 pb _1 . The second error is the difference obtained between Epp and Emc- 

various algorithms and tools have been developed to efficiently reconstruct and identify electrons and 
separate them from the huge backgrounds from hadronic jets. 

Presently, two reconstruction algorithms have been implemented in the ATLAS offline software, both 
integrated into one single package and a common event model. The first one relies on calorimeter seeds 
for reconstructing electrons, whereas the second algorithm relies on track-based seeds, is optimised for 
electrons with lower energies, and relies less on isolation. 

The calorimeter based algorithm starts from the reconstructed cluster in the electromagnetic calorime- 
ter, then builds identification variables based on information from the calorimeter and the inner detector. 
The rejection power with respect to QCD jets comes almost entirely from the identification procedure. 
Depending on the electron transverse energy and the analysis requirements, rejection factors of 500 to 
100 000 can be achieved, for efficiencies of 88% to 64%, using a simple cut-based selection. More re- 
fined identification procedures combining calorimeter and track quantities using multivariate techniques 
provide a gain in rejection of about 20 — 40% with respect to the cut-based method, for the same effi- 
ciency of 61 — 64%. Alternatively, they provide a gain of 5 — 10% in efficiency, for the same jet rejection 
(tight and medium cuts). 

Electrons in the forward region can also be identified and separated from the background. A simple 
cut-based method, exploring the energy depositions in the inner wheel of the electromagnetic end-cap 
calorimeter and in the forward calorimeter as well as the shower-shape distributions, shows that ~ 99% 
of the QCD background can be rejected, for an electron identification efficiency of ~ 80%. This per- 
formance should be sufficient to select cleanly, for example, Z —> ee decays with one electron in the 
forward region. 

Studies of the strategies for measuring efficiencies and fake rates in early data show that the tag-and- 
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probe method is a good tool to estimate the electron identification efficiency and to control the reliability 
of the Monte Carlo simulation. With 100 pb _1 , the method is limited by the statistics of the Z sample, 
whereas its systematic uncertainty is of the order of 1 to 2 %. 

The work presented here primarily addresses the description and performance of the offline recon- 
struction and identification of electrons. However, it also gives an overview of the possible path towards 
physics discoveries with electrons in Higgs, SUSY, and exotic scenarios. 
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Abstract 

This note presents the description and performance of photon identification 
methods in ATLAS. The reconstruction of an electromagnetic object begins 
in the calorimeter, and the inner detector information determines whether the 
object is a photon - either converted or unconverted - or an electron. Three pho- 
ton identification methods are presented: a simple cut-based method, a Log- 
likelihood-ratio-based method and a covariance-matrix-based method. The 
shower shape variables based on calorimeter information and track informa- 
tion used in all three methods are described. The efficiencies for single pho- 
tons and for photons from the benchmark H — > 77 signal events, as well as the 
rejection of the background from jet samples, are presented. The performance 
of the cut-based method on high-pr photons from a graviton decay process 
G — > 77 is also discussed. 

1 Introduction 

Isolated photons with large transverse momentum, pj, in the final state are distinguishing signatures for 
many physics analyses envisaged at the LHC. The Higgs particle has been sought over several decades 
in many high-energy experiments, including those currently running at the Tevatron. It is understood 
that if the Standard Model Higgs particle exists, and unitarity is not violated, its mass is within the reach 
of LHC. As described in detail in other parts of this work [1], while the expected cross-section times 
branching ratio of the Higgs particle decaying into the two photon final state is relatively small, given its 
distinct signature, isolated high-/^ photons may play a significant role in discovering the Higgs particle 
in the low mass region. In addition, very high-/^ photons are also signatures of more exotic particles, 
such as the graviton predicted in Ref. [2], which is expected to have mass larger than 500 GeV. These 
photons appear as a single, isolated objects with most of their energy deposit in the electromagnetic 
compartment of the calorimeter. Thus the primary source for background to these photons, namely fake 
photons, result from jets that fluctuate highly electromagnetic which contain a high fraction of photons 
from neutral hadron decays, such as n° — >• 77. 

Since the ATLAS electromagnetic calorimeter [3] is highly segmented with a three-fold granularity 
in depth and with an 17 x <j) granularity in the barrel of 0.003 x 0.1, 0.025 x 0.025, and 0.05 x 0.025, re- 
spectively, in the front, middle and rear compartments assisted by a pre-sampler in front of the calorime- 
ter, photon identification methods in ATLAS should be much more powerful that those used in past 
experiments. The experiment also employs elaborate trigger systems that select electrons and photons 
efficiently, as described in detail in Ref. [4]. 

This paper presents three ATLAS photon identification methods and their performance for single, 
isolated photons as well as for photons from physics processes. 

2 Data samples 

The H —>yy (ra# =120 GeV) process is used as the primary signal benchmark sample for medium pj 
photons and with the pile-up that corresponds to the instantaneous luminosity 10 33 cm _2 s _1 . Rejection 
studies were conducted using a pre-filtered jet sample (described in details in Ref. [5]), containing all 
relevant hard-scattering QCD processes with pj > 15 GeV. A filter is applied at the generator level, 
requiring the summed transverse energy of all stable particles (excluding muons and neutrinos) in a 
region of A(j) x Arj = 0.12x0.12 to be above 17 GeV. A total number of 3 million events were used 
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in rejection studies. Two additional samples with 150 GeV < pj < 280 GeV (Jet5) and 280 < pj < 
400 GeV (Jet6) were also employed for high-^7- photon rejection studies. Finally, an additional 300,000 
event y+jet sample has been used for rejection and fake rate studies. 

In addition to these signal and background samples, the three identification methods described in 
this paper were developed using single photon samples - events with no activity except the photon - 
with full detector simulation in the energy range 10 — 1000 GeV with flat pseudorapidity distributions 
over 1 77 1 < 2.5. For high-/^ photons, graviton samples with masses of 0.5 and 1.0 TeV were employed. 

All the samples used in this note were generated using PYTHIA and its fragmentation scheme and 
were passed through the full detector simulation. Some of the simulations were done with the nominal 
geometry and material distribution ("ideal") and others with additional material added ("distorted"). 

In order to maintain the consistency between different studies, the following requirements and defi- 
nitions are used for efficiencies and rejections. 

• Truth match: the reconstructed photons must lie within a cone of radius A R = y / At] 2 + A^> 2 < 0.2 
of the true photons in the simulation. 

• The reconstructed photons must be within the fiducial volume, pseudorapidity 0 < |tj | < 1.37 or 
1.52 < 1 77 1 < 2.47 to avoid the overlap between the barrel and end-cap calorimeters. 

Using the base samples that satisfy the above requirements, the efficiency is defined as follows: 



where Ny Uth is the number of true photons in the simulation that satisfy all the requirements above with 
the true Ej greater than either 25 GeV or 40 GeV and Ny eco is the number of reconstructed photons that 
satisfy all the requirements with the true E T greater than either 25 GeV or 40 GeV and that pass the 
threshold for one of the three methods. 

Similarly, the rejection from the pre-filtered jet sample is computed as follows: 



where Nj et is the total number of jets reconstructed in the normalisation sample (same generation as the 
reconstructed sample but without the filter requirements) using particle four-momenta from the generator 
hadron level within a cone size AR = 0.4, and A^2(= 400,000) is the number of events used in this 
normalisation sample. The values for Nj et /N2 in the fiducial volume of |t]| < 1.37 or 1.52 < \r)\ < 2.37 
are 0.226 for jets with E T > 25 GeV and 0.042 for jets with E T > 40 GeV. N fakey is the number of 
fake photons in the reconstructed (filtered) sample with the candidates that matched to true photons from 
the hard scatter or from quark bremsstrahlung removed, and N\{= 3,095,900) is the number of events 
analyzed from this sample. Finally, £y-fm er (= 0.082) is the efficiency of the generator level filter applied 
to the jet sample. 

3 Photon identification methods 

As discussed in previous sections, three photon identification methods have been developed and are 
available at present in ATLAS: a simple cut-based identification method, a Log-likelihood-ratio-based 
identification method (LLR) and the covariance-matrix-based identification method (H-matrix). A par- 
tial description of the basic electromagnetic object reconstruction and a detailed presentation of their 
calibration can be found in Ref. [6]. 




(1) 




(2) 
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3.1 Characteristic variables and cut- based photon identification 

In order to separate real photons from fake photons resulting from jets, several discriminating variables 
are defined using the information both from the calorimeters and the inner tracking system. Cuts on these 
variables are developed to maintain high photon efficiency even in the presence of pile-up resulting from 
the overlapping minimum bias events due to high instantaneous luminosity at the LHC. The discriminat- 
ing variables used in this study are the same as in previous studies [7-11]. Calorimeter information is 
used to select events containing a high-£y electromagnetic shower. The fine-grained first compartment 
allows to reject showers from photons from 7T° decays. Track isolation is used to improve the rejection. 
Only electromagnetic clusters with Ej > 20 GeV are used in this study. 

3.1.1 Variables using calorimeter information 

In the electromagnetic calorimeter, photons are narrow objects, well contained in the electromagnetic 
calorimeter, while fake photons induced from jets tend to have a broader profile and can deposit a sub- 
stantial fraction of their energy in the hadronic calorimeter. Hence, longitudinal and transverse shower- 
shape variables can be used to reject jets. 

• Hadronic leakage : The hadronic leakage is defined as the ratio of the transverse energy in the 
first layer of the hadronic calorimeter in a window AT7 x A(j) = 0.24 x 0.24 to the transverse en- 
ergy of the cluster in order to avoid boundary effects that could result from using readout cells. 
Real photons are purely an electromagnetic object, therefore they deposit their energy primarily 
in the electromagnetic compartment of the calorimeter. Fake photons induced from jets contain 
hadrons that would penetrate deeper into the calorimeter depositing sizable energy beyond the 
electromagnetic calorimeter. 

• Variables using the second compartment of the ECAL : Electromagnetic showers deposit most 
of their energy in the second layer of the electromagnetic calorimeter. For this reason several 
variables that measure the shape of the shower are available as follows: 

- The real photons deposit most of their energy in a A77 x A(j) = 3 x 7 window (in units of 
middle cells). The lateral shower-shape variables, R^ and R$, are given by the ratio of the 
energy reconstructed in 3 x 7 middle cells to the energy in 7 x 7 cells and the ratio of the 
energy reconstructed in 3 x 3 cells to the energy in 3 x 7 cells, respectively. Due to the effect 
of the magnetic field increasing the width of the converted photon contributions in the 0 
direction, R$ is less discriminating than R^ . 

- The lateral width in 77 is calculated in a window of 3 x 5 cells using the energy weighted sum 
over all cells. W2 = ^ £(^*^ 2 ) _ j^n^ik) j ; where E c is the energy deposit in each cell, 
and r\ c is the actual 77 position of the cell represented by the center of the cell in 77 direction. 
Therefore, n>2 is given in units of 77 . A correction is applied as a function of the impact point 
within the cell to reduce the bias from the finite cell size. 

• Variables using the first compartment of the ECAL : Cuts applied on the variables in the 
hadronic calorimeter and the second layer of the electromagnetic calorimeter reject jets which 
contain high-energy hadrons and resulting broad showers. Jets containing single or multiple neu- 
tral hadrons such as 77 and 7T°, provide the main contribution which can fake photons. The readout 
of the first layer of the calorimeter uses strips and provides very fine granularity in pseudorapidity. 
Thus, the information from this layer can be used to identify substructures in the showers and dis- 
tinguish isolated photons from the hard scatter and photons from 71° decays efficiently. The lateral 
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shower shape in the strips is exploited for |j] | < 2.35 where the strip granularity is sufficiently fine, 
as long as a 0.5% or larger fraction of the total energy is reconstructed in this layer. 




Figure 1: Distributions of the mean of each calorimetric discriminating variable as a function of the 
pseudorapidity | T] | for true and fake photons (before cuts) with 20 < Ej < 30 GeV. The samples have 
been simulated with the geometry under the realistic alignment scenario and additional material. 



- Since the energy-deposit pattern from 71° 's is often found to have two maxima due to 71° — > 
77 decay, showers are studied in a window At/ x A0 = 0.125 x 0.2 around the cell with 
the highest Ej to look for a second maximum. If more than two maxima are found the 
second highest maximum is considered. The following two variables are constructed using 
the information from the identified second maximum: 

• AE S = E max 2 — E m i n , the difference between the energy associated with the second max- 
imum £max2 and the energy reconstructed in the strip with the minimum value, found in 
between the first and second maxima, E m i n . 

• ^max2 = £"max2/(l + 9 x 10~ 3 £V/GeV), where Ej is the transverse energy of the cluster 
in the electromagnetic calorimeter. The value of the second maximal energy deposit is 
corrected as a function of the transverse energy of the cluster to minimise its sensitivity 
to fluctuations [9, 10]. 

- ^side = [£(±3) — E(±\)] /E(±\), the fraction of the energy deposited outside the shower 
core of three central strips. The variable E(±n) is the energy deposited in ± n strips around 
the strip with the highest energy. 

- w S 3 = Y^L-^f x (*' — imax) 2 /Y,Ei, the shower width over the three strips around the one with 
the maximal energy deposit. The index i is the strip identification number, i max the identi- 
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fication number of the most energetic strip, and is the energy deposit in strip /. w S 3 is 
expressed in units of strip cells and corrected for impact point dependence [9]. 
- w sto t, the shower width over the strips that cover 2.5 cells of the second layer (20 strips in the 
barrel for instance). It is expressed in units of strip cells. 

Figure 1 shows the average values of the calorimeter-based discriminating variables as a function 
of the absolute value of pseudorapidity. Features in the plots can be explained by: upstream material 
thickness which increases with pseudorapidity in the barrel; physical cell-size changes in the end-cap to 
maintain a constant granularity in tj-0; and the change in the granularity of the first layer in the end- 
cap. In particular, the rise of R maX 2 and of AE S for |t]| > 1.5 stems from a combination of effects from 
the variation of the quantity of the upstream material and changes in the strip-cell sizes in the end-cap 
calorimeters. The dip in the hadronic leakage variable near \f] \ = 1.1 corresponds to a smaller coverage 
by the first hadronic layer in this region. 





Figure 2: Normalised distributions of the discriminating variable for | t; | < 0.7 for true and fake photons 
(before cuts) with 20 < Ej < 30 GeV. The samples have been simulated with the geometry under the 
realistic alignment scenario. 



The cut values are tuned separately in six pseudorapidity intervals in |t]| < 2.37 to reflect the pseu- 
dorapidity dependence of these variables. The subdivision is motivated by the varying granularity and 
material in front of the electromagnetic calorimeter. The quantities calculated using the first compart- 
ment can be used only in the regions |tj| < 1.37 and 1.52 < |tj| < 2.37 since there are no strips in the 
crack region or beyond \rj\ > 2.40. In addition, up to eight different bins in transverse energy are also 
used for the cut value adjustment. Figure 2 shows the distributions of the variables in the first X] bin and 
in one energy bin. The dashed vertical lines represent the cut values in this bin. The variables are shown 
for all reconstructed electromagnetic objects before cuts. 
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Figure 3: Effect of pile-up and distorted material on mean values of two shower-shape variables for 
photons from H — ► 77 decays: (left) and energy of the second maximum in the first layer (right). 



Figure 3 shows the impact of pile-up and additional material before the calorimeter on the shower- 
shapes for photons from Higgs decays. The impact of the large amount of additional material in the 
transition region, 1 .5 < | T] | < 1 .8, in the realistic alignment geometry can clearly be seen for two shower- 
shape variables. While pile-up at a luminosity of 10 33 cm _2 s _1 does not change the average shower 
shape significantly as can be seen for the two variables in Figure 3, it is observed that it does increase 
RMS of the distributions. 

At present, the same cuts are applied for converted and unconverted photons. Studies of the y-n° 
separation, however, have shown that if conversions can be identified efficiently, different cuts can be 
applied for converted and unconverted photons [12], which could improve rejection by 10-20% while 
maintaining the same overall photon identification efficiency. 

The cuts have been chosen comparing the photons from H — > 77 decays to fake candidates in inclu- 
sive jet samples. For this optimisation, samples generated with realistic alignment geometry and pile-up 
have been used. Some improvement in the performances should be possible at higher Ej for further 
refinement and optimisation in some of the variables, such as hadronic leakage. The rejection presented 
in this paper has been estimated on a sample statistically independent from the one used to tune the cuts. 




Figure 4: Normalised distribution of the track-isolation variable for events passing the calorimeter selec- 
tion criteria. Left: comparison of true and fake photons. Right: comparison of early conversions (true 
conversion radius less than 40 cm) and late conversions (true conversion radius above 40 cm) for photons 
from H — > 77 decays. 
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3.1.2 Track isolation 

After the calorimeter cuts, the contamination of the inclusive signal from charged hadrons is greatly 
reduced. The remaining background is dominated by low track multiplicity jets containing high-/^ n Q 
mesons. In order to further remove fake photons from these jets, the track-isolation variable is defined 
as the sum of the pj of all tracks with pj above 1 GeV within AR < 0.3, where AR is the tj — 0 distance 
between the track position at the vertex and the cluster centroid. Track p T > 1 GeV is imposed to 
minimise the effect of pile-up and underlying events. 

Since the tracks from photon conversions should not be included in computing this variable, some 
additional selections are applied to tracks within AR < 0. 1 of the cluster centroid. The impact parameter 
with respect to the beam line must be less than 0. 1 mm. The track pj must not exceed 15 GeV to remove 
tracks from very asymmetric conversions, must not be part of a reconstructed conversion vertex and must 
have a hit in the innermost pixel layer. 

The plot on the left in Fig. 4 shows the distribution of the track-isolation variable for true and fake- 
photon candidates, after the calorimeter shower-shape cuts. An additional rejection of factor 1.5 to 2 is 
possible for a relatively small efficiency loss. The plot on the right in this figure shows the track-isolation 
variable for early converted and late converted photons. The difference between the two distributions 
is rather small, showing that the tracks from conversions have been efficiently removed. At present, a 
4 GeV upper cut on the track-isolation variable is applied for this method. 

3.2 Log-likelihood-ratio-based photon identification 

In the Log-likelihood-ratio (LLR)-based method, the distribution of each of the shower-shape variables 
is normalised to unity to obtain the probability density functions (PDF). The shower-shape variables 
are pseudorapidity-dependent, so they are separated in four regions of \r\ \ and three bins in p T for this 
method. The PDF's are obtained using 1.6 million y+jet events which provided slightly over 100,000 
events in each bin. Since the statistics for the PDF computation is somewhat low in some kinematic 
phase-space regions, further improvement can be obtained by using tools to smooth the PDF's to com- 
pensate for the low statistics [13]. Once the PDF's are established, the Log-likelihood-ratio parameter is 
defined as: 

LLR = J £\n(L si /L hi ), (3) 

where L w - and are PDF's of the i th shower-shape variable for the photon and the jet, respectively. 

The shower-shape variables used for the LLR method were the same as those used for the cut-based 
method described previously. Track isolation was also included as a discriminating variable in Equa- 
tion 3. Figure 5 shows the LLR parameter distribution for photons and for jets. The LLR cut can be 
tuned over r\ and p T to obtain an optimal separation between photons and jets. 

3.3 Covariance-matrix-based photon identification 

The shower-shape variables associated with a photon shower in the calorimeter are correlated. The co- 
variance matrix (H-matrix) technique takes advantage of these correlations. The technique was employed 
successfully in the D0 experiment at the Tevatron and was used to identify electrons [14]. 

The ten photon shower-shape variables used in the ATLAS H-matrix method are as follows: 

• Five longitudinal shower-shape variables: fraction of energy deposited in pre-sampler layer; frac- 
tions of energy deposited in sampling layers 1, 2 and 3 separately; and the hadronic leakage, the 
energy leakage into the first layer of the hadronic calorimeter. 
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Figure 5: Expected Log-likelihood ratio (LLR) cut-parameter distributions for photons (solid histogram) 
and for jets (dashed histogram). 
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Figure 6: The distributions of H-matrix % 2 for photons from the H — ► yy sample (solid histogram) and 
for jets from the inclusive jet samples (dashed histogram). 

• Five transverse shower-shape variables: the ratio of the energy in 3 x 3 cells to the energy in 7 x 7 
in the second sampling layer of the electromagnetic calorimeter; w rmS 3, the corrected width in 3 
strips in sampling layer 1; W2, the corrected width in a 3 x 5 window in sampling layer 2; the 
energy outside of the shower core; R$, the ratio of energy ina3x3toa3x7 window around the 
cluster centroid. 

Using the above variables, a covariance matrix, M, is constructed as follows: 

Mij = ^ N n=1 (yt ) -ydiyf -Jj), (4) 

where indices i and j run over the ten variables, N is the total number of photons used in the training 
sample, y n - is the j th variable for the n th photon candidate, and y • is the mean value of yj variable for the 
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control sample electrons/photons. These matrix elements are constructed for each X] bin and parametrised 
for energy dependences. The photon likeness of an object is then measured by the value of the x 2 , defined 
as follows: 

x 1 = 1^%^ -yi)Hijiyf -J]) (5) 

where H = M~ 1 , the inverse of the covariance matrix, and the indices / and j run from 1 to the total 
number of variables (ten) which is the same as the dimension of the matrix, dim. 

The mean value of the x 2 is close to the number of dimensions for a photon shower. The shapes of 
the distributions of the selected shower-shape variables depend on the tj and the energy of the incident 
photon. These effects are taken into account in the construction of the H-matrix using single photon 
samples of energies 10 - 1000 GeV generated flat in |t] | and parametrising each of the covariance terms 
in the matrix M of Eq. 4 as a function of the photon energy. The parametrisation as a function of photon 
energy is obtained in each of the 12 Y\ bins. The discrimination power of the H-matrix between real 
photons and jets is well illustrated in Fig. 6, where the x 2 distribution of the H-matrix for the jet sample 
is contrasted to that obtained from photons from H —* 77 decays. 

Since the H-matrix implementation at this time does not include the same variables as the other two 
methods, its performance is currently not directly comparable. Consequently, the performance is not 
reported here, although the method is decribed for completeness. 




n ~ " E T (GeV) 

Figure 7: Efficiency of the calorimeter cuts as a function of pseudorapidity (left) and transverse en- 
ergy (right) of the photons for the distorted geometry. 



Efficiency 


£ (calorimeter cuts) 


£ (track-isolation cut) 


Nominal geometry no pile-up 


(87.6±0.2)% 


(99.0±0.1)% 


Nominal geometry with pile-up 


(86.6±0.5)% 


(98.0±0.2)% 


Distorted geometry with pile-up 


(83.6±0.2)% 


(98.1±0.1)% 



Table 1: Overall efficiency for photons from H — > 77 decays for three different simulation choices. 



4 Photon identification performance for medium-/?^ photons 

This section describes the performance (efficiencies and rejections) of the cut-based method and the Log- 
likelihood ratio method on medium-pr photons, in particular the photons from H — > 77 decays and the 
jet background samples described in Section 2. 
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4.1 Performance of the cut-based method 

In the performance studies presented in this section, all reconstructed electromagnetic objects, including 
both electron and photon candidates are considered. The efficiency as defined in Section 2 includes both 
the reconstruction efficiency and the efficiency of the identification cuts. 

Figure 7 shows the efficiency of the calorimeter cuts for photons with Ej > 25 GeV from H — > yy 
decay as a function of pseudorapidity (left) and transverse energy (right) for events in the presence of the 
pile-up expected at a luminosity of 10 33 cm~ 2 s~ 1 . The optimisation of the cuts for the H —> yy signal has 
led to an efficiency which is uniform for Ej > 40 GeV, but which decreases substantially below 40 GeV 
because of the much larger fake backgrounds from jets expected at these lower transverse energies. The 
average efficiencies of the calorimeter and track-isolation cuts are summarised in Table 1 . 





All 


Quark jets 


Gluon jets 


■/V(jet)/Af(generated events) 


0.23 


0.056 


0.177 




Before isolation cut 


A r (fake)/A r (filtered events) 


(5.43±0.13).10~ 4 


(3.87±0.11).10 4 


(1.44±0.07).10~ 4 


Rejection 


5070±120 


1770±50 


15000±700 




After isolation cut 


N(fake)/Ar (filtered events) 


(3.38±0.10).10- 4 


(2.47±0.08).10- 4 


(0.78±0.49).10- 4 


Rejection 


8160± 250 


2760±100 


27500±2000 



Table 2: Rejection (Equation 2) measured in the inclusive jet sample for Ej > 25 GeV 





All 


Quark jets 


Gluon jets 


A r (jet)/A r (generated events) 


0.042 


0.011 


0.034 




Before isolation cut 


Af(fake)/Af(filtered events) 


(1.16±0.06).10- 4 


(8.3±0.5).10- 5 


(2.8±0.3).10- 5 


Rejection 


4400±230 


1610±100 


15000±1600 




After isolation cut 


Ar(fake)/W(filtered events) 


(6.4±0.4).10- 5 


(4.6±0.5).10~ 5 


(1.5±0.2).10- 5 


Rejection 


7800±540 


2900±240 


28000±4000 



Table 3: Rejection (Equation 2) measured in the inclusive jet sample for Ej > 40 GeV 



The rejection from the pre-filtered jet sample is computed using Equation 2. The rejection is com- 
puted separately for all jets, for quark-initiated jets and for gluon-initiated jets. The quark or gluon 
initiation is defined using the type of the highest Ej partem from the PYTHIA record inside the cone 
AR = 0.4 around the reconstructed jet object. The rejection values are summarised in Table 2 for the 
three categories of jets. A small fraction (« 1-2%) of jet objects are not classified, so the sum of quarks 
and gluons is slightly smaller than the total. A cut Ej > 25 GeV is applied to both reconstructed photons 
and jets. Table 3 shows the same computation, but for Ej > 40 GeV. 

Figure 8 shows the fake rate, defined as the inverse of the rejection, as a function of pseudorapidity for 
all jets with Ej greater than 25 GeV. There is a slight increase of fake rate as a function of pseudorapidity 
due to the increase in material in front of the calorimeter, which imposes somewhat looser cuts to preserve 
a constant efficiency. Some additional increase near \r\ \ = 1.1 is also visible probably coming from the 
reduced energy in the first layer of the hadronic calorimeter as pointed out previously. This effect, 
however, gives a less than 10% increase in the overall fake rate. 
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Figure 8: Fake-photon rate as a function of pseudorapidity in the filtered jet sample 
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Figure 9: £7 spectra from the inclusive jet sample, for the generated jets (solid squares for full simulation 
and solid triangles for uncorrected jets from parametrised fast simulation) and the fake-photon candidates 
before (inverted solid triangles) and after (open circles) the track-isolation cut. The normalisation is that 
predicted by PYTHIA. 



Figure 9 shows the Ej distribution of the jets and of the fake photon candidates before and after the 
track-isolation cut. This figure also shows that the rejection at 25 GeV is ps 30% lower if the normalisa- 
tion is based on the uncorrected parametrised jets from the fast simulation, as was done in Ref. [1 1]. 

Figure 10 shows the 71° content of the fake-photon candidates at three different cut levels; all recon- 
structed electromagnetic objects, after the cut on the hadronic leakage and the second layer shower-shape 
variables (Had+S2) and after all the cuts (Had+Sl+S2). A fake photon is defined as coming from a 7T° if 
the energy of the leading 71° in the cone of 0.2 around the cluster centroid is more than 80% of the recon- 
structed cluster energy. The figure shows already after the second layer shower-shape cuts, the dominant 
background contribution comes from 71° as expected. After all cuts, the fraction of n° is ps 70% of the 
remaining fake-photon candidates. 

Figure 11 shows the rejection of the cuts on the first layer variables for candidates from single 7T 0 's 
passing the cuts on the hadronic leakage and the second layer shower-shape variables. As expected, the 
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Figure 10: Ej distribution of fake-photon candidates in jets after different level of cuts. The contribution 
from "single" 71° is also shown 




Figure 11: Rejection of the strip-layer cuts against fake photons coming from "single" 71° in the jet 
sample as a function of the transverse energy, for three different pseudorapidity regions. 



rejection power against these isolated 7T°'s decreases with energy, as the opening angle between the two 
photons from 71° decays become smaller. The rejection is also better in the central part in the barrel as 
there is less material than in the higher tj part of the barrel, and also opening angle is larger than in the 
end-cap for the same pj . As a cross-check, Fig. 12 shows the efficiency of the calorimeter cuts for single 
photons and single 71° of Ej = 40 GeV, as a function of pseudorapidity. Again, the rejection is slightly 
higher than 3 in the central part of the barrel calorimeter and is in reasonable agreement with findings 
from previous studies [15]. 

The rejections measured in these studies have to be taken with care as they rely strongly on the 
modelling of the fragmentation tail in PYTHIA and the details of the simulation of the detector response. 
A discussion of the first effect can be found in Ref. [8] from which one would expect an uncertainty of 
50 — 100%, and where the uncertainty is larger for gluon initiated jets. In addition, a recent investigation 
on the differences in fragmentation algorithms in PYTHIA and HERWIG shows appreciable differences 
in 7T° production rates. Some differences in rejection are anticipated if the momentum distributions of 
the 7C°'s from the two fragmentation algorithms differ. 

4.2 Performance of the Log-likelihood-ratio method 

The efficiency for the Log-likelihood-ratio (LLR) method is computed for individual photons from the 
H — > 77 events generated with the nominal geometry. Figure 13 shows the photon efficiency as a function 
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Figure 12: Efficiency of calorimeter cuts versus pseudorapidity for 40 GeV Ej single photons and 7T ( 
(distorted geometry without pile-up). 



of pr (left) and X] (right) for LLR cut values set at 8, 9 and 10. The overall efficiencies for LLR cuts 
at 8, 9 and 10 are summarised in Table 4. Jet rejection (left) and photon identification efficiency (right) 
are shown in Fig. 14 as a function of LLR cut parameter values for three different jet pj ranges which 
correspond to the three mean jet pj values indicated. 





E T > 25 GeV 


E T > 40 GeV 


LLR cut 


LLR>8 


LLR>9 


LLR > 10 


LLR> 8 


LLR>9 


LLR > 10 


Efficiency(%) 


87.6±0.3 


84.3 ±0.2 


80.0 ±0.2 


86.4 ±0.3 


83.2±0.2 


79.0±0.2 


Rej.(y+jet) 


1660 ±170 


2190 ±260 


2930 ±390 


1690 ±140 


2170±210 


2650 ±280 


Rej. (di-jet) 


6820 ±440 


8930 ±650 


12430 ±1070 


6780 ±1000 


7800 ±1230 


11550 ±2220 



Table 4: Overall photon efficiencies and jet rejections for different Log-likelihood ratio (LLR) cut values. 



Figure 13 shows the pj -dependence of the photon efficiency. A looser cut on low-pj- photons seems 
to be beneficial in order to retain a flat photon efficiency as a function of pj. Furthermore, it might also 
be useful to parametrise the LLR cut values as a function of photon pj for further optimisation. The 
jet rejection is also /^-dependent as shown in the plot on the left in Fig. 14. A harder cut on LLR for 
varying jet pj can help to keep the rejection constant as a function of pj. 

The rejection for jets from y+jet and di-jet samples are shown in the fourth and fifth rows in Table 4. 
The cuts on the photon and jet pj are 25 GeV and 40 GeV, respectively. The rejection against jets from 
the di-jet samples is significantly higher than that from the y+jet samples. This is largely due to the fact 
that the jets in y+jet events are dominated by quark-initiated jets while those in di-jet events are enriched 
with gluon-initiated jets. 

5 Photon identification performance for high-pj photons 

Searches for particles of very high mass decaying to photons, such as the Randall-Sundrum graviton, G, 
decaying via G — ► yy [2], require excellent detector and particle identification performance in a kine- 
matic region very different from the benchmark H — > yy process. The /^-dependent effect caused by 



106 



Electrons and Photons - Reconstruction and Identification of Photons 




Figure 13: Photon efficiency as a function of pj and r\ for different Log-likelihood ratio (LLR) cuts. The 
photons are from H — > yy decays simulated with the nominal geometry. 




LLR LLR 



Figure 14: Jet rejection (left) and photon efficiency (right) as a function of Log-likelihood ratio (LLR) 
cut-parameter values. 

differences in kinematics can complicate high-mass graviton searches because they modify the shape of 
background distributions as a function of the two-photon invariant mass, m rr . 

The performance of the cut-based identification method for high-pr photons has been investigated. 
Studies of the shower characteristics of the photons in the H -^yy and G -^yy {mo = 500 GeV) processes 
found only minor differences in most of the shower-shape variables. In the absence of the track-isolation 
cut, the photon efficiency as a function of pj in both the barrel and end-cap calorimeters is approximately 
constant above pj = 50 GeV. The barrel and end-cap photon electromagnetic reconstruction efficiencies, 
before applying any identification or isolation cuts, are found to be within 10% of one another for photons 
from graviton decays. After applying photon identification but no isolation requirements, the efficiencies 
are 0.829 ± 0.004 in the barrel and 0.639 ±0.010 in the end-cap calorimeters for p\ > 100 GeV and 
m G > 500 GeV. 

An isolation variable based on the calorimeter energy in a cone of size AR = 0.45 around the cluster 
centroid was studied. The cut on the calorimeter isolation was observed to produce roughly constant 
efficiency as a function of pj. A linearly pj -dependent selection cut was determined for barrel and 
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Figure 15: Photon efficiency in the 500 GeV graviton sample as a function of pj for barrel (left) and 
end-cap (right) calorimeters. 
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Figure 16: Fake-photon rejection as a function of 777 of the reconstructed photon object for high-pj- 
binned di-jet samples in the barrel (left) and end-cap (right) calorimeters. 



end-cap photons independently. The efficiencies of these pj -dependent cuts for barrel and end-cap 
calorimeters are shown in Fig. 15 for photons from 500 GeV graviton decays. As can be seen in the 
figures, these pj -dependent isolation cuts show about a 0.1% reduction in efficiency for photons over 
the entire pj -range. 

The Jet5 and Jet6 high-/^ jet samples discussed in Section 2 were used for rejection studies. Fig- 
ure 16 shows the pj dependence of jet rejection with and without the calorimeter energy isolation cuts. It 
can be seen that while the efficiency loss is small, employing the isolation cut increases rejection across 
the full pj range. In particular, the region below pj = 500 GeV shows a factor 5 — 10 increase in re- 
jection. Table 5 provides the measured rejections in the barrel and end-cap calorimeters using these two 
di-jet samples. 

6 Comparison of the photon identification methods 

Figure 17 shows the rejection and efficiency curves for two of the three currently available photon iden- 
tification methods - the cut-based method and the Log Likelihood Ratio method - for y+jet generated 
in specific photon momentum bins and the benchmark H — ► 77 samples. Similarly, Fig. 1 8 shows the 
rejection and efficiency curves for these methods for di-jet and H — > 77 samples. Tables 6 and 7 provide 
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numerical comparisons of fake-photon rejections for the two methods, for similar photon identification 
efficiencies and for the y+jet and di-jet samples, respectively. 

The y+jet events, with jets dominated by quark-initiated jets, are the source of the largest background 
to the H — ► yy process. It is apparent from Figs. 17 and 18 that the methods demonstrate significantly 
reduced rejections for jets from the y+jet samples than for those from di-jet samples whose jets are pre- 
dominantly from gluons. As discussed in previous sections, this difference in rejection can be attributed 
to the fragmentation differences between the quark and gluon-initiated jets. 

Finally, Figs. 17 and 18 also illustrate that, for equal efficiencies, the Log-likelihood ratio method 
and the cut-based method perform comparably in rejecting jets. 




65 70 75 80 85 90 65 70 75 80 85 90 

Photon efficiency (%) Photon efficiency (%) 



Figure 17: Jet rejection vs photon efficiency for binned y+jet and H —>■ yy benchmark samples for 
pl,p J T et > 25 GeV(left) and p y ,p J T et > 40 GeV(right). 



f A Cut-t 

10 4 r T 



ATLAS ATLAS 

" E T > 25 GeV " E T > 40 GeV 



Photon efficiency (%) 



Figure 18: Jet rejection vs photon efficiency of the two methods for filtered di-jet and H —> yy benchmark 
samples for p y T , p J T et > 25 GeV (left) and p y T , pf > 40 GeV (right). 



7 Conclusions 

This note presents the three photon identification methods developed in ATLAS, the cut-based method, 
the Log-likelihood ratio (LLR)-based method and the covariance-matrix-based method (H-matrix). The 
efficiencies and fake-photon rejections of the first two methods have been measured using fully simulated 
H — ► yy (niH = 120 GeV), y+jet and filtered electromagnetic di-jet samples. The cut-based and LLR 
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methods show similar rejection factors at equal efficiencies. The strength of the continuous methods 
such as the LLR and H-matrix is the ability to vary the cuts on LLR or x 2 values to optimise for specific 
physics analyses. The performance of the cut-based method for very high-^7- photons from Randall- 
Sundrum graviton samples has also been studied and, while the cut selection was optimised at low- 
Pt compared to the signal in the graviton sample, the efficiency remains high. While the currently 
available photon identification methods perform very well in rejecting background, with high efficiency 
in retaining photons, it is of critical importance to study the performance of the methods with beam- 
collision data. 



Region 


Rejection (xlO 3 ) 


Rejection (xlO 3 ) 


Barrel 
End-cap 
Total 


1.54 ±0.05 
0.64 ±0.03 
1.25 ±0.03 


3.85 ±0.03 
1.14 ±0.08 
2.78 ±0.08 



Table 5: Jet rejections obtained using two binned di-jet samples, with the cut-based photon identification 
without (left) and with (right) the track-isolation cut 





E T > 25 GeV 


E T > 40 GeV 




LLR 


Cut-based 


LLR 


Cut-based 


Efficiency (%) 


84.3 ±0.2 


84.5 ±0.2 


87.1 ±0.2 


86.3 ±0.2 


Rejection 


2190 ±250 


1940 ±230 


2170±210 


2030 ±190 



Table 6: Comparison of jet rejection (y+jet sample) versus photon efficiency for the cut-based and LLR 
methods. 





E T > 25 GeV 


E T > 40 GeV 




LLR 


Cut-based 


LLR 


Cut-based 


Efficiency(%) 


84.3 ±0.2 


84.6 ±0.2 


85.5 ±0.2 


86.3 ±0.2 


Rejection 


8930 ±650 


8240 ±270 


9170 ±1570 


9240 ±710 



Table 7: Comparison of jet rejection (di-jet sample) versus photon efficiency with the cut-based method 
and the Log-likelihood (LLR) method. 
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Abstract 

The reconstruction of photon conversions in the ATLAS detector is important 
for improving both the efficiency and the accuracy of the detection of particle 
decays with photon final states, including H — > yy. In this note, the perfor- 
mance of the reconstruction of photon conversions for simulated events of dif- 
ferent types is described, using both standard inside-out tracking and the more 
recently implemented outside-in tracking. 

1 Introduction 

Reconstruction of photon conversions in the ATLAS detector is important for a variety of physics mea- 
surements involving electromagnetic decay products. In particular, the efficiency of detection of particles 
with high-mass di-photon final states, such as the Higgs boson or a heavy graviton, is greatly enhanced 
by efficient conversion reconstruction. Conversion reconstruction will also be used for detector-related 
studies: mapping the locations of the conversion vertices provides a precise localisation of the material 
in the ATLAS inner detector. 

As photons may convert at any point in the tracker in the presence of material, the ability to recon- 
struct conversions will depend strongly on the type of tracking algorithm used. Due to the structure of 
the ATLAS tracker, photons which convert within 300 mm of the beam axis may be reconstructed with a 
high efficiency with standard (inside-out) Si-seeded tracking, while photons which convert further from 
the beam pipe may only be reconstructed using (outside-in) tracks, which begin with TRT seeds with or 
without associated Si hits. Track reconstruction will be discussed in Section 2, while the reconstruction 
of conversion vertices will be discussed in Section 3, and the overall reconstruction of conversions will 
be discussed in Section 4. Applications of photon conversion reconstruction in the case of neutral pion 
decays and low-p T photons as well as the application of conversion reconstruction to the case of high-p r 
physics measurements (such as H — > yy), will be found in Section 5. A summary and concluding remarks 
are found in Section 6. 

1.1 Theory 

The ATLAS detector is designed to measure, among other things, the energies and momenta of photons 
produced in high-energy proton-proton collisions. The photons which are relevant to physics measure- 
ments will have energies in excess of 1 GeV. These photons must pass through the ATLAS tracker 
before depositing their energy in the Liquid Argon Calorimeter. At photon energies above 1 GeV, the 
interaction of the photons with the tracker will be completely dominated by <? + <?~pair production in the 
presence of material, otherwise known as photon conversion. All other interactions between the photons 
and the tracker material, such as Compton or Rayleigh scattering, will have cross-sections which are 
orders of magnitude below that for the photon conversion, and may thus be safely ignored. The leading- 
order Feynman diagrams for photon conversions in the presence of material are shown in Figure 1 . The 
presence of the material is required in order for the conversion to satisfy both energy and momentum 
conservation. 

The cross-section for the conversion of photons in the presence of material is both well understood 
theoretically and thoroughly measured. Work on calculating this cross-section began almost immediately 
after the discovery of the positron by Anderson in 1932 [1]. Bethe and Heitler first gave a relativistic 
treatment of photon conversion in 1934 [2] in which the screening of the nuclear Coulomb field was 
taken into account. A detailed review of the theory regarding photon conversion and the calculation of 
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the conversion cross-section for a variety of materials was given by Tsai in 1974 [3]. A more modern 
treatment of the topic of conversions, including corrections to the Bethe-Heitler formula for photon 
energies above 5 TeV was given by Klein in 2006 [4]. 

For photon energies used in this study (1 GeV and above) the cross-section for the conversion process 
is almost completely independent of the energy of the incident photon, and may be given by the following 
equation [3]: 

In this expression A is the atomic mass of the target given in g/mol, and Na = 6.022 x 10 23 is Avo- 
gadro's number. Xo is known as the radiation length of the material through which the photon passes, 
which for elements heavier than helium may be approximated from the atomic mass A and the atomic 
number Z by the following relation [5] : 

716.4gcm-*A 
Z(Z + l)ln(287\/Z) 

This radiation length is defined such that it is 7/9 of the mean free path for photon conversion. Plots 
showing the total radiation length traversed by photons in the tracker before reaching the calorimeter 
may be found in the next section. 

The differential cross-section for photon conversions of energies of 1 GeV and above in terms of the 
quantity x = (E dectron /E photon ) is [4]: 

^*»-*»- 

This cross-section is symmetric in x and 1 — x, the electron and positron energies, and it implies that 
the momentum of the photon is not simply shared equally between the electron and the positron. Some 
fraction of the photon conversions will be highly asymmetric, and either the electron or the positron 
may be produced with a very low energy. If this energy falls below the threshold required to produce 
a reconstructable track in the ATLAS tracker, then the converted photon will be seen to have only one 
track, and will be difficult to distinguish from a single electron or positron. This problem is more serious 
at lower photon energies, as the proportion of conversions which are asymmetric enough to cause the loss 
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Figure 2: Material in the inner detector as a function of | r\ \ . 

of one of the two tracks increases as the photon energy decreases. The difficulties involved in identifying 
these highly asymmetric single-track conversions will be discussed in a later section. 

1.2 Experimental setup 

In this section a very brief description of the ATLAS tracker and electromagnetic calorimeter is included. 
These are the two sub-systems necessary for the studies relevant to this note. A detailed description of 
the ATLAS detector can be found in the ATLAS detector paper [6] and references therein. 

The ATLAS tracker consists of several co-axial layers immersed in a 2T solenoidal magnetic field. In 
the so-called barrel region, the innermost of these is a pixel detector consisting of three highly segmented 
cylindrical layers surrounded by four stereo-pair silicon microstrip (SCT) layers. In addition to the cylin- 
drical layers forming the barrel, both the pixel and the SCT also have end-caps consisting of disk shaped 
segments used for tracking particles with large pseudorapidities (\r\ \ >1.5). The outermost portion of 
the tracker consists of the Transition Radiation Tracker (TRT), which is comprised of many layers of 
gaseous straw tube elements interleaved with transition radiation material. The TRT is divided into a 
barrel detector, covering the small pseudorapidity region \ r\ \ < 1, and two end-cap detectors covering the 
large pseudorapidity region 1< |tj | <2.1. The lack of TRT detector elements at higher pseudorapidities 
is the reason for all the results presented in this note having a cut-off at |j] | = 2.1. 

The amount of material in the tracker given in radiation lengths as a function of pseudorapidity can 
be seen in Fig. 2 [6]. As mentioned earlier, the probability of a photon converting in any given layer 
is proportional to the amount of material in that layer. Overall, as many as 60 % of the photons will 
convert into an electron-positron pair before reaching the face of the calorimeter [6]. This number varies 
greatly with pseudorapidity as can be seen in Fig. 3 [6], for the case of photons with pj > 1 GeV in 
minimum-bias events. The probability is lowest in the most central region \r\ \ <0.5, where the amount 
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of tracker material is at its minimum. A plot showing the true position of photon conversions in the 
ATLAS tracker, as obtained from a sample of 500,000 simulated minimum-bias events, can be seen in 
Fig. 4 [6]; the three pixel layers and disks as well as the four barrel SCT layers and their corresponding 
end-cap layers can be clearly seen. 




Radius (mm) 



Figure 3: Probability of a photon to have converted as a function of radius for different values of pseu- 
dorapidity. 

Finally, the energies of the electrons resulting from photon conversions are measured in the electro- 
magnetic calorimeter segments. These are lead-liquid argon detectors with accordion-shaped absorbers 
and electrodes. Their fine-grained lateral and longitudinal structure, ensures high reconstructed energy 
resolution for photons with Et>2 — 3 GeV, as described in reference [7]. Although the daughter electron 
tracks and the vertices resulting from the converted photons are reconstructed without any calorimetric 
information, the latter plays a crucial role later in the reconstruction and particle identification process. 

2 Track reconstruction 

The current track reconstruction process consists of two main sequences, the primary inside-out track re- 
construction for charged particle tracks originating from the interaction region and a consecutive outside- 
in track reconstruction for tracks originating later inside the tracker. Both methods reconstruct tracks that 
have both silicon (Si) and transition radiation tracker (TRT) hits and place these tracks in two distinct 
track collections. A third track category contains those tracks that have only TRT hits and no Si hits; 
these TRT-only tracks are placed in their own distinct track collection. All three track collections are 
then examined to remove ambiguities and double counting and are finally merged into a global track col- 
lection to be used later during the vertex-reconstruction phase. For a track to be reconstructed by any of 
these methods, a minimum transverse momentum pj > 0.5 GeV is required throughout. In the following 
section, brief descriptions of the various tracking algorithms are provided. More detailed descriptions, 
in particular of the inside-out tracking, can be found in reference [8]. 
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Figure 4: Location of the inner detector material as obtained from the true positions of simulated photon 
conversions in minimum-bias events. 

2.1 Inside-out track reconstruction 

After the reconstruction of space points inside the pixel and SCT sub-detectors, candidate tracks (seeds) 
are then formed using three space-point combinations. These seeds are subject to some constraints, such 
as the curvature, to limit the number of possible combinations. Seeds which pass these constraints then 
become the starting points for reconstructing tracks. Once a seed has been formed a geometric tool is 
then invoked in order to provide a list of Si-detector elements that should be searched for additional 
hits. A combinatorial Kalman-fitter/smoothing formalism is then used to add successive hits to the track. 
The track information is updated after every step in the search and extraneous outlier hits are efficiently 
eliminated through their large contribution to the % 2 of the track fit. Not all space-point seeds coming 
from Si hits result in a track; the rate at which seeds give rise to a fully reconstructed track is on the order 
of 10% in a typical tt physics event. 

A large fraction of the reconstructed track candidates either share hits, are incomplete, or may be 
fakes resulting from random combinations of hits. It is therefore necessary to evaluate the tracks based 
on a number of quality criteria and score them accordingly, with the score providing an indication of 
the likelihood of a specific track to describe a real particle trajectory. Tracks with the highest score are 
refitted and used as the quality reference for all the remaining tracks. Shared hits are removed in this 
stage and the remaining part of the track is evaluated again and refitted. Track candidates with too many 
shared hits are then discarded as well as any other track candidate that fails to comply with any of the 
quality criteria during evaluation. 

At this stage, each one of the resolved track candidates is assigned a TRT extension. First, a geometric 
extension of the Si track is built inside the TRT and compatible measurements are selected. Possible TRT- 
track extensions are constructed by combining all such TRT measurements. The full track, including any 
TRT extension, is then refitted and scored in a way analogous to that during the previous ambiguity- 
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resolving stage. If the new track has a quality score which is higher than that of the original Si track, 
the TRT extension is kept and added to the Si track, thus creating a "global" Inner Detector track. In 
other cases only the original Si track is kept, without the TRT extension. The final reconstructed tracks, 
with or without TRT extensions, are then stored in a dedicated track collection. At this stage they can be 
classified into three categories: 

1. Tracks without TRT extensions (e.g. |tj | > 2); 

2. Tracks with extensions which are used in the final fit; 

3. Tracks with extensions which are not used in the final fit (outliers). 

This last category is characteristic of tracks that have suffered large material interactions as they propa- 
gated through the tracker material. 

The inside-out track reconstruction (as described in the previous section) is a very powerful technique 
for reconstructing tracks, especially in busy environments where the high granularity of the Si sub- 
detectors (and in particular that of the pixel detector) can provide the necessary resolution for recovering 
the track-hit pattern. However, it may also lead to fake tracks if not carefully implemented. In order to 
reduce the number of fake reconstructed tracks, a minimum number of Si hits is required for a track to 
be reconstructed; in the present implementation of the algorithm this number is seven. This requirement 
immediately leads to a decreased efficiency in reconstructing tracks that originate late inside the tracker, 
i.e. in the SCT Furthermore, tracks which are present only inside the TRT will not be reconstructed at 
all. These tracks can appear in the cases of secondary decays inside the tracker (e.g. K s decays) or during 
photon conversions, the latter being of special interest to this note. 

2.2 Outside-in track reconstruction 

The outside-in track reconstruction (also referred to as back-tracking) can offer a remedy to the ineffi- 
ciency in reconstructing tracks which originate after the pixel detector. 

The starting point for this type of track reconstruction is the TRT, where initial track segments are 
formed using a histogramming technique. The TRT tracker can be divided in two parts, a barrel and an 
end-cap one, the dividing line being at the \r} \ = 0.8 pseudorapidity range. In the R - 0 plane of a TRT 
barrel sector or the R — z plane of a TRT end-cap sector, tracks which originate roughly at the primary 
interaction region appear to follow straight lines (this is exactly true in the second case). These straight- 
line patterns can be characterised by applying the Hough transform [9], which is based on the simple idea 
that in the R — 0(/? — z) plane, a straight line can be parametrised using two variables: (0o> £r) or (0o, c z ) 
respectively, where ct and c z are the corresponding azimuthal and longitudinal curvatures and 0o is the 
initial azimuthal angle. As a result, in a two-dimensional histogram formed by these two parameters, 
TRT straw hits lying on the same straight line will fall within a single cell. Straight lines can therefore be 
detected by scanning for local maxima in these histograms. To improve the accuracy in the longitudinal 
direction, the TRT is divided into 13 pseudorapidity slices on either side of the tj=0 plane. The slice 
size varies, it being smaller around the TRT barrel/end-cap transition region and bigger inside the TRT 
barrel or end-cap regions. The two- variable approximate track parameters can then be used to define a 
new set of geometric divisions inside the TRT, within which all straws that could possibly be crossed 
are included. Using the transformation described in [10], the curved trajectory suggested by the straw 
hits may be transformed into a straight line in a rotated coordinate system. This is the initial step for 
a "local" pattern recognition process, in which the best TRT segment may be chosen as the one that 
crosses the largest number of straws in this straight-line representation. A cut on the minimum number 
of straw hits necessary to consider the segment as valid is applied during this step. A final Kalman-filter 
smoother procedure is then applied to determine as accurately as possible the final track parameters of the 
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Figure 5: Track reconstruction efficiency for conversions from 20 GeV pj photons as a function of the 
conversion radius. The gain in track reconstruction efficiency when tracks reconstructed moving inwards 
from the TRT are combined with tracks reconstructed by the inside-out algorithm, is evident particularly 
at higher radial distances. 

segment. The above TRT-segment reconstruction procedure has been adopted from the original ATLAS 
track reconstruction algorithm xKalman as described in the references [11]. 

The reconstructed TRT segments are then fed into the second step of the back-tracking algorithm 
in which extensions are added to them from the Si sub-detectors. Space-point seeds are searched for in 
narrow R — (j) wedges of the Si tracker, indicated by the transverse TRT-segment track parameters derived 
in the previous step. A minimum of two space points is required in this case, the search being confined 
to the last three SCT layers. To reduce the number of space-point combinations cuts on the curvature 
are then applied, with the third measurement point provided by the first hit in the initial TRT segment. 
As soon as seeds with pairs of space points are formed, the initial-segment track parameters can then 
be significantly improved, especially the longitudinal components. A new geometric section through the 
Si-detector elements is then constructed and a combinatorial Kalman-fitter/smoother technique, as in the 
case of the inside-out tracking, is applied to produce Si-track extension candidates. The Si-track exten- 
sions provide a much improved set of track parameters, which can be used to find new TRT extensions to 
be assigned to every Si-track candidate, thus creating once more a "global" track. Ambiguity resolving 
and track refitting follow afterwards in the appropriate manner. The final set of resolved tracks from this 
process is stored in a dedicated track collection. In order to reduce the time required for the reconstruc- 
tion and minimise double counting, the outside-in tracking procedure excludes all the TRT-straw hits and 
Si-detector space points that have already been assigned to inside-out tracks. The enhancement of the 
track reconstruction efficiency after the outside-in reconstructed tracks are included is shown in Fig. 5. 
Here the track reconstruction efficiency for photon conversions is plotted as a function of the radial dis- 
tance of the conversion for the case of 20 GeV p T single photons, before and after the outside-in tracking 
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is performed. The bulk of the gain in tracking efficiency is, as expected, at larger radii. The inefficiencies 
of this method as a function of radius are discussed further in Section 2.3 and again in Sections 4.3 and 
4.4. Due to the more limited pseudorapidity coverage of the TRT tracker, the outside-in tracking can be 
used to efficiently reconstruct tracks up to a pseudorapidity value of |t]| =2.1. All the results presented 
here have therefore been restricted to within this pseudorapidity range. 

2.3 Stand-alone TRT tracks and final track collection. 




Figure 6: Track reconstruction efficiency for conversions from 20 GeV pj converted photons (left) and 
5 GeV pj converted photons (right) as a function of conversion radius. 




Figure 7: Track reconstruction efficiency for conversions from 20 GeV photons (left) and 5 GeV photons 
(right) as a function of pseudorapidity. 

After the inside-out track collection has been formed, all TRT segments that have not been assigned 
any Si extensions are then used as the basis of one more distinct track collection. These segments are 
first transformed into tracks, and the segment local parameters are used as the basis for producing the 
corresponding track parameters assigned to the surface of the first straw hit. Perigee parameters are also 
computed, but no overall track refitting is performed. These new TRT tracks are then scored and arranged 
accordingly and a final ambiguity resolving is performed in order to reject any tracks that share too many 
straw hits. Finally, these stand-alone TRT tracks are then stored in a special track collection. 
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At the end of the track reconstruction process, and before any primary or secondary vertex fitters are 
called or other post-processing tasks are executed, the three track collections described above are merged. 
One last ambiguity resolving is performed in order to select unique tracks from all three collections, 
although this is mostly for consistency since the straw hits and Si space points associated with the inside- 
out tracks have already been excluded before the outside-in track reconstruction. This merged track 
collection is then used by the photon conversion reconstruction algorithm. 

The overall tracking efficiency after all three track collections discussed above are merged, is shown 
in Fig. 6 for both the case of a 20 GeV p T single photon sample, and also for a 5 GeV single photon 
sample, which is more indicative of the case of low track momenta. Two competing effects become 
apparent as one observes these two plots. The overall track reconstruction efficiency for conversions that 
happen early inside the tracker, i.e. in R < 150 mm, is higher in the case of the 20 GeV pj photons than 
that for the 5 GeV pj ones. This is a clear indication of the larger effect that bremsstrahlung losses have 
on low pr tracks, especially on those that originate early inside the tracker. Furthermore it is possible 
that, depending on the amount of the incurred losses, only part of the track will be reconstructed, i.e. 
its TRT component, with the pattern recognition failing to recover the corresponding Si clusters. The 
small fraction of stand-alone TRT tracks that enhance the track reconstruction efficiency from early 
conversions, is primarily due to this effect. On the other hand the overall track reconstruction efficiency 
at higher radii is much better for the case of the 5 GeV p T photons. This is due to the fact that the 
radius of curvature, being much larger for those tracks, enables them to separate from each other faster 
as they traverse the tracker under the influence of the applied magnetic field. It is therefore easier in this 
case to distinguish the two tracks and reconstruct them during the pattern recognition stage. Figure 7 
shows the track reconstruction efficiency as a function of pseudorapidity, for both 20 GeV and 5 GeV 
pj photons. The overall track reconstruction efficiency is very uniform along the whole pseudorapidity 
range, starting only to significantly fall off as one approaches the limit of the TRT pseudorapidity extent 
(|tj| = 2.1). The reduction in efficiency observed around \r\\ = 1, is due to the gap at the transition from 
the barrel to the end cap TRT. The seemingly higher overall tracking efficiency in this plot compared to 
that in Fig. 6, is due to the fact that the great majority of converted photons originate from the earlier 
layers of the Si tracker. In this region, as Fig. 6 demonstrates, the converted photon track reconstruction 
efficiency is very high. 

3 Vertex fitting 

Track finding is only the first step in reconstructing photon conversions; the next step is being able to 
reconstruct the conversion vertex using the pair of tracks produced by the converted photon. Recon- 
struction of the conversion vertex is quite different from finding the primary interaction vertex, since for 
conversions additional constraints can be applied that directly relate to the fact that the converted photon 
is a massless particle. A specific vertex algorithm, appropriately modified in order to take into account 
the massless nature of the conversion vertex, has been developed for use by the photon conversion algo- 
rithm. 

The vertex fit itself is based on the fast-Kalman filtering method; different robust versions of the 
fitting functional can also be set up in order to reduce the sensitivity to outlying measurements. The 
vertex fitting procedure uses the full 3D information from the input tracks including the complete error 
matrices [12]. 

3.1 Algorithm description 

The goal of a full 3D vertex fit is to obtain the vertex position and track momenta at the vertex for all 
tracks participating in the fit as well as the corresponding error matrices. From the input tracks, the 
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helix perigee parameters defining the particle trajectory along with their weight matrix are extracted as 
described in the references [13, 14]. If one assumes that the particle is created at the vertex V, then the 
trajectory parameters qi may be expressed as a function of the vertex position and the particle momentum 
at this vertex qi = T(V,pi). A vertex is then obtained by minimising: 

X^iiqi-TiV^Oywiiqt-TiVji)), (4) 

!=1 

where w, is the 5 x 5 weight matrix from the track fit. In order to find the V and /5, which minimise the 
above % 2 , equation 4 can be linearised at some convenient point close to the vertex as: 

2 

x 2 = Y,(8qi-Di8V -Eidpi) T Wi{8qi-Di8V -EiSpi), (5) 
i=i 

where D; = (dT(V,pi))/ (dV) and E{ = (dT(V,pi))/ (dpi) are matrices of derivatives. A fast method to 
find a solution that minimises equation 5 has been proposed in the references [13, 14]. It can be shown that 
this method is completely equivalent to a Kalman-filter based approach [15], where the vertex position 
is recalculated after every new track addition. 

If the initial estimation of the vertex position is far from the fitted vertex, then the track perigee 
parameters and the error matrix are extrapolated to the fitted point, all derivatives are recalculated and 
the fitting procedure is repeated. The official tracker extrapolation engine, along with a magnetic field 
description based on the actual measurement of the ATLAS tracker solenoidal field, is used in this case. 

3.2 Vertex fit constraints 

Constraints are included in the vertex fit algorithm via the Langrange multiplier method 
can be viewed as a function 

Aj(V,pi,p2,---,p n ) = const 
which is added to the fitting function of equation 4 as 

X 2 = Xo+ "EVA? ( 7 ) 

Here %o is tne function without constraints, Xj is a Lagrange multiplier and j is the constraint number. 
Ay(...) can be linearised around some point iVo,Poi) to obtain 

Nconst 

X 2 = Xo + I h ■ (A 2 j0 +Hj8V + 8V T H j + F i j8 Pi + 8pjF u ) (8) 

7=1 

where Hj = (dAj) /(dV), F u = (dAj) /(d p t ) , A j0 is an exact value of A; at the (V 0 , pod point, SV = V - Vo 
and 8 pi = pi — poi. 

The solution of equation 8 then has the form V = Vo + V\ , pi = pot + p\u where Vo,poi is the solution 
of the corresponding problem without the constraint x 2 = Xo- ^ ne secon d component Vi,pu of the 
above solution is obtained through the normal Lagrange multiplier system of equations. In the case of 
the conversion vertex, a single angular constraint needs to be implemented. This requires that the two 
tracks produced at the vertex should have an initial difference of zero in their azimuthal and polar angles 
8(j)o,86o = 0. This is a direct consequence of having an initial massless particle, but it has the advantage 
of being much easier to implement. 



. A constraint 
(6) 
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The right-hand plot in Fig. 8 shows the reconstructed photon inverse transverse momentum after 
vertex fitting for conversions where neither of the emitted electrons suffered significant bremsstrahlung 
(less than 20% of the energy of each electron is lost in the inner detector material), while the left-hand plot 
shows the transverse momentum for the cases where significant bremsstrahlung energy losses occurred. 
Similarly, the corresponding radial position resolution for conversions with/without significant energy 
losses due to bremsstrahlung is shown in Fig. 9. Single converted photons with pj = 20 GeV were used 
for the plots above, and the emitted electron tracks were required to have at least two silicon space points. 
The angular constraints 50,50=0, implemented as described earlier, have been used throughout. The 
overall vertex reconstruction efficiency will be discussed in the following section. It is evident that the 
presence of bremsstrahlung significantly deteriorates the performance of the vertex fitter. 
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Figure 8: Reconstructed inverse transverse momentum from 20 GeV pj converted photons with 
(left) and without (right) significant energy losses due to bremsstrahlung. 

As a further check of the performance of the vertex algorithm described in this section, one can apply 
it to the case of K® —> n + K~ decays. The absence of losses due to bremsstrahlung for the pion tracks, as 
well as the non-zero opening angle, provide a good test scenario for the constrained vertex fitting. Instead 
of the angular constraint used in the case of the photon conversions, a straightforward mass constraint is 
implemented in this case. Figure 10 shows the resolution of both the reconstructed l/pr and the radial 
position for 10 GeV pj K® decays. The absence of a bremsstrahlung-related tail in the left-hand plot 
compared to those in Fig. 8 is striking. 

In a direct comparison to the converted photon case, Fig. 1 1 shows the relative l/pr resolution, with 
and without significant bremsstrahlung losses (20%) respectively, for reconstructed 20 GeV pj converted 
photons, together with that for 10 GeV pj K® decays, as a function of the radial distance from the beam 
axis. In the case of the decays, the reconstructed momentum resolution is better than 2% irrespective 
of the radial distance from the beam axis, deteriorating only slightly as one moves away from the beam 
axis. For the case of the photon conversions though, a deterioration in the transverse momentum recon- 
struction resolution due to the presence of bremsstrahlung losses, is clearly observable when compared 
to the K% case. Due to the bremsstrahlung losses, the reconstructed l/pr distribution has a non-gaussian 
shape, characterised by a tail towards the higher l/pr ranges, as shown in Fig. 8. As a result, a gaussian 
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Figure 9: Reconstructed vertex radial positions for 20 GeV pj converted photons, compared to 
their true values, with (left) and without (right) significant energy losses due to bremsstrahlung. 




Figure 10: Overall reconstructed relative l/pr resolution (left) and radial position resolution 
(right) for K° decays (to charged pions) with p T = 10 GeV. Only tracks with at least two sili- 
con space points are used. 



123 



Electrons and Photons - Reconstruction of Photon Conversions 



fit performed on the core of the l/pr distribution, will result in a worse overall reconstructed momentum 
resolution, even in the case of small bremsstrahlung losses, as Fig. 1 1 demonstrates. The effect is even 
more significant if one recalls that the reconstructed converted photons have a transverse momentum 
which is twice that of the decays shown in the same figure. The fact that the photon is a massless par- 
ticle, resulting in an extremely small angular opening of the emitted tracks, makes it also more difficult 
to reconstruct accurately the position of the conversion vertex, as shown in Fig. 12. 
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Figure 1 1 : Reconstructed relative l/pr resolution as a function of radial distance from the beam 
axis for 20 GeV pj converted photons and 10 GeV pj K® decays to charged pions. In the plot on 
the left, only converted photons, where both of the daughter electrons lost less than 20% of their 
energy due to bremsstrahlung, are shown. In the plot on the right, all conversions are included. 



4 Conversion reconstruction 

With the three track collections and the vertex fitting algorithm described in the previous two sections, 
we now have all the necessary tools in place in order to fully reconstruct photons which convert as far as 
800 mm away from the primary interaction point. Beyond that radius, the track reconstruction efficiency 
drops off dramatically due to the lack of a sufficient number of hits in any sub-detector to reliably recon- 
struct the particle trajectory and accurately predict its track parameters. The conversion reconstruction 
algorithm is run within the framework of the overall Inner Detector reconstruction software; it is one of 
the last algorithms run during the post-processing phase. The basic components of the conversion recon- 
struction are: the track selection and subsequent track classification, the formation of pairs of tracks with 
opposite charge, the vertex fitting and reconstruction of photon conversion vertex candidates, and finally 
the reconstruction of single-track conversions. The conversion candidates are then stored in a separate 
vertex collection, to be retrieved and further classified through matching with electromagnetic clusters 
during the next level of the event reconstruction. In the results presented in this section, the reconstruc- 
tion efficiency is estimated for those photon conversions that happen as far as 800 mm away from the 
primary interaction point, emit daughter electrons with each having at least p T = 0.5 GeV and are within 
the |tj| =2.1 pseudorapidity range. This amounts to ~ 77% of the total photons converted inside the 
ATLAS tracker volume in the case of the H — ► yy sample. 

4.1 Track selection 

Only a fraction of the possible track pairs reconstructed by the tracking algorithms and included in the 
final track collection come from converted photons. Although the wrong-track combinations may be 
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Figure 12: Reconstructed radial resolution as a function of radial distance from the beam axis for 
20 GeV pr converted photons and 10 GeV pj decays to charged pions. In the case of the 
converted photons, all daughter electrons regardless of bremsstrahlung losses have been included. 



Cut 


Efficiency 


Rejection 


No Cuts 


0.7378 


1.00 


Impact dO 


0.7334 


1.16 


Impact zO 


0.7316 


1.18 


TR ratio 


0.7119 


2.12 



Table 1: Track selection cuts: cumulative efficiencies and rejection rates are presented. 
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rejected later during the conversion reconstruction process or by physics specific analysis, it is impor- 
tant to remove them as efficiently as possible at an early stage, not least because of the large amount of 
CPU time involved in processing every possible track pair. Cuts on the perigee impact and longitudinal 
track parameters, as well as the transverse momentum, are first applied. Tracks that are most probably 
associated to electrons are then selected by cutting on the probability reconstructed by using the ratio of 
high-threshold TRT hits over the total number of TRT hits on each track. These cuts have been tuned 
using H — > 77 events, with background present due to the underlying event. All the efficiencies and rejec- 
tion factors due to track selection cuts which are quoted in this note refer to this physics sample. Table 1 
shows the performance of these cuts in accepting tracks produced by converted photons and rejecting 
non-conversion related tracks. The starting efficiency of ~ 74% reflects entirely the inefficiency of re- 
constructing all the conversion related tracks during tracking. After applying these cuts, the surviving 
tracks are then arranged into two groups with opposite charges. 

4.2 Track-pair selection 

At this point in the reconstruction process, all possible pairs of tracks with opposite signs are formed and 
further examined. There are three possible types of track pairs: 

1. Pairs in which both tracks have Si hits; 

2. Pairs in which one of the two tracks is a stand-alone TRT track; 

3. Pairs in which both tracks are stand-alone TRT tracks. 



Cut 


Efficiency 


Rejection 


Polar angle 


0.7070 


10.8 


Radial distance between first hits 


0.7049 


12.5 


Minimum distance 


0.6970 


16.5 


Vertex radius 


0.6959 


16.6 


Minimum arc length 


0.6935 


40.3 


Maximum arc length 


0.6890 


111.6 


Distance in z 


0.6870 


111.9 



Table 2: List of cuts employed during the track-pair selection for the three possible types of track 
pairs.The cumulative efficiencies and rejection rates are presented (see text for the definition of the cut 
variables). 

In order to reduce the combinatorial background, a series of cuts are applied during the pair for- 
mation. These are common to all three track-pair types described above, although their actual values 
may differ. Table 2 lists those cuts along with the corresponding efficiencies and rejection factors for 
selecting the correct track pairs and discarding fakes resulting from wrong track combinations. The first 
criterion for accepting a track pair is that the difference in polar angles between the two daughter tracks 
in a conversion should be small, based on the fact that the photon is massless. Furthermore, the distance 
between the first hits of the two tracks in the pair should be reasonably close; this is particularly true 
in the case where both of them are stand-alone TRT tracks. Finally, the distance of minimum approach 
between the two tracks in the pair is checked. An iterative method has been implemented that uses the 
Newton approach to find the set of two points (one on each track) which are closest to each other. The 
distance of minimum approach between the two tracks is then calculated and a cut is applied to reject 
those cases where the tracks fail to come within a specified distance from each other. 
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In order to enhance the performance of the constrained vertex fitter, it is important to begin with 
a reasonable initial estimate of the vertex position. Using the perigee parameters of the two tracks in 
the pair, the corresponding radius of curvature and the centre of curvature of the track-helix projection 
in the R — 0 plane can be derived. As this track-helix projection is circular in the case of a uniform 
magnetic field such as that of the ATLAS tracker, the estimated vertex position can be identified as 
either the point of intersection of two circles, or in the case of non-intersecting circles, as the point of 
minimum approach between two circles. If the two circles do not intersect or approach each other closer 
than a set minimum distance then the pair is discarded. In principle, two circles may intersect at two 
points. Since two tracks originating from a conversion vertex (or any vertex for that matter) should also 
intersect in the R — z plane, the correct intersection point in the R — (j) plane is then chosen to be the one 
which is closer to the point of minimum approach of the two tracks in the R — z plane. The points of 
minimum approach both in the R — 0 and the R — z planes should clearly be sufficiently close to each 
other. If they are separated by more than a set minimum distance, then the track pair is discarded. A 
cut is also applied on the arc length of the R — (j) plane projection of the two track helices between the 
line connecting the centres of curvature of the two circles and the actual intersection points. This arc 
length is required to fall within a specific range which again ideally should tend to be very small. Finally, 
the distance from the track origin (the candidate conversion vertex location) and the actual points of 
intersection should also be small. Only track pairs with intersection or minimum-approach points that 
satisfy the above criteria are further examined. Estimating the initial vertex position allows for a larger 
number of quality criteria of the track pair to be used in the overall selection process. All the cuts applied 
during this step have been tested using the 120 GeV H — > 77 physics sample; the cuts are tuned so that 
at least two orders of magnitude of the combinatorial background can be rejected at this point without 
significant loss in overall conversion reconstruction efficiency. As a consequence, cut values have been 
intentionally kept fairly loose since even correct track pairs could be characterised by less than optimal 
selection quantities. This is especially true in cases where at least one of the two tracks involved has 
only TRT hits resulting in reduced reconstructed track parameter accuracy along the z-axis, or in cases 
where the tracks have suffered substantial bremsstrahlung losses during their propagation through the 
ATLAS tracker. In general, the position of the initially estimated vertex falls within a few millimetres 
of the actual conversion vertex for the correct pair combinations, all deviations being due to the reasons 
mentioned just before. 



Cut 


Efficiency 


Rejection 


Fit convergence 


0.6870 


171.5 


Fit x 2 


0.6710 


288.9 


Invariant mass 


0.6626 


353.9 


Photon pj 


0.6625 


377.1 



Table 3: Post-vertex fit selection cuts: cumulative efficiencies and rejection rates are presented. 



4.3 Vertex fitting 

The original track perigee assigned during the track reconstruction process is set at the primary interac- 
tion point and for the case of photon conversions, especially those that happen far inside the tracker, this 
is a rather poor assignment. Using the initial estimate for the vertex position described previously, we 
can redefine the perigee at this point. The new perigee parameters need to be recomputed by carefully 
extrapolating from the first hit of each track in the pair to this new perigee, taking into account all the 
material encountered on the way. It is these tracks with their newly computed perigee parameters that 
are passed to the vertex fitter. This also has the desirable effect of avoiding long extrapolations during 
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the various iterations of the vertex fitting process, which might lead to distortions due to unaccounted-for 
material effects. At the end of the process the new vertex position along with an error matrix and a 
X 2 value for the fit are computed. A vertex candidate is then reconstructed that also contains the track 
parameters as they are redefined at the fitted conversion vertex. The fit is always successful in the case of 
the correct track pairs, and it often fails otherwise. After the fit is executed, post-selection cuts on the % 2 
of the fit, on the reconstructed photon invariant mass and on the reconstructed photon pj can be applied, 
to reduce even further the wrong pair combinations. These are listed in Table 3. 

The track pair selection and the vertex fitting process result in a reduction in the combinatorial back- 
ground rate by more than two orders of magnitude, with only a rather small loss in overall conversion 
reconstruction efficiency, amounting to ~ 8% in the case of H —>■ yy decays with ma = 120 GeV. A more 
quantitative description of the conversion reconstruction efficiency in such decays is presented in Sec- 
tion 5.3. At this stage of the conversion vertex reconstruction, which is still within the tracking software 
framework, vertices which come from the combinatorial background outnumber the correct conversion 
vertices by almost a factor of six. The main part of this remaining background consists of reconstructed 
vertices where at least one of the participating tracks is not an electron at all. This is primarily due to the 
rather weak particle identification capabilities of the tracker without any access to the electromagnetic 
calorimeter information. Part of this background can be reduced by some more stringent requirements on 
the reconstructed conversion vertices after the constrained fit is preformed. But effective improvement is 
only expected during the subsequent stages of the photon conversion reconstruction, when information 
from the calorimeter becomes available. Use of the electromagnetic calorimeter should also help to re- 
duce a different type of combinatorial background originating when two electrons from different sources 
are combined in order to form a track pair. Recent studies indicate significant reduction of both types 
of the combinatorial background, both by applying tighter vertex selection criteria after the vertex fit is 
performed and by using the electromagnetic calorimeter information, although they are beyond the scope 
of this note. The possibility of using the reconstructed photon p T in order to reduce the number of recon- 
structed fake vertices, is worth investigating. Figure 13 shows the p T distribution of the reconstructed 
conversion vertices along with the distribution for fake vertices resulting from wrong combinations. It 
is clear that the latter tend to concentrate at the lower pj region. Nevertheless a final cut on the recon- 
structed photon pj will not be as efficient as expected, due to the limited ability at present to correct 
the reconstructed track momentum for losses due to bremsstrahlung. This is evident in the figure when 
comparing the reconstructed converted photon pj distribution with (top row) and without (bottom row) 
significant bremsstrahlung losses. It becomes even more striking once it is compared to the truth pj 
distribution of the converted photon. In the remaining part of this section, the overall performance of 
the conversion reconstruction software, without utilising the electromagnetic calorimeter information, is 
examined in the case of single 20 GeV pj photons, where the combinatorial background is minimal. 

Figure 14 shows the track, track-pair, and vertex reconstruction efficiencies for conversions coming 
from 20 GeV pj photons as a function of both conversion radius and pseudorapidity. Both the track 
and track pair efficiencies shown in the figure are measured before any of the selection criteria described 
above are applied. The large drop in the efficiency at R > 400 mm is primarily due to the inefficiency 
of reconstructing both tracks in the track pair from the photon conversion. It is noteworthy that both 
the track and the conversion vertex reconstruction efficiency are essentially constant as a function of 
pseudorapidity. For completeness, Fig. 15 shows the slightly different version of the left-hand plot 
in Fig. 14 as published in Ref. [6]. 

Finally, Fig. 16 shows the overall vertex reconstruction efficiency for converted photons with low 
transverse momenta as a function of conversion radial position. The two competing effects, the brems- 
strahlung losses that affect more severely the low pj tracks, and the higher radii of curvature that result 
in increased resolving ability of the smaller pj tracks, that were discussed in Section 2.3, are once more 
evident here. 
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Figure 13: Transverse momentum distribution of reconstructed photon conversions for both cor- 
rect and wrong track pairs for all three types of pairs: Silicon-Silicon (Si, left column), TRT-TRT 
(Trt, centre column), and Silicon-TRT (ST, right column). In the top row all electron tracks re- 
gardless of bremsstrahlung energy losses are considered for the case of the correct track pairs. In 
the bottom row only track pairs where both electrons have lost less than 20% of their energy due 
to bremsstrahlung are shown. For comparison the truth pj of the converted photon is also shown. 



4.4 Single-track conversions 

Due to conversions which decay asymmetrically (as described in Section 1.1), as well as cases where the 
conversion happens so late that the two tracks are essentially merged, there are a significant number of 
conversions where only one of the two tracks from the photon conversion is reconstructed. Depending 
on the photon momentum scale, these "single-track" conversions become the majority of the cases for 
conversions that happen late in the tracker and especially inside the TRT. The ability of the TRT to resolve 
the hits from the two tracks is limited, especially if those tracks do not traverse a long enough distance 
inside the tracker for them to become fully separated. As a result, only one track is reconstructed, but it 
will still be highly desirable to recover these photon conversions. 

At the end of the vertex fitting process, all of the tracks that have been included in a pair that success- 
fully resulted in a new photon conversion vertex candidate, are marked as "assigned" to a vertex. The 
remaining tracks are then examined once more on an individual basis in order to determine whether or 
not they can be considered as products of a photon conversion. For a track to be considered, it should 
have its first hit beyond the pixel vertexing layer. Furthermore, the track should be electron-like, where 
again the probability reconstructed by using the ratio of the high threshold TRT hits over the total number 
of TRT hits (as in the initial track selection described earlier in this section, but requiring a higher value) 
is used to select likely electron tracks. At the end of this selection tracks wrongly identified as emerging 
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Figure 14: Conversion reconstruction efficiency for conversions from 20 GeV pj photons as a function 
of conversion radius (left) and pseudorapidity (right). The solid histograms show the track reconstruction 
efficiency, the dashed histograms show the track-pair reconstruction efficiency, and the points with error 
bars show the conversion vertex reconstruction efficiency. 




Figure 15: Conversion reconstruction efficiency for conversions coming from 20 GeV pj photons as a 
function of conversion radius. The solid histogram shows the track reconstruction efficiency, the dashed 
histogram shows the track-pair reconstruction efficiency, and the points with error bars show the conver- 
sion vertex reconstruction efficiency as published in Ref. [6]. 
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Figure 16: Conversion vertex reconstruction efficiency as a function of conversion radius for photons 
with transverse energy of 2 and 5 GeV. 




Figure 17: Reconstruction efficiencies for conversions from 20 GeV pj photons as a function of con- 
version radius (left) and pseudorapidity (right). The points with error bars show the total reconstruction 
efficiency, the solid histograms show the conversion vertex reconstruction efficiency, and the dashed 
histograms show the single-track conversion reconstruction efficiency. 
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Figure 18: Conversion reconstruction efficiency for conversions coming from 20 GeV pj photons as a 
function of conversion radius. The points with error bars show the total reconstruction efficiency, the 
solid histogram shows the conversion vertex reconstruction efficiency, and the dashed histogram shows 
the single-track conversion reconstruction efficiency as published in Ref. [6]. 

from photon conversions, outnumber the actual photon conversion electron tracks, by almost a factor of 
two. These are tracks which are not electrons at all, misidentified as such due to the inherent weakness 
of the particle identification process without the presence of any information from the electromagnetic 
calorimeter. 

A conversion vertex candidate is then reconstructed at the position of the first track hit. It is clear that, 
especially in the case where the first hit is inside the Si part of the tracker, the position of the conversion 
vertex reconstructed in this way can be off by as much as a detector layer. This discrepancy is normally 
much smaller in the case of a vertex inside the TRT due to the higher straw density. On the technical 
side, this type of reconstruction requires a careful transformation of the local track parameters and error 
matrix into global ones that are directly assigned to the newly defined vertex. A new vertex candidate is 
then stored, identical in structure to the one derived from a vertex fit with the important difference that 
it has only one track assigned to it. The effect of including the single-track conversions into the overall 
conversion reconstruction efficiency is significant as is shown in Fig. 17. The plot shows the conver- 
sion reconstruction efficiency for 20 GeV pj photons as a function of both radius and pseudorapidity. 
As expected, the single-track conversions become more and more dominant at higher radial positions, 
and single-track conversions are fairly uniformly distributed across the full pseudorapidity range. For 
completeness, Fig. 18 shows the slightly different version of the left-hand plot in Fig. 17 as published 
in Ref. [6]. While it is not possible to reconstruct the two merged tracks in these single-track conversions, 
it should be possible to separate such cases from very asymmetric conversions with the lower-energy part- 
ner of the pair not reconstructed: the transition radiation information should correspond on average to 
that expected from two electrons and the drift-time information should be inconsistent with that expected 
from a single track (resulting in a significant fraction of unused drift circles in the track fit). 
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5 Physics applications: low-pr conversions, 7/ 7T° separation, H — > 77 

In this section some interesting applications of the usage of the photon conversions are presented. Only 
results from photon conversions where both of the daughter electron tracks have been reconstructed are 
included. It needs to be stressed at this point, that everything that is presented here is meant only as an 
application example and that nofull-scale analysis has been made. 

5.1 Low-pr photon conversions 

Of particular interest during initial data taking is the use of the reconstruction of converted photons as 
a tool to obtain a measurement of the amount of material inside the ATLAS tracker, including passive 
material. The abundance of low-p^ neutral pions in minimum bias events represents a very rich source of 
photons and makes this approach particularly promising. The number of photon conversions measured on 
a detector volume of known x/X 0 can be used as a normalisation point to extract the amount of material 
at any other location inside the detector by counting the relative number of conversions occurring in that 
portion. To obtain an unbiased map of the tracker material it is necessary to correct the measured number 
of conversions by the conversion reconstruction efficiency. Several methods are being investigated to 
measure this efficiency from data, e.g. embedding Monte Carlo photon conversions in data or extracting 
it from the measure of decays with similar topology like K® — ► K + K~ . 




Figure 19: Reconstructed radial positions for conversions of 5 GeV pj photons. The black his- 
togram shows the truth radial position of the conversion vertices, and the gray histogram shows 
the radial positions of the reconstructed vertices, regardless of the bremsstrahlung losses of their 
daughter electrons. 

Figure 19 shows the reconstructed radial positions of photon conversions with 5 GeV pj. A few 
structures may be identified: the initial peak caused by the beampipe, the three layers of the pixel detec- 
tor and then with lower resolution and significance the SCT layers and the TRT The observed smearing 
of the reconstructed position of the conversion vertex is mainly due to bremsstrahlung effects. The po- 
sition resolutions (in the radial direction) of the reconstructed conversion vertex, for photon conversions 
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produced by the decay of neutral pions with various energies, are shown in Fig. 20 as a function of the dis- 
tance from the beam axis. All conversions regardless of the amount of energy lost due to bremsstrahlung 
by the daughter electrons, have been used. In the case of the lower pj neutral pions, more relevant in the 
case of minimum bias events, the radial position resolution improves somewhat, as might be expected 
from the larger angular separation between the produced electrons. On the other hand the use of low 
Pt tracks can be limited by the lower tracking efficiency caused by multiple scattering and especially 
bremsstrahlung. 

In order to be able to determine the amount of material at a given position, it is necessary to compare 
the number of reconstructed converted photons at that position with the number of conversions recon- 
structed at the position of some reference point. This necessitates being able to resolve the position of 
the reference, which may not be trivial. 
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Figure 20: Reconstructed radial position resolution for converted photons produced by the decay 
of neutral pions with various energies. 



5.2 y/n° Separation 

Another application of conversion reconstruction is the possibility of using the converted photons to 
identify, and subsequently remove, neutral pions in which at least one of the photons resulting from 
the decay of the pion has converted. Low multiplicity pions constitute the dominant background to the 
photon signal after all the calorimeter-specific cuts have been applied during photon identification [16]. 
In the case of converted photons from 71° decays, additional handles could be derived as soon as their 
reconstructed transverse momentum is made available. About 30% of the neutral pions will have at 
least one of their daughter photons converted and subsequently reconstructed as such, thus providing an 
estimate of their pj. 

The transverse momentum reconstruction resolution is important when attempting to use conversions 
to identify low pj neutral pions. The ratio of the reconstructed pj of a converted photon inside the 
ATLAS tracker to the Ej measured by the electromagnetic calorimeter is different for photons from n° 
decays and for prompt photons. Figure 21 shows such distributions for the case of converted 20 GeV 
Pt single photons and converted photons from the decay of a 20 GeV pj neutral pion. The photon pj 



134 



Electrons and Photons - Reconstruction of Photon Conversions 



J 


No brem cleaning 

Of j 


W3Mm 


111 


<0.6 P/ E t 




brem < 20 /o | 




CV 1 


111 


<0.6 P/ E t 




Figure 21: Pt/Ej distribution for 20 GeV converted photons and for photons from a 20 GeV 71°. 
The top row shows the distribution for all photons irrespective of the daughter electron energy losses 
due to bremsstrahlung. The bottom row shows the distribution only for those photon conversions in 
which the daughter electrons have lost less than 20% of their energy to bremsstrahlung. Three different 
pseudorapidity ranges are shown, corresponding to the barrel (left), the barrel/end-cap transition (centre) 
and the end-cap (right) regions. 

shown is that reconstructed by the conversion algorithm, while the Ej shown is taken from the truth value 
from the simulation. Three regions in pseudorapidity are shown separately, namely those corresponding 
approximately to the tracker barrel, barrel/end-cap transition and end-cap regions. The top row of plots 
include all converted photons, irrespective of losses due to bremsstrahlung of their daugter electrons, 
while the bottom row has only those converted photons where both of their daughter electrons have lost 
< 20% of their energy due to bremsstrahlung. Clearly the distinction between conversions from single 
photons and conversions from photons produced in neutral pion decays is less pronounced in the case 
of strong bremsstrahlung losses, although an effective bremsstrahlung recovery mechanism should be 
able to significantly improve the separation between the two distributions. A certain degredation is also 
evident as we move from the barrel to the end-cap tracker, due to the less accurate reconstruction of the 
transverse momentum of the daughter electron tracks at higher pseudorapidities. Figure 22 shows the 
fraction of remaining n° particles as a function of converted photon efficiency, both with and without 
significant losses due to bremsstrahlung. The overall 71° rejection corresponding to a photon acceptance 
of 90 % for the three different pseudorapidity regions, as described above, is shown in Fig. 23. Again 
a distinction is made for the cases with and without significant energy losses due to bremsstrahlung. 
Although reduced, the discriminatory power against 71° is significant even when severe losses due to 
bremsstrahlung are present. 
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Figure 22: Fraction of remaining n° as a function of converted photon efficiency with (left) and without 
(right) significant bremsstrahlung losses of the corresponding daughter electrons, for three pseudorapid- 
ity regions as described in the text. 
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Figure 23: Rejection factors against 71° corresponding to photon acceptance efficiencies of 90 %, with 
and without significant energy losses due to bremsstrahlung for the three pseudorapidity regions de- 
scribed in the text. The results are shown for converted photons and 7T° with a p T of 20 GeV. 
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5.3 H — > 77 decays 

As mentioned in the introduction, the recovery of converted photons is of primary importance in the 
search for physics processes in which photons are the primary decay product. In particular, accurate 
reconstruction of the H — > 77 process is heavily dependent on the ability to properly reconstruct photon 
conversions for the following reasons: 

1 . A significant fraction of photons will convert inside the ATLAS tracker volume. Efficient recon- 
struction of these photons will enhance the signal statistics for this process. 

2. Photon identification, using a combination of inner detector and electromagnetic calorimeter se- 
lection criteria, will be improved with effective conversion reconstruction. Even single-track con- 
versions will be useful in this context. 

3. The electromagnetic calorimeter calibration will be significantly enhanced when converted pho- 
tons are identified as such. Again, even single-track conversions will be very useful. 

4. The ability to accurately point back to the mother Higgs particle is dramatically enhanced for the 
case of reconstructed converted photons where both daughter electron tracks are properly recov- 
ered. 



It is important, therefore, to investigate the performance of the conversion reconstruction strategy in 
this case, not least because of the higher transverse momenta which characterise the photons produced 
in H — >• 77 decays. Decays to photon pairs from a Standard Model Higgs boson with 120 GeV mass have 
been studied throughout this section. 




Figure 24: Track, track-pair, and vertex reconstruction efficiencies for converted photons from H — > 77 
decays with m H = 120 GeV, as a function of radial distance from the beam axis (left) and pseudorapidity 
(right). The efficiency reduction at |tj| ~ 0.8, is due to the track reconstruction inefficiencies in the gap 
region between the TRT barrel and end-cap detectors. 

Figure 24 shows the converted photon reconstruction efficiency as a function of both radius and pseu- 
dorapidity for photons coming from H — > 77 decays. The reduced efficiency at higher radii is primarily 
due to the smaller distance which the produced electron tracks travel inside the magnetic field, reducing 
the separation between them. The conversion reconstruction efficiency as a function of pseudorapidity is 
fairly flat, independent of the material distribution inside the ATLAS tracker, as expected. The effect of 
including the single-track conversions into the overall conversion reconstruction efficiency is also signif- 
icant for H — >• 77 decays with large conversion radius and over the full pseudorapidity range, as shown 
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Figure 25: Reconstruction efficiencies for converted photons from H — > 77 decays with m# = 120 GeV, 
as a function of conversion radius (left) and pseudorapidity (right). The points with error bars show the 
total reconstruction efficiency, the solid histograms show the conversion vertex reconstruction efficiency, 
and the dashed histograms show the single-track conversion reconstruction efficiency. 




Figure 26: Reconstructed vertex radial position resolution (in mm) for converted photons from H — > 77 
decays with mn = 120 GeV. For comparison, the two cases where the participating tracks have lost 
> 20% (< 20%) of their energy due to bremsstrahlung are also shown separately. 
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Figure 27: Reconstructed polar angle resolution (in radians) for converted photons from H — > 77 decays 
with m H = 120 GeV. 

in Fig. 25. The reconstructed conversion vertex radial position resolution is shown in Fig. 26 for recon- 
structed converted Higgs photon vertices where the participating tracks have lost > 20% (< 20%) of their 
energy due to bremsstrahlung, along with all vertices put together. The results are fairly comparable to 
the ones shown for single photons in Section 4, despite the fact that the resulting photon momenta in this 
case are on average at least a factor of two bigger and the fact that the presence of the underlying event 
causes additional complications for the track reconstruction. 

Of particular interest for the reconstruction of the Higgs invariant mass is the resolution on the mea- 
surement of the polar angle of the reconstructed converted photon. This is shown in Fig. 27 for the case 
of conversions where both electron tracks have Si hits. These account for ~ 58% of the reconstructed 
converted photons inside the ATLAS tracker volume. The resulting resolution is of the order of 0.5 mrad 
regardless of the transverse momentum of the converted photon. This is an improvement of at least an or- 
der of magnitude with respect to the polar angle resolution derived using the electromagnetic calorimeter 
response [6]. 

6 Summary and conclusions 

This note has described and presented a detailed performance evaluation of the conversion reconstruc- 
tion algorithm which will be used to reconstruct and study early data at the LHC. All three types track 
collections delivered by the tracking software have been combined and used. A dedicated vertex fit al- 
gorithm has been developed for the purpose of reconstructing converted photon vertices. Special care 
has been given to flagging possible conversions where only one of the produced electron tracks has been 
reconstructed (or where the two tracks are merged into one in the case of late conversions). Combining 
all of these tools, a reconstruction efficiency of almost 80% has been achieved for conversions that occur 
up to a distance of 800 mm from the beam axis. A transverse momentum reconstruction resolution of the 
order of 5% has been found for converted single photons of various energies. This has also been shown 
to be valid for the case of photons produced by the decay of a Standard Model Higgs boson with a mass 
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of 120 GeV, as well as for those coming from the decay of low pj neutral pions. The position resolution 
is found to be better than 5 mm in the radial direction, making this a promising method for mapping the 
material inside the ATLAS inner detector. The angular resolution is found to be below 0.6 mrad, giving 
effective pointing to converted photons from physics processes. 
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Abstract 

This note discusses the reconstruction of J/y and T decays to electron pairs 
based on ATLAS Monte Carlo simulated signal and background samples. The 
possible trigger strategies are described, one geared to select two low-energy 
electromagnetic objects in direct production, the second one taking advantage 
of the possible presence of a muon in the final state in bb production followed 
by the decay of one &-quark to J/y + X. The low-energy electrons are re- 
constructed using a dedicated algorithm seeded by a track reconstructed in the 
inner detector and identified combining information from the inner detector 
and the electromagnetic calorimeter. The performance of this algorithm is pre- 
sented and the potential of using such events for early LHC data studies is 
investigated. 

1 Introduction 

When switched on, the LHC will produce charm and beauty quarks in abundance which will be col- 
lected by the ATLAS experiment [1], even during the low luminosity periods. The number of produced 
quarkonium states such as J/y and T, important for many physics studies, will be equally numerous. 
On average, one in every hundred collisions will contain a bb pair. The large bb cross-section and the 
high luminosity of the machine give therefore a high rate for B-hadrons, making B-physics an interesting 
and competitive subject at the LHC. Low energy resonances, such as J/y and T will be one of the main 
sources of isolated electrons in the early data. Both the J/y and T signal samples are important for un- 
derstanding the production of prompt quarkonia. But there is another aspect which is the main focus of 
this note: these samples are ideal to study the performances of trigger and offline reconstruction at low 
energies as well as being potentially useful for the in-situ calibration of the electromagnetic calorimeter. 

This note is organised as follows: Section 2 gives a description of the data-samples used in this 
note, Section 3 details the trigger selections, and Section 4 describes the offline electron reconstruction 
and identification procedure. Finally, in Section 5, the physics potential of these channels is explored 
with initial data, assuming an instantaneous luminosity of 10 31 cnr 2 s _1 and an integrated luminosity 
of 100 pb- 1 . 

2 Data samples 

The different data samples used in this study are summarised in Table 1. The total cross-sections for 
charm production at LHC is 7.8 mb and the one for bottom production is 0.5 mb. Quarkonium production 
was originally described by the colour singlet model which failed to reproduce the direct J/y production 
cross section measured by the CDF experiment [2]. The colour octet model [3] was proposed as a 
solution to this quarkonium deficit. Direct quarkonia Monte Carlo samples comprise of directly produced 
J/y or T in colour singlet and octet states, along with promptly-produced #'s, which decay into J/y's or 
T's [4] [5]. The inclusive production cross sections of J/y and T are respectively ~ 90/xb and ~ 0.7/xb. 
A minimum transverse momentum of 3 GeV and a pseudo-rapidity r\ <2.7 are required for the two 
electrons. The resulting cross sections for the used data samples are respectively ~ 117nb and ~ 47nb. 
Another sample used in this study is originated from Drell-Yan production. In addition to the electron 
filter also applied to the J/y and T samples, the generated di-electron invariant mass m ee (DY) has to be 
1 < m ee (DY) < 60 GeV. Studies also include non-diffractive minimum-bias events with a total assumed 
cross section of 70 mb. 
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Table 1 : D ata samples: process, production cross-section and total number of events available. 



Process 


Cross section 


Number of events (x 10 3 ) 


Direct production 


pp -»• J/yr(e3e3)X 
pp -^T(e3e3)X 
pp -► Drell-Yan(<?3<?3) 
minimum-bias 


116.3 nb 
47.6 nb 
2.9 nb 
70 mb 


160 
150 
250 
1,000 


bb production 


bB d ^n(6)J/xir(e2e2)X 


0.2 nb 


50 



For the production via the decay of bb, only J/yr events are considered. The signal sample is made 
of bB d — >• n(6)J/\j/(ee) +X events, where the ju(6) refers to a muon coming from the b quark with a 
transverse momentum above 6 GeV. A minimum transverse momentum threshold of 2 GeV is applied to 
the generated electrons. 

The simulated data have been produced for the ATLAS Computing System Commissioning [6]. All 
samples have been generated using the Pythia 6.403 [7] Monte Carlo event generator. More details on 
the Monte Carlo generators used can be found in [8]. Data have been simulated using GEANT4 [9], with 
the ATLAS software ATHENA [10], with a realistic geometry including material distortions in front of 
the electromagnetic calorimeter. Studies presented here correspond to very early data taking, with an 
initial luminosity of 10 31 cm~ 2 s _1 . No pile-up has been included. Detailed information about these 
samples is given in Table 1 . 

Signal electrons come from J/yr and T decays 1 . The background electrons arise from other direct 
(b — > e, c — > e) and cascade (b —>■ c — > e) semileptonic decays of meson with an electron in the final 
state and b — > x — > e~ decays 2 . Other background electrons arise from 71° Dalitz decays, /-conversions 
occurring in the inner detector and decays of light hadrons. Distributions of generator level transverse 
momentum pj and pseudorapidity r\ for electrons and pions are shown for the pp —>■ J/yr (ee) +X 
sample on Fig. 1. The r\ distribution of electrons from conversion reflects the amount of material in front 
of the electromagnetic calorimeter. Ref. [13] details the reconstruction of such electrons. Table 2 gives 
the mean pj for each population in the different samples. 

Fig. 2 shows the distance AR, at generator-level, between the two signal electrons from J/yr. On av- 
erage, electrons from direct reconstructed J/y/(e3e3) are separated by AR = 0.7, and are restricted from 
being produced at separations larger than 1.1. Electrons from J/yr originated from B hadrons on the con- 
trary are on average more collimated, with a mean AR = 0.6 and with a larger spread. In comparison, the 
higher mass of of T requires the electrons to have a much larger opening angle, with a broad distribution 
in AR, the two electrons being almost back-to-back. 

3 Trigger selection 
3.1 General requirements 

ATLAS has a three level trigger system which reduces the 40 MHz bunch crossing rate to about 200 Hz 
to be recorded. The first level (LI) is a hardware-based trigger which makes a fast decision (in 2.5 /xs) 
about which events are of interest for further processing, with a rate reduced down to below 40 kHz in its 

^he corresponding branching ratio [1 1] is Br(J -> ee) = (5.94 ± 0.06)% and Br(X -> ee) = (2.38 ±0.11)%. 
2 The corresponding branching ratios [12] aicBr(b -> l~) = (10.71 ±0.22)%, Br(b -> c -> 1+) = (8.01 ±0.18)%, Br(b -> 
£-►/-) = (1.62+°^)%, Br(b — > T — > e~) = (0.419±0.055)% and Br{b -> (J/yrJ) -> ee) = (0.072 ±0.006)%. 
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Figure 1: Normalised distributions of generator-level transverse momentum pj (left) and pseudorapidity 
| Tj | (right) in the pp — > J / yX sample are shown for signal electrons (hatched histograms), electrons from 
conversions (dotted line histogram), and pions (plain histograms). 




AR 



Figure 2: Distance AT? at generator-level between the two signal electrons for direct J/y events (top left), 
direct T (top right) and J/\\r from b decays (bottom). 

initial implementation. Coarse granularity information from the calorimeter and muon trigger systems 
are used at this stage of the trigger to identify regions of the detector which contain interesting signals 
corresponding to, for instance, electrons, muons, taus, and jets. These are called "Regions of Interest" 
(Rols) and are used to guide the later stages of the trigger reconstruction. The high level trigger (HLT) 
is software-based and is split into two levels. At level 2 (L2) the full granularity of the detector is used 
to confirm the LI signals and then to combine information from different sub-detectors within the Rols 
identified at LI. Fast algorithms are used for the reconstruction at this stage and the rate is reduced to 
1 — 2kHz with an average execution time of about 40ms. Lastly, at the event filter (EF), the whole event is 
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Table 2: Mean generator-level pj (in GeV) for electrons and pions having pj > 2 GeV. Typical RMS on 
distributions of Fig. 1 is 1.4 GeV. 



sample 


electrons 


pions 


signal 


B and D hadrons 


/-conversions and 71° Dalitz 


pp J/yX 


4.7 


4.7 


4.3 


4.4 


pp^TX 


4.8 


4.5 


4.0 


4.2 


pp — > Drell-Yan 




5.9 


4.9 


4.7 


minimum bias 




4.2 


4.2 


4.2 


Z?B d n(6)J/yfX 


6.3 


5.5 


4.9 


5.1 



available and "offline-like" algorithms are used along with better alignment and calibration information 
to form a final decision whether or not an event is accepted. With an execution time of about 4s, the rate 
is reduced to 200 Hz. 

The expected ATLAS trigger performance at an initial luminosity of 10 31 cm -2 s _1 is studied using 
the samples described in the previous section. Two trigger menus are considered here: the first is a 
purely electromagnetic menu which could be used only for early data taking; the second menu relies on 
the B-trigger and could be extended for data taking at low luminosity 10 33 cm~ 2 s _1 . More details about 
the overall trigger strategy, in particular for these channels, can be obtained in [14] and [15]. 




Figure 3: Distribution of the generator-level transverse momentum of the less energetic electron versus 
the transverse momentum of the most energetic electron in the direct J/y (left) and T (right) decays. 



3.2 Electron selection 

J/y —>■ ee and T —>■ ee events are very demanding for the trigger system. Due to their relatively low 
masses, the electrons produced in the J/y and T decays are very soft. Figure 3 shows the distribution of 
the transverse momentum of the less energetic electron versus the transverse momentum of the most en- 
ergetic electron in J/y (left) and T (right) decays [5]. This poses a huge challenge for the LI calorimeter 
trigger. Its performance at the low-energy end is limited by the noise of typically 0.5 GeV per Rol and a 
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3 GeV threshold is the limit of what is feasible for the LI trigger. The 6.5 kHz LI output rate for 2EM3 
(corresponding to two LI electromagnetic clusters greater than 3 GeV ) makes it one of the biggest con- 
sumers of the total bandwidth [14]. The strategy to trigger on Jly and T — > ee events is based on low E T 
LI electromagnetic Rols and further electron identification using calorimeter and inner detector informa- 
tion at the HLT The inner detector tracks are reconstructed in regions of half-size Atj x A(p = 0.1 x 0.1 
around these electromagnetic Rols. 

Figure 4 shows the distribution of the invariant mass of the pairs of electrons for signal and back- 
ground events after the LI selection. The Jly and T samples can be easily recognised by the resonance 
peaks. Table 3 gives the number of events which are expected to pass the LI selection. The LI trigger 
efficiency is calculated with respect to the number of generated events, which in particular include a 
requirement on the minimum transverse momentum of 3 GeV on each electron as detailed in Section2. 
The selected events are in the tail of the J/y distribution (see Figure 3). Additionally the requirement 
of E T > 3 GeV at LI implies E T >= 4 GeV thus starting to cut into the peak of the T distribution. The 
efficiency of this level is measured to be 27% for Jly and T events. About 43% of Drell-Yan events 
pass the LI in the generated mass range. A total of 4.2 x 10 6 Jly ->ee, 1.2 x 10 6 T -^ee and 0.12 x 10 6 
Drell-Yan events are expected after this level. For the minimum bias sample, the enhancement above 




0 5 10 15 20 25 30 35 40 
m ee (GeV) 



Figure 4: Expected differential cross section for low-mass electron pairs using the 2EM3 trigger menu 
item after LI selection for J/y decays (dotted histogram), T decays (dashed histogram), Drell-Yan pro- 
duction (solid histogram) and expected background (full circles). The invariant mass is reconstructed 
with calorimeter only information available at LI. 

6 GeV arises from the requirement of the presence of two LI clusters with energy greater than 3 GeV. 
Studies and implementation of an efficient HLT selection, based on the selection of two electrons with 
E T > 5 GeV (2e5 menu) is ongoing. Typical rates are expected to be ~ 40 Hz at L2 and 6 Hz at EE 



Table 3: Performance of the 2EM3 trigger at the luminosity of 10 31 cm~ 2 s _1 for the direct production 
of J/y, T, Drell-Yan and background events. For signal events the efficiency e is given as well as the 
number of expected events. For background the rate is provided. Quoted errors are statistical only. 





J/y 


r 


Drell-Yan 


background 




£(%) 


10 6 ev / 


e(%) 


10 b ev / 


£(%) 


10 6 ev 


Rate 






100 pb" 1 




100 pb" 1 




100 pb" 1 


(Hz) 


LI 


27.4±0.3 


4.17±0.04 


27.3±0.3 


1.22±0.01 


43.0±0.5 


0.12±0.001 


6500±27 
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3.3 5-physics 

The B-trigger is expected to account for 5-10% of the total ATLAS trigger resources. The trigger for B- 
physics is initiated by a single- or a di-muon selection at LI. At 10 31 cm -2 s~\ a threshold pj > 4 GeV 
will be used, rising to about 6 GeV at 10 33 cm~ 2 s _1 to match the rate capabilities of the HLT For 
final states such as the bBj — > n{6)J /yX events, inner detector tracks are combined to reconstruct the 
J/y/ particles. Two different strategies are used for finding the tracks, depending on luminosity [16]. 
At 10 31 cm~ 2 s _1 full reconstruction over the whole inner detector can be performed, since the LI 
muon rate is comparatively modest, while at higher luminosities reconstruction will be limited to LI 
electromagnetic Rols with Ej > 3 GeV. For the bBj — > jj,(6)J/\ffX events the LI trigger efficiency 
is ~ 88%. This latter approach has lower efficiency for selecting the signal but requires fewer HLT 
resources for a fixed LI rate. If one combines triggers for electromagnetic final states and pre-scaled 
single muon-triggers needed for trigger efficiency measurements, the overall rate for B-physics triggers 
is approximately 20 Hz at 10 31 cm~ 2 s _1 . 

4 Electron reconstruction and identification 

The standard electron reconstruction procedure [17], optimised for high energetic electrons, is based on 
calorimeter clusters to which tracks are associated in a second step. An alternative procedure will be 
used for the reconstruction of electrons originated from J/*P and T decays. It takes full advantage of the 
tracking capabilities of the inner detector as well as the granularity of the electromagnetic calorimeter. 
The method is seeded by a track which is extrapolated into the electromagnetic calorimeter and allows 
for efficient reconstruction of electrons in jets for ^-tagging purpose (cf. [18]) and very low pj electrons. 

4.1 Electron reconstruction 

The track-based algorithm could handle any charged track particles with a transverse momentum greater 
than 0.5 GeV. Still, as it will be detailed further, in order to reduce the amount of fake candidates, in 
particular in jets, only particles with p T > 2 GeV are considered. The inner detector coverage goes to 
pseudorapidity values up to 2.5, except for the transition radiation tracker (TRT) which extends up to 
2. This subdetector being crucial nonetheless in the identification procedure but also to preselect tracks, 
the electron reconstruction is limited to \r]\ < 2. Strict selection criteria, similar to the ^-tagging ones, 
are required to have at least nine precision hits (pixel and silicon detectors); at least two hits in the pixel 
detector and at least one hit in the vertexing layer. The TRT plays a central role in electron identification. 
Selection criteria are thus required to have at least 20 hits in the TRT and at least one high energy hit 
(HTR hit) in the TRT detector along the track. After these criteria, only 50% of initial tracks remain. 
All the tracks that pass these criteria are extrapolated [19] to the second sampling of the electromagnetic 
calorimeter. Around this position a cluster of size At] x A0 = 0.075 x 0.125 (3 x 5 in units of cells in 
the middle sampling) in the barrel and At] x A(j) = 0.125 x 0.125 (5 x 5) in the end-caps is built. The 
cell with the maximum energy is searched within a small r\ and 0 window, 0.075 x 0.075 in the middle 
layer, around the extrapolation point. Shower shapes are estimated with respect to this position. 

Since the algorithm is the same as for the reconstruction of electrons in jets [18], a set of preselection 
criteria are applied to decrease the number of fake candidates: 

- the fraction of energy reconstructed in the core of the shower in the first sampling E\ (core) /E >0.03; 

- the fraction of energy reconstructed in the core of the shower in the third sampling ^(core) /E <0.5; 

- the ratio of the energy E reconstructed in the electromagnetic calorimeter over the momentum p 
reconstructed in the inner detector Ej p >0.7. 
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The above selection rejects about 5% of signal electrons but also ensures that the shower shapes in the 
first sampling are correctly defined. Finally, candidates which are also reconstructed as originating from 
a conversion [13] are vetoed, corresponding to a loss of 1-3% of signal electrons and pions. 

By fitting electron tracks in such a way as to allow for bremsstrahlung, it is possible to improve 
the reconstructed track parameters, as shown in Fig. 5 on the ratio between the reconstructed and the 
true momentum for electrons. These algorithms rely exclusively on the inner detector information. The 
method of dynamic-noise-adjustment extrapolates track segments to the next silicon layer. If it finds 
a significant % 2 contribution, compatible with an energy loss by the track due to bremsstrahlung, the 
fraction of radiated energy is estimated and a corresponding additional noise term is included in the 
Kalman filter [1] [20]. 




Figure 5: Left : ratio of the reconstructed to true momentum for electrons, for the default Kalman filter 
(hatched histogram) and for bremsstrahlung recovery algorithm (plain histogram) in the J/y samples. 
Right : ratio of the reconstructed to true energy versus X] for electrons. 

Position and energy corrections are applied in the precise reconstruction of the electromagnetic clus- 
ter and are described in [21]. These corrections have been tuned for high energy clusters and are not 
optimal for low energy electrons. Moreover, they have been determined with electron samples simulated 
with a detector taken to be perfectly aligned. In Fig. 5 the ratio between the reconstructed and the true 
energy is shown as a function of \r\\ for signal electrons from J/y samples. It can be seen that these 
corrections over-estimate the electron energy except in the crack region where the effect of extra-material 
in front of the calorimeter is important. Work is on going to improve the energy reconstruction at low 
energy. 

By default the four-momentum of an electron is defined as the energy reconstructed in the calorime- 
ter, whereas direction is taken from the associated track. As in this note the main physics processes 
result in electron transverse momenta of less than 15 GeV, the tracker momentum is used instead of the 
energy unless stated otherwise. Future developments in ATLAS will ensure an optimal combination of 
calorimeter and tracker measurements in the energy definition. 

4.2 Electron identification 

The most common background processes for producing electron-like showers in the calorimeters were 
described in section 2. Because the development of showers is different for electrons and hadrons, the 
electron identification algorithm incorporates variables that describe the shower shapes, quality of the 
match between the track and its corresponding cluster and the fraction of high threshold hits in the 
transition radiation tracker. 
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4.2.1 Identification of isolated electrons 

The identification for isolated electrons is based on cuts on the shower shapes, on information from the 
reconstructed tracks and on the combined reconstruction [17]. To be consistent with the trigger selection, 
only particles having a transverse energy Ej > 5 GeV are considered in the following. Three levels of 
selection are available: 

- "loose", consisting of simple shower-shape cuts (longitudinal leakage, shower shape in the middle 
layer of the electromagnetic calorimeter) and very loose matching cuts between reconstructed 
tracks and calorimeter clusters; 

- "medium", which adds shower shape cuts making use of the important information contained in 
the first layer of the electromagnetic calorimeter and track-quality cuts; and 

- "tight", with tighter track matching criteria and the cut on the energy-to-momentum ratio. This 
selection also explicitly requires the presence of a vertexing-layer hit on the track (to further reject 
photon conversions) and a large ratio of high-threshold to low-threshold hits in the TRT detector 
(to further reject the background from charged hadrons). Additionally, further isolation of the 
electron may be required by using calorimeter energy isolation beyond the cluster itself. Two sets 
of tight selection cuts are used to estimate the overall performance of the electron identification. 
They are labeled as "tight(TRT)", in the case where a TRT cut with approximately 90% efficiency 
for electrons is applied, and as "tight(isol)", in the case where a TRT cut with approximately 95% 
efficiency is applied in combination with a calorimeter isolation cut. 

The discriminating variables show a significant dependence on the pseudorapidity and a less pronounced 
one on the transverse momentum. In tj the dependence corresponds to varying granularities, lead thick- 
ness and material in front of the electromagnetic calorimeter. The separation between the distributions 
obtained for electrons and pions can vary also with tj. The thresholds applied for cuts have been op- 
timised in five tj bins, (0,0.8), (0.8,1.37), (1.37,1.52), (1.52,1.8), (1.8,2.0), and for transverse energies 
below 7.5 GeV, between 7.5 and 15 GeV and above 15 GeV. 

The electron identification efficiency is defined as e e =N' e /N e , where N e is the number of signal elec- 
tron tracks, which pass the track cuts and N' e is the number of signal electrons which pass identification 
cuts. The charged pion rejection is defined as R n = N n /N' n , where N n is the number of good quality 
pion tracks and N' n is the number of good quality pion tracks misidentified as signal electrons. Table 4 
shows the electron identification efficiency and pion rejection factor after loose, medium, tight with no 
isolation requirement and tight selections for the different data samples. For tight selection the efficiency 
is e e ~ 65% for direct J/y production. Performance is similar for the T sample, despite the higher av- 
erage momentum of the signal electrons, due to the cut on Ej > 5 GeV. Figure 6 shows in more detail 
the overall reconstruction and identification performance: the pj and tj dependences of the efficiencies 
for electrons. There is still an important tj dependence due to a few discriminating variables and work is 
ongoing to improve it. 

4.2.2 Identification of electrons from b quark 

Another identification procedure is optimised for non-isolated electrons and is thus particularly useful 
for bb events. The trigger anticipated for these events is based on a muonic decay mode of either the 
b or the b quark. All good quality tracks are considered above a transverse momentum pj >2 GeV. 
When possible we use the same variables as for the isolated electron identification but some variables 
- like the hadronic leakage by the fraction of energy reconstructed in the third sampling - are replaced 
or only the core of the electromagnetic shower is used. In addition to the traditional cut-based analysis, 
multivariate techniques have been developed, based on the similar variables, and in particular a likelihood 
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Table 4: Expected efficiencites e e for electrons from J/i/a and T decay for the four standard levels of cuts 
used for isolated electron identification. Only electrons with Ej > 5 GeV, corresponding to the HLT 
threshold, are considered. The crack region in the electromagnetic calorimeter, between 1.37 < |tj| < 
1.52 is removed. The quoted errors are statistical only. 



Selection 


PP J/yX 


pp^tx 




e e (%) 


Rn 


S e (%) 


R% 


Loose 


84.3 ±0.1 


36 ±3 


83.7 ± 0.4 


32 ±7 


Medium 


78.4 ±0.1 


72 ±9 


78.4 ± 0.4 


49 ± 13 


Tight(TRT) 


71.4 ±0.1 


109 ± 17 


71.3 ±0.4 


57 ± 16 


Tight (isol) 


65.5 ±0.1 


900 ± 400 


66.1 ±0.5 


740 ± 300 




P T (GeV/c) 



Figure 6: Electron identification efficiency with "Tight(TRT)" cuts level as a function of the pseudora- 
pidity (left) and the transverse momentum (right) in direct J/y events. 

technique can also be used. Figure 7 shows the obtained pion rejection curve as a function of the electron 
identification efficiency. In the following the working point is an electron identification efficiency of 




0.75 0.8 0.85 0.9 0.95 1 

Electron identification efficiency 



Figure 7: Pion rejection as a function of the electron identification efficiency, in bB^ — > \iJ j\\(X sample. 
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80%, corresponding to a pion rejection factor of ~ 1300. Figure 8 shows the overall reconstruction and 
identification performance in more details: the pj and tj dependencies of the efficiencies are shown for 
electrons. 




Figure 8: Electron identification efficiency in bBj — > jJ.(6)J '/yfX sample as a function of the pseudorapid- 
ity (left) and the transverse momentum (right). The mean electron identification efficiency is e e = 80%. 



5 Expected physics studies for early data 
5.1 Number of expected events 

As described in section 3, for an initial luminosity of 10 31 cm~ 2 s -1 , the trigger seelction of two low 
energy electrons (2EM3 menu at level 1) should provide good statistics for J/y —>■ ee and T —>■ ee decays. 
Fig. 9 shows the expected differential cross-section for low-mass electron pairs using the 2EM3 trigger 
menu item and the offline selection in linear (left) and log (right) scale. The invariant mass is recon- 
structed with direction taken from the inner detector and energy from the electromagnetic calorimeter 
which allows a better reconstruction of the invariant mass than using calorimeter only information as 
done at level 1. The signal-to-background ratio obtained is greater than one at the J/y/ and T peaks. With 
an integrated luminosity of 100 pb~ ! and an efficient identification and reconstruction of these low-mass 
pairs, approximately two hundred thousand J/yr decays could be extracted (see table 5). 

Table 5: Number of expected events for direct production of J/yr, T and Drell-Yan events passing the 
2EM3 trigger and offline analysis. Numbers are given for an integrated luminosity of 100 pb -1 with 
early data taking at 10 31 cm -2 s -1 . Quoted errors are statistical only. 





J/yr 


r 


Drell-Yan 




10 3 ev / 


10 3 ev / 


10 3 ev / 




100 pb" 1 


100 pb" 1 


100 pb" 1 


offline + E T > 5 GeV 


256±9 


45 ±5 


13.9±0.3 


offline + E T > 5 GeV + LI 


230±9 


43±5 


13.3±0.3 



Moreover, the standard B-physics trigger, using a single muon above a threshold of p T > 4 GeV, can 
give access to a sample of J / yf events originating from the bb production, without possible bias on the 
selection of electromagnetic objects. Due to its lower cross-section, the expected number of events is 
much less, around 2.3 x 10 3 after offline selection and ~ 1.9 x 10 3 after trigger and offline selection, but 
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m ee (GeV) m ee (GeV) 



Figure 9: Expected differential cross section for low-mass electron pairs using the 2EM3 trigger menu 
item and the offline selection in linear (left) and log (right) scale. Shown is the invariant di-electron 
mass distribution reconstructed using tracks for J/i/a — >• ee decays (dotted histogram), T — > ee decays 
(dashed histogram) and Drell-Yan production (full histogram). Also shown is the expected background 
(full circles). The invariant mass is reconstructed with direction taken from the inner detector and energy 
from the electromagnetic calorimeter. 



without any selection on the electrons themselves. A better estimation of this number requires combining 
a single muon trigger with a trigger for electromagnetic final states as described in section 3.3. 

5.2 Quality of the mass reconstruction with initial data 

In this section, we study the offline reconstruction of the V\y and T particles from their decay products. 
After a short description of the algorithm, we study the performance of the reconstruction for J/y/s 
originating from the bb decays. The invariant mass has been reconstructed with the inner detector only, 
combining information from the inner detector and the electromagnetic calorimeter, and using only the 
latter information. For the reconstruction with inner detector information we present results with and 
without the bremsstrahlung recovery procedure included. For direct production of J/i/a and T, we only 
show results using mass reconstruction with the inner detector. 

5.2. 1 Reconstruction o(J/\j/ and T events 

The identification of electrons is performed using the electron reconstruction algorithm described above. 
Electrons are identified with either the "tight" cuts for isolated electrons, or based on the likelihood 
method tuned to an electron identification efficiency of 80%. Pairs of electrons are thus selected. These 
pairs define the overall detection efficiency of J/yf (or T) events which is the product of the losses due to 
the removal of clusters located in the crack in the electromagnetic calorimeter, the track quality cuts, and 
the electron identification efficiency. 

Pairs of reconstructed opposite-charge tracks are fitted to a common vertex. Only events with a 
quality of the fit with % 2 per degree of freedom < 6 are retained Fig. 10 shows the distribution of the 
reconstructed transverse decay length L xy for direct J/y events and events originated from B hadrons 
decay. It is defined as: = w^nj^i , where D is the distance between the primary and secondary 
vertices and pr{J/w) is the J/yr reconstructed transverse momentum. It is used to distinguish between 
the prompt J/yf, which have a pseudo-proper time of zero (L^ < 0.4 mm) , and B-hadron decays into 
J/yf+X having an exponentially decaying pseudo-proper time distribution due to the non-zero lifetime of 
the parent B-hadrons (L xy > 0.25 mm). 
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Figure 10: Distributions of the reconstructed transverse decay length direct J/i/a events (left) and J/i/a 
events originated from B hadrons decay (right). 



5.2.2 Reconstruction oiJ/y from bb decays 

After selection, only ~2000 events are reconstructed. This statistics is scaled to 1.9 x 10 3 events, corre- 
sponding to the expected statistics for an integrated luminosity of 100 pb^ 1 . 

Reconstruction in the inner detector: 
Fig. 1 1 shows the electron pair invariant mass distribution using only the inner detector information for 
signal events. The fitted function behaves as a Breit-Wigner distribution ~ Tf (Ami + (r/2) 2 ) to the left 
of the peak mo, and as a Gaussian of width a r i g h t to the right, as shown in Fig. 11. The parameter a r i g h t 




Figure 11: The electron pair invariant mass distribution for bBj — > }x(6)J /yX events. The energy and 
direction information are taken from the inner detector. An asymmetric fit is performed with a function 
which behaves as a Breit-Wigner distribution to the left of the peak and as a Gaussian to the right of 
the peak. Results are shown without (crosses) and with (bullets) bremsstrahlung recovery included. 
Selection of events includes LI trigger and offline and number of events is scaled to 100 pb~ l . 



152 



Electrons and Photons - Reconstruction of Low-Mass Electron Pairs 



characterises the effective resolution in the invariant mass distribution of the pair, while F is a measure 
of the intensity of energy loss by the electrons due to the bremsstrahlung. Amo = mo — Mj/y, where 
Mj/xy = 3096 MeV is the nominal J/i/a mass. The fitted values of the parameters are shown in Table 6. 
The J/i/a reconstruction performance is assessed separately for the three cases: TRT barrel, when both 
electrons have their track pseudorapidity |?]| < 0.7, the TRT end-caps, when at least one electron has 
|tj| > 0.7, and the full r\ range. In general the quality of the fit is not very high, in particular we see 

Table 6: Results of an asymmetric fit to the invariant mass distributions for bB d — ► n(6)J/\]/X events, 
with a function that behaves as a Breit-Wigner distribution to the left of the peak and as a Gaussian to 
the right of the peak. The direction and energy information are taken from the inner detector. 



brem fit 


77 range 


Am 0 (MeV) 


T (MeV) 


a rig ht (MeV) 


No 


all 
Barrel 
End-caps 


-77 ±7 
-71 ±7 
-178± 17 


557 ±20 
393 ±19 
688 ±43 


67±4 
65±4 
123±11 


Yes 


All 
Barrel 
End-caps 


-66 ±6 
-45 ±7 
-128 ±12 


540 ±18 
417±19 
657 ±33 


99±4 
77±5 
155 ±8 



difficulties with correctly reproducing the peak. Table 6 shows the results of the fit of the invariant mass. 
A shift in the reconstructed mass is measured around 77 MeV, larger in the end-caps than in the barrel. As 
mentioned in [5], such mass shifts may be due to detector alignment, material effects, magnetic field scale 
and its stability. The CDF collaboration extensively and successfully used this method but it took many 
years at the Tevatron to collect sufficient statistics to allow for the disentanglement of various detector 
effects [22]. The parameter T is around 550 MeV. The Gaussian width, estimated from the right part of 
the distribution is around 67 MeV. One can also notice the improvement in the mass reconstruction from 
bremsstrahlung recovery. Without any bremsstrahlung recovery, only 47% of events are reconstructed 
within ± 200 MeV of the nominal J fry mass, whereas with the use of the bremsstrahlung recovery, this 
fraction increases to approximately 55% for the dynamic-noise-adjustment algorithm. 

Combined reconstruction: 
The J/i/a mass can be also determined combining information from the inner detector and the electro- 
magnetic calorimeter. The energy is taken from the electromagnetic calorimeter and the direction from 
the more accurate measurements provided by the inner detector, taking into account the bremsstrahlung 
recovery procedure. Figure 12 shows the di-electron invariant mass distribution obtained from the signal 
sample only. An asymmetric gaussian function is fitted, with different width, Oi e ft and a r i g ht, either side 
of the fitted peak mass mo. It is performed in a narrow mass interval, between 2.5 and 3.6 GeV The 
parameter a r i g ht characterises the effective resolution in the invariant mass, while Oi e ft is a measure of the 
deterioration of this resolution due to bremsstrahlung. The fitted values of the parameters are shown in 
Table 7. Performance is assessed separately for the three cases: TRT barrel, when both electrons have 
their track pseudorapidity |tj| < 0.7, the TRT end-caps, when at least one electron has |t]| > 0.7, and the 
full tj range. The resolution obtained is highly asymmetric, ~ 387 MeV on the left and ~ 189 MeV on 
the right. It can be also noticed that the quality of the fit is rather poor. 

Reconstruction in the electromagnetic calorimeter: 
Finally it is interesting to investigate the performance if we rely only on the information from the elec- 
tromagnetic calorimeter. Fig. 13 shows the electron candidates invariant mass distribution obtained from 
the signal sample only. The same function defined for the combined reconstruction is used to fit the 
distributions. The fitted values of the parameters are shown in Table 8. Performance is assessed sepa- 
rately in three cases: the barrel region of the electromagnetic calorimeter, when both electrons have their 
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Figure 12: The electron pair invariant mass for bBj — > n(6)J '/yfX events. The energy is taken from the 
electromagnetic calorimeter and the direction from the inner detector (including bremstrahlung recov- 
ery). Selection of events includes LI trigger and offline and number of events is scaled to 100 pb^ 1 . 



Table 7: Asymmetric Gaussian fit results for bB^ — > n(6)J/yX events. The energy is taken from the 
electromagnetic calorimeter and the direction from the inner detector. 



n 


Am 0 (MeV) 


dieft (MeV) 


Oright (MeV) 


All 
Barrel 
End-caps 


101 ±14 
94 ±16 
113±28 


327 ± 12 
285 ±12 
385 ±27 


189±11 
183 ± 12 
191 ±21 



pseudorapidity \t] \ < 1.4; the end-cap region, when at least one electron has | X] | > 1.4; and for the full tj 
range. The resolution obtained from the width of the Gaussian is ~ 550 MeV. 



Table 8: Asymmetric Gaussian fit results for bBj — > jJ.(6)J /yX events. The energy and direction infor- 



mation is taken from the electroma, 



netic calorimeter only. 



n 


Am 0 (MeV) 


(Tieft (MeV) 


a rig ht (MeV) 


all 
barrel 
end-cap 


-17 ±54 
-9 ±62 
-68 ±102 


567 ±46 
558 ±50 
629 ±125 


541 ±53 
560 ±60 
414±74 



5.2.3 Reconstruction of direct J/y and T events 

After selection, only ~4000 events are reconstructed. This statistics is scaled to 2.3 x 10 5 events, corre- 
sponding to the expected statistics for an integrated luminosity of 100 pb~ l . Figure 14 shows the electron 
pair invariant mass distribution using only the inner detector information. The same function as defined 
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Figure 13: The electron pair invariant mass for bBj — »• jl{6)J '/yX events. The energy and direction 
are taken from the electromagnetic calorimeter. Selection of events includes LI trigger and offline and 
number of events is scaled to 100 pb^ 1 . 




Figure 14: The electron pair invariant mass distribution for pp — > J/y/X events. The energy and direction 
information are taken from the inner detector. An asymmetric fit is performed with a function which 
behaves as a Breit-Wigner distribution to the left of the peak and as a Gaussian to the right of the peak. 
Selection of events includes LI trigger, offline and a cut on Ej > 5 GeV for each electron to mimic the 
HLT The number of events is scaled to 100 pb" 1 . 



previously is fitted. The fitted values of the parameters Am 0 , r and a r i g ht are shown in Table 9. The fitted 
mass value is shifted by about 100 MeV, the F factor is ~ 300 MeV and the resolution term is ~ 70 MeV. 
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Table 9: Results of an asymmetric fit to the invariant mass distributions for pp —> J /\\fX events, with a 
function that behaves as a Breit-Wigner distribution to the left of the peak and as a Gaussian to the right 
of the peak. The direction and energy information are taken from the inner detector. 



n 


Am 0 (MeV) 


T (MeV) 


bright (MeV) 


All 


-98 ± 1 


298 ±2 


71 ± 1 


Barrel 


-77 ± 1 


255 ±2 


62 ± 1 


End-caps 


-142 ± 2 


354 ±3 


87 ±2 



5.2.4 T reconstruction 

After selection, only ~1000 events are reconstructed. This statistics is scaled to 4.3 x 10 4 events, corre- 
sponding to the expected statistics for an integrated luminosity of 100 pb~ ! . Fig. 15 shows the electron 
pair invariant mass distribution from the inner detector information. The fitted values of the parameters 




m ee (GeV) 

Figure 15: The electron pair invariant mass distribution for pp — > TX events. The energy and direction 
information are taken from the inner detector. An asymmetric fit is performed with a function that 
behaves as a Breit-Wigner distribution to the left of the peak and as a Gaussian to the right of the peak. 
Selection of events includes LI trigger, offline and a cut on Ej > 5 GeV for each electron to mimic the 
HLT Number of events is scaled to 100 pb~ l . 

are shown in Table 10. The fitted mass value is shifted by about 180 MeV, the F factor is ~ 1 GeV and 
the resolution term is ~ 140 MeV. 

5.3 Assessment of performance in situ with initial data 

Initial studies have been performed for the J/i/a — > ee tag-and-probe method briefly outlined below, using 
events satisfying a single electron trigger with Ej > 5 GeV. Due to too high rate at LI (40 kHz) it has 
to be pre-scaled by a factor of 60, which reduces the final statistics. Those events are used to look for 
an opposite-charge electron pair identified by the offline electron reconstruction with an invariant mass 
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Table 10: Results of an asymmetric fit to the invariant mass distributions for pp — > TX events, with a 
function which behaves as a Breit-Wigner distribution to the left of the peak and as a Gaussian to the 
right of the peak. The direction and energy information are taken from the inner detector. 



77 range 


Am 0 (MeV) 


T (MeV) 


a rig ht (MeV) 


All 
Barrel 
End-caps 


-181 ±5 
-177 ±5 
-252 ± 11 


1098 ± 11 
930 ± 12 
1335 ±25 


137 ±3 
137 ±3 
166 ±7 



near the J / y peak. Using the second electron as the probe which was not required to pass any trigger 
selection, the efficiency (relative to the offline selection) of a given trigger signature can be measured. 
We expect to collect of the order of « 20 x 10 3 J/y signal events after the pre-scale with an integrated 
luminosity of 100 pb~ l . Similar studies could be performed to study the offline electron selection. 

One important ingredient in the calibration strategy for the electromagnetic calorimeter is the use of 
large statistics samples of Z — >• ee decays to perform an accurate inter-calibration of regions with a fixed 
size of At] x A<p = 0.2 x 0.4. To cross-check the calibration obtained from the Z° decays and also to 
check the linearity of the calorimeter, it is important to have calibration coefficients for a lower electron 
energy range, which can be obtained using the J/y —>■ ee and T —>■ ee decays as shown in [23]. With 
the expected statistics, a statistical precision of ~ 0.6% can be expected on the inter-calibration of the 
electromagnetic calorimeter based on 100 pb~ x . Still, more studies are needed in particular to improve 
the energy reconstruction and to disentangle effects of inter-calibration with the distribution of material 
in front of the electromagnetic calorimeter. 

More generally, these electron samples will allow us to study the performance of both the recon- 
struction of tracks in the inner detector and clusters in the electromagnetic calorimeter, as well as the 
alignment between these two detectors. All these studies are crucial for the very first measurements 
(such as, for example cross-section measurements) to be performed by the ATLAS experiment on the 
early data. 

6 Conclusion 

In this note, the strategy to reconstruct J/y and T particles, decaying into electron-positron pairs, has 
been investigated. The possible trigger strategies have also been described. For initial luminosities 
of 10 31 cm~ 2 s -1 , a trigger on low-energy di-electron pairs (2EM3 at LI) should provide good statistics 
for the direct production of these particles. Moreover the standard B-physics trigger, using a single 
muon above a certain pj threshold can give access to these events through the bb production, without 
biasing the selection of electromagnetic objects. For these studies, the electron reconstruction seeded by 
a track in the inner detector has been used. Compared to previous studies, the main improvement comes 
from the identification procedure, which can either use the standard cut-based analysis, with thresholds 
tuned at low energy, or a dedicated identification developed for non-isolated electrons. The signal-to- 
background ratio obtained is larger than one at the J/y and T peaks, but the extraction of the Drell-Yan 
signal requires further studies. With an integrated luminosity of 100 pb~ l and an efficient identification 
and reconstruction of these low-mass pairs, approximately two hundred thousand J/y decays could be 
isolated for detailed studies of the electron identification and reconstruction performance, in particular in 
terms of matching energy and momentum measurements at a scale quite different from that of the more 
commonly used Z — > ee decays. 
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Muon Reconstruction and Identification: Studies with 
Simulated Monte Carlo Samples 

Abstract 

The strategy and performance for muon identification and reconstruction in 
ATLAS are described. Performance metrics include efficiency, fake rates and 
momentum resolution. Results are based on data simulated and reconstructed 
in 2007. 

1 Introduction 

The ATLAS experiment will detect particles created in 14 TeV proton-proton collisions produced by the 
CERN LHC (Large Hadron Collider). Only a tiny fraction of these collisions will correspond to inter- 
esting standard model processes and an even smaller fraction to new physics. Muons, especially those 
with high-pr (transverse momentum) and those that are isolated (from other activity in the detector), will 
be much more common in these interesting events than in the background, and thus provide important 
means to identify such events and to determine their properties. The ATLAS detector has been designed 
to be efficient in the detection of muons and to provide precise measurement of their kinematics up to 
one TeV. 

In parallel with the construction of the detector, software has been developed to reconstruct these 
muons, i.e., for each recorded event, to identify muons and measure their position, direction and momen- 
tum. Here we describe the strategies being pursued for this reconstruction and the current performance 
characterized in terms of efficiency, fake rate and precision and accuracy of measurement. The results 
reported here are based on simulation data generated and reconstructed in 2007. 

We begin with descriptions of the detector, the reconstruction algorithms and the means by which 
we measure the performance. These are followed by sections documenting this performance for each of 
the various reconstruction strategies and finally a section summarizing results and commenting on future 
developments. 

2 Detector 

The ATLAS detector [1] has been designed to provide clean and efficient muon identification and precise 
momentum measurement over a wide range of momentum and solid angle. The primary detector system 
built to achieve this is the muon spectrometer, shown in Figure 1 . The spectrometer covers the pseudo- 
rapidity range \r}\ < 2.7 and allows identification of muons with momenta above 3 GeV/c and precise 
determination of pj up to about 1 TeV/c. 

The muon spectrometer comprises three subsystems: 

• Superconducting coils provide a toroidal magnetic field whose integral varies significantly as a 
function of both r\ and q> (azimuthal angle). The integrated bending strength (Figure 2) is roughly 
constant as a function of r\ except for a significant drop in the transition between the barrel and 
endcap toroid coils (1.4 ^>\r\\ ^1.6). 

• Precision detectors are located in three widely-separated stations at increasing distance from the 
collision region. Each station includes multiple closely -packed layers measuring the tj -coordinate, 
the direction in which most of the magnetic field deflection occurs. Monitored drift tubes provide 
these measurements everywhere except in the high-Tj (|t]| > 2.0) region of the innermost station 
where cathode strip chambers are used. The measurement precision in each layer is typically better 
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Figure 1 : The ATLAS muon spectrometer. 



than 100 j^m. The cathode strip chambers additionally provide a rough (1 cm) measurement of the 
(^-coordinate. 

• Resistive plate and thin gap chambers provide similarly rough measurements of both tj and (p near 
selected stations. 

Higher muons typically traverse all three stations but there are r\-(p regions where one, two or all 
three stations do not provide a precision measurement, e.g. those regions with support structures or 
passages for services. There are also regions where overlaps allow two measurements from a single 
station. Figure 3 shows the number of station measurements as function of tj and (p. The resolution and 
efficiency are degraded where one or more stations do not provide a measurement. 

Figure 4 shows how contributions to the muon spectrometer momentum resolution vary as a function 
of pj. At low momentum, the resolution is dominated by fluctuations in the energy loss of the muons 
traversing the material in front of the spectrometer. Multiple scattering in the spectrometer plays an 
important role in the intermediate momentum range. For pj > 300 GeV/c, the single-hit resolution, 
limited by detector characteristics, alignment and calibration, dominates. 

The other ATLAS detector systems also play important roles in achieving the ultimate performance 
for muon identification and measurement. The calorimeter, with a thickness of more than 10 interaction 
lengths, provides an effective absorber for hadrons, electrons and photons produced by proton-proton 
collisions at the center of the ATLAS detector. Energy measurements in the calorimeter can aid in muon 
identification because of their characteristic minimum ionizing signature and can provide a useful direct 
measurement of the energy loss [2] . 

A tracking system inside the calorimeters detects muons and other charged particles with hermetic 
coverage for |tj| < 2.5, providing important confirmation of muons found by the spectrometer over that 
77 range. This inner detector has three pixel layers, four stereo silicon microstrip layers, and, for |tj| < 
2.0, a straw-tube transition radiation detector that records an average of 36 additional measurements on 
each track. A 2 Tesla solenoidal magnet enables the inner detector to provide an independent precise 
momentum measurement for muons (and other charged particles). Over most of the acceptance, for pj 
roughly in the range between 30 and 200 GeV/c, the momentum measurements from the inner detector 
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Figure 2: ATLAS muon spectrometer integrated magnetic field strength as a function of \r\\ 
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Figure 4: Contributions to the momentum resolution for muons reconstructed in the Muon Spec- 
trometer as a function of transverse momentum for \r]\ < 1.5. The alignment curve is for an 
uncertainty of 30 }Xm in the chamber positions. 



and muon spectrometer may be combined to give precision better than either alone. The inner detector 
dominates below this range, and the spectrometer above it. 

3 Overview of reconstruction and identification algorithms 

ATLAS employs a variety of strategies for identifying and reconstructing muons. The direct approach is 
to reconstruct standalone muons by finding tracks in the muon spectrometer and then extrapolating these 
to the beam line. Combined muons are found by matching standalone muons to nearby inner detector 
tracks and then combining the measurements from the two systems. Tagged muons are found by ex- 
trapolating inner detector tracks to the spectrometer detectors and searching for nearby hits. Calorimeter 
tagging algorithms are also being developed to tag inner detector tracks using the presence of a mini- 
mum ionizing signal in calorimeter cells. These were not used in the data reconstruction reported here 
and their performance is documented elsewhere [2]. 

The current ATLAS baseline reconstruction includes two algorithms for each strategy. Here we 
briefly describe these algorithms. Later sections describe their performance. 

The algorithms are grouped into two families such that each family includes one algorithm for each 
strategy. The output data intended for use in physics analysis includes two collections of muons — one 
for each family — in each processed event. We refer to the collections (and families) by the names of the 
corresponding combined algorithms: Staco [3] and Muid [4]. The Staco collection is the current default 
for physics analysis. 

3.1 Standalone muons 

The standalone algorithms first build track segments in each of the three muon stations and then link the 
segments to form tracks. The Staco-family algorithm that finds the spectrometer tracks and extrapolates 
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them to the beam line is called Muonboy [3]. On the Muid side, Moore [5] is used to find the tracks and 
the first stage of Muid performs the inward extrapolation. 

The extrapolation must account for both multiple scattering and energy loss in the calorimeter. Muon- 
boy assigns energy loss based on the material crossed in the calorimeter. Muid additionally makes use 
of the calorimeter energy measurements if they are significantly larger than the most likely value and the 
muon appears to be isolated [6]. 

Standalone algorithms have the advantage of slightly greater \r] \ coverage — out to 2.7 compared to 
2.5 for the inner detector — but there are holes in the coverage at |tj| near 0.0 and 1.2 (see figure 3). 
Very low momentum muons (around a few GeV/c) may be difficult to reconstruct because they do not 
penetrate to the outermost stations. 

Muons produced in the calorimeter, e.g. from n and K decays, are likely to be found in the standalone 
reconstruction and serve as a background of "fake" muons for most physics analyses. There are a few 
exotic channels for which charged particles appearing in the calorimeter are a signal of interest. 

3.2 Inner detector 

The primary track reconstruction algorithm for the inner detector is described in Ref. [7]. Space points 
are identified in the pixel and microstrip detectors, these points are linked to form track seeds in the 
inner four layers, and tracks are found by extending these seeds to add measurements from the outer 
layers. This strategy is expected to give very high detection efficiency over the full detector acceptance, 
|i? | < 2.5. 

3.3 Combined muons 

Both of the muon combination algorithms, Staco and Muid, pair muon- spectrometer tracks with inner- 
detector tracks to identify combined muons. The match chi-square, defined as the difference between 
outer and inner track vectors weighted by their combined covariance matrix: 

Xlatch = (Tms - T ro ) T (do + Cms)" 1 (Tms - T ID ) (1) 

provides an important measure of the quality of this match and is used to decide which pairs are retained. 
Here T denotes a vector of (five) track parameters — expressed at the point of closest approach to the beam 
line — and C is its covariance matrix. The subscript ID refers to the inner detector and MS to the muon 
spectrometer (after extrapolation accounting for energy loss and multiple scattering in the calorimeter). 

Staco does a statistical combination of the inner and outer track vectors to obtain the combined track 
vector: 

T = ( c id + c ms) 1 ( c id t id + C M s Tms) (2) 
Muid does a partial refit: it does not directly use the measurements from the inner track, but starts from 
the inner track vector and covariance matrix and adds the measurements from the outer track. The fit 
accounts for the material (multiple scattering and energy loss) and magnetic field in the calorimeter and 
muon spectrometer. 

3.4 Tagged muons 

The spectrometer tagging algorithms, MuTag [3] and MuGirl [8] , propagate all inner detector tracks with 
sufficient momentum out to the first station of the muon spectrometer and search for nearby segments. 
MuTag defines a tag chi-square using the difference between any nearby segment and its prediction from 
the extrapolated track. MuGirl uses an artificial neural network to define a discriminant. In either case, 
if a segment is sufficiently close to the predicted track position, then the inner detector track is tagged as 
corresponding to a muon. 
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At present, both algorithms simply use the inner-detector track to evaluate the muon kinematics, i.e. 
the inner track and spectrometer hits are not combined to form a new track. This is not very important in 
the low-pr regime that these algorithms were originally intended to address. Both algorithms are being 
further developed to allow extrapolation to other and multiple stations and add the capability to include 
the spectrometer measurements in a track refit. 

There is an important difference in the way these algorithms are run in the standard reconstruction 
chain. MuGirl considers all inner-detector tracks and redoes segment finding in the region around the 
track. MuTag only makes use of inner-detector tracks and muon- spectrometer segments not used by 
Staco. Thus MuTag serves only to supplement Staco while MuGirl attempts to find all muons. Obviously, 
MuTag is part of the Staco family and most sensibly used in that context. MuGirl muons are recorded as 
part of the Muid family. 

3.5 Merging muons 

The muon finding efficiency (and fake rate) may be increased by including muons found by multiple 
algorithms but care must be taken to remove overlaps, i.e. cases where the same muon is identified by 
two or more algorithms. To a large extent, this is done when the collections are created. Standalone 
muons that are successfully combined are not recorded separately. In those cases where a standalone 
muon is combined with more than one inner-detector track, exactly one of the muons is flagged as "best 
match." In the Staco collection, the tagged and combined muons do not overlap by construction. In 
the Muid collection, overlaps between MuGirl and Muid muons are removed by creating a single muon 
when both have the same inner detector track. 

Analysts wishing to merge standalone and tagged muons or muons from different collections may 
make use of a muon selection tool to remove overlaps. It requires muons have different inner-detector 
tracks and merges standalone muons that are too close to one another. Closeness is defined by rj-(p 
separation with a default limit of 0.4. 

4 Tools for performance evaluation and classification of tracks 

Simulation samples were created in the ATLAS framework by running an event generator (PYTHIA [9] 
or MC@NLO [10, 11]) and using GEANT4 [12] to propagate the final-state particles using ATLAS- 
specific code to describe the geometry and response of the detector. The data were then reconstructed 
using the software based on the algorithms described in the previous chapter. 

4.1 Truth matching and track classification 

Muon reconstruction performance is evaluated for each event by comparing selected reconstructed muons 
with the true muons, i.e. those in the Monte Carlo truth record. The latter include muons created in the 
initial event generation as well as secondaries produced during propagation through the tracking volume. 
Muons produced in the calorimeter or muon spectrometer are not included in the truth record. True 
muons with transverse momentum below 2 GeV/c are also excluded to avoid spurious matches with 
candidates we do not expect to be able to reconstruct. 

For each event, a one-to-one matching is performed between the selected reconstructed muons and 
the true muons. The matching makes use of two distance metrics: D re f is the reference distance measured 
from true muon to the reconstructed muon: 




(3) 
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and D eva is the evaluation distance measured from the reconstructed muon to the true muon: 



Here pj is signed (i.e. carries the charge sign), but elsewhere in the text it denotes the magnitude 
of the transverse momentum. In the second distance equation, T again denotes the vector of (five) 
track parameters (expressed at the distance of closest approach to the beam line) and C the associated 
covariance matrix. Note that D 2 eva is a chi-square with five degrees of freedom. 

There is a maximum allowed value for each of these distances. For D eva the maximum value is 1000, 
a very loose cut. The limit for D re f is 100 and we see from equation 3 this implies the matched muons 
must be within a distance of 0.5 in r\ and q> and have the same charge sign with pireco > 0.25 prtrue or 
opposite sign with pjreco > 0.50 p Ttr ue- 

The matching is carried out by first examining each reconstructed muon and assigning it to the nearest 
true muon using the evaluation distance. The reconstructed muon is left unmatched if no distance is less 
than the maximum allowed value. The reference distance is evaluated for each match and the match is 
discarded if it exceeds the threshold for that quantity. If more than one match remains for any true muon, 
then only the match with the smallest reference distance is retained. 

True muons that are matched are said to be found and those left unmatched are lost. Found muons 
are classified as good if they have D eva < 4.5 corresponding to a chi-square probability above 0.001 1. 

Reconstructed muons are said to be real if they are matched and fake if unmatched. Note that these 
fakes may correspond to true muons produced outside the tracking volume (e.g. in the calorimeter) and 
hence not included in the truth record. 

4.2 Performance measures 

Our performance measures include efficiency, fake rate, resolutions and resolution tails. The efficiency 
or finding efficiency is defined to be the fraction of true muons that are found and is typically evaluated 
for some kinematic selection (applied after matching). The good efficiency is the fraction of true muons 
that are found and classified as good (as defined in the previous section). The good fraction is the fraction 
of found muons that are classified as good. In the sections that follow, we present the overall efficiency 
for various physics samples and the efficiency as a function of rj for the primary benchmark sample. 

The fake rate is defined to be the mean number of fake muons per event and it is presented for a 
variety of pj thresholds corresponding to the values that might be chosen for different physics analyses. 

Five kinematic variables characterize a track, but here we examine only the measurement of the 
transverse momentum. The precision and accuracy of the direction measurements are typically much 
better than that required for any physics analysis. The measurement of the initial position of the track 
(e.g. at the distance of closest approach to the beam line or vertex) is discussed in another note [13]. For 
the momentum, we use the fractional residual, Apr/pr, defined in equation 5. This distribution is fitted 
with a Gaussian and the resolution is defined to be the sigma of this fit. The tails in the distributions 
are often more important than the core resolution and we characterize these by evaluating the fraction 
of found muons in five tail categories. The first three are those for which the magnitude of this residual 
exceeds 5%, 10% or 30%. The last category is the fraction for which the charge sign is incorrectly 
measured. Finally there is an intermediate category in which either the sign is incorrect or the magnitude 
of the measured momentum is more than two times larger than the true value. 




(4) 



In the first equation, Ap T /p T is the fractional momentum resolution: 



Apr _ l/PTreco ~ 1 / ' PTtrue _ PTtrue ~ PTre 
PT 1 / PTtrue PTreco 



(5) 
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4.3 Monte Carlo samples 

Our primary benchmark sample is a collection of tt events requiring the presence of at least one lepton 
(electron, muon or tau). The initial inclusive sample was produced using MC@NLO in conjunction with 
Herwig [14]. This sample provides a variety of mechanisms for producing muons and we present results 
for two: direct muons which do not have any quarks in their ancestry and indirect muons whose ancestry 
includes a heavy quark (b or c) but not a tau. In this sample, the former are produced directly in the 
leptonic decay of a W-boson. 

Performance metrics are plotted as a function of 77 for tt direct muons. In addition, we tabulate 
efficiencies and fake rates for these muons, for tt indirect muons, and for muons in separate low- and 
high-pr samples. The low-pr sample is taken from direct PYTHIA J/lfr production with the J/\\r forced 
to decay to two muons and a filter selecting only those events where both muons have |tj| < 2.5 and 
Pt > 4 GeV/c. Muons produced by other processes in these events are suppressed by restricting the 
analyzed sample to muons that have a c-quark in their ancestry. The high-/?r sample consists of direct 
muons in PYTHIA production of Z' —>■ ^iji with a Z' mass of 2 TeV. The generation also includes Z/y 
and interference but a dimuon mass cut (m^ > 500 GeV/c) ensures that the average muon p T is above 
500 GeV/c. 

At design luminosity, ATLAS will have many interactions in each beam crossing (pileup) and there 
will be significant background in the muon chambers from low-energy photons and neutrons (cavern 
background). To get an estimate of the effect this will have on our reconstruction algorithms, we pro- 
cessed a tt sample overlaid with the backgrounds expected for a reference luminosity of 10 33 cm~ 2 s _1 . 
The cavern background was included with a safety factor of 2.0, i.e. at twice the value expected for this 
luminosity. In the following, this sample is called the high-luminosity tt sample. Low luminosity refers 
to samples without any pileup or cavern background. 

There is considerable uncertainty in the estimate of the cavern background and active development is 
underway to improve reconstruction in this environment, and so the results presented here provide only 
a rough indication of the performance we expect at high luminosity. 

Figure 5 shows the pj, r\ and isolation energy distributions for the true muons in the samples studied 
in this note. The isolation energy was obtained by summing the calorimeter transverse energy in an r\-(p 
cone of radius 0.2 about the muon. The most probable value for the muon energy loss (as discussed in 
reference [2]) is subtracted from these values. 
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5 Standalone muon performance 

5.1 Efficiencies and fake rates 

Figure 6 shows the standalone tt direct muon efficiencies and fake rates as functions of r\ at low luminos- 
ity (i.e. without any pileup or cavern background) and at our reference luminosity (10 33 /cm 2 /sec with 
cavern background safety factor 2.0). Table 1 gives the integrated efficiencies and fake rates for these 
and other samples. 



Sample 


Efficiency 
found good 


Fakes/(1000 events) above p T limit (GeV/c) 
3 i 10 | 20 | 50 


Muonboy 


it direct 
it indirect 


0.951 (1) 
0.949 (1) 


0.812(1) 
0.783 (2) 


24.0 (3) 


4.4(1) 


1.69 (7) 


0.52 (4) 


hi-Jz? tt direct 


0.950 (2) 


0.809 (3) 


53 (1) 


8.2 (4) 


3.9 (2) 


1.9 (2) 


Z' direct 


0.914 (2) 


0.781 (3) 


141 (4) 


79 (3) 


61 (3) 


37 (2) 


J/v 


0.959 (3) 


0.764 (6) 


51 (1) 


5.0 (4) 


1.6 (2) 


0.6 (1) 


Moore/Muid 


tt direct 
tt indirect 


0.943 (1) 
0.920 (2) 


0.861 (1) 
0.838 (2) 


19.8 (3) 


3.9 (1) 


1.44 (6) 


0.47 (4) 


hi-Jz? tt direct 


0.932 (2) 


0.836 (3) 


984 (4) 


301 (2) 


156 (2) 


61 (1) 


Z' direct 


0.887 (2) 


0.769 (3) 


168 (4) 


102 (3) 


75 (3) 


43 (2) 


J/Y 


0.830 (5) 


0.723 (6) 


6.7 (4) 


1.1 (2) 


0.5 (1) 


0.13 (6) 



Table 1: Muonboy and Moore/Muid efficiencies and fake rates for various samples (section 4.3). 
Efficiencies are presented both for all found muons and for those with a good truth match (D eva < 
4.5). Both are calculated for true muons with \t]\ < 2.5 and pj > 10 GeV/c. Fake rates are 
presented for a variety of pj thresholds. 



Comparing with Figure 3, we see most of the efficiency loss occurs in regions where the detector 
coverage is poor, i.e. for \r\\ around 0.0 and 1.2. Otherwise, the tt muon efficiency is close to 100% 
for Muonboy and around 99% for Moore/Muid. The Muid good fraction is significantly higher than for 
Muonboy, presumably because of better handling of the material in the calorimeter. The algorithms have 
similar fake rates at low luminosity. At the higher luminosity, the Staco rate increases significantly (by 
a factor of 2-4) while the Moore/Muid rate increases dramatically (factor of 100). In the high-pr Z', the 
efficiency falls by a few percent for both algorithms. For the low-pr (and non-isolated) // \ff muons, the 
Moore/Muid efficiency degrades significantly while Muonboy remains high. 

5.2 Resolution 

Figure 7 shows the pr resolutions and tails as functions of Tj and pj. The resolution is degraded at 
intermediate pseudorapidity (1 .2 < | r\ \ < 1 .7) because of the reduced number of measurements (figure 3), 
the low field integral in the overlap between barrel and endcap toroids (figure 2), and the material in the 
endcap toroid (figure 1). The average resolution is very similar for the two algorithms. Despite having a 
lower good fraction, Muonboy has fewer muons for which the charge sign is incorrectly measured. This 
suggests that, at least in the tails, Moore/Muid provides a better estimate of the momentum error while 
Muonboy provides a better estimate of its value. The Moore/Muid tails are likely due to the assignment 
of incorrect hits to spectrometer tracks. 
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Figure 6: Standalone efficiency and fake rate as functions of true 77 for Muonboy (left) and 
Moore/Muid (right) for direct muons in tt at low (top) and high (bottom) luminosity. In the effi- 
ciency plots, the upper curve (blue) is the efficiency to find the muon while the lower curve (green) 
additionally requires a good match (D evu < 4.5) between reconstructed and true track parameters. 
Fake rates are shown for a variety of pj thresholds. 
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Figure 7: Standalone fractional momentum resolution (Apr/pr) as function of rj (top) and pj 
(2nd row) and tails in that parameter also as functions of rj (3rd row) and pj (bottom). All are for 
both Muonboy (left) and Moore/Muid (right). The tail is the fraction of reconstructed muons with 
magnitude of Apr / Pt outside a range and is shown for a wide range of values. The last tail curve 
(red, "charge") includes only muons reconstructed with the wrong charge sign. The 4th tail curve 
(yellow, "2X high") includes these and those with momentum magnitude more than two times the 
true value. 
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6 Inner detector performance 

Figure 8 shows the efficiency for it direct muons and Table 2 gives the integrated efficiencies for all of the 
samples. The efficiency is high for all Y\ (within the acceptance) and all samples. There is no evidence 
of degradation when pileup is added. 

The inner detector momentum resolution is the same as that for tagged muons, reported later: see 
Figure 13 in section 8. 




T| T| 

Figure 8: Inner detector it direct muon efficiency as a function of true Y\ at low (left) and high 
(right) luminosity. In each figure, the upper curve (blue) is the efficiency to find the muon while 
the lower curve (green) additionally requires a good match {D eva < 4.5) between reconstructed 
and true track parameters. The efficiency is for pj > 10 GeV/c. 





Efficiency 


Sample 


found 


good 


tt direct 


0.996 (1) 


0.950 (2) 


it indirect 


0.997 (1) 


0.833 (5) 


hi-Jjf it direct 


0.995 (1) 


0.947 (2) 


Zprime direct 


0.993 (1) 


0.966 (1) 


J/Y 


0.995 (1) 


0.941 (3) 



Table 2: Inner detector efficiencies. The samples and algorithms are described in the text. Effi- 
ciencies are presented both for all found muons and for those with a good truth match (D eva < 4.5). 
Efficiencies are calculated for true muons with \ rj\ < 2.5 and pj > 10 GeV/c. 



7 Combined muon performance 

7.1 Efficiencies and fake rates 

Figure 9 shows the combined it direct muon efficiency and fake rates for each algorithm as a function of 
r\ for both low and high luminosity. Compared with the performance for standalone muons (figure 6), 
Staco shows a small drop in efficiency with little reduction of the fake rate except for the lowest pj 
threshold at high luminosity. In fact, the high-p^ fake rates increase at either luminosity because low-pr 
standalone muons are matched to high-/?/- inner-detector tracks. At low luminosity, Muid it shows a 
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small decrease in both efficiency and fake rate. When background is added, the dramatic increase in 
fakes for Moore standalone is not observed in Muid combined, i.e. the matching suppresses most of the 
fakes and the Muid high-pr fake rates are lower than those of Staco. However, the high-luminosity tt 
Muid efficiency is significantly worse than that of Staco. 

When matching inner detector and muon spectrometer tracks, both Staco and Muid calculate Xmatch 
(section 3.3) which serves as a discriminant for separating real and fake muons. The fakes include pion 
or kaon decays in or near the calorimeter. Figure 10 shows the Xmatch distributions for both direct found 
muons and fakes. We see that with a cut on this quantity, e.g. Xmatch < 100, many of the Staco high-/? r 
fakes can be suppressed with only a modest loss in efficiency. The higher Staco fake rates come from 
looser cuts during reconstruction and, if the Xmatch cuts are adjusted to give the same efficiencies, the 
Staco fake rate is lower. 

Table 3 shows the integrated Staco and Muid muon efficiencies and fake rates for all samples includ- 
ing an entry showing the effect of the above cut on Xmatch- 



Sample 


Efficiency 
found good 


Fakes/(1000 events) above p T limit (GeV/c) 
3 | 10 | 20 | 50 


Staco 


tt direct 
tt indirect 


0.943 (1) 
0.933 (1) 


0.875 (1) 
0.767 (2) 


22.0 (3) 


9.6 (2) 


3.4 (1) 


0.62 (4) 


tt direct cut 


0.924 (1) 


0.865 (1) 


14.8 (2) 


3.1 (1) 


0.39 (3) 


0.01 (1) 


hi-Jz? tt direct 


0.941 (2) 


0.871 (3) 


25.9 (7) 


11.2 (4) 


4.3 (3) 


0.7 (1) 


Z' 


0.910 (2) 


0.824 (3) 


14 (1) 


8.4 (9) 


5.2 (7) 


3.4 (6) 


J/Y 


0.943 (3) 


0.873 (4) 


0.9 (2) 


0.24 (8) 


0.11 (5) 


0.0 (0) 


Muid 


tt direct 
tt indirect 


0.926 (1) 
0.888 (2) 


0.877 (1) 
0.748 (3) 


15.4 (2) 


2.36 (9) 


0.48 (4) 


0.05 (1) 


tt direct cut 


0.917(1) 


0.871 (1) 


14.0 (2) 


1.96 (8) 


0.33 (3) 


0.03 (1) 


hi- Jz? tt direct 


0.904 (2) 


0.854 (3) 


35.5 (8) 


5.0 (3) 


1.1 (1) 


0.24 (6) 


Z' direct 


0.872 (2) 


0.811 (3) 


11 (1) 


4.5 (7) 


3.1 (6) 


2.7 (5) 


J/¥ 


0.793 (5) 


0.741 (6) 


0.8(1) 


0.03 (3) 


0.0 (0) 


0.0 (0) 



Table 3: Staco and Muid efficiencies and fake rates. The samples and algorithms are described in 
the text. Algorithm names are followed by "cut" to indicate that reconstructed muons are required 
to have Xmatch < 100 for both efficiency and fake calculations. Efficiencies are presented both for 
all found muons and for those with a good truth match. Both are calculated for true muons with 
\r} \ < 2.5 and p T > 10 GeV/c. The fake rates are presented for a variety of p T thresholds. 



7.2 Resolution 

Figure 11 shows the tt direct muon p T resolutions and tails as functions of T7 and p T . Comparing 
with the same for standalone reconstruction (figure 7), we see, as expected, the combined resolution 
is significantly better especially in the overlap region (\r]\ around 1.5) and for pj below 100 GeV/c. 
There are also significant reductions in the tails of momentum residuals. Misreconstruction and charge 
misidentification rates are around 0.01% for the combined muons instead of 0.1% for the standalone. 
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Figure 9: Combined muon efficiency and fake rate for Staco (left) and Muid (right) as functions 
of true Tj for direct muons in tt at low (top) and high (bottom) luminosity. In each efficiency plot, 
the upper curve (blue) is the efficiency to find the muon while the lower curve (green) addition- 
ally requires a good match (D evu < 4.5) between reconstructed and true track parameters. The 
efficiencies are for pj > 10 GeV/c. Fake rates are shown for a variety of pj thresholds. 
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Figure 10: Distributions of Xmatch f° r direct muons (top) and fakes (third from top). The fakes are 
shown for a variety of pr thresholds. The second row shows the efficiency as function of Xmatch 
when muons above that value are rejected. The bottom row shows the fake rates as a function 
of efficiency as that threshold is varied. All are shown for both Staco (left) and Muid (right). 
The sharp drops in the Staco % match distribution come from cuts on that quantity made during 
reconstruction, i.e. before filling the output muon collection. 
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Figure 11: Combined muon fractional momentum resolution (Apr / pr) as function of r\ (top) and 
Pt (2nd row) and tails in that parameter also as functions of rj (3rd row) and pj (bottom). All 
are for both Staco (left) and Muid (right). The tail is the fraction of reconstructed muons with 
magnitude of Apr/pr outside a range and is shown for a wide range of values. The last tail curve 
(red, "charge") includes only muons reconstructed with the wrong charge sign. The 4th tail curve 
(yellow, "2X high") includes these and those with momentum magnitude more than two times the 
true value. 



178 



Muons - Muon Reconstruction and Identification: Studies with Simulated . 



8 Tagged muon performance 

8.1 Efficiencies and fake rates 

ATLAS runs two tagging algorithms but only MuGirl attempts to find all muons. MuTag is run in a 
manner to complement Staco and the performance of the combination of these two is reported in the 
following section. 

Figure 12 shows the MuGirl direct muon efficiency and fake rates as a function of Vj in tt at low and 
high luminosity. Table 4 gives the MuGirl integrated efficiencies and fake rates for all our samples. 

Comparing with the combined muon results (figure 9 and table 3), we see that MuGirl has lower 
efficiency and a substantially higher fake rate. We also observe that its performance degrades faster 
when luminosity background is added. MuGirl has higher efficiency than Muid for reconstructing the 
low-pr muons in the J/\\r sample. 



8.2 Resolution 

Figure 13 shows the MuGirl pj resolution and tail as functions of rj and pj. MuGirl does not refit the 
tracks and so this is just the resolution of the inner detector. Comparing with the standalone (figure 7) 
and combined (figure 1 1), we see how the standalone and inner measurements complement one another 
to give high precision over the full tj and pj range of the it sample. 



ATLAS 
tt direct 
MuGirl 



ATLAS 

L33sf02 tt direct 
MuGirl 




Figure 12: MuGirl efficiency (left) and fake rates (right) as a function of true rj in tt at low (top) 
and high (bottom) luminosity. In each efficiency plot, the upper curve (blue) is the efficiency to find 
the muon while the lower curve (green) additionally requires a good match (D eva < 4.5) between 
reconstructed and true track parameters. The efficiency is for muons with true pj > 10 GeV/c. 
Fake rates are presented for a variety of pj thresholds. 
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Sample 


Effic 
found 


ency 
good 


Fakes/(10( 
3 


)0 events) al 
10 


)ove pr lin 
20 


lit (GeV/c) 
50 


tt direct 
tt indirect 


0.911 (1) 
0.899 (2) 


0.870(1) 
0.748 (3) 


105.0 (6) 


23.7 (3) 


7.3 (2) 


1.14(6) 


hi-Jzf it direct 


0.866 (3) 


0.825 (3) 


154 (2) 


26.1 (7) 


7.6 (4) 


1.2 (1) 


Z' direct 


0.802 (3) 


0.781 (3) 


57 (2) 


26 (2) 


15 (1) 


5.9 (8) 


//y/c-quark 


0.888 (4) 


0.839 (5) 


4.4 (3) 


0.11(5) 


0 (0) 


0 (0) 



Table 4: MuGirl efficiencies and fake rates. The samples and algorithms are described in the text. 
Efficiencies are presented both for all found muons and for those with a good truth match. Both 
are calculated for truth muons with | tj | < 2.5 and pr > 10 GeV/c. The fake rates are presented for 
a variety of pj thresholds. 
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Figure 13: MuGirl fractional momentum resolution (Apr/pr) as a function of r\ (top) and pj 
(bottom). Both the distribution (left) and tails (right) are shown for each. The tail is the fraction 
of reconstructed muons with magnitude of residual greater than a threshold and results are shown 
for a variety of thresholds. The last tail curve (red, "charge") includes only muons reconstructed 
with the wrong charge sign. The 4th tail curve (yellow, "2X high") includes these and those with 
momentum magnitude more than two times the true value. 
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9 Merged muon performance 

Finally we consider merging the muons produced by different algorithms. There are many possible com- 
binations but we restrict ourselves to two simple but important cases: merging the combined and tagged 
muons separately within each collection (family), i.e. we examine Staco+MuTag and Muid+MuGirl. 

Figure 14 shows the corresponding direct muon efficiencies and fake rates in tt at low and high 
luminosity. The integrated efficiencies and fake rates for all samples are summarized in table 5. One 
of the primary goals of the tagging algorithms is to reconstruct low-pj- muons which the standalone 
reconstruction misses because the energy loss in the calorimeter leaves these muons with very little 
momentum in the muon spectrometer. Figure 15 shows the low-pr efficiency as function of pj for 
combined alone and combined supplemented with tagged for each of the collections. 



Sample 


Efficiency 
found good 


Fakes/(1000 events) above p T limit (GeV/c) 
3 | 10 | 20 | 50 


Staco+MuTag 


tt direct 
tt indirect 


0.948 (1) 
0.940 (1) 


0.879 (1) 
0.772 (2) 


49.0 (4) 


14.4 (2) 


4.8 (1) 


0.86 (5) 


hi-Jz? tt direct 


0.946 (2) 


0.876 (3) 


58 (1) 


16.6 (5) 


6.1 (3) 


1.1 (1) 


Z' direct 


0.931 (2) 


0.844 (3) 


32 (2) 


14 (1) 


7.1 (9) 


4.2 (7) 


J/v 


0.954 (3) 


0.883 (4) 


2.5 (3) 


0.3 (1) 


0.11(5) 


0 (0) 


Muid+MuGirl 


tt direct 
tt indirect 


0.955 (1) 
0.946 (1) 


0.903 (1) 
0.790 (2) 


113.1 (6) 


24.9 (3) 


7.6 (2) 


1.17(6) 


hi- Jz? tt direct 


0.952 (2) 


0.898 (2) 


181 (2) 


29.8 (7) 


8.4 (4) 


1.2 (2) 


Z' direct 


0.929 (2) 


0.866 (3) 


61 (3) 


28 (2) 


16 (1) 


7.5 (9) 


J/w 


0.946 (3) 


0.885 (4) 


4.7 (4) 


0.11(5) 


0 (0) 


0 (0) 



Table 5: Staco+MuTag and Muid+MuGirl efficiencies and fake rates. The samples and algorithms 
are described in the text. Efficiencies are presented both for all found muons and for those with 
a good truth match. Both are calculated for truth muons with |tj | < 2.5 and p T > 10 GeV/c. The 
fake rates are presented for a variety of p T thresholds. 



The merge provides only a small improvement in the Staco efficiencies and a substantial increase in 
the fake rates (factor of about four). For Muid, the efficiency gains are more substantial: the indirect 
tt efficiency increases by 6% and the J/\jf by 15%. The fake rates are increased by a factor of five, 
i.e. slightly above the MuGirl rates. Overall, the Muid+MuGirl performance is very similar to that 
of Staco+MuTag. In both cases, we see the tagging algorithms do provide the significant efficiency 
improvement for p T below 10 GeV/c. 

10 Summary 
10.1 Present status 

The starting point for most ATLAS analyses are the combined muons, i.e. those muons constructed by 
combining tracks found independently in the inner detector and muon spectrometer. Their momentum 
resolution and fake rate (with appropriate quality cuts) are both significantly better than muons recon- 
structed from either the spectrometer alone or muons identified by tagging inner detector tracks. In tt 
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Figure 14: Muon efficiencies and fake rates for Staco+MuTag (left) and Muid+MuGirl (right) as 
functions of true X] in it at low (top) and high (bottom) luminosity. In each efficiency plot, the 
upper curve (blue) is the efficiency to find the muon while the lower curve (green) additionally 
requires a good match (D eva < 4.5) between reconstructed and true track parameters. The muon 
selection is described in the text. The efficiency is calculated for true pj > 10 GeV/c. The fake 
rates are presented for a variety of pj thresholds. 
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Figure 15: Low-pr muon rinding efficiencies for combined muons alone and combined plus 
tagged for the Staco (left) and Muid (right) collections. Results are show for the it indirect se- 
lection. The other samples show similar behavior but have much poorer statistics at low-py The 
efficiency is calculated for muons with \rj\ < 2.5. 



events, for muons from W —> jXV with \r\ \ < 2.5, the Staco combined muon efficiency is 94% with most 
of the loss coming from regions of the spectrometer where the detector coverage is thin. The efficiency 
falls by a few percent when the muon transverse momentum reaches the TeV scale where it is much 
more likely that a muon will radiate a substantial fraction of its energy. The it rate for fakes is a few per 
thousand events for pj > 20 GeV/c and this can be reduced by an order a of magnitude (with a 2% loss 
in efficiency) by cutting on the muon quality (Xmatcf)- The performance of the Muid algorithm is only 
slightly worse for it but it is significantly less robust, losing additional efficiency at low-pr and high-pr 
and when luminosity background is added. 

The combined muons can be supplemented with the standalone muons to extend the Tj coverage to 
2.7 and to recover the percent or so efficiency loss in combination. We do not report on this merge 
but it is clear from the standalone results that the fake rates will increase significantly especially when 
luminosity background is present. In the case of Moore, the fake rate is likely intolerable. 

We find that merging with MuTag provides only slight improvement to the Staco efficiency with a 
significant increase in fakes. This may reflect the success of Staco more than deficiencies in MuTag. 
MuGirl is able to improve the Muid efficiency, so that the merge Muid+MuGirl has performance similar 
to Staco or Staco+MuTag. By itself, the MuGirl efficiency is somewhat less than that of Staco especially 
for high-pr muons, and the fake rates are substantially higher. 

10.2 Future 

The results presented here reflect the status of the ATLAS software used to reconstruct (Monte Carlo) 
production data in 2007. Work continues both to improve the algorithms described here and to add 
new ones. The high-luminosity fake rate for Moore is being addressed by introducing timing cuts and 
investigating alternative approaches to the pattern recognition. The latter also has the goal of reducing the 
number of false hit assignments. Combined muons with large Xmatch are being studied to see if a second 
stage of pattern recognition can reduce the efficiency loss or resolution tails. Efforts are underway to 
improve or replace the existing spectrometer- tagging algorithms; in particular, code is already in place 
to extrapolate to additional stations enabling recovery of much of the standalone/combined efficiency 
loss near |tj| = 1.2. Two calorimeter-tagging algorithms have been developed and offer the possibility 
of recovering much of the efficiency loss near T] = 0. Improvements in modularity will make it possible 
to mix components from the different algorithms, (e.g. to use Muid to combine Muonboy muons) and 
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enable algorithms to share common tools such as those being developed to calculate energy loss, refit 
muon tracks, and repair muons with poor fit quality. 
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Muons in the Calorimeters: 

Energy Loss Corrections and Muon Tagging 

Abstract 

The muon spectrometer is the outermost subdetector of the ATLAS detector, 
beginning after a muon has traversed 100 radiation lengths of material. Muon 
momentum measurements must be corrected for energy loss in the calorimeters 
and the inert material before the muons reach the muon spectrometer. Energy 
lost in the calorimeters can be estimated from parameterizations or from a 
measurement of the energy deposited in the calorimeters. In addition, the muon 
energy loss measurement can be used to tag muons not reconstructed in the 
muon spectrometer due to inefficiencies, spectrometer acceptance or their low 
momenta. 

In this document we discuss different algorithms developed to perform the en- 
ergy loss correction in the muon reconstruction. We compare the performance 
of the muon reconstruction algorithms before and after the energy loss correc- 
tion is applied. In addition, we describe the muon tagging algorithms, based 
on measurements obtained in the calorimeters, and contrast their performance 
in different simulated data samples. 



1 Introduction 

Muons traverse the inner detector and the calorimeters in the ATLAS experiment before reaching the 
muon spectrometer. The material thickness traversed by the muons before reaching the muon spectrom- 
eter is over 100 radiation lengths (Xo) (see Figure 1). By passing through this material, muons undergo 
electromagnetic interactions which result in a partial loss of their energy. As over 80% of this material is 
in the instrumented areas of the calorimeters, the energy loss can be measured. Understanding how this 
energy loss happens, its magnitude and how to measure it is essential to obtain the best performance in 
muon reconstruction and identification. 

In this document we discuss the aspects of muon reconstruction and identification that make use of all 
available energy loss information in the ATLAS software. The Muonboy [1] and Muid [2] algorithms for 
muon reconstruction take into account internally the calorimeter material effects for tracks already found 
in the muon spectrometer. Algorithms that calculate the energy loss and transport the track anywhere in 
the detector are also available. The detailed computation of this correction is the main focus of Sections 2 
and 3, while Section 4 is devoted to the use of the energy loss information for muon identification. This 
note gives an overview of the current algorithms and techniques which will be used for the reconstruction 
of the first data. 



2 Algorithmic Treatment of Material Effects 

When a muon traverses the detector material, it undergoes successive deflections and a loss of energy. 
The total angular deflection is an accumulation of many small angle deflections, referred to as multiple 
(Coulomb) scattering; and it is well approximated by a gaussian distribution that is centered at a zero 
mean value. The expected root mean square of the projected scattering angle can be described by the 
formula of Highland [4] : 

. = L^6Mey^ 0Q3 
pep 
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Figure 1: Material distribution before the muon spectrometer in ATLAS as a function of T7 [3]. The 
material is expressed in radiation lengths (X 0 ). 



where t is the thickness of the traversed material in units of the radiation length X 0 . The energy loss, on 
the other hand, is non-gaussian. Throughout this document, we will study the energy loss of muons going 
through the ATLAS detector in detail. The discussion of multiple scattering, however, will be limited 
to this section, because it is simpler and it will be based on the Highland formula shown above. The 
thickness in the formula above is calculated from the geometry description for all algorithms. However, 
there are small differences in how the multiple scattering information is used in the track fitting. These 
differences are explained below, as the different track fitting strategies are discussed. 

In ATLAS track reconstruction applications, two main track fitting strategies are deployed: the classi- 
cal least squares method and the progressive method that corresponds to the Kalman filter formalism [5]. 



The least squares fit: In the global fitting technique, most material effects are directly integrated into 
the X 2 function (the energy loss may or may not be fitted). This is done by introducing the deflection 
angles and, possibly, energy losses as additional parameters to the fit. 

The contribution of the fitted scattering angles to the % 2 function has to be regulated by the expected 
range of the scattering process in the traversed material. Scattering effects are applied to the muon on two 
surfaces along its trajectory, because the scattering effects from material bulk can be accurately described 
by two scattering centers. The Muonboy algorithm iterates its calculation of the muon trajectory in a 
complex geometry with many scattering centers. The number of scattering centers is reduced to two 
after the iteration. The iteration allows for a calculation of the material traversed after the trajectory has 
been modified to account for the energy loss. On the other hand, the Muid algorithm and the ATLAS 
tracking global-^ 2 fitter [6] currently use a map of the material from the Monte Carlo on two surfaces. 
The 77 coordinate of the track on these two surfaces is then used to calculate the amount of material 
traversed by the muon. 

The least squares fit with the calorimeter energy loss as a fitted variable can only be performed in 
a combined fit including measurements of both the muon spectrometer segments and the inner detector 
hits. If no inner detector hits exist, the treatment of the energy loss effects is fundamentally equivalent 
for both a least-squares-inspired algorithm and a Kalman-filter-inspired algorithm. 

To minimize the number of degrees of freedom in the least-squares fit, the number of fitted variables 
must be minimized. In particular, one energy loss variable in a track fit is preferable. This does not mean 
that the trajectory cannot be affected smoothly by the energy loss, because an extended set of material 
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layers can be calculated using a detailed detector description as in Figure 2 and the fitted energy loss 



Figure 2: Left: 3-D view of the tracking geometry up to the muon spectrometer. Right: Example 
set of energy loss update layers (shown as additional surfaces with respect to the figure on the left; 
update positions shown as squares) created during the extrapolation of a track (black line) through the 
calorimeter. 

distributed proportionally among these layers. This is done for the purpose of transporting the track 
through the calorimeters inside the Muonboy algorithm. An alternative approach is currently taken in the 
Muid and ATLAS tracking global-^ 2 fitter. These algorithms apply the energy loss to the track on one 
surface inside the calorimeters hence approximate the rate of change of curvature within the calorimeter 
volume (i.e.: they assume the momentum of the muon changes only at one place along its trajectory). 

The effect of this simplification on the muon combined reconstruction is expected to be small, be- 
cause the energy loss only affects the trajectory of the track if the track is bending. However, the area 
where most of the energy loss happens (the calorimeter) has a small magnetic field. A quantitative es- 
timate of the effect of the simplification can be obtained by comparing the multiple scattering effects 
on the track and the bending that the track undergoes from its entrance in the calorimeters to its exit. 
The bending is shown in Figure 3. Equation 1 indicates that a 10 GeV (100 GeV) muon going through 
the calorimeters scatters following a gaussian distribution with RMS «14-20 (1.4-2) milliradians (with 
Xq = 100-200 from Figure 1). Figure 3 shows that the deviation of the track due to the magnetic field is 
comparable to the deviation expected from multiple scattering at least in the <f) direction. Algorithms that 
use one surface to apply the energy loss correction approximate the mean trajectory inside the calorime- 
ter with a systematic offset that increases with depth to a maximum value at the calorimeter centre. The 
magnitude of this offset is proportional to the magnetic bending scaled by the fraction of energy loss to 
muon energy. It thus remains small with respect to the uncertainties caused by Coulomb scattering. 

Progressive fitting techniques: In progressive fitting techniques, the particle-detector interaction is 
part of the transport process of the track to the next surface where a hit may exist (measurement sur- 
face). The transported track can then be compared (and updated) with the measurement obtained on the 
next measurement surface. In this transport process magnetic field and material effects (multiple scat- 
tering and energy loss) are applied to the parameterization of the track. Multiple scattering is applied 
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Figure 3: Calculated difference between the calorimeter entrance and exit coordinates (Atj, left, and A<f>, 
right) for 10 GeV (solid squares) and 100 GeV muons as a function of Tjo of the muon at the interaction 
point. The lack of mirror symmetry is due to the combined effect of the return flux of the solenoid 
(unidirectional) and the toroidal magnetic field (symmetric around the z axis). 



by increasing the uncertainties of the angular direction variables, while energy loss effects are taken into 
account in two ways. A mean energy loss is applied to the track parameterization, and then an uncer- 
tainty is added to the corresponding covariance matrix term to account for the stochastic behavior of the 
energy loss. The resulting increased covariance terms degrade the track prediction for the subsequent 
measurement surface. 

Progressive fitting tools rely, therefore, on a precise description of the detector material and magnetic 
field. An example is shown in Figure 2. The illustration on the right of Figure 2 shows material layers 
that are calculated dynamically during the extrapolation process into the calorimeter active volumes. 

In ATLAS, the stand-alone muon reconstruction algorithms (MOORE [7] and Muonboy) use ex- 
clusively the least-squares formalism to fit tracks in the muon spectrometer. On the other hand, the 
inner detector reconstruction uses by default the progressive fitting techniques. In combined muon re- 
construction, when the hits in the inner detector are used in combination with the muon spectrometer, 
the Muonboy-based algorithm (STACO) combines tracks reconstructed in the inner detector and muon 
spectrometer independently, therefore being a mixture of the tracking fit and the least-squares fit carried 
out by Muonboy. On the other hand, the MOORE-based algorithm (Muid) performs a least-squares fit in 
both subsystems. 

3 Corrections for the Energy Loss from the Beam Pipe to the Muon Spec- 
trometer 

In this section we describe how the energy loss is calculated from GEANT4 [8] based parameterizations 
and/or measurements of the energy loss by the calorimeters. Muon isolation is also discussed in this 
context. Finally, the energy loss corrections are validated as part of the muon reconstruction algorithms. 

3.1 Parameterizations of the Energy Loss 

Relativistic muons going through matter lose energy mostly through electromagnetic processes: ion- 
ization, e + e~ pair-production, and bremsstrahlung. Ionization energy loss dominates for muons of 
momenta <100 GeV. Bremsstrahlung and e + e~ pair-production energy losses are often jointly referred 
to as radiative energy losses. Higher energy muons lose energy mostly through radiative energy losses. 
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However, when passing through materials made of high-Z elements the radiative effects can be already 
significant for muons of energies «10 GeV [9]. 

Ionization energy losses have been studied in detail, and an expression for the mean energy loss per 
unit length as a function of muon momentum and material type exists in the form of the Bethe-Bloch 
equation [10]. Other closed-form formulae exist to describe other properties of the ionization energy loss. 
Bremsstrahlung energy losses can be well parameterized using the Bethe-Heitler equation. However, 
there is no closed-form formula that accounts for all energy losses. Nevertheless, theoretical calculations 
for the cross-sections of all these energy loss processes do exist. With these closed-form cross-sections, 
simulation software such as GEANT4 can be used to calculate the energy loss distribution for muons 
going through a specific material or set of materials. 

The fluctuations of the ionization energy loss of muons in thin layers of material are characterized 
by a Landau distribution. Here "thin" refers to any amount of material where the muon loses a small 
percentage of its energy. Once radiative effects become the main contribution to the energy loss, the 
shape of the distribution changes slowly into a distribution with a larger tail. Fits to a Landau distribution 
still characterize the distribution fairly well, with a small bias that pushes the most probable value of the 
fitted distribution to values higher than the most probable energy loss [11]. These features are shown 
for the energy loss distributions of muons going from the beam-pipe to the exit of the calorimeters in 
Figure 4. 




E,„ss(MeV) ' E loss (MeV) 



Figure 4: Distribution of the energy loss of muons passing through the calorimeters (\r]\ < 0.15) as 
obtained for 10 GeV muons (left) and 1 TeV muons (right) fitted to Landau distributions (solid line). 

As can be seen in Figure 4 the Landau distribution is highly asymmetrical with a long tail towards 
higher energy loss. For track fitting, where most of the common fitters require gaussian process noise, 
this has a non-trivial consequence: in general, a gaussian approximation has to be performed for the 
inclusion of material effects in the track fitting [12]. 

In order to express muon spectrometer tracks at the perigee, the total energy loss in the path can be 
parameterized and applied to the track at some specific position inside the calorimeters. As the detector 
is approximately symmetric in (j), parameterizations need only be done as a function of muon momentum 
and T7 . The r\ -dependence is included by performing the momentum parameterizations in different tj 
bins of width 0.1 throughout the muon spectrometer acceptance (|tj| < 2.7). The dependence of the most 
probable value of the energy loss, E^ , as a function of the muon momentum, p^, is well described by 

£ kT (Pn ) = a o PV + a T V ln P» + a 2 Py Pii > (2) 

where a™ pv describes the minimum ionizing part, af pv describes the relativistic rise, and a™ pv describes 
the radiative effects. The width parameter, (Ji oss , of the energy loss distribution is well fitted by a linear 
function O\ 0 ss(Pn) = a° + a°Pn- Some of these fits are illustrated in Figure 5. This parameterization is 
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Figure 5: Parameterization of the (left) and Oi oss (right) of the Landau distribution as a function of 
muon momentum for different 77 regions. One sees a good agreement between the GEANT4 values and 
the parameterization. 



used as part of the Muid algorithm for combined muon reconstruction [3]. 

An alternative approach exists in the ATLAS tracking. In this approach, the energy loss is param- 
eterized in each calorimeter or even calorimeter layer. The parameterization inside the calorimeters is 
applied to the muon track using the detailed geometry described in Section 2. 

The most probable value and width parameter of the Landau distribution are not affected by radiative 
energy losses in thin materials in the muon energy range of interest (~5 GeV to a few TeV). This justifies 
treating energy loss in non-instrumented material, such as support structures, up to the entrance of the 
muon spectrometer as if it was caused by ionization processes only. The most probable value of the 
distribution of energy loss by ionization can be calculated if the distribution of material is known [13]. 
Since material properties are known in each of the volumes in the geometry description used, it is easy 
to apply this correction to tracks being transported through this geometry. 

For the instrumented regions of the calorimeters, a parameterization that accounts for the large radia- 
tive energy losses is required. To provide a parameterization that is correct for the full r\ range and for 
track transport inside the calorimeters, a study of energy loss as a function of the traversed calorimeter 
thickness, x, was performed. Two parameters that characterize fully the pdf of the energy loss for muons 
were fitted satisfactorily using several fixed momentum samples as 

BZ' a (*,Pri = b^(p^x + b^(p^x\n X . (3 ) 

The momentum dependence of the bj{p^) parameters was found to follow the same form as in Equa- 
tion 2. Fits for some of the absorber materials are shown in Figure 6. These parameterizations have 
been validated over the 77 range from -3 to 3. A direct comparison of the most probable energy loss in 
GEANT4 simulation and in the geometry of the ATLAS tracking algorithms is shown in Figure 7 for 
muons propagating from the beam-pipe to the exit of the electromagnetic calorimeters and to the exit of 
the hadronic calorimeters. 

3.2 Measurements of the Energy Deposited in the Calorimeters 

In this section the measurement of the muon energy loss in the calorimeters is discussed. Understanding 
this measurement is important because it allows for an improvement in the energy loss determination. 
This section provides a basic description of the ATLAS calorimeters and their measurements which is 
important for understanding the topics discussed in Sections 3.3, 3.4 and 4. 
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Figure 6: Left: Fit to the most probable value and width of the Landau distribution as a function of 
thickness of iron for muons of momentum 200 GeV. The fitting function has the form box + bixlnx. 
Right: Fit to the parameters bo and b\ for the most probable value of the energy loss in lead as a function 
of muon momentum. 




Figure 7: Most probable value of the energy loss as parameterized in the geometry of the ATLAS tracking 
(points) and in GEANT4 for muons of momentum 10 GeV (left) and 1 TeV (right) as a function of 
pseudorapidity. The solid line and points correspond to the energy loss of muons propagating from the 
beam pipe to the exit of the hadronic calorimeters. The filled histogram and hollow points correspond to 
the energy loss of muons propagating from the beam pipe to the entrance of the hadronic calorimeters. 

3.2.1 Muons in the Liquid Argon Calorimeters 

The electromagnetic calorimeter is a lead-liquid argon sampling calorimeter with accordion shaped ab- 
sorbers and electrodes, covering the \r\ \ range up to 3.2. 

The hadronic end-cap calorimeter, also based on liquid argon technology, covers the | tj | range from 
1.5 to 3.2. The absorbers are made of parallel plates of copper. The total thickness of the hadronic 
end-cap calorimeters is 10 interaction lengths (Ai nt ). The measurement of a muon signal in the hadronic 
end-cap is complicated because the noise levels are high compared to the muon signal itself [14]. 

The detailed geometrical description of the LAr calorimeters is presented in [15]. Only the aspects 
relevant for muon studies will be recalled. Both barrel and end-cap calorimeters possess up to three 
longitudinal samplings (called strip, middle and back). They are completed by a liquid argon presampler 
detector to estimate the energy lost in upstream material. The signal and noise distributions in two 
longitudinal calorimeter samples in the barrel of the electromagnetic calorimeter are shown in Figure 8 
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Figure 8: Distribution of the muon energy deposited in one electromagnetic calorimeter cell by 150 GeV 
muons, fitted to a Landau function convolved with a gaussian [16]. The gaussians on the left of each 
plot are the distributions of the noise. Left (right): energy deposit in a cell belonging to the first (middle) 
longitudinal sampling traversed by the muon. The energy is the sum of the energies of the (up to two) 
cells belonging to the muon cluster (see Section 4). The data were collected in the 2004 Combined Test 
Beam. 

for 150 GeV muons. Further discussion of how these distributions were calculated from the combined 
test beam data can be found in Section 4. The signal can be separated from the noise, especially in the 
middle sampling. In addition, comparisons between GEANT4 simulation and test beam data show that, 
despite the high noise in the first sampling, the electromagnetic calorimeter can measure reliably the 
energy lost by muons traversing it. 

3.2.2 Muons in the Tile Calorimeter 

The Tile Calorimeter (TileCal) [17] is a plastic scintillator/steel sampling calorimeter, located in the 
region |tj| < 1.7; it is divided into three cylindrical sections, referred to as the barrel and extended 
barrels. It extends from an inner radius of 2.28 m to an outer radius of 4.25 m. Modules are segmented 
in tj and in radial depth. In the direction perpendicular to the beam axis, the three radial segments span 
1.5, 4.1 and 1.8 Ai nt in the barrel and 1.5, 2.6, 3.3 Ai nt in the extended barrels. The resulting typical 
cell dimensions are Atj x A0 = 0.1 x 0.1 (0.2 x 0.1 in the outermost layer). This segmentation defines a 
quasi-projective tower structure. 

The TileCal response to high-energy muons follows a Landau-type distribution with characteristi- 
cally long tails at high energies caused by radiative processes and energetic 5-rays. This response has 
been extensively studied in test beams with 180 GeV muons incident at projective angles. The peak 
values of the muon signals vary by more than a factor of two in projective geometry. An example of the 
muon signal, expressed in units of collected charge (pC), is shown in Figure 9, both for the whole tower 
and the last radial compartment. The signal is well separated from the noise, with a signal-to-noise ratio 
(S/N) of ~ 44 and ~ 18 respectively. The muon response was shown to be uniform in X] to within 1.9 % 
over all modules tested. The energy deposition spectrum observed in the TileCal test beams is within a 
few percent of the GEANT4 prediction. 
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Figure 9: Example of the isolated muon signal as measured at tj = 0.35 in the whole tower (left) and in 
the last radial compartment (right). The narrow peaks represent the corresponding noise. The energy is 
measured in units of collected charge. For a muon 1 pC corresponds to roughly 1 GeV, yielding a noise 
width of roughly 40 MeV for the last radial compartment. The data were collected in test beams in 2002 
and 2003. 



3.2.3 Measurements in Muon Algorithms 

The previous sections discussed the reconstruction of energy depositions at the cell level. To provide 
estimates of muon energy loss and muon isolation, several cells need to be used along the muon trajectory. 
In addition, muon calibration factors such as the e/jJ, ratio for minimum ionizing muons, need to be 
adjusted in order to find the correct energy deposition. 

The classical method for measuring the energy loss of muons in calorimeters is based on the concept 
of a calorimeter tower, where the muon is assumed to follow a straight trajectory inside the calorime- 
ters. A tower is defined as all calorimeter cells within a cone of fixed radius AR = \J kf] 1 +A0 2 cen- 
tered around the muon trajectory. Motivated by this concept, but with a few muon- specific changes, the 
Straight Line method has been developed as part of the Muid algorithm for muon reconstruction. The 
Straight Line method calculates the coordinates of the relevant track at half the depth of the calorimeter 
by transporting the track to that position. These coordinates are used to calculate the calorimeter cells 
included in the measurement cone. 

In the Track Update method, the muon trajectory through the calorimeters is extrapolated either from 
inner detector tracks or muon spectrometer tracks. Given this trajectory, the center of the measurement 
cone is recalculated at each calorimeter layer. Figure 10 shows a qualitative comparison between the 
Straight Line method and the Track Update method. 

Figure 3 illustrated the quantitative differences in the muon trajectories from the two methods. The 
difference between the Straight Line and Track Update methods can be estimated by comparing the 
coordinates of the muon at the entrance of the electromagnetic calorimeters and at the exit of the hadronic 
calorimeters. The difference is clearly negligible for muons of pj > 100 GeV, even though it can be as 
big as a third of a hadronic cell width for 10 GeV muons. 

In Figure 11, a comparison between the measured energy and the true energy loss is shown for the 
Track Update method. The average measured transverse energy loss in a cone of 0.2 around the muon 
trajectory for single muons of momentum 10, 100 and 300 GeV is shown as a function of T]. In the same 
r\ bins the average true (as obtained from the GEANT4 full simulation) transverse energy lost between 
the interaction point and the entrance of the muon spectrometer is shown. The energy lost by the muons 
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Figure 10: Illustration of the Straight Line (left) and Track Update (right) concepts. 




Figure 1 1 : Comparison between the average measured transverse energy deposition (points) and true 
energy lost between the beam-pipe and the muon spectrometer (line) for muons of momentum 10 GeV 
(left), 100 GeV (center) and 300 GeV (right). The errors shown are statistical only. 



is well estimated by measurements in the calorimeters. The region around \t] | = 1 corresponds to the 
crack in the TileCal. Consequently, the measurement underestimates the energy loss in that region. 

3.3 Muon Isolation 

The previous studies demonstrate the capabilities of the calorimeters to measure the energy lost by 
muons. However, these studies were all performed with single muon samples. In real physics sam- 
ples, muons do not reach the calorimeters alone, but are often accompanied by additional particles that 
deposit energy in the cells around the muon trajectory and contaminate the muon energy loss measure- 
ment. Therefore, in order to determine the energy loss of such muons, isolation criteria must also be 
defined and optimized for maximum reliability in the energy measurement. 

Isolation criteria can be divided into two categories: calorimeter-based and track-based. In muon 
reconstruction an area is defined around the muon trajectory with a minimum and maximum radius for 
the purpose of determining calorimeter isolation. This achieves the puipose of excluding the cells where 
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the muon deposits its energy. The size of this inner radius needs to be optimized to collect most of the 
energy lost by the muon but as little energy as possible from other particles. The energy deposited in the 
annulus between the inner radius and the outer radius, where the muon deposits little energy, is what the 
following paragraphs refer to as isolation energy. The optimal radii that define this annulus depend on 
the underlying event and luminosity. However, the muon shower is contained in a small cone of radius 
0.1 [18]. Therefore, a choice of an inner radius much bigger than 0.1 does not achieve the purpose of 
collecting the energy deposited by the muon, and it adds noise to the measurement. 

A study to determine possible isolation criteria [3] has been performed on a fully simulated it sample, 
where the W bosons were forced to decay into a muon and a neutrino. Muons produced by the semi- 
leptonic decays of b quarks tend to be non-isolated, while those from W decays tend to be isolated. In 
Figure 12 the distribution of the isolation energy for muons originating from quarks and Ws is shown 
for the electromagnetic and hadronic calorimeters. The isolation energy inner and outer radii are 0.075 
(0.15) and 0.15 (0.30) for the electromagnetic (hadronic) calorimeters, respectively. This reflects their 
different granularities. The electromagnetic isolation energy is a more powerful discriminant for the 
annuli radii chosen. 




Figure 12: Distribution of the isolation energy in the electromagnetic (0.075 < AR < 0.15) (left) and 
hadronic calorimeters (0.15 < AR < 0.30) (right) in muons from a it sample without pile-up. 

Based on this figure, for the purpose of the rest of the studies in this section, a cut of 2 GeV on electro- 
magnetic isolation was used to discriminate isolated muons from non-isolated muons. An additional cut 
of 10 GeV in hadronic isolation was used, even though this cut does not help rejecting non-isolated muons 
in the vast majority of events. These cuts were relaxed slightly with increasing muon pj to account for 
a possible slight increase of the transverse radius of the shower caused by muons in the calorimeters. 

Tracking-based criteria can be used to determine isolation cuts independent of calorimeter-based 
criteria. If used together, they can help eliminate non-isolated muons belonging to highly-collimated jets 
that escape being identified by calorimeter-based criteria. In Figure 13 the number of inner detector tracks 
around the muon are plotted for muons from quarks and Ws in the same samples used for Figure 12. 
These distributions were obtained for muons that passed the calorimeter-based isolation cuts mentioned 
above. 

The production rate of low-pr non-isolated muons from &-quark decays is expected to be very sig- 
nificant. At the same time, a typical muon from aWorZ decay, will have a pj of about 40 GeV. Thus, 
it is unlikely that low momentum muons will be isolated in a it sample. For this reason, all muons with 
a pj of less than 15 GeV were automatically tagged as non-isolated and are excluded. In most cases, the 
muon originating from a W boson is not accompanied by other tracks in the inner detector. In fact, the 
only track found in the track isolation cone is, essentially, the muon itself as reconstructed in the inner 
detector. In contrast, several inner detector tracks can be found close to muons originating from quarks. 
For an isolation cone of A/?=0.2, the most probable value is three tracks, including the muon itself, but it 
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Figure 13: Distribution of the number of inner detector tracks (including the muon track) with AR < 0.2 
around the muon spectrometer track, after the calorimeter isolation and pj threshold cuts are applied to 
muons in a it sample. 

can be much larger. A cut on tracking isolation has been applied that complements the cut on calorimeter 
isolation. This cut constrains an isolated muon track to be accompanied by at most one extra track inside 
the tracking isolation cone. Using the criteria described above (electromagnetic and hadronic calorimeter 
isolation, track isolation and pj > 15 GeV) approximately 0.2% of the muons originating from b quarks 
have an energy loss overestimated by more than 6 GeV. On the other hand, 80% of the muons originating 
from Ws are tagged as isolated. 

All the cuts mentioned above are used by default as isolation criteria in the Muid muon reconstruction 
algorithm, to establish the contamination of the calorimeter measurement. If the cuts were not chosen 
tightly enough, non-isolated muons would exhibit an artificial increase in energy loss. If this measure- 
ment was then used in muon reconstruction, the reconstructed momentum at the interaction vertex would 
be artificially increased. This could significantly deteriorate the momentum measurement. A calorimeter 
measurement of the energy loss will then only make sense if the muon is tagged as isolated. These cuts 
were considered conservative enough that they could be used by default without inducing biases in the 
momentum reconstruction [3]. These cuts have not been studied with pile-up or in other samples with 
an important source of non-isolated muons, like high-p^, bb samples. Studies of this type are impor- 
tant in order to set conservative isolation cuts as default for muon reconstruction involving calorimeter 
measurements. 

In addition, it is worth discussing the relationship between muon isolation in reconstruction and 
muon isolation in physics analyses. While both concepts represent an attempt to determine whether a 
muon is inside a jet, analysis cuts are also decided on the basis of criteria such as efficiency or fake 
rate that are not necessarily important for the momentum reconstruction. It is, however, important to 
emphasize that the optimization of the cuts on reconstruction isolation for specific analyses is possible. 
It requires, nevertheless, a refitting of the track with analysis-specific cuts; and it should, therefore, only 
be attempted when the recovery of the Landau energy loss tails is crucial for the analysis and the standard 
treatment is not adequate. 

For muon tagging, however, isolation criteria can overlap with the criteria derived for specific anal- 
yses. Isolation studies are necessary to provide a reliable muon tag and do not affect the momentum 
reconstruction, because the tracking algorithms are independent of the tagger. Default isolation criteria 
can, for example, be relaxed based on information from the physics sample and the specific analy- 
sis. As an example, Figure 14 shows the rejection on the Zbb background versus the Standard Model 
H(\30 GeV) — >• ZZ* — >• }i + }i~}i + fi~ efficiency using the calorimeter isolation cuts [19]. At this stage, 
after a preselection procedure, the four muon candidates have already been selected. Both absolute and 
normalized (with respect to muon pj) isolation are presented. In these analyses, the isolation energy was 
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Figure 14: Rejection of the Zbb background as a function of the H(\30 GeV) — >• \)X signal efficiency. 
Different radii (0.1 < AR < 0.3) are compared for absolute, left, and normalized (with respect to muon 
Pt), right, calorimeter isolation. No pile up events were simulated. 

defined using a cone of fixed radius. The isolation energy of an event was then defined as the isolation 
energy of the least isolated muon of the event. The optimum cone radius depends on signal efficiency, 
with AR of 0.2 being an efficient choice. 

3.4 Measurement/Parameterization Combination Methods 

To integrate energy loss in tracking algorithms, the energy loss is assumed to be gaussian. If the param- 
eterization is used exclusively to correct for the energy loss, any event in which muons undergo a large 
energy loss will be incorrectly reconstructed. There is, thus, an advantage in using the parameterizations 
together with measurements in the calorimeters to optimize the energy loss reconstruction. Here we 
describe two algorithms developed to use the calorimeter information as well as the energy loss param- 
eterizations for the muon reconstruction algorithms: the Hybrid Method [3] and the Bayesian Method. 
The Hybrid Method is used by default after isolation cuts as part of Muid. The Bayesian Method is used 
if specified by the user as part of Muonboy. By default, Muonboy uses a parameterization of the energy 
loss only. 

The Hybrid Method consists in fully separating the two regions of the Landau distribution: the peak 
region and the tail region. The calorimetric energy loss measurement is used when the energy deposition 
is significantly larger than the most probable value (tail region); otherwise the parameterization is used 
(peak region). The transition point between the two regions is taken as £" mpv + 2 OLandau ■ 

The Bayes Method is based on a statistical combination of the parameterization and the measurement 
in the calorimeters. This combination is performed using Bayes' theorem. This method uses informa- 
tion from the calorimeters, even when the measurement falls in the peak region. When the calorimeter 
measurement falls in the tail, the results provided by this method are similar to those obtained by the Hy- 
brid Method. If the measurements falls in the peak region, the measurement still constrains the energy 
loss pdf, improving the energy loss reconstruction resolution. In addition, this method generalizes the 
selection procedure of the hybrid method, with an event-by-event optimization and for each calorimeter 
subsystem. This generalization allows also, in principle, an automatic improvement in the energy loss 
reconstruction as the calorimeter calibration improves. 

The full validation of these methods in muon reconstruction with all muon reconstruction effects is 
shown in the next section. The most significant of these effects are the intrinsic resolutions of the inner 
detector and muon spectrometer. However, to validate the methods standalone, it is necessary to do it in 
a model free of these reconstruction effects. 
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The Hybrid Method has been validated through the energy loss reconstruction distributions in the 
ATLAS full simulation using the muon kinematics from the simulation. The ratio of the energy loss 
resolution for the Hybrid Method, Ghybrid. with respect to the parameterization alone, (J pmam , is presented 
in Figure 15. The resolution is defined as the square root of the variance of the energy loss resolution. 
For low-pr values the ratio is close to unity, as expected due to the smaller fraction of events in the 
Landau tail. For increasing pj values the ratio decreases, approaching 30% at pj = 1 TeV. Thus, using 
the Hybrid Method results in a significant improvement in the energy loss estimation with respect to the 
parameterization alone. 



Ij, o . . 




Figure 15: Ratio of the energy loss resolution for the Hybrid Method with respect to the parameterization 
alone for single muons. 

In addition, the performance of the Bayesian Method has been studied in a toy model under the 
assumption that the calorimeter calibration is understood. Muons of 1 TeV were shot through a block of 
matter representative of one of the samplings of the hadronic calorimeter. A perfect muon spectrometer 
was assumed as it is done for the Hybrid Method above. Figure 16 shows some results from this study that 
demonstrate the potential of the Bayesian Method to reconstruct energy loss. The use of the measurement 
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Figure 16: Demonstration of the potential of the statistical method to reconstruct the energy loss. The 
left plot shows the bias in the energy loss reconstruction, while the right plot shows the Ef^ — E^° dis- 
tribution. Both plots compare the energy loss reconstruction using the parameterization only (triangles, 
dotted line), the measurement only (filled circles, solid line) and both statistically combined through 
B ayes' theorem (open circles, dashed line). 

by itself biases the energy loss reconstruction. The Bayesian Method, on the other hand, shows no 
biases. Incidentally, there are no biases in the energy loss reconstruction if the muon spectrometer 
measurement is coupled to the energy loss reconstruction in these studies. In addition, the — E r ^° 
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distributions show that the resolution obtained with the Bayesian Method is better than that obtained 
using the parameterization or the measurement only. 

These studies prove the potential of these two methods. Now their performance is analyzed when 
they are used as part of the reconstruction software. 



3.5 Impact of the Energy Loss Corrections in Reconstruction 

Energy loss estimates must be validated as part of the muon reconstruction algorithms. Effects such as 
the resolution of muon reconstruction, biases intrinsic to the track transport or the effects of gaussian 
assumptions will be coupled to the energy loss reconstruction. However, the studies shown here are still 
important for understanding the energy loss correction. They also allow for an investigation of which 
data samples are most sensitive to an incorrect estimation of the energy loss. 

In Figure 17, two parameters that characterize the gaussian distributions 1 / pf co — 1/ p l ™ e are shown. 
In these plots, the label "MS" corresponds to tracks from fits in the muon spectrometer only. The label 
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Figure 17: Left: Muon reconstruction bias for different algorithms as a function of muon pj. Right: 
Muon reconstruction resolution for different algorithms as a function of muon pj. These plots were 
produced with the Muonboy/STACO algorithms for muon reconstruction [1], but similar performance is 
obtained with the MOORE/Muid algorithms [2,7]. 

"MS+^ioss correction" refers to tracks reconstructed at the muon spectrometer and transported to the 
interaction point (IP), applying an energy loss correction (parameterized or hybrid). For Figure 17 the 
energy loss correction was calculated using the Muonboy parameterization only. The label "MS+ID" 
refers to tracks reconstructed with the muon spectrometer and the inner detector. To obtain these com- 
bined tracks, the energy loss correction needs to be considered in the fit. The distributions are calculated 
in 1/pr-space because the muon spectrometer reconstruction and inner detector reconstruction have 
gaussian fluctuations in 1/ pr- These plots refer to muons reconstructed in the barrel (\r]\ < 1.0) of the 
muon spectrometer. In the left plot, the bias in the reconstruction is shown, defined as the mean of the 
distribution 




(4) 



with //™ e ' being the true pr of the muon at the IP. Clearly, in the absence of an energy loss correction, 
this reconstruction can be highly biased. Such a bias also causes a degradation in the resolution. The 
application of an energy loss correction reduces this bias and improves the resolution. Further bias 
reduction and resolution improvement is obtained by using the inner detector together with the muon 
spectrometer to reconstruct the muon track. However, the combination of inner detector tracks and muon 
spectrometer tracks is only possible if a proper energy loss correction exists. 
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Figure 18 shows the invariant mass resolution (M^° -M^" e ) for Z —> jU/x and Z'(1000 GeV) — > /x/x 
samples. The events include a generation cut that requires the pj of both decay muons to be above 7 GeV 




Figure 18: Left: Reconstruction resolution of the Z peak for different algorithms. Right: Reconstruction 
resolution of the Z' peak for a Z' —>■ jiji of mass 1 TeV for different algorithms. These plots were 
produced with the MOORE/Muid algorithms for muon reconstruction [2,7], but similar performance is 
obtained with the Muonboy/STACO algorithms [1]. 

for Z decays and above 20 GeV for Z' decays. Inner detector and muon spectrometer tracks were required 
for both reconstructed muons. The effect of the energy loss correction is most significant in the Z-mass 
reconstruction. If the energy loss is not included a shift of about 7 GeV in the mass peak and a significant 
deterioration in the resolution are visible. These effects are much less pronounced in the reconstruction 
of the Z' peak. This happens because the more energetic muons from the Z' lose a smaller fraction of 
their total energy as they pass through the calorimeters. 

An additional improvement in the Z and Z' resolution is possible if the energy loss correction uses 
the calorimeter measurement [20]. This is demonstrated in Figure 19, where a comparison is shown 




Figure 19: Left: Reconstruction resolution of the Z peak for an algorithm using muon spectrometer 
standalone tracks and the parameterized energy loss correction (filled histogram) and an algorithm using 
a combination of a parameterization and the calorimeter measurement for the energy loss correction 
(empty histogram). Right: Reconstruction resolution of the Z' peak for a Z' — > of mass 1 TeV for the 
same algorithms. 

for the muon spectrometer reconstruction with energy loss correction with and without the calorimeter 
measurement. The same samples as for Figure 1 8 were used. A few events are recovered from the tails 
and populate the peak region. This results in « 8% resolution improvement when the inner detector 
hits are not used. If the inner detector hits are used the resolution improves by pa 4%, showing that 
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the combined fit is less sensitive to the energy loss correction. These plots were produced with the 
MOORE/Muid algorithms for muon reconstruction using the Hybrid Method (see Section 3.4). A similar 
performance is expected using the Bayesian Method, implemented in the ATLAS tracking. 

4 Tagging of Muons in the Calorimeters 

In this section the different calorimeter-based muon identification algorithms are described. Section 4.1 
provides a detailed description of the algorithm that is currently part of the standard reconstruction; and 
Section 4.2 illustrates its performance in different physics samples. 

These algorithms have been developed with the main goal of complementing the muon spectrometer 
in two ways: recovering muons with low transverse momentum (jpj = 2-5 GeV), and in the regions of 
limited spectrometer acceptance (especially in the tj ~ 0 region). For completeness, algorithms used for 
commissioning or triggering are also discussed. 

There are two types of calorimeter-based muon tagging algorithms. Their main difference lies in 
how they initiate the muon search. The calorimeter-seed algorithms search for muons looking at the 
measured energy. Cells with energy depositions inside some energy range are used as seeds. The lower 
limit of this range is known as the initiation threshold. The cluster of cells used to identify the muon is 
then built up by adding cells around the seed cell whose energy is above a second lower threshold, so 
called continuation threshold. At the end of the clustering, the tj and 0 directions of the reconstructed 
cluster can be used to match a track in the inner detector. There are two algorithms that correspond to 
this description: 

• LArMuID is based on a topological clustering algorithm used by ALEPH [21]. A topological clus- 
tering algorithm groups neighboring cells whose energy is above a given threshold. Therefore, 
the resulting clusters have a variable number of cells. This algorithm was used to find muons 
using the electromagnetic calorimeter data during the test beam and the cosmic commissioning 
data analysis. It builds the cluster from a seed cell in the middle sampling of the electromagnetic 
calorimeter. Then, it creates the cluster adding another cell (if any) adjacent in 0 above the con- 
tinuation threshold. Due to the accordion structure of the electromagnetic calorimeters there are 
no more than two adjacent cells in 0 that can share the muon signal, thus the clusters consist of at 
most two cells. The efficiency was measured with a muon beam during the combined test beam 
as the fraction of events with a reconstructed muon cluster. The efficiency and the probability to 
generate a fake muon from noise fluctuations were evaluated as a function of both thresholds. The 
clustering algorithm inherently biases the energy reconstruction, so the lower the thresholds the 
better the estimation of the reconstructed energy for the muon. The spectrum of energies collected 
has been compared to and shown agreement with GEANT4. 

• TileMuId is simple and fast and is used for triggering purposes. Its clustering methods are sim- 
ilar to those of LArMuID. This algorithm, however, runs by default as part of the reconstruction 
software. It starts with a search for a "candidate" muon in the cells belonging to the last TileCal 
sampling, where the muon gives the clearest signature, due to the screening effect of the two pre- 
vious samplings. If the measured energy is between a lower and and upper threshold, it uses the 
tj and 0 coordinates of the cell to look for another cell with energy within the same thresholds in 
the central sampling. If both searches are successful it looks for a third cell with energy within the 
thresholds in the first sampling. When cells with energies within the thresholds are found in all 
three samplings, the candidate is confirmed to be a muon. Performance studies of TileMuId can 
be found in [22]. 

There are two track-seed algorithms that extrapolate inner detector tracks through the calorimeter 
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identifying those matching the energy deposition pattern of a muon. The track-seed algorithms use no 
tracking information from the muon spectrometer. The first track-seed algorithm, CaloMuonTag, will be 
described in detail in the next section. 

The second track-seed algorithm, CaloMuonLikelihoodTool, builds a likelihood ratio to discrim- 
inate muons from pions. The likelihood discriminant is built out of different energy ratios in order to 
capture the global features of the energy depositions. The discrimination power of these ratios varies 
both as a function of the momentum of the particles considered, and as a function of X] . Therefore three 
bins in r\ (barrel, crack, end-cap) and three bins in momentum (0-10 GeV, 10-50 GeV, 50-100 GeV) are 
used, and a different set of ratios is selected for each of the 9 regions. The likelihood ratio for the muon 
candidate is then defined as 

Where Pj 1 and Pf, i = \,...,N are the pdf for the energy ratios. The performance of this algorithm is still 
being studied, however, the first results show that it is comparable to the performance of CaloMuonTag 
discussed in Section 4.2. 

4.1 CaloMuonTag 

CaloMuonTag extrapolates inner detector tracks through the calorimeters, collecting the energy in the 
cell closest to the extrapolated track for each traversed sampling. The muon can deposit energy in more 
than one cell in the hadronic calorimeter, but the probability of this happening is rather low, as illustrated 
in Figure 20. For the purpose of this algorithm, it is enough to assume that all the energy deposited by 
the muon is localized in the central cell. This also minimizes the electronic noise, which is particularly 
large in the HEC. 




A track preselection is made to reduce the number of fakes in the output of the algorithms. In 
addition, this preselection reduces the time needed by the algorithm to run on events with high track 
multiplicity. The following cuts in p T , and transverse isolation energy (Ef°) inside a cone of 0.45 are 
applied: 

• p T > 2 GeV and Ef° < 10 GeV for tracks pointing to the barrel (tj < 1.6). 

• p T > 3 GeV and Ef° < 8 GeV for tracks pointing to the end-cap. 
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Tracks are rejected if any of the collected energies are above veto values defined for each sampling. As 
most fakes are seeded by low-p^ tracks, more stringent cuts can be set for low-pr (< 10 GeV) track 
candidates. These cuts can be relaxed for track candidates of higher p T . 

Once calorimeter cells along the muon trajectory have been identified, the algorithm determines the 
lower threshold energy cut that should be used for the tagging as a function of 77 : 

. £ th = g^for|T7|<1.7, 

• £ th= ( fV n 7 )2 for|T?|>1.7, 

where 6 is the polar angle. The values of E th from these two equations roughly follow the shape of the 
measured energy distributions, which increases with the path length of the muon in the cell. 

Energy depositions in the last sampling of the calorimeters give the most reliable muon signals. 
However, due to the gap between the TileCal barrel and extended barrel modules, and the transition 
region between the TileCal and the end-cap (HEC) calorimeters, it is necessary to look in the two previous 
samplings to obtain a good efficiency throughout r\. For this reason, if the energy in the last sampling, 
or one of the two previous samplings depending on the 77 of the track, is above the threshold cut, E t h, the 
track is tagged as a muon. A different tag is given depending on which sampling passes the threshold 
cut. 

4.2 Performance 

In this section the performance of CaloMuonTag is analyzed. The performance of CaloMuonTag is 
studied in some relevant physics samples: 

• pp — > J '/y ' — > jlljU. A direct production of a J/y decaying to two muons with the following cuts at 
generation level: one muon with pj > 6 GeV and the other with pj > 4 GeV. 

• H — ► ZZ* — > At. A Higgs generated with an invariant mass of 130 GeV is forced to decay into 
two Z's (one of them offshell) that decay leptonically. Only events with four muons are used for 
reconstructing the Higgs peak, but all events are used for the efficiency /fake rate calculation. 

• tt . A sample of pair produced top quarks with all decay channels allowed. 

• Zbb — ► 4£ A Z is produced in association with 2 b quarks, and is forced to decay into two charged 
leptons. The b quarks are also forced to decay into electrons or muons. 

All samples were generated with pile-up with a safety factor of 5, i.e. five times nominal value 
expected at a luminosity of \0 33 cm^ 2 s^. Except for the pp — >• J/y — >• }X}X sample, where a safety factor 
of 2 was used. 

Figure 21 shows the performance of the calorimeter muon tagger algorithm on selected samples. The 
vertical axis on the left shows the efficiency (top distributions). The vertical axis on the right (in red) 
shows the fake rate (the number of misidentified tracks per event), represented by the shaded distribution 
at the bottom of the plots. The efficiency (fake rate) is defined as the fraction of muons that are found 
by the algorithm and (not) matched with a true MC muon. Muons from minimum-bias interactions were 
not included in the efficiency calculation. 

For the tt (Zbb — > At) sample, the algorithm performs well in identifying isolated muons from Ws 
(Z's). However, due to the isolation and veto cuts applied to reduce the number of fakes, the efficiency 
for non-isolated muons from b quarks is very poor, affecting the overall efficiency. 

To reduce the fake rate due to the low-/?r tracks in the end-caps, some efficiency in that region needs 
to be sacrificed. The "peaks" in the fake rate in X] match the regions where the acceptance of the last 
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Figure 2 1 : Efficiency (and fakes per event, right axis in red and shaded histograms) vs Vj for different 
samples. Top left: pp -> J/y -> pLpL. Top right: #(130) -> ZZ* -> 4£. Bottom left: «. Bottom right: 
Zbb^AL 

calorimeter sampling is limited, and the two previous samplings are used for muon identification. Due to 
the higher electronic noise, CaloMuonTag presents a higher fake rate in the HEC than in the TileCal. The 
results for these samples, and for dijet samples generated with different pj transfers at the hard scattering 
interaction are summarized in Table 1. 



J/y->/z/x | #->4l | ft | Zbb-^M | 1120-2240 | 560-1120 | 280-560 | 70-140 | 17-35 



Eff. 


0.80 


0.86 


0.54 


0.69 












f/e 


0.05 


0.09 


0.16 


0.14 


0.11 


0.12 


0.12 


0.13 


0.12 



Table 1: Summary of the efficiencies and fakes per event (f/e) for different physics processes and dijet 
samples (top numbers show the ranges of pj transfers at the hard scattering interaction, in GeV). 



Finally, Figures 22 and 23 are used to evaluate the performance improvement when using calorimeter 
muons to reconstruct the Higgs and the J fry mass. The plots on the left show the invariant mass recon- 
structed with muons from a combined muon reconstruction algorithm, which makes use of both inner 
detector and muon spectrometer tracks. The plots on the right show the invariant mass including muons 
tagged by CaloMuonTag with momenta reconstructed by the inner detector. 

To obtain the plots in Figure 22 the same set of cuts were used as in the #(130) -> ZZ* -> 4£ stud- 
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M ww (GeV) M m ,(GeV) 



Figure 22: Reconstructed Higgs peak in the H — > Ai invariant mass reconstruction for the standard com- 
bined muons (left) and for combined muons together with inner detector muons tagged by CaloMuonTag 
in the tt. region \r\ \ < 0.1 (right). 

ies [19]. The increase on the number of reconstructed events was achieved by adding an extra muon 
found by the CaloMuonTag algorithm during the muon preselection. The extra muon was requested to 
be found in the last sampling of the TileCal and within the 1 77 1 < 0. 1 region. No loss in mass resolution or 
shift in the mean of the mass peak are observed between the two selected muon samples. The acceptance 
gap around |tj| < 0.1 represents 4% of the \ r\ \ < 2.5 region covered by the combined muon reconstruc- 
tion. Since, four muons are recontructed in this analysis, the total efficiency loss due to the gap is 16%. 
This results shows that calorimeter identification can recover almost all of the lost events (14.9%). 

For the plots shown in Figure 23 the muons used to reconstruct the invariant mass peak were matched 




M,!MeV! M.jMeV; 



Figure 23: Reconstructed J /iff peak in the J/y — > /x/J, invariant mass reconstruction for standard com- 
bined muons (left) and for combined muons together with inner detector muons tagged by CaloMuonTag 
in the 77 region |t]| < 0.1 (right). 

to the Monte Carlo truth. In this case, all muons identified by the calorimeter were added to the combined 
reconstruction muons. Again, no loss in mass resolution or shift in the mean of the mass peak are 
observed between the two selected muon samples. 

CaloMuonTag shows very good efficiency and acceptable fake rate for a high-^r analysis like H — ► 
ZZ* — ► 4/x. An additional 15% of events were reconstructed when muons identified by CaloMuonTag 
were added to the muons found by the standard reconstruction algorithm. In \ow-p T analyses, such as 
J/yf -^2/j,, tighter selection cuts need to be applied to keep an acceptable fake rate. This reduces the 
efficiency of the tagging algorithms. However, an additional 15% of events were reconstructed when 
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muons from CaloMuonTag were also used. 

5 Conclusion 

This document reviews the current status of the understanding of muon energy loss in the ATLAS 
calorimeters. Although energy losses and their distribution along the muon track have a very small 
impact on muon reconstruction inside the muon system, they play an important role in the transport of 
a reconstructed muon track to the beam pipe. During this backtracking, the muon momentum can be 
corrected using the energy measured in the calorimeter cells traversed. This procedure is justified for 
muons that have a catastrophic energy loss. In most cases, the muon momentum can be corrected using 
a parameterization of energy loss, estimated from the reconstructed momentum and the amount and the 
nature of the material traversed by the reconstructed trajectory. Techniques that attempt to combine the 
measurement with the parameterization to improve the energy loss estimate have been developed and 
validated. A performance improvement is achieved in some important analyses through the use of these 
techniques. 

Muon tagging algorithms that use calorimeter measurements and track information have been pre- 
sented. These algorithms have been developed with the main goal of complementing the muon spec- 
trometer in two ways: recovering muons with very low transverse momentum, and in the regions of 
limited spectrometer acceptance, especially in the 77 ~ 0 region. The performance of the calorimeter 
muon tagger algorithm in a few relevant data samples has been shown. 
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In-Situ Determination of the Performance of the Muon 
Spectrometer 

Abstract 

The ATLAS muon spectrometer consists of three layers of precision drift-tube 
chambers in a toroidal magnetic with a field integral between 2.5 and 6 Tm. 
Muon tracks are reconstructed with 97% efficiency and a momentum resolu- 
tion between 3% and 4% for 10 GeV< p T <500 GeV and better than 10% for 
transverse momenta up to 1 TeV. In this note, the performance of a perfectly 
calibrated and aligned muon spectrometer will be reviewed and the impact of 
deteriorations of the magnetic field, the calibration and misalignment of the 
muon chambers on the performance will be discussed. The main part of the 
note describes how the performance of the muon spectrometer can be deter- 
mined using dimuon decays of Z bosons and J/y mesons. 

1 Introduction 

Muons with a transverse momentum 1 p T greater than 3 GeV are detected in the ATLAS muon spectrom- 
eter, which is designed to measure muon momenta with a resolution between 3% and 4% for a range of 
transverse momenta of 10 GeV< pj <500 GeV and better than 10% for pj's up to 1 TeV. The muon 
spectrometer consists of a system of superconducting air-core toroid coils producing a magnetic field 
with a field integral between 2.5 and 6 Tm [1]. Three layers of chambers are used to precisely measure 
muon momenta from the deflection of the muon tracks in the magnetic field (see Figure 1). Three layers 
of trigger resistive -plate chambers (RPC) in the barrel and three layers of fast thin-gap chambers (TGC) 
in the end caps of the muon spectrometer are used for the muon trigger. The trigger chambers measure 
the muon tracks in two orthogonal projections with a spatial resolution of about 1 cm. The precision 
measurement of the muon trajectory is performed by three layers of monitored drift-tube (MDT) cham- 
bers in almost the entire muon spectrometer and by cathode-strip chambers (CSC) in the innermost layer 
of the end caps at \r] \ > 2.2. The precision muon chambers provide track points with 35 jim resolution in 
the bending plane of the magnetic field. The goal of a momentum resolution better than 10% up to 1 TeV 
scale requires the knowledge of the chamber positions with an accuracy better than 30 ;Um in addition to 
the high spatial resolution of the chambers. This is achieved by a system of optical alignment monitoring 
sensors [1]. 

In the first part of this note, we review the performance of a perfectly calibrated and aligned muon 
spectrometer and discuss the dependence of the performance on the knowledge of the following quanti- 
ties: the magnetic field, the calibration of the chambers, the alignment of the chambers, and the accuracy 
of the determination of the energy loss of the muons in the calorimeters. In the second and main part, we 
describe how the performance of the muon spectrometer can be determined by means of dimuon decays 
of Z bosons and J /\\r mesons. 

2 Performance of a perfect and deteriorated spectrometer 
2.1 Muon reconstruction 

The muon spectrometer measures the momenta of charged particles at the entrance of the muon spec- 
trometer. The energies lost by the muons on the passage through the calorimeters have to be added to the 

'The transverse momentum is denned as the components of momentum in the transverse plane. 
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Figure 1 : Sketch of a quadrant of the ATLAS muon spectrometer. 

energy measured at the entrance of the muon spectrometer in order to obtain the muon momentum at the 
primary vertex. This reconstruction strategy is called stand-alone muon reconstruction. In order to cor- 
rect for the energy loss, the expected average energy loss is used as a first estimation; in a second step the 
energy deposition measured in the calorimeters is used to account for the large energy losses of highly 
energetic muons due to bremsstrahlung and direct e + e~ pair production. One speaks of combined muon 
reconstruction when the momentum measurement of the inner detector is combined with the stand-alone 
reconstruction. In this note, muon momenta will always be given at the pp interaction point. 

The muon reconstruction is described in detail in [2, 3]. The focus of this note is the measurement of 
the performance of the stand-alone reconstruction from real data. 

2.2 Definitions 

The performance of the muon spectrometer is characterized in terms of efficiency and momentum res- 
olution. In the analysis of simulated data, let r\ rec and r\ t ruth denote the pseudorapidities and (f) rec 
and ([>t r uth denote the azimuthal angles of the reconstructed and generated muons. The distance AR = 
y/ ij]rec — fltmth) 2 + (fyrec — tytruth) 2 of a reconstructed and generated muon is shown for a Monte Carlo 
sample with muons of pr = 50 GeV at the pp interaction point in Figure 2. More than 99.7% of all 
reconstructed muons have a distance AR < 0.05. We therefore define the muon reconstruction efficiency 
as the fraction of generated muons which can be matched to a reconstructed muon within a cone of 
AR < 0.05. 

The momentum resolution is measured by comparing the deviation of the reconstructed inverse trans- 
verse momentum from the generated inverse transverse momentum: 

l i_ 

P / "" ; : " (1) 

/'; truth 

p would be normally distributed for a muon spectrometer uniform in X] and (j) . The momentum resolution 
is not independent of Y\ and 0 due to the nonuniformity of the magnetic field and the nonuniformity of the 
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Stand-alone reconstruction^ 



0 0.01 0.02 0.03 0.04 0.05 

AR 



Figure 2: Distribution of distance AR of reconstructed from generated muons in a 50 GeV single muon 
Monte Carlo sample. 



material distribution in tj and 0. This leads to non-Gaussian tails in the p distribution when integrated 
over r\ and 0, as illustrated in Figure 3. In order to minimize the effect of tails, the momentum resolution 
is determined in the following way throughout this note: In the first step, a Gaussian go is fitted to the 
distribution. In the next step i a Gaussian gi is fitted to the data between the x m j-\ ±2a,-_i, where a,_i 
is the fitted width of and x m j-i its fitted mean. The iterative procedure is terminated when the 
fit relative change of the fit parameters from one to the next iteration is less than 0.1%. The standard 
deviation of the final fit curve is taken as a measure for the momentum resolution. The mean of final fit is 
referred to as the momentum scale, which is a measure for systematic shifts of measured muon momenta 
with respect to the correct values. 

2.3 Performance of a perfect muon spectrometer 

We briefly review the performance of a perfectly calibrated and aligned muon spectrometer. We refer 
to [1] and [2] for a more detailed discussion of the performance. 

Figure 4(a) shows the reconstruction efficiency for muons with pj=5Q GeV as a function of X] and 0. 
The efficiency is close to 100% in most of the r/-0 plane. It drops significantly in the acceptance gaps of 
the muon spectrometer. The inefficiency near | r\ \ = 0 is caused by the gap for services of the calorimeters 
and the inner tracking detector. The inefficiency near \r\ \ = 1.2 will disappear after the installation of 
additional muon chambers in the transition region between the barrel and the end caps which will not be 
present in the initial phase of the LHC operation. The inefficiencies at 0 « 1.2 and 0 « 2.2 for \ r\ \ < 1.2 
are related to acceptance gaps in the feet region of the muon spectrometer. 

The stand-alone reconstruction efficiency is presented as function of pj in Figure 4(b). It rises from 
0 to its plateau value of 95% between p T = 3 GeV and 10 GeV. 

The pr -resolution is independent of (j) apart from the feet region where it is degraded due to the 
material introduced by the support structure of the detector. The resolution also depends on pseudora- 
pidity. It is almost constant in the barrel part of the spectrometer (\r] \ < 1.05). It is up to three times 
worse in the transition region between the barrel and the end caps for 1 .05 < | TJ | < 1 .7 mainly due to the 
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Figure 3: Illustration of the iterative fit of normal distributions to the fractional deviation of the recon- 
structed inverse momentum from the generated inverse momentum, go is the fitted Gaussian of iteration 
step 0. g 4 is the fitted Gaussian of final iteration step 4. 




pr=50GeV. |tj| < 2.7 and <j>. 

Figure 4: Efficiencies of the reconstruction of tracks in the muon spectrometer. 
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small integral of the magnetic field in this region. The momentum resolution becomes uniform again for 
|tj|>1.7. 



10? 

P T (GeV) p T (GeV) 



Figure 5: Stand-alone momentum resolution integrated over 77 and 0 as a function of pj for the barrel 
(5(a)) and the end-cap region (5(b)). 



The stand-alone momentum resolution varies with pj (see Figure 5). The momentum resolution in 
the barrel is dominated by fluctuations of the energy loss in the calorimeters for p T < 10 GeV where it 
is about 5% at p T = 6 GeV. It is best, 2.6% (4%) in the barrel (end cap), for p T « 50 GeV where it is 
dominated by multiple scattering in the muon spectrometer. The momentum resolution at high momenta 
is limited by the spatial resolution and the alignment of the precision chambers and approaches 10% at 
p T = 1 TeV 

2.4 Deterioration of the performance 

The performance of the stand-alone muon reconstruction is affected by the limited knowledge of the 
magnetic field in the muon spectrometer, the limited knowledge of the material distribution along the 
muon trajectory required for the calculation of the energy loss, the calibration of the position measure- 
ments by the monitored drift-tube chambers, and the alignment of the muon chambers. 

The magnetic field will be known with a relative accuracy better than 5 x 10~ 3 based on the mea- 
surements of 1840 magnetic field sensors which are mounted on the muon chambers. As a consequence 
the relative impact on the momentum resolution is less than 3% [1]. 

Studies for the technical design report of the muon spectrometer [4], which have been confirmed by 
studies in the context of this note, show that the space-drift-time relationship r(t) of the MDT chambers 
must be determined with 20 jJ,m accuracy in order to give a negligible contribution to the momentum 
resolution up to pj = 1 TeV. A strategy to calibrate r{t) with muon tracks with the required accuracy has 
been worked out and is described in detail in [5]. 

The muon chambers are installed with a positioning accuracy of 1 mm in the muon spectrometer with 
respect to global fiducials in the ATLAS cavern. The studies for the technical design report, however, 
showed that the muon chambers must be aligned with an accuracy better than 30 }Xm in the bending 
plane. A bias of 30 ;Um on the sagitta of a 1 TeV muon corresponds to a systematic shift of the measured 
momentum of 60 GeV. The alignment of the muon spectrometer is based on a system of optical alignment 
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sensors monitoring relative movements of the chambers on the level of a few micrometers. Muon tracks 
are used for the absolute calibration of the optical sensor with 30 jim accuracy. The optical system does 
not cover the whole muon spectrometer. The positions of the end caps with respect to the barrel must 
be measured with muon tracks traversing the overlap between the barrel and the end-cap part of the 
spectrometer. There are also chambers in the transition region between the barrel and end caps whose 
positions are not monitored by the optical system. These chambers will be aligned with the rest of the 
muon spectrometer by muon tracks. The alignment of the muon spectrometer is discussed in [6]. 

The expectation of the muon energy loss in the calorimeters can be checked by comparing the muon 
momentum as measured by the inner detector and the muon momentum at the entrance of the muon 
spectrometer, for instance. We shall not discuss the measurement of the muon energy loss in this article 
and refer the reader to [7]. 

The initial misalignment will be the dominant source of performance degradation. We shall show 
in the next section that Z — >• }X + }1~ will lead to a clearly visible resonance peak in the dimuon mass 
distribution even in the case of the initial misalignment. It will therefore be possible to measure the 
muon performance of a misaligned muon spectrometer with Z — > jJ. + jJ.^ events. 

Impact of misalignment on the performance 

In order to study the impact of the initial misalignment of the muon spectrometer on the performance, 
the simulated data were reconstructed with a different geometry from the one used in the simulation. 
In the reconstruction geometry, the chambers were randomly shifted from the nominal positions with 
Gaussian distribution centred at 0 and a standard deviation of 1 mm and rotated randomly with Gaussian 
distribution centred at 0 and a standard deviation of 1 mrad. Deformations of the chambers which are 
monitored by an optical system mounted on the chambers were not considered in our studies. 
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(a) Efficiency vs. 7] integrated over <j> for pr=50 GeV. 



(b) Efficiency vs. <j) integrated over 7] for pr=50 GeV. 



Figure 6: Comparison of reconstruction efficiency for an aligned muon spectrometer and a misaligned 
muon spectrometer with a average positioning uncertainty of 1 mm for a simulated single muon sample. 



Figure 6 illustrates the comparison of the stand-alone track reconstruction efficiency for 50 GeV 
muons in the aligned and the misaligned case. Only a small decrease in the reconstruction efficiency can 
be observed for muons with a momentum of 50 GeV, a momentum typical for muons originating from W 
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or Z bosons. The relatively small decrease in the reconstruction efficiency is mainly due to the fact that 
the used definition of efficiency is based on a simple r\ and (j) matching and does not take into account 
the measured transverse momentum of the muons. The reconstruction efficiency could be increased in 
the misaligned case by applying softer cuts in the pattern recognition stage of the track reconstruction. 

Figure 7(a) and 7(b) show the impact of a misaligned muon spectrometer on the fractional transverse 
momentum resolution; the resolution is highly degraded. The overall observed fractional muon spec- 
trometer resolution o tot can be expressed as the quadratic sum of the intrinsic fractional /^-resolution at 
the ideal geometry (Guieal) and the fractional resolution due to the misaligned geometry (o Alignment)- 

°tot = \J° Alignment + °ideal 

This leads to c Alignment ~ 014 for muons with pj « 50 GeV as expected from the relationship be- 
tween sagitta and momentum. The effect on the momentum scale is relatively small for the overall muon 
spectrometer, since random misalignments cancel to a certain extent. In physics signatures, such as the 
decay of a Z boson into two muons, the impact on the average momentum scale is even less, since a 
misaligned geometry has the opposite effect for opposite charged muons to first order. 




-0.4 -0.3 -0.2 -0.1 -0 0.1 0.2 0.3 0.4 u -2 -1 0 1 2 



1/ PT, r ec-( 1/ PT,,J / ( 1/ PT,»J " 
(a) Overall pj -resolution (b) pj -resolution vs. r\ 

Figure 7: Comparison of the fractional /^-resolution for an aligned muon spectrometer and a misaligned 
muon spectrometer. 

The impact of initial misalignment of the muon spectrometer on the Z resonance is shown in Figure 
8. It is expected that the mean of the invariant mass distribution does not change significantly, since the 
momentum scale of the reconstructed muon pj is hardly affected by misalignment. On the other hand a 
large broadening of the distribution due to the degradation of the pj -resolution of the muons is expected, 
which is shown in Figure 8. The dependence of the reconstructed width of the Z boson mass distribution 
on the size of the misalignment is shown in Figure 9. o%' ale is a scaling factor applied to the initial 
misalignment of 1 mm and 1 mrad. The observed dependence is the basis for the determination of the 
muon spectrometer resolution with data, which is discussed in section 4. A more detailed discussion of 
misalignment impacts on the muon spectrometer performance can be found in [8]. 
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Figure 8: Reconstructed Z boson mass distribu- 
tions for an aligned and a misaligned (o% ale = 1) 
muon spectrometer layout. 



Figure 9: Width of the Z resonance peak includ- 
ing the natural width of the Z vs. misalignment 
parameter o^ ale . 
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3 Measurement of the reconstruction efficiency from pp collision data 

The simulation of the ATLAS detector is still under development and is not expected to reproduce the 
actual performance of the detector in all details at the beginning of the LHC operation. Therefore it is 
necessary to determine all efficiencies with data in order not to rely on the simulation. 

3.1 Reconstruction efficiency from dimuon decays of the Z boson 
3.1.1 Tag-and-probe method 

The so-called "tag-and-probe" method can be used to determine the muon spectrometer reconstruction 
efficiencies from pp collision data. Muons from Z decays will be detected by the inner tracking detector 
and the muon spectrometer in the common acceptance range of | T] | < 2.5. The measurements of the inner 
detector and the muon spectrometer are independent, though not necessarily uncorrected. We require 
two reconstructed tracks in the inner detector, at least one associated track in the muon spectrometer, 
and the invariant mass of the two inner-detector tracks to be close to the mass of the Z boson. The 
last requirement ensures that the reconstructed tracks are the tracks of the decay muons of the Z boson. 
Moreover, the two inner tracks are required to be isolated to reject possible OCD background. The inner 
track which could be associated to the track in the muon spectrometer is therefore a muon and is called 
the tag muon. It is also required that the tag muon fired the 20 GeV single-muon trigger in order to 
ensure that the event is recorded. This selection ensures that aZ^ M + M decay has been detected. The 
second inner track must then be a muon, too, which is called the probe muon (see Figure 10). In the 
analysis of dimuon events from pp collisions, the probe muon plays the role of the generated muon in 
the determination of the efficiency with simulated data. 

The tag-and-probe technique is not restricted to the measurement of the stand-alone reconstruction 
efficiency. It can, for instance, be used to measure the muon reconstruction efficiency of the inner detector 
or the trigger efficiency [9]. 




Figure 10: Schematic illustration of the tag and probe method. 

Our studies show that the acceptance gaps of the muon trigger which are reflected in uncovered 
r)-(j) regions of the tag muon do not create uncovered r\-§ regions of the probe muon. The tag-and- 
probe method therefore allows us to determine the efficiency over the full Tj and 0 coverage of the inner 
detector. 
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Some systematic uncertainties of the tag-and-probe method must be considered. Muons from Z — > 
jJ. + IJ.~ decays usually fly in opposite directions in the plane transverse to the proton beam axis. Hence 
inefficiencies which are symmetric in A0 ps % may not be detected with this method. 

The topology of pp — > Z/y* — > jJ. + jJ.~ events is characterized by two highly energetic and isolated 
muons in the final state. A significant QCD-background contribution is expected due to the huge cross 
section of QCD processes. Moreover, the decay of a W ± boson into one highly energetic muon and a 
neutrino plus an additional muon from a QCD jet and the process Z — > T + T~ — > jU + v T v ;li jU _ v. r vj u were 
studied as possible background processes in our analysis. 

Because of the high collision energy of the LHC, the production of top quark pairs has a cross section 
of the order of the signal cross section. Top quarks mostly decay into a W boson and bottom quark. The 
W boson and the bottom quark can decay into muons or electrons, which also might fake the signal 
process. 

The cross section of QCD processes is far too large to be simulated within a full Monte Carlo simu- 
lation of the ATLAS detector. Hence it is assumed that the dominant contribution from highly energetic 
muons is due to the decay of ^-mesons. A more detailed discussion of the selection of pp — > Z — > jJ. + jJ.~ 
events and the background processes which must be considered can be found in [9]. 



3.1.2 Selection of candidate tracks 

Figure 11 shows the invariant dimuon mass and the transverse momenta of the selected muon track 
candidates for the signal and the chosen background processes. 

The following cuts have been applied to get a clean track selection. Tracks of opposite charge and 
a difference in their (j> coordinates greater than 2.0 rad are selected. The rapidity of the tracks is limited 
to a rapidity coverage of the inner detector of |tj | < 2.5. Each of the selected muon candidate tracks is 
required to have pj > 20 GeV. The invariant mass of the two muon candidate tracks must agree with 
the Z mass within ±10 GeV, i.e.\M m - 91.2 GeV\ < 10 GeV.The following isolation cuts are applied to 
the selected tracks: 

• number of reconstructed tracks in the inner detector within a hollow cone around the candidate 
muon: Arracks < 5 

• sum of the pr's of reconstructed tracks in the inner detector within a hollow cone around the 
candidate muon: £ p l p Tracks < 8 GeV 

r { <r<r 2 

• sum of reconstructed energy in the cells of the calorimeter within a hollow cone around the candi- 
date muon: £ E T < 6 GeV 

n<r<r 2 

• energy of a possible reconstructed jet within a hollow cone around the candidate muon: £"^ 2 Energy < 
15 GeV 

These isolation variables are defined within a hollow cone in the T7— and <j>— plane of the reconstructed 
muon track, 

n < 7(%-Tj iC ) 2 +(^-0« c ) 2 < r 2 (2) 

where n and r 2 are the inner and the outer radius of the cone. The index stands for the reconstructed 
muon track while the index ic labels the isolation criteria. The smaller radius is set to r\ = 0.05 and is 
introduced to exclude the candidate muon track from the calculations of the isolation quantities. The 
specific value of the outer radius r 2 has only a minor effect on the signal and background separation, as 
long it is large enough to contain a significant amount of data for the definition of isolation variables, 
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(a) Invariant mass of Z candidates. (b) Transverse momentum distribution. 




(c) Number of reconstructed tracks within a cone of AR =0.5 (d) Sum of transverse momenta of all tracks within a cone of 
around the candidate track. AR =0.5 around the candidate track. 



Figure 1 1 : Reconstructed quantities for Z candidate events only using inner detector tracks with a trans- 
verse momentum above 6 GeV and no further cuts for signal and background processes. 
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i.e. r2 > 0.3. Our choice of r2 = 0.5 is the same as used in the measurement of the cross section of the 
process pp — >• Z — ► jJ. + jJ.~ (see [9]). The isolation criteria listed here are optimized for events without 
pile-up of inelastic pp collisions in a selected event. Pile-up of inelastic pp collisions is expected for the 
operation of the LHC at a luminosity of 10 33 cm~ 2 s _1 and will lead to more energy in a cone around 
the muons. It was checked that the efficiency of our event selection is reduced by less than 5% in the 
presence of pile-up and that the purity of our selected samples is not affected by the presence of pile-up. 

The distributions of the first two isolation variables for signal and background processes normalized 
to their cross sections are presented in Figure 11(c) and 11(d) in absence of pile-up. The selection of 
isolated high-/? r muons allows for a substantial suppression of the background. 




Cut Flow 

Figure 12: Cut-flow diagram for probe muon tracks: (0) opposite charge requirement, (1) invariant mass 
requirement, (2) kinematic cuts, (3) isolation requirements, (4) electron veto, (5) found at least one track 
in the muon spectrometer. 

The cut-flow diagram for probe muons is shown in Figure 12. The QCD background can be rejected 
with isolation cuts. More problematic in this selection is the W — > jJ.V background, and those ff-events 
in which at least one W boson decays into a muon and a neutrino. These processes produce one highly 
energetic isolated muon track which passes all selection cuts for a tag muon. A further track in the 
inner detector which passes the other cuts and is not a muon will decrease the measured efficiency. Such 
a track is most likely caused by an electron, since it is expected that electrons also appear as isolated 
tracks in the inner detector. Therefore it is required that no reconstructed electromagnetic jet in the 
electromagnetic calorimeter can be matched to an inner track as an additional selection requirement. This 
applies especially for probe tracks stemming from a tt-event. Here, again, one has to distinguish between 
inner tracks, which result from the decay of the bottom quark or simple QCD-interactions and those, 
which result from the decay of the W boson. The first case is suppressed by the isolation requirement 
and can be neglected. The second case can lead to a highly energetic isolated electron, stemming from 
the decay of the second W boson. These electrons are expected to be vetoed. The cut-flow diagram also 
shows that the probe muon candidates from the background processes can also be associated to a muon 
spectrometer track and hence have no negative effect on the efficiency determination. 

An overview of the remaining background expected from Monte Carlo is shown in Table 1; there 
we have assumed at least three events surviving the cuts as a systematic uncertainty in order not to 
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Table 1: Fractional background contribution in % based on Monte Carlo prediction including estimated 
systematic and statistical uncertainties. 



bb — > mi 


W ± -*n ± v 


Z/f -» zz 


a->W + bW~b 


Overall 


« 0 + 0.03 


« 0 + 0.06 


«0 


« 0.02 ±0.01 


0.02 + 0.1 



underestimate the background contribution. After all selection cuts, the purity of our sample is high: less 
than 0.1% of the selected dimuon events are from background processes. 

Our results are stable against variations of the track matching distance AR from 0.05 to 0.3. The 
larger track matching cut of AR=0.3 takes account for possible misalignment effects in the first phase of 
LHC. The robustness of our results against the AR matching cut indicates that our selected data sample 
will allow an efficiency determination which is not significantly affected by background processes even 
with a possible misalignment of the muon spectrometer. 

3.1.3 Determination of the stand-alone reconstruction efficiency 

The stand-alone reconstruction efficiency depends on pj, v\ and 0 of the muons. Hence, one should 
determine the efficiency in appropriate bins in these quantities. The lower value of the /77-binning is 
given by the selection cut of 20 GeV. The highest value is set to 70 GeV and 10 bins are used to ensure 
high enough statistics within each bin. For larger statistics also values above 100 GeV can be considered. 

A natural binning in X] and 0 is given by the geometry of the muon spectrometer. The muon spec- 
trometer consists of 16 sectors in the (j) plane, small and large MDT chambers sequentially ordered as 
illustrated in Figure 13(a). Therefore 16 bins in <j> are used. The same geometrical argument applies to 
the T] -plane of the detector. Three MDT-chambers which are projective to the interaction point define 
one tower. Twenty towers are defined in 77 which are the basis for the chosen binning (Figure 13(b)). In 
total 320 regions are defined in the xj — (j) plane. 




r|=<>.25 r|=(>-45 n=0.«) r|=0.75 r|=0.9 



(a) 0 -binning (b) f] -binning 

Figure 13: Illustration of the choosen <j> and tj -binning of the muon spectrometer 



It is important to note that the dominant effect of losing reconstruction efficiency is the acceptance 
gap due to the absence of MDT chambers. Hence it is a pure geometrical effect mainly in the 77 -direction. 
Therefore different physics samples with different Tj- and to a certain extent also different (j)- and pj- 
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Table 2: Overall reconstruction efficiencies for different physics processes. Efficiencies with respect to 
the Monte Carlo truth information are quoted for the sample of events that pass the single muon trigger. 



Sample 


AR = 0.05 


AR = 0.075 


AT? = 0.15 




0.952 


0.956 


0.958 


W ± -► /i±v 


0.953 


0.958 


0.960 


tt — > W + W~bb 


0.943 


0.948 


0.950 




0.930 


0.944 


0.952 



distributions will lead to different overall reconstruction efficiencies. An overview of the overall recon- 
struction efficiencies for different physics samples and track matching distances is shown in Table 2. 
Hence the in-situ determined efficiencies must be applied in an appropriate binning for different physics 
samples. 

The comparison of the efficiencies determined with Monte Carlo truth information and the trag- 
and-probe method is shown in Figure 14 for T7 and pj, assuming an aligned muon spectrometer. A 
track matching distance of AR < 0.075 was chosen. The efficiencies determined in both ways coincide 
within their statistical uncertainty for an integrated luminosity of 100 pb" 1 . This proves that possible 
correlations between tag and probe muons are small and can be neglected to a good extent. 



u 0.95 
0.! 



Monte Carlo truth 
Tag & Probe method 



Stand-alone reconstruction 



0.95 
0.9 



• Monte Carlo truth 
1 Tag & Probe method 



Stand-alone reconstruction 



60 70 
P T [GeV] 



(a) Efficiency integrated over <j> and pj in Z — > [l + fl 



(b) Efficiency integrated over <j> and 77 in Z —> [l + fl 



Figure 14: Comparison of the muon reconstruction efficiency of the muon spectrometer vs. Tj and pj 
determined by the tag and probe method and via the Monte Carlo truth information. 



The statistical error on the reconstruction efficiency e can be calculated (for large N) by 

where Af is the number of tag muons. Note that both muons can, and will, be chosen as tag muons 
in most cases, as the muon spectrometer is expected to have a reconstruction efficiency of 95% on 
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average. Figure 15 shows the distribution of the in-situ determined efficiencies for all 320 regions. The 
overall reconstruction efficiency can be determined to a high statistical precision even for relatively low 
integrated luminosities. A statistical precision of 1% of the overall muon spectrometer reconstruction 
efficiency can be reached with less than 1 pb _1 . Figure 16 illustrates the statistical uncertainty averaged 
over all 320 regions versus the integrated luminosity. 




Stand-alone reconstruction 



60 80 100 120 140 160 
Integrated Luminosity [pb 1 ] 



Figure 15: Distribution of muon reconstruction 
efficiency of the 320 muon spectrometer regions. 



Figure 16: Average statistical error of reconstruc- 
tion efficiency of the 320 regions vs. integrated 
luminosity. 



A possible correlation between tag and probe muons could be caused by the trigger. The probability 
of reconstructing a muon is significantly higher if it was triggered, as shown in Figure 17. Hence, it might 
be suspected that this correlation implies also a correlation in real data, since data events must contain 
at least one muon which has been triggered. This is not a problem as long as the trigger requirement is 
only applied on the tag muon. 

In Section 3.1.1 it was already mentioned that the tag and probe approach has problems in detecting 
inefficiencies which have a <j> « % symmetry. Dividing the data sample in two parts differing in the 
angle A<I> could overcome this problem. One part contains reconstructed tag and probe muons with 
AO < 2.8 rad the second sample with A<& > 2.8 rad. The chosen value of 2.8 rad leads to roughly 
equally sized samples. Applying the tag and probe method on both sub-samples will lead to different 
efficiency distributions in case of (j) -symmetric inefficiencies. Monte Carlo studies showed that for the 
presently simulated detector layout we expect only small differences (Fig. 18). 

Table 3 summarizes statistical and systematic uncertainties of the in-situ determined stand-alone 
reconstruction efficiency for two different integrated luminosities. The difference in | £,•„_,„*„ — e true \ is 
calculated via 

\ein-situ-e trU e\ = f ^|4-rf B " 4*1 (4) 

where the index i runs over all bins in r\ -direction. This is treated as primary source of systematic 
uncertainty. One should note that the given systematic error has a strong statistical component from the 
Monte Carlo statistics which is reflected in the large decrease of the systematic uncertainty in Table 3. 
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Figure 17: Reconstruction efficiency of the muon 
spectrometer for muon tracks which have been 
triggered and muon tracks which have not been 
triggered. 



Figure 18: Comparison of muon reconstruction 
efficiencies determined via tag and probe ap- 
proach for two sets of muons differing by A(j) . 



Table 3: Estimated uncertainties of in-situ determined muon spectrometer reconstruction efficiencies 
for muons in a p^-range between 20 GeV and 70 GeV and within an Tj-range smaller than 2.5 from a 
Z — > mi decay. 



/JSf 


Statistical 


\£in-situ ~ £true\ 


Background 


Overall 




Uncertainty 




Contribution 


Systematic 


100 pb- 1 


0.08% 


0.9% 


0.02% 


« 1% 


lfb- 1 


0.03% 


0.1% 


0.02% 


w0.1% 



We take the difference between the efficiency obtained for the misaligned layout and the efficiency 
obtained for the aligned layout as a conservative estimate of the precision which can be achieved with the 
tag-and-probe method in case of small unresolved misalignments. The difference in both efficiencies for 
the different A0 -sample is comparable within its statistical uncertainties. The background contribution 
is only estimated by the Monte Carlo prediction and treated as a systematic uncertainty. 

The Gaussian sum of the two systematic uncertainties, namely |e/ n _« fM — e true \ and the background 
contribution, is defined as the overall systematic uncertainty. 

The given uncertainty estimation assumes that nearly all MDT chambers work and £ true « 96%. A 
lower value of e trU e will lead to an increase of the statistical uncertainty via Equation (3) and also to a 
higher systematic uncertainty. For real data a conservative estimate of the systematic uncertainty would 
be the difference of the Monte Carlo prediction for the efficiency and the efficiency determined with 
collision data. Moreover, it should be noted that the given uncertainties apply for muons in a p^-range 
between 20 GeV and 60 GeV and within an tj -range smaller than 2.5. 
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3.1.4 Alternative approach 

The tag-and-probe analysis presented above uses isolation cuts to reject background events and assumes 
that a negligable background contribution remains. In this section, we explore the possibility of determin- 
ing the reconstruction efficiency from collision data without isolation cuts and determine the background 
contribution directly in data. We apply only cuts on the transverse momenta, e.g. pj > 10 GeV. This 
leads to a dominant background contribution in the lower invariant dimuon-mass region. 

In this approach, a tag muon is defined as a muon spectrometer and inner detector combined muon 
track, with pj > 10 GeV. A probe muon is defined as any inner detector track, also with pr > 10 GeV. 
An invariant mass is then calculated from every combinatoric tag and probe pair with opposite charges. 
The size of this sample is denoted as N in the following (Figure 19(b)). Finally, we select a subsample 
requiring that the probe muon also be a combined muon track. The size of this sample is denoted as n 
(Figure 19(a)). 




50 60 70 80 90 100 110 50 60 70 80 90 100 110 



nv [GeV] m w [GeV] 

(a) At least one of the muons is matched to a muon spectrom- (b) Both muons are matched to muon spectrometer tracks. 

Figure 19: Expected invariant Masses resulting from two inner tracks where both muons must be 
matched to a muon spectrometer track (a) or at least one of the muons must be matched to a muon 
spectrometer tracks (b). 

The track-finding effciency of the muon spectrometer, e, is then defined as n/N. Missing tracks in 
the muon spectrometer will result in n < N and thus effciency loss. The main difference from the ap- 
proach presented in Section 3.1 is that no isolation cuts are used for the background rejection but instead 
the background is directly estimated from data via side band subtraction. In this approach an exponential 
function is fitted to the invariant mass region between ~ 40 GeV to ~ 60 GeV, where it is assumed that 
the background contribution is dominating. The exponential function is then extrapolated to the invariant 
mass region between ~ 81 GeV to ~ 101 GeV and used for subtraction of the background in this region. 
The remaining number of events between ~ 81 GeV to ~ 101 GeV define n and N, respectively. In this 
way, the background contribution is accounted for implicitly in data and no further assumptions on the 
Monte Carlo predictions are made. The disadvantage of this procedure are the systematic uncertainties 
of the fitting procedure and the choice of the fitting function. One possible improvement with higher 
statistics of the background sample would be that the Monte Carlo prediction of the shape of the back- 
ground distribution could be used to obtain a better fit function than the pure exponential for the side 
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band subtraction. 

The systematic uncertainty of this method is again estimated by the residual difference of the in-situ 
determined efficiency and the true efficiency. For a simulated data sample corresponding to an integrated 
luminosity of / Ldt = 100 pb~ ! it is expected to determine the efficiency with this approach up to a 
precision of 

Ae = ±0.05 (sys.) (5) 

The relative large systematic uncertainty arises mainly from the limited available statistics of background 
Monte Carlo samples which has a direct impact on goodness of applied fit. Hence, further improvements 
are likely to be achieved in future studies. 

3.2 Determination of the reconstruction efficiency with J '/W events 

The reconstruction efficiency for muons with transverse momenta less than 20 GeV is not determined 
from Z — ► H + H~ events due to the cuts on the transverse momenta of the muons. Muons from J/y — > 
jU + ;U~ decays populate the momentum range below 20 GeV. We explored the possibility of using the 
tag-and-probe method on J/\\r —>■ events for the measurement of the reconstruction efficiency 

at low transverse momenta. The method works well on signal events. Yet the huge QCD background 
contaminates the selected dimuon data sets so much that a reliable efficiency measurement becomes very 
difficult. Studies using muon isolation techniques have started. The muon reconstruction efficiency of 
low- pr muons must therefore be extracted from Monte Carlo simulations and not be determined easily 
from data. 

4 Measurement of the momentum resolution and momentum scale 

The muon momentum measurement will be affected by the limited knowledge of the magnetic field, the 
uncertainty in the energy loss of the muons, and the alignment of the muon spectrometer as discussed in 
Section 2.4. 

The analysis of the measurements of the optical alignment sensors and the collision data with the 
switched-off toroid coils will provide the position of the muon chambers with an accuracy better than 
100 /xm at the start-up of the LHC [1]. A systematic error of 100 jl(m on the sagitta corresponds to an 
additional systematic error in the muon momentum of about 0.1 TeV -1 • p 2 which amounts to 250 MeV 
for p=50 GeV. 

Muons with energies below 100 GeV lose on average about 3 GeV of their energy on their passage 
through the calorimeters almost independently of their energy. The material distribution of the ATLAS 
detector is modelled in the detector simulation with an accuracy better than a few percent [1]. A 5% 
uncertainty in the amount of the material traversed by the muons would reflect in a 5% uncertainty of the 
energy loss, that is an uncertainty of the average energy loss of ±150 MeV. 

The uncertainty in the bending power of the toroidal field will lead to a momentum uncertainty 
which is significantly smaller than the energy loss uncertainty and the impact of the misalignment on the 
momentum measurement. It can therefore be neglected with respect to energy loss uncertainties and the 
misalignment of the spectrometer. 

A bias in the measured muon momentum translates into a bias in the measurement of the dimuon 
mass in Z —>■ jJ. + jJ.~ decays. An tj, (j), and momentum dependent bias will also broaden the dimuon mass 
peak. The shape of the dimuon invariant mass distribution for Z — > jJ. + jJ.~ decays can therefore be used 
to measure the accuracy of the momentum measurement with collision data. 

As the momentum bias caused by misalignment is of the same magnitude, but of opposite sign for 
jl + and jl leptons while the energy loss uncertainty has the same sign and magnitude for and jl~ 
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leptons, it is possible to disentangle the effect of misalignment and the effect of energy loss errors on the 
reconstructed Z mass. The the sensitivities of the dimuon invariant mass spectrum to misalignment and 
errors in the energy-loss correction were therefore studied separately to get a first insight. 

4.1 Determination of the energy-loss uncertainty with Z — > events 

We begin with the determination of the energy-loss uncertainty with Z — > events. We assume 

that the detector is aligned and that the magnetic field is known with the expected accuracy such that its 
impact on the momentum scale can be neglected. We allow for an error in the energy-loss and, therefore, 
correct the reconstructed muon energy in each of the 320 spectrometer towers by a tower-dependent 
constant 8E reCitower : 

Erectower E reCjtower + 8E rec j ower . (6) 

We determine the 400 constants 8E rec tower by minimizing 

v 2_ V [(Pcorr, + ,k+Pcorr,-Jc) 2 -M*\ 2 

dimuon pairs k 

where p cor r,±,k denotes the corrected measured /x* momentum and Cfy the expected dimuon mass reso- 
lution. To estimate the sensitivity to the energy-loss correction, we applied this fit to 40,000 simulated 
Z —> H + H~ events (corresponding to an integrated luminosity of 50 pb _1 ). The fit gives 8E recfower with 
a bias of 100 MeV and a stastitical error of the same size. Studies to improve the check of the energy-loss 
correction with collision data are ongoing. 



4.2 Determination of the momentum scale and resolution for a misaligned spectrometer 

In a second step, we assume that the energy-loss correction is right and consider the misalignment of the 
muon spectrometer as the only source of a deterioration of the momentum measurement. 

If the Monte Carlo simulation describes the detector correctly, it also predicts the shape of the re- 
constructed dimuon mass spectrum for Z —> jJ. + jJ.~ events correctly. The misalignment of the muon 
chambers causes a deviation of the measured from the predicted shape of the invariant dimuon mass 
spectrum. In order to match the Monte Carlo prediction with the experimental measurement, the recon- 
structed simulated muon momenta must be smeared and shifted. The following procedure was adopted 
in our analysis: A random number Sp normally distributed around 0 with standard deviation a res was 
added to the reconstructed simulated muon momenta p rec ,MC and multiplied by a scale factor a: 

Pcorr = a{prec,MC-8p). (8) 

The inclusion of dp corrects for an underestimation of the momentum resolution. The scale factor a 
takes care of systematic shifts between the reconstructed momenta in the experiment and the simulation. 
a and o res are determined by a fit of the corrected simulated invariant dimuon mass spectrum to the 
experimentally measured spectrum. 

To test this approach, the existing Z — »■ !X + jX~ Monte Carlo data set was divided into two subsamples 
of equal size corresponding to an integrated luminosity of 50 pb" 1 . The one sample serves as Monte 
Carlo reference for an aligned muon spectrometer, the other plays the role of the experimental data set. 
Two scenarios were investigated: 

1. The Monte Carlo reference sample and the experimental sample were simulated and reconstructed 
with the same (aligned) geometry. a res was fixed to 0 in the analysis of this scenario. Separate 
scale factors ag and (Xe were applied to muon in the barrel (\t]\ < 1) and the end-cap region 
(1<|T7|<2.7). 
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2. The Monte Carlo reference sample was simulated and reconstructed with the same (aligned) ge- 
ometry. But the experimental sample was reconstructed with a different geometry misaligned as 
described in Section 2.4. In this scenario, two scale factors Ofe and (Xe for the barrel and end-cap 
parts of the muon spectrometer and a global standard deviation o res were used as fit parameters. 



Table 4: Fit results for the scale and resolution parameters for an integrated luminosity of 50 pb 



Layout 




1-flfe 




Aligned 
Misaligned 


(4±14)1(T 4 
(6±2)1CT 3 


(l±i3)icr 4 
(5±2)1CT 3 


(11.6±0.3) % 



The results of the tests are summarized in Table 4. In the ideal case in which the reference and 
the experimental sample are statistically independent, but equivalent otherwise, the fit gives factors a# 
and <Xe equal to 1 within the statistical errors as expected. In the second scenario, the uncorrected 
misalignment in the experimental sample leads to a systematic shift of the reconstructed momenta, hence 
as and a<E differ from 1 slightly, but significantly, and a large degradation of the momentum resolution 
from 3.5% to 12% is observed which is consistent with the degradation presented in Section 6. A large 
Z — >• }X + IX~ sample would clearly allow for a finer segmentation than the division in barrel and end-cap 
parts for the scale factors and lead to smaller values of a res . 
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Figure 20: Dependence of < p corr - p re c,MC > /PrecMC on tj integrated over p T and 0 for Z — >• }X + }X 
events in the second scenario of a misaligned detector. 



The mean value of 1 + Pcorr p P ™ MC is presented in Figure 20 as a function of r\ for the second scenario. 
The mean values are spread around 1 with a standard deviation of 0.3%. The maximum deviation from 1 
is less than 1%. This results indicates that the Z-mass distribution permits the detection of imperfections 
in the momentum reconstruction. Studies which use a more refined parametrization of the momentum 
correction and take into account energy-loss and alignment corrections at the same time are in progess. 

We conclude from the studies in this section that it should be possible to control the muon momentum 
and energy scale on the level of 0.5 GeV for 50 GeV muons with 40,000 Z — »■ events corresponding 
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to an integrated luminosity of 50 pb 1 . 

5 Conclusions 

The performance of the ATLAS muon spectrometer can be predicted by Monte Carlo simulations. The 
performance of the spectrometer will, however, differ from the prediction due to the initial misalignment 
of the muon chambers and imperfections in the corrections of the muon energy-loss. It is therefore 
important to measure the performance with collision data. 

We showed in the present article that it is possible to measure the muon reconstruction efficiency 
with Z — > jJ. + jJ.~ events with an accuracy better than 1% with an integrated luminosity of 100 pb -1 . 
Selection cuts and the pj spectrum of the Z decay muons limit the momentum measurement to range of 
20 GeV< pj <70 GeV. The efficiency measurement can be extended to higher momenta with increased 
luminosity when the tails of the p T spectrum get populated. 

We explored the possibility of measuring the efficiency at low transverse momenta with J/\j/ — > 
jJ. + IJ.~ events. Our studies show that a reliable efficiency measurement will be difficult due to large 
irreducible QCD background. 

We finally addressed the question of how the momentum and energy scale can be measured with 
Z — >• According to our feasibility study it will be possible to control the energy-loss correction on 

the level of 100 MeV and the momentum scale on the level of 1% for an integrated luminosity of about 
100 pb -1 . More detailed studies are needed to obtain a better estimate of the achievable precision. 
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Abstract 

In this note the overall performance of the ATLAS detector is discussed for 
the identification of and measurements with hadronic decays of x leptons in 
a wide dynamic range of transverse energies, spanning from 10-15 GeV up 
to at least 500 GeV. In general, hadronically decaying x leptons are recon- 
structed by matching narrow calorimetric clusters with a small number of 
tracks. Two complementary approaches, a calorimeter-seeded and a track- 
seeded algorithm, have been developed to efficiently reconstruct these de- 
cays while providing the necessary large rejection against jets from QCD pro- 
cesses. The performance of these algorithms in terms of efficiency and rejec- 
tion against jets is discussed. In addition, the prospects for the determination 
of fake x rates as well as the extraction of x lepton signals from W and Z boson 
decays and from it events in early ATLAS data corresponding to an integrated 
luminosity of 100 pb _1 are discussed. 

1 Introduction 

Tau leptons, and particularly their hadronic decays, will play an important role at the LHC. They will 
provide an excellent probe in searches for new phenomena: the Standard Model Higgs boson at low 
masses, the MSSM Higgs boson or Supersymmetry (SUSY). Therefore, understanding their selection 
efficiencies and the cross-sections at which they will be produced is essential for discovering new physics. 

Tau leptons are massive particles with a measurable lifetime undergoing electroweak interactions 
only. The production and the decay of x leptons are well separated in time and space (r T /m T ~ 10~ n ), 
providing potential for unbiased measurements of the polarisation, spin correlations, and the parity of 
the resonances decaying into x leptons. The excellent knowledge of x decay modes from low energy 
experiments indeed makes this an ideal signature for the observations of new physics. 

The interesting transverse momentum range of x leptons spans from below 10 GeV up to at least 
500 GeV. Experiments at the LHC will thus have to identify them in a wide momentum range. The low 
energy range should be optimized for analyses related to W and Z boson observability with x decays and 
also to Higgs boson searches and SUSY cascade decays. The higher energy range is mostly of interest 
in searches for heavy Higgs bosons in MSSM models and for extra heavy W and Z gauge bosons. For 
illustration, Fig. 1 shows the transverse energy spectrum of the visible decay products of x leptons from 
different processes of interest normalized to the predicted cross-section with which they will be produced 
at the LHC and to an integrated luminosity of 10 fb _1 . 

The reconstruction of x leptons is usually understood as a reconstruction of the hadronic decay 
modes, since it would be difficult to distinguish leptonic modes from primary electrons and muons. 
Despite a strong physics motivation for exploring data with x leptons in the final state, their reconstruc- 
tion at hadron colliders remains a very difficult task in terms of distinguishing interesting events from 
background processes dominated by QCD multi-jet production. Another related challenge is providing 
efficient triggering for these events while keeping trigger rates at manageable levels. 

The availability of various decay modes makes x leptons a rich but not totally unique signature. 
Hadronically decaying x leptons 1 are distinguished from QCD jets on the basis of low track multiplicities 
contained in a narrow cone, characteristics of the track system and the shapes of the calorimetric showers. 
Isolation from the rest of the event is required both in the inner detector and the calorimeter. From this 

will often use notation Tf, a d in this note when discussing objects reconstructed from the visible part of the hadronic 
decay products of a T lepton. 
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Figure 1: The visible transverse energy of x leptons from different physics processes: top quark decays, 
W/Z production, Standard Model vector boson fusion Higgs boson production for m# = 120 GeV with 
H — >• xx, for x leptons from low energy Supersymmetry with a light stau (SU1 sample), heavy Z' bosons, 
and heavy Higgs bosons from bbH production in the MSSM with tan/3 = 20(45) for masses of 400 GeV 
(800 GeV). 

information, a set of identification variables is built, to which either a traditional cut-based selection or 
multi-variate discrimination techniques are applied. 

The inner detector provides information on the charged hadronic track or the collimated multi-track 
system reconstructed in isolation from the rest of the event. These tracks should neither match track seg- 
ments in the muon spectrometer nor reveal features characteristic of an electron track (e.g. high threshold 
hits in the Transition Radiation Tracker). In the case of a multi-track system, they should be well col- 
limated in (tj,0) space and the invariant mass of the system should be below the x lepton mass. The 
charge of the decaying x lepton can be directly determined from the charge(s) of its decay product(s). 

Calorimetry provides information on the energy deposit from the visible decay products (i.e. all 
decay products excluding neutrinos). Hadronically decaying x leptons are well collimated (with an 
opening angle limited by the ratio m T /E T ) leading to a relatively narrow shower in the electromagnetic 
(EM) calorimeter with, for single-prong decays with one or few 71° 's, a significant pure electromagnetic 
component. On average in this case about 55% of the energy is carried by 7T°s present among the decay 
products. 

The calorimeter and tracking information should match, with narrow calorimeter cluster being found 
close to the track(s) impact point in the calorimeter. Furthermore, the invariant mass of the cluster should 
be small and the cluster should be isolated from the rest of the event. 

The algorithms for the reconstruction of hadronically decaying x leptons are considered higher level 
reconstruction as they use components provided by algorithms specific to different subdetectors like track 
reconstruction in the inner detector or topological clustering of the energy deposits in the calorimeter. At 
present, two complementary algorithms have been implemented into the ATLAS offline reconstruction 
software. 

• The calorimetry-based algorithm starts from clusters reconstructed in the hadronic and electromag- 
netic calorimeters and builds the identification variables based on information from the tracker and 
the calorimeter. 

• The track-based algorithm starts from seeds built from few (low multiplicity) high quality tracks 
collimated around the leading one. The energy is calculated with an energy-flow algorithm based 
only on tracks and the energy in the electromagnetic calorimeter. All identification variables are 
built using information from the tracker and the calorimeter. 
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A short overview of the features of x lepton decays is included in Section 2. In Section 3 selected 
topics on the performance of the detector directly relevant to the reconstruction and identification of the 
hadronically decaying x leptons are discussed. Offline reconstruction algorithms and performance results 
are described in Section 4. In the remaining part of the note strategies for analyses using x leptons with 
the first 100 pb _1 of data are presented. 

2 Topology of t leptons in LHC collisions 

The transverse momentum range of interest spans from below 10 GeV up to 500 GeV. x leptons decay 
hadronically in 64.8% of all cases, while in ~ 17.8% (17.4%) of the cases they decay to an electron 
(muon) [1]. From the detection point of view, hadronic modes are divided by the number of charged 7is 
among the decay products into single-prong (one charged %) and three -prong (three charged ns) decays. 
The small fraction (0.1%) of five-prong decays is usually too hard to detect in a jet environment. The 
X — > n ± v mode contributes 22.4% to single-prong hadronic decays and the x — >• «7r°7r ± v modes 73.5%. 
For three-prong decays, the x — > 3n ± v decay contributes 61.6%, and the x — > nn°3n ± v mode only 
33.7%. In general, one- and three-prong modes are dominated by final states consisting of n ± and 7T°. 
There is a small percentage of decays containing K* 1 which nevertheless can be identified using the same 
technique as for states with n ± from the ATLAS detector point of view. A small percentage of states 
with cannot be easily classified as belonging to either the single -prong or three-prongs categories as 
the number of registered prongs depends on the actual K$ interaction within the detector. Unless specific 
studies are done, other multi-prong hadronic modes can be safely neglected. 

The lifetime of the X lepton (cx = SI. Hum) in principle allows for the reconstruction of its decay 
vertex in the case of three-prong decays. The flight path in the detector increases with the Lorentz 
boost of the x lepton, but at the same time the angular separation of the decay products decreases. A 
resulting transverse impact parameter of the x decay products can be used to distinguish them from 
objects originating from the production vertex. 

The incorporation of spin effects in x lepton decays is often of importance. This was done within the 
framework of the ATLAS Monte Carlo simulation and events were generated using PYTHIA [2] interfaced 
with TAU0LA [3]. The generation process has correctly included full spin correlations in production and 
decays of the X leptons. Tau leptons from the decay of gauge bosons, Higgs bosons or in SUSY cascade 
decays will carry information on the polarisation of the decaying resonance and in the case of pair 
production also some information on the spin correlations. Tau leptons from W — ► TV and — > xv 
will be 100% longitudinally polarised, with P r = +1.0 and P x = -1.0 respectively, resulting in different 
distributions of the charged to total visible energy for single-prong decays in the center-of-mass system 
of the decaying resonance. At the LHC this effect can be used to suppress W — ► XV background and to 
increase the — > xv observability [4]. The x polarisation could also be used as a tool to discriminate 
between MSSM versus Extra Dimension scenarios [5]. The longitudinal polarisation of x leptons from 
neutral Higgs boson decays will be democratic with 50% probability, thus x leptons from Higgs boson 
decays are effectively not polarized. The polarisation of x leptons from Z boson decays will be a more 
complicated function of the center-of-mass energy of the system and the angle of the decay products [6]. 
In the cleaner environment of the ILC and also perhaps at the sLHC, building variables sensitive to the 
longitudinal and transverse spin correlations may lead to a CP measurement of the Higgs boson [7,8]. 

3 Performance of the ATLAS detector for t identification 

Hadronic x decays can be efficiently reconstructed and identified using information from the inner de- 
tector and from the calorimeter. Reconstruction is done only for the visible part of the decay products, 
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however, for specific analyses like H — > xx the complete invariant mass of the xx system may be re- 
constructed using the collinear approximation [9] (neutrino momenta parallel to that of the visible decay 
products). A few selected topics related to the performance of the detector are discussed below before 
turning to the reconstruction algorithms. 

3.1 Tracking and vertexing 

The reconstruction of tracks from charged pion decays is an important ingredient of the Xhad reconstruc- 
tion algorithms. The track-based algorithm is seeded by one or more good quality tracks which allow 
for the calculation of the Xhad energy with the so called energy-flow scheme. Both the calo-based and 
the track-based algorithm determine the charge of the Xhad candidate by summing up the charge(s) of 
the tracks reconstructed in the x had core region 2 . The tracking information is further used to identify 
hadronically decaying x leptons and to discriminate them against the background from hadronic jets 
by considering the track multiplicity, the impact parameter and the transverse flight path in the case of 
multi-track candidates. The track selection should therefore ensure high efficiency and quality of the 
reconstructed tracks over a broad momentum range from 1 GeV to a few hundred GeV. 

3.1.1 Reconstruction efficiency and track quality 

The efficiency for track reconstruction in x decays is defined as the probability for a given charged % 
from a x decay to be reconstructed as a track. With respect to the reference tracking performance of 
the detector established for single muons in the low p T range a degradation due to hadronic interactions 
(a charged % interacting with the material of the inner detector) is expected. In the higher pj range a 
degradation is caused by the strong collimation of the multi-track system for three-prong decays. 

Good quality tracks reconstructed with pj as low as 1 GeV are required by the track-based algorithm, 
while the calorimeter-based algorithm accepts any track with pj > 2 GeV. A standard quality selection 
has been defined in Ref. [10]. However, for the reconstruction of x leptons a somewhat stricter selection 
has been applied. Good quality tracks are required to satisfy # 2 /n.d.f < 1.7, to have a number of pixel 
and SCT hits > 8 and transverse impact parameters do < 1mm. For the leading track in addition the 
number of low threshold TRT hits has to be larger than 10 in a pseudorapidity T7 range up to 1.9, while 
for the second or third track the presence of a B -Layer hit and ratio of the of high- to-low threshold hits of 
smaller than 0.2 are required. Both requirements were added to minimize the number of accepted tracks 
from conversions. A dedicated veto against electron tracks being used as leading tracks is not applied at 
the reconstruction level. This will be taken care of separately as part of the identification procedure. 

Figure 2 shows the reconstruction efficiency for pj = 1 — 50 GeV using the standard quality selection 
as defined in Ref. [10]. Adding the additional quality criteria as described above, the overall efficiency 
for reconstructing good quality tracks from x lepton hadronic decays is reduced to 82 — 83%. The 
reconstruction efficiency is slightly higher for tracks from single prong decays compared to three-prong 
decays, where tracks could be very collimated particularly for boosted x leptons. 

3.1.2 Charge misidentification 

The charge of the x lepton is calculated as the sum of the charges of the reconstructed tracks. For the 
leading track, which is required (e.g. by the track-based algorithm) to have a transverse momentum 3 
larger than 9 GeV, charge mis-identification is limited to ~ 0.2% using the quality cuts described above. 

2 The core region for the track-based (calo-based) algorithm is understood here as AR < 0.2(0.3) cone in (rj, <j>) around the 
reconstructed direction of the visible decay products. 

3 The threshold p T > 9 GeV on the leading track was used for results presented here, while it was lowered to 6 GeV in the 
more recent software releases. 
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Figure 2: Reconstruction efficiency for tracks from charged ns for one- and three-prong hadronic x 
decays from W — > XV and Z — > xx signal samples as a function of the transverse momentum of the track 
(left) and of the pseudorapidity for three different ranges of track p T (right). 



The overall charge mis-identification probability for the hadronically decaying x lepton is however dom- 
inated by combinatorial effects: single-prong decays may migrate to the three -prong category due to 
photon conversions or the presence of additional tracks from the underlying event. A three-prong decay 
might be reconstructed as a single-prong decay due to inefficiencies of the track reconstruction and selec- 
tion. This overall charge mis-identification is estimated to be below ~ 3.6% without requiring additional 
quality cuts. In fact, the X charge misidentification is dominated by a combination of effects, but the 
contributions from the charge misidentification of the individual tracks should not be neglected. 

Table 1 shows the percentage of contamination for one- and three -prong candidates using the afore- 
mentioned quality criteria for tracks in the core region. For the roughly 3.9% contamination of the 
single-track candidates from three-prong decays, about 85% are due to hadronic interactions. A 3.8% 
contamination of three-track candidates from one-prong decays is observed with 70% of them being due 
to conversions. The percentage of the overall charge misidentification is also shown. Requiring at least 
one B -Layer hit reduces the charge misidentification both in the case of electron tracks from conver- 
sions and in the case of hadronic interactions at low radii. However, this happens at the expense of an 
additional loss in efficiency in particular for three -prong decays. 

3.1.3 Tracks from conversions 

Photons from 71° decays might convert in the material of the inner detector and then contribute additional 
tracks to the core or isolation region of the Xhad candidate. This could result in one-prongs being recon- 
structed as three-prong candidates, in an inefficiency of the reconstruction and identification criteria and 
in a degradation of the energy resolution as calculated from the energy-flow algorithm. 

A large fraction of reconstructed Xhad candidates are accompanied by conversions. In 1.5% of the 
cases a conversion electron is reconstructed as the leading track of the one -prong candidate, while 5.7% 
of the three -prong candidates contain one reconstructed track coming from a conversion electron. In 
Table 1 the effects of charge misidentification and contamination from photon conversions are quantified. 

3.1.4 Impact parameter 

The mean proper lifetime of the x lepton is about 0.29 ps. Although the lifetime of the x lepton is about 
five times shorter than that of the ft-quark, the transverse impact parameters of its decay products are 
still useful for x identification. The impact parameters, do and zosin(O), have been studied for one-prong 
candidates reconstructed by the track-based algorithm. The transverse impact parameter do is defined 
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Table 1: Percentage of one- and three prong x lepton hadronic decays within reconstructed one-, two- 
and three-prong Xhad candidates by the track-based algorithm, matched to true x decays. Tracks in 
a cone of AR = 0.2 around the leading good quality track are considered. A transverse momentum 
of pr > 9 GeV is required for the leading track. An estimate for electron contamination and charge 
misidentification is given in addition. Separately specified are results for a subsample where no hadronic 
secondary interaction of primary charged % was recorded inside the inner detector volume. Events from 
Z — > xx and W — > XV samples were used. 



Seeds for track-based 


Reconstructed as 


Reconstructed as 


Reconstructed as 


?had -candidates 


single-prong 


three-prong 


two-prong 


Electron contamination 








(from conversion) 


1.5% 


5.7% 


2.9% 


x — ► n ± nn°v 


96.1% 


3.8% 


23.8% 


x -> l^niPv 


3.9% 


96.2% 


76.2% 


Charge misid. 


1.7% 


3.6% 




(no had. interact.) 


0.4% 


2.1% 
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Figure 3: Transverse (left) and longitudinal (right) impact parameter resolution as a function of | T] | from 
a one -prong Z — > TT sample. The open (full) circles are from t — »■ 7r(7r°)v (T — > /xvv) events. 



as the smallest distance in the transverse plane between the track and the reconstructed primary vertex. 
The impact parameter zo is given by the distance in z-direction between the reconstructed primary vertex 
and the point of closest approach in the transverse plane of the track multiplied by sin(0) to obtain the 
component transverse to the track direction. Tracks assigned to the T had candidate are not used in the 
primary vertex fit. 

In Fig. 3 the resolution of the transverse (left) and longitudinal (right) impact parameters are shown as 
a function of |tj |. The resolution for final state muons and pions from T —> fiw and x —> n(n°)v decays 
are similar: about 13 jJ,m for |t] | < 1.0 and about 50 jum for \r} \ > 1.0 for the transverse and longitudinal 
impact parameters, respectively. No degradation related to hadronic interactions for x — ► 7TV decays 
is observed. This has been verified by studying events with elastic interaction in the Inner Detector 
(hadronic interaction), defined as events where the outgoing 71* carries more than 90 % of the transverse 
energy of the incoming n ± . This observation is consistent with that presented in Ref. [11]. 

In Fig. 4 distributions of the significances of the impact parameters are presented. The significance 
is defined as the impact parameters divided by its estimated error. The distribution shows a moderate 
discrimination power between one-prong candidates reconstructed from hadronic x decays and fake one- 
prong candidates. Due to the limited resolution of the longitudinal impact parameter, the separation 
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Figure 4: Significances of the impact parameters do (left) and zo sin(0) (right) for 1 -prong Xh a d candidates 
reconstructed by the track-based algorithm. Distributions are shown for x had candidates reconstructed 
from x decays and for fake candidates which do not originate from the decays of b- or c-hadrons. 

between the two classes is less significant in that case. 

3.1.5 Secondary vertex reconstruction and transverse flight path 

The significant lifetime of the x lepton (cx = 87.11/xm) allows the reconstruction of its decay vertex 
for three-prong decays. Currently, five vertex fitting algorithms [12-14] are implemented in the ATLAS 
reconstruction framework. Among these the adaptive vertex fitter [13], an iterative re-weighted fit which 
down-weights tracks according to their weighted distance to the vertex, was found to give the optimal 
performance. 

To estimate its performance, secondary vertex fits were performed using tracks associated with Xhad 
candidates from Z — >• xx and W — > XV events. The quality criteria applied in the track-based reconstruc- 
tion were required to be met by the tracks. The X\ m d candidates associated with a true hadronic x decay 
were divided into two classes. Candidates with three tracks successfully matched to true particles com- 
ing from the same true three-prong hadronic x decays were used as a reference. These candidates are 
denoted hereafter as fully-matched. The second class is composed of the remaining candidates with at 
least two tracks of which at least one is matched to a true particle coming from a hadronic X decay. These 
candidates are denoted hereafter as partially matched. 

The resolution of the secondary vertex position varies strongly if measured in the perpendicular or 
parallel direction with respect the momentum of Xi M d candidate. The resolution on the position of the 
secondary vertex calculated in the plane perpendicular to the momentum of the Xhad candidate is expected 
to be better than in the parallel direction due to the collimation of tracks. To estimate the resolution in the 
transverse plane, the residuals of the vertex position in the direction perpendicular to both momentum of 
a Xhad candidate and the beam axis were calculated. The distributions were approximated by a double 
Gaussian fit. For fully matched three -prong X] w d candidates there is no significant difference between the 
distributions obtained with different fitters. The distributions of residuals of the secondary vertex position 
obtained with the adaptive fitter, parallel and perpendicular to the direction of flight of the Xhad candidate 
are presented in Fig. 5. Shown in Table 2 are the resolution 4 , the mean values of the fit and 68.3 % and 
95.0 % coverages 5 for the fully matched, partially matched and combined samples. As expected, the 
transverse resolution (a ~ 10 }Xm) is far more accurate than the parallel one (a ~ 600 jlm). The non- 
Gaussian tails are significant in both cases, but far more important in the case of the parallel component. 
A precise reconstruction of the transverse flight path is therefore possible, which is important for further 

4 ln the case of a double Gaussian fit, the width of the central Gaussian will be quoted as the resolution hereafter. 
5 The coverage is the half- width of a symmetric interval covering a given percentage of the distribution. 
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Figure 5: Residuals of the secondary vertex position parallel and perpendicular to the direction of flight 
of the Thad candidate using the adaptive vertex fitter. Fully (solid) and partially (open) matched three- 
prong Xhad candidates reconstructed with the track-based algorithm from Z — > tt and W — ► TV processes 
are used. 



Table 2: Resolution and mean of the distribution of residuals of the secondary vertex position in the 
directions parallel and transverse to that of the reconstructed momentum vector of the Xhad candidate as 
obtained from the adaptive vertex fitter. Candidates with up to three associated tracks reconstructed by 
the track-based algorithm were used. The resolution quoted is the a of the core Gaussian of a double 
Gaussian fit in the range [—4 mm, 4 mm] in the parallel direction and [—50 Aim, 50 Aim] in the transverse 
direction. The 68.3% and 95% coverages are also quoted. 





Resolution 


Mean 


68.3% 


95% 




Parallel 


Fully matched 3-prong 
Partially matched 
Combined 


0.593 ±0.008 mm 
0.703 ± 0.030mm 
0.613 ±0.008 mm 


0.006 ±0.006 mm 
-0.035 ± 0.020mm 
0.004 ±0.006 mm 


1.27 mm 
3.83mm 
1.89mm 


5.33 mm 
> 15 mm 
11.37 mm 




Transverse 


Fully matched 3-prong 
Partially matched 
Combined 


10.1 ±0.2 jum 
11.3 ±0.5 Jim 
10.5 ±0.2 Aim 


0.2±0.lAim 
-0.1 ±0.2 Jim 
0.1 ±0.1 Aim 


14.4 Aim 
20.9 Aim 
16.4Aim 


36.9 Aim 
72.2 Aim 
48.1 Aim 



rejection of the QCD background. It may also be possible to obtain a competitive measurement of the T 
lifetime, which requires a measurement of the flight path and momentum of the T lepton. 

Shown in Fig. 6 is the resolution on the transverse flight path as a function of the transverse mo- 
mentum and the pseudorapidity of the %had candidates. The resolution was obtained from a Gaussian fit 
to a central interval covering 80% of distributions of residuals of a transverse flight path for the adap- 
tive vertex fitter for fully matched three-prong candidates. In addition the 68.3% and 95% coverages of 
distributions of residuals are presented. 

Distributions of the significance of the transverse flight path, for different classes of three-prong can- 
didates are shown in Fig. 7. This distribution might be used to discriminate between true Xhad candidates 
and fake candidates from light jets. The discrimination in the case of b- and c- jets seems however to be 
difficult. 

An efficient rejection of tracks coming from photon conversions, decays of long-lived particles and 
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Figure 6: Resolution on the transverse flight path reconstructed with the adaptive vertex fitter for fully 
matched three -prong T/ !a j candidates as a function of the transverse momentum (left) and the pseudora- 
pidity (right). Standard deviations of Gaussians fitted to central intervals covering 80% of the residual 
distributions are shown (black points). In addition the 68.3 % and 95 % coverages of the distributions of 
residuals of the secondary vertex position are shown (dashed and dot-dashed lines). 
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a* %\ — fake (b/c jets) 
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Transverse flight path significance 

Figure 7: Significance of the transverse flight path for fully matched and partially matched three-prong 
and for fake candidates with and without hadrons containing b or c quarks (the contribution from semilep- 
tonic decays of b/c jets into % leptons was not subtracted). 
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Table 3: Single prong candidates: fractions with zero, one and two or more reconstructed n° subclusters. 



decay mode 


no 7T° subclusters 


1 71° subcluster 


> 2 7T° subclusters 


all t — ► hadv 


32% 


35% 




X — ► 7TV 


65% 


20% 


15% 


X — > pv 


15% 


50% 


35% 




9% 


34% 


57% 



hadronic interactions in the material reduce the number of one-prong candidates wrongly reconstructed as 
two- or three-prong candidates and improve the separation between correctly reconstructed three-prong 
candidates and candidates from light jets. It is a subject of further studies currently in progress. 

3.2 Reconstruction of 7T° subclusters 

The high granularity of the electromagnetic calorimeter in ATLAS allows for the identification of isolated 
subclusters from 7T°s inside the core region of the reconstructed x lepton hadronic decays. 

Studies have been performed based on the topological clustering algorithm [15] with only the mid- 
dle layer of the calorimeter used for finding primary maxima, and the strip layer used for finding the 
secondary maxima. The clustering was performed based on cells in a region AR < 0.4 around the direc- 
tion of the leading track satisfying pj > 9 GeV and only subclusters with center within AR < 0.2 were 
taken. A subtraction procedure was applied first to reduce the impact from energy deposits of nearby 
7T ± 's and of energy double-counting when adding the latter (track + 71° clusters) to reconstruct the visible 
Thad energy. Namely, before clustering procedure, cells being closest to the impact point of the track 
(AR < 0.0375) were removed. The subtraction was stopped when the subtracted energy exceeded 70% 
of the track momentum. In the case of coincidence of large energy deposits in the hadronic calorime- 
ter (above 40% of the track momentum) and in the presampler+strip layer close to the track, indicating 
superposition of 7T° and %^ showers, cells were subtracted only from the middle layer up to the point 
where the transverse energy of the remaining cells exceeded 2.5 • E T collected in the presampler+strip 
layer (always counted in AR < 0.0375 from the track impact point). 

Reconstructed subclusters were required to have Ej > 1 GeV and be separated by AR > 0.0375 from 
the impact point of the track in the middle layer. In addition, subclusters were accepted if their recon- 
structed energy in the strip+presampler layers exceeded 10% of their total energy. These requirements 
efficiently removed about 50% of satellite clusters from charged 7Ts in the case of x — ► 7TV decays. Ta- 
ble 3 summarizes the results in terms of the fraction of one-prong candidates reconstructed with a given 
multiplicity of 7T° subclusters. 

Reconstructing the track and 7T° subclusters for single-prong decays allows for the definition of the 
energy and visible mass of the hadronic x decays from the vector sum of both components. The procedure 
was evaluated for one -prong decays from W — ► XV events, i.e. for x leptons with visible transverse 
momenta below 50 GeV. Figure 8 shows the response and resolution obtained by this algorithm for 
reconstructing the visible energy in decays of type X — > pv from W — > XV events in case at least one 7T° 
subcluster is reconstructed. A Gaussian fit to the core region of the distribution yields a resolution of 
4.6% with an effective shift of —2.4%, dominated by the calibration of the electromagnetic calorimeter 
not being optimal for the 7T° subcluster reconstruction. 

As a final benchmark for the quality of the 7T° cluster reconstruction discussed above, the invariant 
mass of T — > pv — > K°KV decays is reconstructed from the track + 7T° subcluster system which is more 
difficult than the reconstruction of the transverse energy only since the resolution is dominated by the 
precision of the reconstruction of the angle between the charged and the neutral pion. Figure 8 (right) 
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Figure 8: The energy response obtained for the visible energy from x — > pv events using candidates 
with one n° subcluster (left). The invariant mass of the visible decay products for hadronic single-prong 
x — ► pv, x — > ci\ (— > 2n°n) v, and x — > nv decays using candidates from W — > xv events with at least one 
n° subcluster reconstructed (right). 




Figure 9: The efficiency of the electron veto algorithm for W — ► XV (rectangles) and W —> ev (triangles) 
events as a function of | r\ | and p T of the leading track. 

shows the reconstructed visible mass for x — > pv, X — > a\(— > 2n Q n)v, and x — > KV decays. The relative 
contributions are proportional to the branching fractions convoluted with the experimental efficiencies 
of the algorithm applied to inclusive hadronic decays of the x lepton. If more than one 71° subcluster is 
reconstructed the energy weighted barycenter of the cluster system is taken. 

3.2.1 Combined veto on electron tracks 

An efficient rejection of tracks originating from isolated electrons is important for rejecting backgrounds 
for example from W —>■ ev and Z —>■ ee events. One possibility would be to reject tracks that have been 
identified as good electron candidates by the standard electron reconstruction algorithm. With the so 
called tight selection this algorithm is found to reject ~ 85% of all electrons from W — > ev events with a 
loss of efficiency for true hadronic x decays with p l j, ack > 9 GeV of less than 1%. 

To achieve a more stringent selection, a dedicated algorithm to veto electrons has been developed 
aiming at a higher rejection rate while, at the same time, retaining a high fraction of hadronic x decays. 
It is based on the following variables: 

• The energy deposited in the hadronic part of the calorimeter (£" HCAL ). 

• The energy not associated with a charged track in the strip compartment of the electromagnetic 
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Table 4: Efficiency for hadronically decaying x leptons and true electrons from W — > TV for passing 
the electron veto algorithm. The numbers given are normalized to true electrons with pj > 9 GeV and 
1 77 1 < 2.5 (vs. true e) and to reconstructed one-prong or three-prong candidates with the leading track 
being matched to a n from W — > TV events (vs. reconstructed Thud)- The probability that an electron from 
W — > ev events with pj > 9 GeV and \ r\ \ < 2.5 is reconstructed as one-prong (three -prong) candidate is 
~ 70% (~ 0.7%). In addition the performance of the standard algorithm for electron reconstruction [16] 
is shown. The statistical uncertainty on the numbers presented here is at the level of 0.1 - 0.5%. 





Reconstructed as 


Reconstructed as 


Overall 


Candidates 


single-prong 


three-prong 




Electron- veto algorithm 








T from W — > TV (vs reconstructed z had ) 


94.1% 


96.2% 


94.9% 


Electron from W — > ev (vs true e) 


1.5% 


< 0.1% 


1.6% 


Standard algorithm (tight selection) 








T from W — ► TV (vs reconstructed Xh a d) 


99.9% 


99.9% 


99.9% 


Electron from W — ► ev (vs true e) 


15.6% 


0.4% 


16.4% 


Standard algorithm (medium selection) 








T from W — ► TV (vs reconstructed Xhad) 


90.6% 


95.1% 


92.1% 


Electron from W — > ev (vs true e) 


4.2% 


0.2% 


4.6% 



part of the calorimeter (£max )• 

• The ratio in the transverse plane of the associated energy in the electromagnetic calorimeter and 
the track momentum (Ej /pt)- 

• The ratio of the number of high threshold to low threshold hits (including outliers) in the TRT 
(Nht/Nlt)- 

The first two variables are used to divide tracks into categories in which discrimination is provided by 
fixed cuts on the remaining two variables. 

The algorithm yields a rejection factor of 60 against electrons from W — > ev events at the expense of 
losing 5% of the signal from W —> TV events. The efficiency of the algorithm 6 as a function of \r) \ and 
Pt is shown in Fig. 9 and its performance is summarized in Table 4. For completeness results from the 
standard electron reconstruction algorithm are also shown. The dedicated electron-veto described here 
gives much better efficiency for rejecting isolated electrons from W decay than the standard electron 
reconstruction algorithm for comparable loss in accepting true hadronic T decays. 

4 Offline algorithms for t reconstruction 

Two complementary algorithms for the reconstruction of hadronic x decays have been implemented 
in the ATLAS offline reconstruction software. Each algorithm is discussed separately below and their 
performance is compared. 

6 For hadronic decays of t leptons the efficiency is defined w.r.t. the reconstructed % ad candidates and for electrons w.r.t. to 
all electrons inside |?7| < 2.5 with p T > 9 GeV. 
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Figure 10: The ratio of the reconstructed (Ej) and the true (Ej~ vls ) transverse energy of the hadronic 
X decay products is shown as a function of the visible true transverse energy Ej Vls (left), calculated in 
1 77 1 < 2.5 and |tj| (right) for taus from Z — > xx (triangles) and A — > xx with m A = 800 GeV (squares) 
decays. The ordinate value is the mean and the error bars correspond to the sigma of the Gaussian fit 
performed in the range 0.8 < E T /Ej ,vls < 1 .2. The results are obtained after applying the loose likelihood 
selection, see below. 

4.1 The calorimeter-based algorithm 

In this approach [17], hadronically decaying x candidates are reconstructed using calorimeter clusters as 
seeds. They are obtained from a sliding window clustering algorithm applied to so called calorimeter 
towers which are formed from cells of all calorimeter layers on a grid of size Atj x A(j) = 0.1 x 2tt/64. 
The energy and position are calculated from the clusters, while all cells with the full granularity of the 
corresponding calorimeters are used to calculate the quantities involved in x identification as described 
in the following. Only clusters with a transverse energy Ej > 15 GeV are used. The probability for a 
true T to be reconstructed as a cluster increases from 20% to 68% over the visible x transverse energy 
range from 15 to 20 GeV and saturates at 98% for E T > 30 GeV. 

All cells within AR < 0.4 around the barycenter of the cluster are then calibrated with an HI -style 
calibration [18]. The cell weights are a function of the cell energy density, r\ and the calorimeter region. 
These weights have been optimized for jets [18] and only approximately for hadronic X decays. The 
mean and sigma of a Gaussian fit to the ratio of the reconstructed and the generated energy of the visible 
X decay products, Ej~ vls , in the range from 0.8 to 1.2 is shown in Fig. 10 as a function of Ej~ vls and 
r\. The resolution is of the order of 10% and an offset in the range from +5 to -7% is observed in the x 
energy range from 20 to 50 GeV, while at larger energies the offset is of the order of -3 to -5%. 

Several quantities that exploit the x lepton properties have been combined in a likelihood function 
to discriminate hadronic x decays from fake candidates originating from QCD jets. These quantities are 
described in the following: 

• The electromagnetic radius R em : 

To exploit the smaller transverse shower profile in x decays, the electromagnetic radius R em is 
used, defined as 



where i runs over all cell in the electromagnetic calorimeter in a cluster with AR < 0.4. The 
quantities r\i, fa, and Etj denote their position and transverse energy in cell i. Cells may have 
different sizes depending on the layer and their 77 value. The size varies from At] x A(j> = 0.003 x 




(i) 



242 



Tau Leptons - Reconstruction and Identification of Hadronic t Decays 



0.1 in the tj -strip region of the barrel to 0.025 x 0.025 for the second calorimeter layer. This leads 
to a dependence of the performance on r\ . This variable shows good discrimination power at low 
E T but becomes less effective at higher E T . 

• Isolation in the calorimeter: 

Clusters built from hadronic x decays are well collimated and therefore rather tight isolation crite- 
ria can be used. Here a ring of 0.1 < AR < 0.2 was chosen as the isolation region and the quantity 

AE? = ^, (2) 

is calculated, where the indices i and j run over all electromagnetic calorimeter cells in a cone 
around the cluster axis with 0.1 < AR < 0.2 and AR < 0.4, respectively, and E T)i and E T j denote 
the transverse cell energies. 

Like R em , the AE^ 2 distribution shows an Ej dependence and becomes narrower with increasing 
Ej. This variable also depends on the event type and is expected to be less effective for events 
with higher hadronic activity, like e.g. tt events. 

• Charge of the x candidate: 

The charge of a x candidate is defined as the sum over the charge(s) of the associated track(s). The 
misidentification of the charge on the level of a few percent shows almost no Ej dependence. 

• Number of associated tracks: 

The number of tracks, N tr , associated with a given cluster within AR < 0.3. The tracks are required 
to have pr>2 GeV and no specific requirements on the quality of the track reconstruction is made. 
A significant fraction of events with zero, two, and even four tracks is observed for true hadronic 
x decays. 

• Number of hits in the tj strip layer: 

The number of hits in tj direction in the finely segmented strip detector, N str i p , in the first layer of 
the electromagnetic barrel calorimeter is also used in the likelihood discrimination. Cells in the 
tj strip layer within AR < 0.4 around the cluster axis are counted as hits if the energy deposited 
exceeds 200 MeV. In contrast to jets, a significant fraction of X leptons deposit nearly no energy in 
the tj strip layer (t — »■ nv decays) and the number of corresponding hits is small. 

• Transverse energy width in the tj strip layer 

The transverse energy width Atj is defined as 



Atj = , 



lLl4T P (H,-Hcluster)2 n , 



where the sum runs over all strip cells in a cone with AR < 0.4 around the cluster axis and E Ti p 
is the corresponding strip transverse energy. Like R em it is a powerful discriminator at low E T but 
loses discrimination power with increasing Ej for higher collimated high Ej jets. 

» Lifetime signed pseudo impact parameter significance: 

At present only a 2-dimensional impact parameter, also called the pseudo impact parameter, is 
used. It is defined as the distance from the beam axis to the point of closest approach of the track 
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Figure 11: The distributions of a few discriminating variables (electromagnetic radius, energy isolation, 
transverse energy width in the r\ strip layer and Ej over pj\ of the leading track) used in the calorimeter- 
based tau identification for true tau decays and jets with visible transverse cluster energies Ej in the 
range from 40 to 60 GeV and track multiplicities between 1 and 3. 

in the plane perpendicular to the beam axis. From this information and from the jet axis, a quantity 
denoted as lifetime signed pseudo impact parameter significance, defined as sig rfo = do/ dj 0 where 
a is the impact parameter resolution, is calculated. 

• E T over p T of the leading track: E T /p T \ : 

For t decays a large fraction of the energy is expected to be carried by the leading track and the 
ratio of the cluster energy Ej to the momentum of the leading track pj\ is expected to be large, 
close to 1. This provides another discrimination against QCD jets, which are expected to have a 
more uniform distribution of pj among the tracks. They are also expected to have more additional 
neutral particles. Values above one are also expected from z decay modes involving additional 7T°s 
and for three -prong decays. The Ej dependence is rather modest for % decays but more pronounced 
for QCD jets, which tend to become more signal like with higher Ej. 

In Fig. 1 1 the distributions of a few discriminating variables are shown for signal and backgrounds 
for transverse cluster energies Ej in the range between 40 and 60 GeV and for candidates with 1 or 3 
tracks. 

For the calorimeter-based algorithm the z identification is based on a one-dimensional likelihood 
ratio constructed from three discrete variables (A^, AWp and the charge of the t lepton) and five continu- 
ous variables (R em , A&p, At], sig d(j , and Ej/pt. i). For the discrete variables the ratios are directly taken 
from the reference histograms. For the continuous variables fits of appropriate functions to each variable 
for all E T bins have been performed. The distribution of the likelihood for taus and jets are shown in 
Fig. 12. Despite any limitation from using only one-dimensional distributions it shows a good separation 
power. 
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Likelihood Efficiency 

Figure 12: Left: The log likelihood (LLH) distribution for % leptons (solid) and jets from QCD production 
(dashed). The likelihood is applied after a preselection on the number of associated tracks, i.e. requiring 
1 <N tr <3. (Candidates with LLH < — 10 had variables outside the boundaries of histograms used when 
obtaining the PDFs for the likelihood calculation). Right: Efficiency for t leptons and rejection against 
jets for different Ej ranges, achieved with the likelihood selection. 

It should be noted that the z identification efficiency chosen to keep enough signal events and to 
achieve the necessary rejection against background depends on the physics channel. Despite the use of 
Ej bins the likelihood discrimination shows a residual Ej dependence. Therefore, a fixed cut on the 
likelihood value neither will result in a generally flat efficiency, nor will it be optimal. 

4.2 The track-based algorithm 

In this approach [19], the visible part of the hadronically decaying % lepton is seen as a very well colli - 
mated object consisting of charged and neutral pions, with the charged component being the leading one, 
i.e. reproducing well the direction of the visible decay products and having significant transverse mo- 
mentum. This assumption is followed by the requirement of a low multiplicity of tracks reconstructed in 
the region considered the core of the Thad candidate, and the requirement of only minimal energy deposit 
in the isolation region around the core. The energy-scale of the object and the calorimetric variables 
used in the identification are built following this picture. For most of the analyses only one-track and 
three-track candidates should be used. Including candidates with two tracks helps to recover a large 
fraction of lost three-prong candidates, however it also significantly increases the background from QCD 
events, in particular for T leptons with visible transverse momentum below 30 GeV. Candidates with 
track multiplicities larger than three should be used for monitoring the level of fake candidates only. 

The reconstruction step consists of identifying and qualifying a leading hadronic track 7 which be- 
comes a seed for building the T candidate. Then up to six additional tracks are allowed in the core 
region. The (tj, 0) position of the candidate is taken from the direction of the track at the vertex or the 
track-/??- weighted bary-center in the case of multi-track candidates and the energy of the candidate is 
calculated from the energy flow method. In addition charge ±1 or 0 is required in the case of multi- 
prong candidates. The identification step consists of calculating calorimetric and tracking quantities 
and then providing a decision either based on selection with cuts or a discriminating variable based on 
multi-variate techniques. 

7 The threshold p T > 9 GeV on the leading track was used for results presented here, while it was lowered to 6 GeV in the 
more recent software releases. 
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4.2.1 The energy - flow approach 

The energy scale of the hadronic Thad candidate is defined using an energy flow algorithm. The energy 
deposit in cells is divided into categories. 

• The pure electromagnetic energy, £^ mcl : 

The energy is seeded by an isolated electromagnetic cluster which is isolated from the good quality 
tracks and which has no substantial hadronic leakage. The energy is collected in a narrow window 
around the seed. Only presampler, strip and middle layers are used. 

• The charged electromagnetic energy, Ej IgEM , ££ hrgHAD : 

The energy is seeded by the impact point of the track(s) in each layer and the energy is collected 
in a narrow window around it. 

• The neutral electromagnetic energy, Ef euEM : 

The energy is seeded by the (t], <p) of the track at the vertex and in each layer the closest cell is 
searched for. The energy is collected from not yet used cells in a cone of AR = 0.2 with respect to 
the cell closest to the impact point. Only presampler, strip and middle layers are used. 

In the energy-flow approach the charged energy deposits E^ rgEM + £'£ hrgHAD are replaced by the 
track(s) momenta (no hadronic neutrals) in order to define the energy scale of the Thad- The contribution 
from ;r°'s is included in £| mcl and Ef euEM ; the effects of 7T° and n ± depositing energy in the same 
calorimeter cells or charged energy leakage outside a narrow cone around the track are corrected by 
adding two terms: y2res£"y hrgEM and resEj euEM . The complete definition for the energy scale £|. flow reads 
as follows: 

E f ow = £f mcl + ££ euEM + ^pf Ck + £W? rgEMtrk + res£r EM (4) 
The fractional energy response, calculated as (E rec - E truth ) /E truth , for one and three prong candidates is 
shown in Fig. 13. 

The evident advantage from using the above approach for defining the energy scale comes from the 
fact that while performing well for true hadronic decays of T leptons, it significantly underestimates the 
nominal energy of fake T/^s from jets. This effect is rather obvious since a cone of AR = 0.2 is too 
narrow to efficiently collect the energy of a QCD jet (particularly with low transverse momentum) and 
also since a large fraction of the neutral hadronic component is largely omitted in the definition itself (as 
the energy deposit in the hadronic calorimeter does not contribute to the energy calculations). This leads 
to a faster falling background spectrum as a function of E T compared to that using calibrated calorimetric 
clusters as implemented in the calorimeter-based algorithm (Section 4.1). This method leads however to 
more non-Gaussian tails in the fractional energy response than the more conventional energy estimates 
from calorimetry only. 

4.2.2 Identification with calorimetric and tracking variables 

Several calorimetric and tracking variables are used to discriminate a narrow, low track multiplicity 
Thad cluster from a hadronic cluster originating from quarks or gluons. If not stated otherwise, the 
calorimetric and tracking identification quantities are calculated from cells/tracks within a core cone 
of AR = 0.2 around the seed. The isolation criteria used here are checked in an isolation cone AR = 
0.2 — 0.4. Please note, that although some definitions are very similar for the calorimeter-based and 
track-based algorithms, in case of the latter the narrower core cone is often used for the calculation of 
calorimetric quantities and a more explicit distinction between core and isolation cone is made. 

Not all discriminating quantities discussed in Section 3 have been already implemented in the iden- 
tification procedure. In particular transverse impact parameter, transverse flight path and categorizing 
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Figure 13: The fractional energy response for single-prong (left) and three-prong (right) true %had candi- 
dates reconstructed with the track-based algorithm. Events from a W — > TV sample are shown. 

single-prong candidates using 71° subclusters have been added only in the current releases of the recon- 
struction software. Therefore they are not used for the results presented below. 

• Tracking quantities 

- The variance Wj acks (for multi -prong candidates only), defined as 

I(AT7^ track ) 2 -^ r track _ (LAT7^ track ^ r track ) 2 
^tracks £pr track (LPr track ) 2 ' 

- The invariant mass of the tracks system (for multi-track candidates), m t rkip, 

- The number of tracks in the isolation cone. 

• Calorimetric quantities 

- The electromagnetic radius of the x^ad candidate, R x 6m , as defined in Eq. (1) for the calorimeter- 
based algorithm, but calculated from cells around the seed belonging to the first three sam- 
plings of the electromagnetic calorimeter only (presampler, strips and middle layer). 

- The number of X] strips, N^ ps , with energy deposits above a certain threshold. 

- The width of the energy deposit in the strips, as defined in Eq. (3) for the calorimeter based 
algorithm but calculated in the core cone only. 

- The fraction of the transverse energy, frac£y S12 , deposited in a cone of radius 0.1 < AR < 0.2 
with respect to the total energy in a cone of AR = 0.2. Cells belonging to all layers of the 
calorimeter are used: 

= E^(iP-<o.2)-E^(y--<ai) 

T ZEf\R z ' cen < 0.2) V ' 

- The transverse energy, Ej om , at the EM scale deposited inside the core cone. 

- The transverse energy, E l j° l and Ef olHAD , at the EM scale, deposited inside the isolation 
cone. 

• Tracking and calorimetric quantities 

- The ratio of transverse energy deposited in the hadronic calorimeter in the core region (at the 
EM scale), £'£ hrgHAD , with respect to the sum of the transverse momenta of the tracks. 
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Figure 14: The distributions for signal and backgrounds for the visible mass m e J' ow and ratio of the 
transverse energy in the isolation and core region Ej° l /Ej ore for single -prong candidates, and variance 
^tracks an< ^ invariant mass of the track system m trk3p for three-prong candidates. Distributions are shown 
for the candidates in the transverse energy range Ej = 20 — 40 GeV. 

- The visible mass m e J'" w calculated from cells used for the energy-flow calculation and tracks. 
In case of multi-prong candidates, where this mass is smaller than that calculated from the 
four-momenta of the tracks, the invariant mass of the track system is taken instead. 

In Fig. 14 as an example the distributions for signal and backgrounds for the m e J L l" w , the ratio 
Ej° l /Ef re for single-prong candidates, the variance Wj acks and the invariant mass for three-prong 

candidates are shown. Note that the m e v {j ow distribution shows a double peak structure coming from 
T — >• 7T ± V and T — >• p(al)v decays. The separation for candidates with and without 71° clusters was not 
done for the distribution shown. 

4.2.3 Overall efficiency and rejection 

The identification step is done by calculating discriminants using basic cut methods, cut methods opti- 
mized by the TMVA package [20], multi-variate analyses based on neural network technique, and from 
PDRS discrimination [21]. 

The rejection power expected from the identification step only is quite modest, given that a quite 
good rejection is already achieved in the reconstruction step. The overall performance is summarized 
in Table 5. For an efficiency of about 30% with respect to all hadronic decays in the energy range 
10 — 30 GeV, rejection rates of 200/360 for one-prong/three-prong hadronic T decays can be achieved 
with the cut based selection and of 500/700 with multi-variate selection techniques. 
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Table 5: Efficiencies and rejection rates for different discrimination techniques for the track-based al- 
gorithm for fixed efficiencies. The efficiencies are normalized to all hadronic x decays. The rejection 
rates are calculated with respect to jets reconstructed from true particles in the Monte Carlo. Events from 
Z — > xx signal samples and QCD dijets were used. The errors given are statistical only. 



Selection 


Efficiency 


Rejection 


Rejection 


Rejection 


Rejection 






cuts 


TMVA cuts 


NN 


PDRS 


|| E T = 10-30 GeV 


one-prong 


0.33 


225 ± 10 


435 ± 30 


510 ±40 


460 ± 40 


three-prong 


0.28 


360 ± 25 


470 ± 40 


740 ± 70 


670 ± 60 




E T = 30-60 GeV 


one-prong 


0.42 


140 ± 10 


170 ± 10 


440 ± 40 


320 ± 30 


three-prong 


0.45 


60 ±2 


90± 10 


160 ± 10 


130 ± 10 



Table 6: Rejection against jets from Monte Carlo true particles for a 30% efficiency and separately for 
the one-prong (lp) and three-prong (3p) candidates. The efficiencies are normalized to true hadronic x 
decays. For the signal Z — > xx events and events from bbH, H — > xx with m# = 800 GeV were used; for 
the background QCD dijet-samples were used. The errors given are statistical only. 



Algorithm 


E T = 10-30 GeV 


E T = 30-60 GeV 


£ T = 60-100 GeV 


E T > 100 GeV 


Track-based 
(neural network) 


lp: 740 ± 70 
3p: 590 ± 50 


lp: 1030 ± 160 
3p: 590 ± 70 






Calo-based 
(likelihood) 




lp: 1130 ±50 
3p: 187 ± 3 


lp: 2240 ± 140 
3p: 310 ±7 


lp: 4370 ± 280 
3p: 423 ± 8 



4.3 Comparison of the two algorithms 

Figure 15 shows the expected performance of the two algorithms, illustrated as curves describing the jet 
rejection versus the efficiency, separately for one and three-prong hadronic T-decays and for different 
ranges of the visible transverse energy. The jet rejections are computed with respect to jets reconstructed 
from true particles in the Monte Carlo. The rejections obtained are between a factor of two and ten higher 
for one-prong decays than for three-prong decays, depending on the algorithm and on the transverse 
energy range considered. For an efficiency of 30% for one-prong decays, the rejection against jets 
is typically between 500 and 5000, as illustrated more quantitatively and as a function of the visible 
transverse energy in Table 6. 

Figure 16 shows the normalized track-multiplicity spectra for hadronic x candidates with visible 
transverse energies above 20 GeV, from Z — > xx decays and from jets, as reconstructed by the track- 
based algorithm. The distributions are shown after the reconstruction step, after a cut-based identification 
algorithm and finally after applying a neural network discrimination. The track multiplicity in the jet 
sample is quite different from that in the signal sample, independently from the cuts applied. 

At the same time, Figure 16 indicates that the purity of one-prong and three-prong Xhad candidates 
improves in the signal sample since the expected fractions of single versus three prongs are reproduced. 
For one-prong (three-prong) candidates, the purity improves from 87% (74%) after reconstruction to 91% 
(86%) after cut-based identification and to 92% (93%) after applying the neural-network discrimination 
technique. 

Figure 16 also shows that using candidates with track multiplicities above three to normalize the 
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Efficiency Efficiency 

Figure 15: Expected performance for the track-based algorithm with a neural-network selection (left) 
and the calorimeter-based algorithm with the likelihood selection (right). The rejection rates against jets 
from Monte-Calo particles as a function of the efficiency for hadronic x decays for various ranges of the 
visible transverse energy are shown. For signal events Z — > xx and bbH,H — > xx with m# = 800 GeV 
were used, for the background QCD dijet samples were used. 




Track multiplicity Track multiplicity 



Figure 16: Track multiplicity distributions obtained for hadronic x decays with a visible transverse energy 
above 20 GeV and below 60 GeV using the track-based x identification algorithm. The distributions are 
shown after reconstruction, after cut-based identification and finally after applying the neural network 
(NN) discrimination technique for an efficiency of 30% for the signal (left) and the background (right). 
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Figure 17: Track multiplicity distributions obtained for hadronic T^-decays with visible transverse en- 
ergy above 20 GeV and below 60 GeV using the calorimeter-based z identification. The distributions are 
shown after reconstruction and after applying the likelihood discrimination technique (medium selection) 
for the signal (left) and the background (right). 

QCD background will allow a reasonably precise calibration of the performance using real data, provided 
the rejection against QCD jets is proven to be sufficient to extract a clean signal in the one-prong and 
three-prong categories. The sensitivity of such a method can be enhanced by also studying the track 
multiplicity outside the narrow cone used for T-identification and combining this information with that 
presented in Fig. 16. 

The corresponding spectra for the calorimeter-based algorithm are shown in Figure 17 after recon- 
struction and after applying the likelihood discriminant. The optimization of the likelihood results in a 
lower efficiency for accepting three-prong decays which biases the track multiplicity spectra. One should 
note that candidates with 1-3 tracks are accepted as good Zhad candidates. 

The application of these z identification algorithms to extract z signatures from physics processes 
and to reject the large backgrounds from QCD processes is discussed in Section 6. 

5 Fake-rates from QCD di-jet samples 

This study demonstrates a simple and generic method to determine the Zhad fake rate from jets in early 
data. Since fake rates are expected to be in the range of 10~ 3 to few 10~ 2 for low pi z leptons, and 
since the expected rate of QCD jets far exceeds the rate of hadronically decaying z leptons, a precise 
measurement of fake rates is expected to be crucial for many analyses, and also for a further optimization 
of the Zhad identification algorithms. In the following, the method and the results are described and 
statistical and systematic uncertainties are discussed. 

The method proposed here uses a very clean sample of QCD jets, with no significant contamination 
from true z leptons. This is achieved by selecting dijet events with two jets having similar pj and being 
back-to-back in 0 (see Fig. 18). One of the two jets is randomly chosen as the so-called 'tag jet', for 
which a cut on the number of tracks («Trk > 4 for pj < 50 GeV + 1 track for each additional 50 GeV 
interval in p T ) ensures that it is not a true hadronic Zhad decay. If this cut is fulfilled, the other jet, 
called 'probe jet', can be used to measure the fake rate from QCD jets 8 . This is performed both for the 
calorimeter-based and the track-based Zhad reconstruction algorithm, with identification according to the 
medium likelihood selection and cut discriminant, respectively. Note also that, in order to avoid a direct 
dependence on the trigger, the probe jet should be required to not have caused the event to be triggered. 

8 The fake rate is determined as the number of probe jets identified as x had divided by the number of probe jets. 
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Figure 18: Example of selections on a MC dijet sample, generated with 70 < pj < 140 GeV. The two 
jets have to fulfill A</> > (n — 0.3) in order to be back to back in <j> (left) and have similar pj values (right). 

Table 7: The Xhad fake rate from QCD jets and its statistical uncertainty for the available Monte Carlo 
statistics and for expected 100pb _1 of data in bins of pj for both x^ad reconstruction algorithms. 





Calorimeter-based algorithm 


Track-based algorithm 


p T range 


MC stat. 


Expected stat. error 


MC stat. 


Expected stat. error 


(GeV) 


(%) 


for 100 pb" 1 (%) 


(%) 


for 100 pb" 1 (%) 


15-40 


2.3 ± 0.3 


±0.02 


2.5±0.5 


±0.02 


40-80 


5.2 ± 2.2 


±0.01 


6.7±2.2 


±0.01 


80-120 


0.5 ± 0.2 


± 0.001 


1.8±0.6 


± 0.002 


120-160 


0.2 ± 0.2 


± 0.002 


1.4±0.6 


± 0.004 



This method, which achieves low statistical uncertainties even for small datasets, only relies on the 
dijet and tag jet selection to acquire a clean QCD jet sample and it does not depend, to first order, on 
the number of true x leptons present in the sample nor on the Xhad efficiencies. Also, the selection can 
be easily adapted to select probe jets in an environment similar to the one of any given physics analysis 
using Xi m d candidates. 

To perform these studies, Monte Carlo dijet samples (generated in various pj ranges) and samples 
containing true x leptons (Z — > xx, W — ► XV) were used. Proper weighting of the samples, including 
trigger prescales for running at Jz? = 10 31 cm _2 s _1 , has been applied. To cross-check the method, it has 
been verified that there is very good agreement between the values obtained with all selected jets and 
those coming from jets matched to Monte Carlo particles jets only, which indicates a very high jet purity. 

The numerical results of the fake rate determination using the available Monte Carlo statistics for the 
two algorithms can be found in Table 7. Systematic uncertainties (discussed below) are not included. 
Note that the uncertainty in the range 40 < pj < 80 GeV, for both algorithms using available Monte 
Carlo statistics, is dominated by one dijet event (with large weight) in one of the Monte Carlo samples. 
More interesting are the expected statistical uncertainties in data, which are at the percent or sub-percent 
level for 100 pb~\ and even for 10 pb _1 of integrated luminosity. This demonstrates the relevance of 
this method for very early data. 

Given the statistical precision of the fake rate results, this measurement of the systematics of other 
measurements is going to be limited by its own systematic uncertainties. In the following, an outline of 
the necessary systematic studies is given, including results where possible. 

The presence of true hadronically decaying x leptons in the selected Monte Carlo sample is not 
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statistically significant enough to alter the results within their uncertainties. When more statistics are 
available, tighter cuts on the tag side can be applied in order to remove more efficiently these types of 
events. 

A slight tendency was observed for the jet samples with lower hard scattering transverse momenta, to 
show higher fake rates. This can be explained by the fact that the jet characteristics depend on the degree 
of parton showering. Also, jets from gluons are wider and on average have higher tracks multiplicity 
than jets from quarks. Then, the fake rate of gluon-jets should be smaller than that of quark-jets. Data 
triggered with the jet-triggers, that has more gluon-jets than quark-jets, and with single photon triggers, 
where the quark-jets very likely will be dominant, can be used to understand this effect. However, this 
type of systematic uncertainty will be small as long as the distributions of observable quantities (such as 
the number of tracks and the jet isolation) of the probe jets studied for the fake rate is comparable with 
the properties of the jets faking hadronically decaying x leptons in the physics analysis. 

In case there is a physical correlation between the properties of the tag and the probe jet, the selection 
of the tag jet would directly influence the properties of the probe jet, and hence distort the results. As 
an example of a possible observable correlation, the number of tracks per jet was studied and it has been 
found that the correlations were in the sub-percent range. Hence it is concluded that given the current 
knowledge from Monte Carlo, this source of uncertainty can be neglected. For data, this can be revisited. 

6 Tau leptons in Standard Model processes 

The goals for early x physics in ATLAS include acquiring a sample of x leptons from data with a pu- 
rity as high possible so that the x had identification efficiency can be measured and the simulation can 
be tuned. Collecting data with an integrated luminosity of 100 pb -1 at an instantaneous luminosity of 
10 31 cm _2 s _1 will provide a unique opportunity to access and understand statistically significant x sam- 
ples from Standard Model processes at relatively low transverse momenta. Processes, like the production 
of W and Z bosons and top quark pairs with their huge cross section will lead to samples of a few hun- 
dred to a few thousand identified hadronic x decays. Hadronically decaying x leptons will then become 
a well understood probe for discovery physics like searches for Higgs bosons, SUSY, or unexpected phe- 
nomena. Below we present feasibility studies for analyses which can be envisaged with an integrated 
luminosity of 100 pb -1 . 

6.1 The W — >■ TV inclusive production 

The W — > XV signal will be produced with o x BR = 1.7 • 10 4 pb, and will be dominated by events with 
low pr of the W-boson resulting in soft x leptons with low missing transverse energy. The expected 
cross-section of the dominant background from hadronic jets is ~ 10 10 pb (calculated for hard-scattering 
phard > g Q e y), about 6 orders of magnitude larger than the signal production. 

The analysis is very sensitive to the performance of the hadronic T-trigger [22]. These events will 
have to be triggered with a x+E™ ss trigger, with a configuration adequate to fit into the allowed budget 
for trigger rates. Given the fact that the physics motivation is to get signal events with x^ad candidates 
at low transverse momenta, the present base-line configuration for this analysis is to use t20i+EFxE30 9 , 
with the Ej us trigger applied only at the Event Filter level. The present trigger optimization gives an 
overall ~ 70% trigger efficiency with respect to off-line analysis. 

The signal will be extracted requiring one identified hadronic x decay with transverse energy Ej = 
20 — 60 GeV and observing the characteristic track multiplicity spectrum of identified hadronic x decays. 
In the present study the track-based algorithm was used with the medium identification, corresponding 

9 ln this notation, T20i+EFxE30 denotes a trigger which requires at least one Xhad candidate with a transverse energy above 
20 GeV and "EFxE30" is short for Ef ss > 30 GeV at the Event Filter level. 
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to rejection of 700-1000 for the single- and 600 for the three-prong x selection against jets and 30% effi- 
ciency for true hadronic X/ m d decays (see Table 6). The overwhelming background from QCD events will 
be further suppressed by vetoing events with an additional isolated electrons or muons and by increasing 
the threshold on Ef lss . An additional handle is to select only events with the topological configuration 
of the Xhad, Ef lss or additional jets optimal for suppressing events with large fake Ej lss , i.e. excluding 
those where the Ef lss direction is close to the direction of the identified Xhad or an additional jets (see 
Ref. [23]). 

The QCD background has been estimated with a mixture of full and fast simulation, taking into 
account the necessary corrections for the different slopes in the Ef lss distribution in the full and fast 
simulations. It should be noted also that predictions for this overwhelming background are subject to 
large uncertainties. 

An important background is also expected from W — > ev events, where an electron passes the 
hadronic T-trigger selection criteria. This channel, with an initial production cross-section of the same 
order as the signal, but contributing almost exclusively to the single-prong mode, will exceed the signal 
rates by some factor, before a dedicated electron veto is applied to the single-prong candidates. With the 
expected performance of such an algorithm, as discussed in Section 3.2.1, this background can be effi- 
ciently suppressed, also providing a control channel for the topology of the hadronic part of the W — > XV 
events passing the trigger and offline selections. Backgrounds from Z — >• xx, tt,Z — >• ee,W — > jiv events 
were also considered and it was estimated that they will be suppressed with the offline selection below a 
few percent of the signal. 

Table 8 summarises the expected event yield at the various stages of the selection. With an E™ ss 
threshold of 50 GeV, the expected signal-to-background ratio is 1 : 1 and about 3240 signal events would 
be observed. Increasing the threshold to 60 GeV would reduce the number of accepted signal events to 
1550 but increase the signal-to-background ratio to 3:1. Figure 19 shows the expected track multiplicity 
spectrum after the final selection. The optimisation of this selection can be performed using control sam- 
ples, i.e. W — >• ev events extracted from the same filter stream, hence modeling of the W-recoil part of the 
event will be possible directly from the data. The final optimisation of the chosen efficiency/rejection and 
offline threshold on Ef' ss will have to be tuned with data, given the large uncertainties on Monte Carlo 
predictions for the background from hadronic jets. The control on the QCD background normalization 
will be possible with fake Xhad with track multiplicity above 3, i.e in the signal-free region. 

Table 8: Expected number of events in 100 pb _1 of data for signal and background after subsequent steps 
of the selection. The track-based algorithm has been used for x^ad reconstruction. The QCD background 
has been estimated combining fast and full simulation. Given are the expected number of events of track 
multiplicity one to three, i.e. contributing to signal region only. 



Selection 


W ^xv 


W ^ev 


W ^jUV 


QCD dijet 


it, Z — > ee, Z — > xx 


Trigger t20/+EFxE30 


8.8-10 4 


6.1-10 5 


3.2-10 4 


4.8-10 8 


3.0-10 5 


Identified x + Ef lss > 30 GeV 


2.0- 10 4 


2600 


200 


3.0 -10 b 


1600 


Ef ss > 50 GeV 


4200 


530 


90 


5.0- 10 4 


550 


Veto fake E™ lss topology 


3600 


500 


80 


1.8-10 4 


150 


Require jet pj > 15 GeV 


3240 


450 


60 


3200 


80 


Increase to Ef lss > 60 GeV 


1550 


150 


25 


500 


30 
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Track multiplicity Track multiplicity 



Figure 19: The track multiplicity spectrum of accepted % a d candidates after selection as described in the 
text with thresholds respectively E™ ISS > 50 GeV (left) and E™" s > 60 GeV (right). The expected event 
numbers are given for an integrated luminosity of 100 pb _1 . 

6.2 The Z — ► tt inclusive production 

The inclusive Z — > tt process will provide a ten times lower rate compared to W — ► TV, but will have 
more robust prospects for analysis. It will be possible to cross-check channels with e Xhad and }X Xhad 
final states to control the background, comparing the number of events observed in the same-sign and 
opposite-sign samples. Moreover, events will be primarily triggered with lepton triggers providing an 
unbiased sample of hadronic x decays which could also serve to understand efficiencies of the hadronic 
T-trigger. The measured cross-section for the Z — > xx process will be an excellent check on the x iden- 
tification efficiencies, while the lepton identification and trigger efficiencies will be measured first from 
Z — >• U channels. A measurement of the visible mass of the i Xf, a d system at low background levels will 
have sensitivity to the energy scale of the reconstructed T/^s. 

The analysis presented here is designed to select in the first 100 pb _1 of data a sufficient number of 
Z — > xx — ► £vv XhadV events with very low background, which then can be used to determine the Xhad 
energy scale from the reconstructed Hhad visible mass and to determine the Ef' ss scale [23] from the 
reconstructed complete invariant mass of the xx pair (including neutrinos). Events, which have been 
selected by the single electron or single muon trigger stream are analysed and as a first step an isolated 
lepton (electron or muon) with pj > 15 GeV is required. Then, the set of basic selection cuts is applied. 
It requires a missing transverse energy E™ m > 20 GeV (to suppress Z — > tl and QCD backgrounds), 
transverse mass of the lepton and E™ 1SS system rnj, Ej < 30 GeV (against W — > £v background), total 
transverse energy deposited in the calorimeter LEj aI ° < 400 GeV (against tt and QCD backgrounds) and 
finally no identified b-jet (against tt and QCD backgrounds). In the next step, events with an identified 
?had with pr > 15 GeV are selected. The track multiplicity of identified Xhad is required to be one or three. 
In addition the angular separation between the isolated lepton and X is imposed, requiring A(f>(£, Xhad) to 
be in the ranges between 1.0 - 3.1 or 3.2-5.3 . 

The analysis was performed using Xhad reconstructed with the calorimeter-based algorithm and the 
identification with the likelihood discriminant. The thresholds on the likelihood discriminant were opti- 
mized for this analysis and correspond to an overall efficiency of ~ 35% with respect to all hadronic x 
decays. The QCD background has been estimated with a mixture of full and fast simulation which as- 
sumes uncorrected probabilities for a jet to produce an isolated lepton candidate (predominantly leptons 
from heavy flavor decays with a small contribution from fakes in the electron case) and for the second 
jet to produce a fake Xhad candidate. 

Table 9 gives the expected number of signal and background events passing the selection criteria for 
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Figure 20: Left: The reconstructed visible mass of the {tXhad) pair for Z — ► xx decays (solid line) and 
QCD, W —>■ £v, Z — > ££ backgrounds (dashed line). Right: The reconstructed visible mass of the (£Xj wl j) 
pair from Z — > xx decays as a function of the Xi m d energy scale (right). The dashed lines correspond 
to ±la and ±3 a with respect to the reconstructed peak position. The results were obtained with the 
calorimeter-based algorithm. 

Table 9: Expected number of events in 100 pb _1 of data for signal and background after reconstruction 
of the X candidate with the calorimeter-based algorithm and after application of the selection cuts for the 
Z — > xx channel. The QCD background has been estimated combining fast and full simulation. 



Selection 


Z^xx 


W->£v 


QCD dijet 




Z^££ 


Isolated lepton 


1.5-10 4 


^•lO 5 


1.110 7 


2.6- 10 4 


2.2- 10 3 


Ef lss > 20 GeV 


4750 


14.3-10 5 


3.2-10 3 


2.4- 10 4 


1.0- 10 4 


mf"" < 30GeV 


3200 


2.6- 10 4 


1.8-10 5 


3650 


3200 


LE T < 400 GeV 


3000 


2.4- 10 4 


1.7-10 3 


1280 


2800 


b-jet veto 


2780 


2.4- 10 4 


2.7-10 4 


135 


2600 


x had -id + A<t>(lx had ) cuts 


630±30 


210 ±10 


74±11 


10±2 


30±5 


OS events, m 1 ^ = 37-75 GeV 


520±30 


45 ±5 


29±5 


<5 


10 ±5 



a data sample of 100 pb _1 . About 520 signal events are expected in the visible mass m^ had window 
between 37 — 75 GeV. The expected background levels are 10% from W —> £v events and about 5% 
background from QCD events. The reconstructed visible mass of the £Xf, a d pair is shown in Figure 20 for 
opposite- sign events. 

This selection for Z — > xx events will provide access to signal-suppressed and signal-enriched sam- 
ples. The same-sign events (e Xhad and jl X] W( s) will be essentially signal-free. This will allow a study 
of Xhad identification and mistagging efficiencies. The mistagging efficiency will be estimated from the 
ratio of accepted to all candidates in different categories (one-prong with and without tt° subclusters, 
multi-prong). Then, this estimate can be used to predict the background component in opposite-sign 
events and to tune the Monte Carlo predictions for the identification variables. It will allow a confirma- 
tion of the overall consistency and an estimate of the relative error on the background predictions in the 
signal enriched sample. Finally, a measurement of the cross-section for Z — > xx events relative to the 
Z — > ee,\i\L channels will provide a cross-checks on the associated efficiencies. 

Once the lepton energy scale is determined with the very first data, the selected Z — > xx events can 
be used to determine the X] w d energy scale in-situ. Subtracting the estimated background from opposite- 
sign events, as measured with the same-sign events, will allow for better estimates on the energy scale 
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Table 10: Expected number of events in 100 pb _1 of data for tt — »■ W(£v)W(x had , v % )bb signal and 
background after subsequent steps in the selection. The track-based algorithm has been used for Xhad 
reconstruction. 



Selection 




W->£v + 3jets 


single t 


It + 2 jets 


Isolated lepton p T > 20 GeV 


1300 


3.9 TO 5 


4300 


630 


Identified x had p T > 15 GeV 


190 


22000 


210 


120 


1st jet E T > 50 GeV, 2nd jet E T > 30 GeV 


170 


4000 


170 


35 


E™ ss > 25 GeV 


150 


3400 


150 


15 


ZE T > 250 GeV 


150 


1750 


130 


10 


Opposite-sign events 


130 


850 


54 


< 10 


1 b-jet tag 


67 


28 


20 





from the shape of the distribution of the visible mass. In Fig. 20 the sensitivity of the measured visible 
Z boson mass, as obtained from the reconstructed x pairs, on the absolute X energy scale is shown, 
assuming only signal events.The statistics correspond to 100 pb _1 of data. Taking into account only the 
statistical uncertainties, the Xhad energy scale could be determined with a precision of ~ 3%. 

6.3 The T leptons from it production 

With a cross-section of 833 pb [24], about 16500 events are expected in 100 pb _1 with a W boson de- 
caying into a x lepton. Due to the increased center of mass energy available at the LHC the cross section 
for tt production increases by nearly two orders of magnitude over what is available at the Tevatron. The 
tt channel is discussed here as an additional source for x leptons from the SM processes, supplementing 
samples expected from W — ► XV and Z — > xx process. 

The decay chain tt —>■ W (qq')W (XhadV)bb requires events triggered using x + Ef us triggers, x triggers 
and multi-jets triggers. In the latter case this will lead to an unbiased sample of Xhad candidates. The 
event is required to have at least two light quark jets, two ^-tagged jets, and an identified hadronic x 
decay. If the event has more than two light quark jets, the pair with the invariant mass closest to the 
nominal mass of the W bosons is chosen which is then combined with the closest b jet to constitute 
the hadronically decaying top quark. With 100 pb -1 of data about 300 tt —>■ W (qq')W (XhadV)bb signal 
events with S:B of 20:1 are expected. These events can be used to study the Xhad reconstruction and 
identification performance and to commission the x trigger. The pj range of identified x leptons will 
be complementary to that available from the inclusive W and Z boson production. A more detailed 
description of the analysis is included in Ref. [25]. 

The decay chain tt — >• W(ev e ,nv^)W(Xhad, v T )bb is also interesting for both its physics potential and 
the possibility of using this channel to understand Xhad identification. These events will be triggered 
with single lepton triggers and the main background will come primarily from W(—> ^v)+jets, single top 
production and from Z(— > TT)+jets production. The analysis requires an isolated lepton and identified 
?had- To suppress backgrounds from W+jets and Z+jets events it requires two additional high Ej jets, 
significant energy deposition in the calorimeter YEj > 250 GeV and Ef lss > 25 GeV. Additional back- 
ground suppression can be achieved by requiring that one or two jets are b-tagged. Table 10 summarizes 
cut flow of the analysis. 

In the first 100 pb -1 of data, 54 ±4 signal events in the eX] m d channel are expected, with a signal- 
to-background ratio (S:B) 1:10. The use of b-tagging rejects considerably the dominant W + 3 jets 
background (see Fig. 21). If at least one tight b-tag jet is required the expected number of signal events 
decreases to 28 ±3 with S:B improving to 1:1. When requiring a jet in the event that passes the tight b 



257 



Tau Leptons - Reconstruction and Identification of Hadronic t Decays 




b-tagging weight b-tagging weight 

Figure 21: Combined ^-tagging weights using impact parameter and secondary vertex information for 
the first two leading E T jets, both in tt —> W(ev e ,ljLV^)W(XhadVT:)bb and W + 3 jets background. The e X 
(/I x) channel is shown on the left (right). The cut value of 7 on the b-tagging weight is indicated with 
the arrows. An integrated luminosity of 100 pb _1 of data is assumed. 

tagging criteria, the dominant source of background is still £v)+jets, however the composition of 
the background changes and single top production starts contributing significantly, with quark or gluon 
jets faking Xi w ,d and true b-jets from b-quark fragmentation. 

7 Summary 

Two complementary algorithms for the identification of hadronic x decays in the ATLAS experiment 
have been developed. The first one (calorimeter based) is seeded from a reconstructed cluster in the 
calorimeter, the second one (track based) relies on seeds built from reconstructed tracks in the inner 
detector. Several discrimination methods have been established, including a simple cut-based selection 
as well as multivariate selections based on likelihood, neutral network, and probability range search 
techniques. Rejection factors against jets from QCD processes of a few hundred up to a few thousand 
can be achieved for a x efficiency of 30% in the pj range between 10 to 60 GeV. In addition, a dedicated 
algorithm has been developed to reject electrons that pass the x identification criteria. In the low energy 
range, rejection factors of the order of 50 and higher against electrons from W and Z bosons decays are 
achieved at the expense of a 5% efficiency loss for hadronic x decays. 

It has also been estimated that the expected performance in the ATLAS experiment will be adequate 
to extract x signals in early LHC data from W — > TV and Z — > xx decays. These signals are important to 
establish and calibrate the x identification performance with early data. The study of dijet events from 
QCD processes will allow a determination of x fake rates. It is expected that such rates can be measured 
with a statistical precision at the percent level or better already with data corresponding to an integrated 
luminosity of 100 pb _1 . 
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Abstract 

This section summarizes the general aspects of jet reconstruction with the AT- 
LAS detector. General but brief descriptions of the available jet algorithms are 
provided, together with a discussion of the performance expectations for the 
various algorithm configurations in different detector regions and for different 
physics environments. The emphasis is on realistic estimates for the initial jet 
reconstruction performance, determined in the absence of experimental data. 
The corresponding expectations for important jet reconstruction parameters 
like signal linearity and uniformity, the relative energy resolution, and the jet 
reconstruction efficiency and purity, are presented. 

1 Introduction 

High quality and highly efficient jet reconstruction is an important tool for almost all physics analyses 
to be performed with the ATLAS experiment at the Large Hadron Collider (LHC) at CERN. The re- 
quirements especially for the absolute precision on the jet energy scale often exceed the corresponding 
achieved performance in previous experiments. Typically, an absolute systematic uncertainty of better 
than 1% is desirable for precision physics like the measurement of the top quark mass, and the recon- 
struction of some SUSY final states. 

The principal detector for jet reconstruction is the ATLAS calorimeter system, with its basic compo- 
nents depicted in Fig. 1. It provides near hermetic coverage in a pseudorapidity range —4.9 < tj < 4.9. 
The technology choices are well suited for high quality jet reconstruction in the challenging environ- 
ment of the proton-proton (pp) collisions at y/s = 14 TeV at the LHC. The electromagnetic liquid ar- 



Tile barrel Tile extended barrel 




Figure 1: The ATLAS calorimeter system. 
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gon/lead calorimeters feature an accordion geometry homogeneous in azimuthal coverage for 1 17 1 < 3.2. 
The hadronic calorimeters surrounding them are iron with scintillating tile readout in the central region 
(|tj| < 1.7) and parallel plate liquid argon/copper in the endcap region (1.7 < \r}\ < 3.2). The forward 
region is covered by liquid argon/copper and liquid argon/tungsten calorimetry with a tubular electrode 
readout accommodating the high ionization rates expected at LHC at design luminosity. The readout 
of the calorimeters is highly granular for the electromagnetic devices, with typically three longitudinal 
segments with varying lateral cell sizes, e.g. AT7 x A(j) = 0.025 x 0.025 in the second segment con- 
taining the electromagnetic shower maximum. The hadronic calorimeters are coarser, with typically 
A 77 x A0 = 0.1 x 0.1, but have also at least three longitudinal segments. The total number of readout 
cells in the ATLAS calorimeter system is close to 200, 000. The total thickness of the ATLAS calorimeter 
system for hadrons is at least 10 absorption lengths over the whole acceptance region. More details on 
the calorimeters, and any other detectors in ATLAS, can be found in Ref. [1]. 

In this note the approaches used by the ATLAS collaboration to achieve the challenging perfor- 
mance goals are discussed. First, the most commonly used jet finders are briefly introduced in Section 2, 
together with the theoretical and experimental guidelines for the ATLAS implementations. Then, the ex- 
pectations for the performance of these algorithms using different detector signals are shown in Section 3. 
As the focus is on the upcoming initial data taking period of ATLAS, some distortions in the detector 
alignment and material distributions have been included but not corrected in the results presented here. 
Finally, special challenges to jet reconstruction like forward going jets and jets in minimum bias events 
are discussed in Section 4, followed by conclusions in Section 5. 

2 Jet algorithms in ATLAS 

In general an attempt is made to provide implementations of all relevant jet finding algorithms in AT- 
LAS. These include fixed sized cone algorithms as well as sequential recombination algorithms and an 
algorithm based on event shape analysis. This approach is a response to the fact that there is no universal 
jet finder for the hadronic final state in all topologies of interest. For example, for the measurement of the 
inclusive QCD jet cross-sections wider jets are typically preferred to capture the hard scattered parton 
kinematics, including possible small angle gluon radiation, completely. On the other hand, to reconstruct 
a W boson decaying into two jets or to find jets in very busy final states like ft production or possible 
SUSY signatures, narrow jets are preferred. 

The common feature of all jet finder implementations in ATLAS is full four-momentum recombina- 
tion whenever the constituents of a jet change, either through adding a new constituent, or by removing 
one, or by changing the kinematic contribution of a given constituent to the jet. Also, in the ATLAS re- 
construction software framework Athena, the same jet finder code can be run on objects like calorimeter 
signal towers, topological cell clusters in the calorimeters, reconstructed tracks, and generated particles 
and partons. 

In this section the basics of the present default jet algorithms used in ATLAS are discussed after 
a brief summary of the theoretical and experimental guidelines for jet algorithm implementation. In 
addition, some features of jet finders not included in the more comprehensive presentation of jet recon- 
struction performance in Section 3 are shown. 

2.1 Guidelines for jet reconstruction 

The basic guidelines for jet reconstruction in ATLAS have been extracted from Ref. [2]. They also 
reflect the concept of jet definition discussed in Ref. [3], which is an attempt to provide a common 
understanding between experiments and theory on how a given jet finding strategy should be specified to 
assure the highest level of comparability between the results from various sources. 
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2.1.1 Theoretical guidelines 

The major theoretical guidelines for jet reconstruction are: 

Infrared safety: The presence of additional soft particles between two particles belonging to the same 
jet should not affect the recombination of these two particles into a jet. In the same sense, the 
absence of additional particles between these two should not disturb the correct reconstruction of 
the jet. Generally, any soft particles not coming from the fragmentation of a hard scattered parton 
should not effect the number of jets produced. 

Collinear safety: A jet should be reconstructed independent of the fact that a certain amount of trans- 
verse momentum is carried by one particle, or if a particle is split into two collinear particles. 

Order independence: The same hard scattering should be reconstructed independently at parton-, par- 
ticle- or detector level. 

Note that from the perspective of experimental data the particles mentioned in these guidelines can, 
to a point, be replaced by four-momentum type objects reconstructed from detector signals, see e.g. 
Section 2.5 and Section 2.6 below. 

2.1.2 Experimental guidelines 

Additional aspects of jet reconstruction in ATLAS include features also reflected in the design of the 
detector. They can be divided into three classes. 

Detector technology independence: The reconstructed jet and its kinematic variables should not de- 
pendent on the signal source, i.e. all detector specific signal characteristics and inefficiencies must 
be calibrated out or corrected as much as possible. 

Detector resolution: contributions from the finite spatial and energy resolution must be at a min- 
imum; 

Detector environment: effects from the detector environment like electronics noise, signal losses 
in un-instrumented (inactive) materials and cracks between detectors must be at a minimum; 

Stable signals: the detector signal reconstruction and calibration must provide a stable input sig- 
nal to jet reconstruction. 

Environment independence: The jet reconstruction environment is characterized by the additional ac- 
tivity in the collision event due to multiple interactions and pile-up, the source of the jet, the 
underlying event activity, and other features of the pp collisions at LHC. 

Stability: a jet should be found and reconstructed safely even in the case of changing underly- 
ing event activity and changing instantaneous luminosity, thus changing number of multiple 
interactions; 

Efficiency: all physically interesting jets from energetic partons must be identified with high effi- 
ciency. 

Implementation: The jet algorithm implementation must be fully specified in that the jet definition, 
which consists of the jet finder and its configuration together with the choice of kinematic recom- 
bination given in Ref. [3], must be complete. Also included must be all selections and, if important 
for the measured jet, the signal choices. In addition, the implementation of the jet reconstruction 
must make efficient use of computing resources, i.e. it must be sufficiently fast and avoid excessive 
memory consumption. 
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The most commonly used jet finder implementations in ATLAS are a seeded fixed cone finder with split 
and merge (see below), and a kj algorithm [4, 5] implementation, with an initial implementation as 
described in Ref. [6], but later replaced by a faster implementation similar to the one in the FastJet 
package [7]. It is anticipated that for the first experimental collision data all implementations of the 
FastJet library (kj, anti-^T, Cambridge flavor kj [8]) will be available, as well as the seedless infrared- 
safe cone algorithm SISCONE [9]. As there are no complete systematic evaluations of these algorithms 
available for this note, they are excluded from further discussions here. 



2.2 Fixed cone jet finder in ATLAS 

The ATLAS implementation of the iterative seeded fixed-cone jet finder follows the algorithm description 
of Ref. [2]. First, all input is ordered in decreasing order in transverse momentum pj. If the object with 
the highest p T i s above the seed threshold, all objects within a cone in pseudorapidity T7 and azimuth 
0 with AR = ^Atj 2 + A0 2 < R con e, where R cone is the fixed cone radius, are combined with the seed. 
A new direction is calculated from the four-momenta inside the initial cone and a new cone is centered 
around it. Objects are then (re-)collected in this new cone, and again the direction is updated. This 
process continues until the direction of the cone does not change anymore after recombination, at which 
point the cone is considered stable and is called a jet. At this point the next seed is taken from the input 
list and a new cone jet is formed with the same iterative procedure. This continues until no more seeds 
are available. The jets found this way can share constituents, and signal objects contributing to the cone 
at some iteration maybe lost again due to the recalculation of the direction at a later iteration. 

This algorithm is not infrared safe, which can be (at least) partly recovered by introducing a split 
and merge step after the jet formation is done. Jets which share constituents with more than a certain 
fraction / sm of the pj of the less energetic jet are merged, while they are split if the amount of shared pj 
is below / sm , with f sm = 0.5 in ATLAS. Other important parameters of the ATLAS cone jet finder are a 
seed threshold of pj > 1 GeV, and a narrow (R cone = 0.4) and a wide cone jet (R cone = 0.7) option. 

From a theoretical standpoint this particular cone jet finder is by design only meaningful to leading 
order for inclusive jet cross-section measurements and final states like W /Z + 1 jet, but is not meaningful 
at any order for 3-jet final states, W/Z + 2 jets, and for the measurement of the dijet invariant mass in 2 
jets +X final states [10]. 



2.3 Sequential recombination algorithms 

The default implementation of a sequential recombination jet finder in ATLAS is the kj algorithm. Here 
all pairs ij of input objects (partons, particles, reconstructed detector objects with four-momentum repre- 
sentation) are analyzed with respect to their relative transverse momentum squared, defined by 

ARl Aij/j + A^j? 

= min(p Tii , Ptj)^t = min (PT,i,PTj) — ~jp — > 

and the squared p T of object i relative to the beam d t = p\ •. The minimum <i min of all dy and d t is found. 
If d min is a dy, the corresponding objects i and j are combined into a new object k using four-momentum 
recombination. Both objects i and j are removed from the list, and the new object k is added to it. 

If dmin is a di, the object / is considered to be a jet by itself and removed from the list. This procedure 
is repeated for the resulting new sets of dij and di until all objects are removed from the list. This means 
that all original input objects end up to be either part of a jet or to be jets by themselves. Contrary to the 
cone algorithm described earlier, no objects are shared between jets. The procedure is infrared safe. As 
it does not use seeds, it is also collinear safe. The distance parameter R, which is the only free parameter 
besides the choice of recombination scheme in this inclusive implementation of the kj algorithm, allows 



265 



Jets and Missing E t - Jet Reconstruction Performance 



Table 1: Default jet finder configurations used in ATLAS. 



Algorithm 


Main parameter 


Clients 


Seeded fixed cone 


R cons = 0.4 


W -► jj in ft, SUSY 


(seed p T > 1 GeV) 


#cone = 0.7 


inclusive jet cross-section, Z' — > jj 


k T 


R = 0A 


W -f jj in r/, SUSY 




R = 0.6 


inclusive jet cross-section, Z' — ► 77 



some control on the size of the jets. Default configurations in ATLAS are R = 0.4 for narrow and R = 0.6 
for wide jets. 

Table 1 summarizes the algorithms and configurations which have been used by ATLAS for basically 
all pre-collision physics studies. Thus, these are the base for most predictions related to the performance 
of the hadronic final state reconstruction for all studied physics channels available to date. 

2.4 Alternative jet finders 

As jet finders others than the "default" algorithms and configurations discussed above can be more appro- 
priate for the precision analysis of specific final states, additional jet algorithms are available in ATLAS 
for application at the analysis stage. Those are the mid-point variant of the fixed cone algorithm originally 
introduced by CDF in Ref. [2, 11], and the "optimal jet finder" discussed in Ref. [12]. Both algorithms 
have been studied for ATLAS and in general provide a very similar performance when compared to the 
default seeded cone and kj implementations. This can be seen in Fig. 2 for the mid-point algorithm, 
which, as already said, is a flavour of the fixed cone algorithm with the modification that the seeds are 
placed between two particles with significant pj, rather than just using an individual particle pj as seed 
directly. Besides a slightly smaller efficiency in the central region, there are no significant differences 
between the studied jet finders observed in this non-comprehensive investigation with simulated ttbb 
events. 

The optimal jet finder is a departure from the traditional approach of reconstructing each jet rather 
independent from previously reconstructed jets in the same event. Here the basic scheme is to calculate 
a weight for each particle reflecting the contribution to any jet by minimizing a test function event by 
event. This function actually includes weights for contributions to transverse momentum not clustered 
into any jet at all, thus using the overall event shape when reconstructing the jets. Parameters of the 
algorithm are a jet cone size and a threshold for the test function. A more exclusive mode is available 
where the number of jets to be reconstructed can be fixed beforehand. In general this jet finder works 
well in busy final states like full hadronic top-quark decays in it production. Predictions for the relative 
top mass resolution, for example, are basically identical for the default kj implementation with R = 0.6 
and the optimal jet finder for the same jet size. 

2.5 Calorimeter jets 

The most important detectors for jet reconstruction are the ATLAS calorimeters. In this section the input 
signals and default calibration schemes for calorimeter jets are briefly described. 

The ATLAS calorimeter system [13] has about 200,000 individual cells of various sizes and with 
different readout technologies and electrode geometries. For jet finding it is necessary to first combine 
these cell signals into larger signal objects with physically meaningful four-momenta. The two concepts 
available are calorimeter signal towers and topological cell clusters. 
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Figure 2: Performance comparison of the ATLAS seeded cone, kj, and the mid-point seeded cone for 
ttbb events, as calculated from simulations. Shown are the distributions for jet multiplicity (top left), the 
pj spectrum for all jets (top right), the rapidity distribution (bottom left) and the pj spectrum for the 
leading jets (bottom right). 
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2.5.1 Calorimeter tower signals 

In case of the towers, the cells are projected onto a fixed grid in pseudorapidity (17) and azimuth (</>). The 
tower bin size is At] x A(j) = 0. 1 x 0. 1 in the whole acceptance region of the calorimeters, i.e. in |rj | < 5 
and —K < (j) < K with 100 x 64 = 6,400 towers in total. Projective calorimeter cells which completely 
fit inside a tower contribute their total signal, as reconstructed on a basic electromagnetic energy scale 1 , 
to the tower signal. Non-projective cells and projective cells larger than the tower bin size contribute 
a fraction of their signal to several towers, depending on the overlap fraction of the cell area with the 
towers, see Fig. 3 for illustration. 
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Figure 3: Calorimeter cell signal contributions to towers on a regular Ar) x A(j) = 0.1 x 0.1 grid, for 
projective and non-projective cells. The signal contribution is expressed as a geometrical weight and is 
calculated as the ratio of the tower bin area over the projective cell area in Tj and 0. 

Thus, the tower signal is the nondiscriminatory sum of possibly weighted cell signals (all cells are 
included). As the cell signals are on the basic electromagnetic energy scale, the resulting tower signal is 
on the same scale. No further corrections or calibrations are applied at this stage. 

2.5.2 Topological cell clusters 

The alternative representation of the calorimeter signals for jet reconstruction are topological cell clus- 
ters, which are basically an attempt to reconstruct three-dimensional "energy blobs" representing the 
showers developing for each particle entering the calorimeter. The clustering starts with seed cells 
with a signal-to-noise ratio, or signal significance T = -Eceii/cWse.ceii, above a certain threshold S, i.e. 
|T| > S = 4. All directly neighbouring cells of these seed cells, in all three dimensions, are collected into 
the cluster. Neighbours of neighbours are considered for those added cells which have F above a certain 
secondary threshold N (|T| > N = 2). Finally, a ring of guard cells with signal significances above a 
basic threshold |T| > P = 0 is added to the cluster. After the initial clusters are formed, they are analyzed 
for local signal maximums by a splitting algorithm, and split between those maximums if any are found 
[15]. 

'This is the raw signal from the ATLAS calorimeters. The nomenclature indicates that this scale has been derived from 
electron signals, but it lacks all corrections applied in high precision electron or photon reconstruction as described in Ref. [14]. 
It typically includes all electronic corrections and the geometrically motivated corrections for high voltage problems, like 
inactive electrode sub-gaps and similar. 
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Figure 4 shows the average number of particles in Monte Carlo generated jets from QCD dijet pro- 
duction together with the number of topological cell clusters in the corresponding simulated calorimeter 
jets. The figure indicates that the clustering algorithm resolves the particle content of the jet in the pseu- 
dorapidity range 1.5 < \rj\ < 2.5. The shower overlap between the particles in the jet cannot be resolved 
as well in the central region \r]\ < 1.5, where the calorimeter cell sizes are a bit larger on the scale of 
the hadronic shower. Here the ratio of number of particles per cluster is approximately 1.6. Similarly, 
in the forward region \r}\ > 2.5 both the increase in shower overlap, due to the decreasing linear distance 
between jet particles, and the increase of the cell sizes reduce the resolution power of the clustering 
algorithm for individual particle showers. 




Jet pseudorapidity 

Figure 4: Estimates for the average number of particles in seeded fixed size cone jets with R cone = 0.7 
in fully simulated QCD dijet production, shown as function of the pseudorapidity tj of the jet. Also 
shown is the corresponding average number of topological clusters in matching jets found with the same 
algorithm in the ATLAS calorimeters. 

Like towers, clusters are initially formed using the basic electromagnetic energy scale cell signals. 
These clusters can already be used for jet reconstruction. In addition, clusters can be calibrated to a local 
hadronic energy scale. This calibration starts with a classification step characterizing clusters as electro- 
magnetic, hadronic, or noise, based on their location and shape. After that, cell signals inside hadronic 
clusters are weighted with functions depending on cluster location, energy, and the cell signal density. 
Then, a correction for energy losses in inactive materials close to or inside the cluster is applied. Finally, 
a correction for signal losses due to the clustering itself (out-of-cluster correction) is applied. Note that 
all calibrations and corrections for topological clusters are derived from single particle simulations and 
do not use the jet context. 

2.5.3 Characteristics of calorimeter input to jet finding 

There are attempts in ATLAS to go beyond the high quality reconstruction of the total jet signal with the 
best possible resolution. Especially the reconstruction of jet structure, including the lateral and longitu- 
dinal signal distributions, can be useful to apply jet energy scale corrections jet by jet, or reconstruct the 
origin of a given jet. The ability to reconstruct this structure depends on the choice of the calorimeter 
signal definition used in jet finding, as is qualitatively indicated in the simulated high pj QCD event in 
Fig. 5. A given calorimeter signal definition like clusters may reproduce the jet shape at particle level 
better in certain regions of the calorimeters than in others. For example, from the depicted event in this 
figure the cluster signals represent the transverse energy flow of particles inside a jet better than the tower 
jets in the central and endcap regions, while in the forward region the clusters cannot resolve individual 
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Figure 5 : A simulated QCD dijet event with four jets in the final state, as seen at particle level and in the 
ATLAS calorimeters when using towers or clusters (extracted from Ref. [16]). 

showers anymore and thus cannot reproduce the jet shape very well. Here the towers still reflect some 
spatial structure of the incoming particles. 

Another important difference between tower and cluster jets is the number of calorimeter cells used 
in the jet. While towers include all cells of the ATLAS calorimeters, topological clustering actually 
applies noise suppression due to the cell signal significance cuts used. This means that many fewer cells 
contribute to jets in case of clusters, and that the noise contribution per jet is also smaller for cluster jets 
than for tower jets, see Section 3.3.1 for further discussion. 

Jet finding needs physical four-momenta on input. Thus, both towers and clusters are defined as 
massless pseudo-particles with a four-momentum (E,p), reconstructed from the reconstructed energy E 
(either electromagnetic or hadronic scale, see above), and the directions X] and 0: 

\P\ = \IPI+Py+Pl 

COS0 
P " 

coshrj 
sin0 
P coshrj 
p ■ tanh tj . 

The directions are fixed to the bin center in the (t] , <j)) grid for each tower, while they are reconstructed 
from the energy-weighted barycenter for topological clusters. 



E = 

Px = 

Py = 

Pz = 



270 



Jets and Missing E t - Jet Reconstruction Performance 



Jet Reconstruction Sequences 

II. Cluster Jets 



et Energy Scale Corrections 
,,se,p ll e-up,al g p effects, et c.) 



III. Cluster Jets 




I 


1 






Calorimeter Jets 

(em scale) 




Calorimeter Jets 






1 

et Based Hadronic Calibration Je 


1 

t Based Hadronic Calibration 






(cell signal weighting in jets etc.) 








Calorimeter Jets 




Calorimeter Jets 




Calorimeter Jets 



et Energy Scale Corrections 
„se,p l le-up,algon,t 1m effects, et c.) 



et Energy Scale Corre 
..se.p.le-up.a.gonthmeffe 



o 

D 

o 
3 



ln-situ Calibration 

ng event, physics environm 
i 



Refined Physics Jet 



situ Calibr 

int,p T' 



Refined Physics Jet 



Refined Physics Jet 



Figure 6: Schematic view on the reconstruction sequences for jets from calorimeter towers (left), uncali- 
brated (center) and calibrated (right) topological calorimeter cell clusters in ATLAS. The reconstruction 
(software) domains are also indicated. 
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2.5.4 Reconstruction flow for calorimeter jets 

The general reconstruction algorithm flows for calorimeter jets are summarized in Fig. 6. They reflect 
the differences between tower and cluster signals. As already discussed, tower signals are on the elec- 
tromagnetic energy scale while topological clusters are either on this scale or are calibrated on a local 
hadronic energy scale. Also, tower signals do not include noise suppression, while topological clustering 
has noise suppression built in. 

Reconstruction sequence I: tower jets Jet reconstruction from calorimeter towers (left diagram in 
Fig. 6) starts with a re-summation step, which addresses a possible unphysical four-momentum due to 
a negative (net) tower signal E tower < 0. This can be generated by signal fluctuations from noise (elec- 
tronics and physics from pile-up) in the cells entering into the corresponding towers. Simply ignoring 
the negative signal towers enhances the contribution of positive noise fluctuations, but combining nega- 
tive signal towers with nearby positive signals such that the combined four-momentum is physical with 
£"tower > 0, leads to cancellations of some of the noise fluctuations and avoids signal biases. Only negative 
signal towers without nearby positive signals are completely dropped. 

The resulting "protojets" represent either one or a few towers, and have all physically valid four- 
momenta. They are the input to the actual jet finding algorithm like seeded fixed cone or kj. The outputs 
of the jet finder are then jets with energies on the electromagnetic energy scale. Their constituents are 
the original calorimeter towers. They are subjected to a cell signal based calibration discussed below in 
Section 2.5.5. After calibration, jets with p T < 7 GeV are discarded. 

More refined corrections are needed to calibrate the tower jets to the particle level. Those include 
corrections for residual non-linearities in the jet response due to algorithm effects, like missing energy 
from the jet, or adding energy not belonging to the jet, in the jet clustering procedure. Other corrections 
include suppression of signal contributions from the underlying event and/or pile-up. Most of these can 
only be addressed in the context of a specific physics analysis. 

Reconstruction sequence II: cluster jets When topological clusters on electromagnetic energy scale 
are used for jet reconstruction, the reconstruction flow is rather similar to the tower jet reconstruction, see 
center diagram in Fig. 6. The main difference is the treatment of negative signals. Due to the symmetric 
noise cut applied in the cell selection in the clustering step, some clusters may have net negative signal as 
well. These can be ignored for jet reconstruction without significantly biasing the jet signal by positive 
noise contributions, because the noise suppression applied by the cell clustering already severely reduces 
any noise contribution. Some additional average cancellation is achieved by the symmetric noise cut, 
which allows inclusion of some negative cell signals even into positive (physical) clusters. 

The cluster jets are initially on the electromagnetic energy scale as well. The same cell signal weight- 
ing functions used for tower jets are applied to initially calibrate these jets, with some additional correc- 
tions for the fact that these calibration functions have not been optimized for the cluster signals, see 
Section 2.5.5 for more details. Like in reconstruction sequence I, all jets with pj < 7 GeV after calibra- 
tion are discarded. 

Reconstruction sequence III: locally calibrated cluster jets In this sequence the input objects to 
jet finding are already calibrated to the local hadronic energy scale [17]. This means that after the jets 
are formed, they are also calibrated on this scale. Additional corrections needed are related to the fact 
that all calibrations and corrections for this particular scale have been derived from single pion response 
only. Additional jet energy losses due to loss of particles in the magnetic field in ATLAS, or in inactive 
material without leaving any signal above clustering threshold in the calorimeters, have to be corrected 
in the jet context itself, in addition to the corrections for the physics environment contributions already 



272 



Jets and Missing E t - Jet Reconstruction Performance 



discussed for the tower jets. See the right hand side diagram in Fig. 6 for a schematic overview. Fully 
calibrated jets from this sequence with pj < 1 GeV are again discarded. 

2.5.5 Calorimeter jet calibration 

The long standing calibration scheme for calorimeter jets in ATLAS is based on cell signal weighting. 
It can be applied to both tower and cluster jets from the reconstruction sequences I and II, respectively. 
The basic idea behind this approach, which was originally developed for the CDHS experiment [18] and 
further refined for the HI experiment Ref. [19], is that low signal densities in calorimeter cells indicate 
a hadronic signal in a non-compensating calorimeter and thus need a signal weight for compensation of 
the order of the electron/pion signal ratio e/n, while high signal densities are more likely generated by 
electromagnetic showers and therefore do not need additional signal weighting. 

To apply the cell signal weighting, first all calorimeter cell signals contributing to a jet are retrieved. 
This is possible even if the jets have not directly been reconstructed from these cells. The signal in each 
cell i in the jet is weighted by a function depending on the cell location % and the cell signal density 
Pi = Ei/Vj, with Ei being the electromagnetic energy signal of the cell, and Vi being its volume. The 
weighting factor is 1 for high density signals and rising up to 1.5, the typical e/n for the ATLAS 
calorimeters, with decreasing cell signal densities. The weighting functions are universal in that they 
do not depend on any jet feature or variable. The calibrated jet four-momentum (£j e t,caio,Preco) is then 
recalculated from the weighted cell signals, which are treated as massless four-momenta (Ei,pi) with 
fixed directions: 

fecalo^calo) = f £ w{pi&)Ei, £ w{ Pi ,Xi)Pi \ • (D 

The signal weighting functions have been determined using seeded fixed size cone jets (/? C one = 0.7) 
in fully simulated QCD dijet events by fits of reconstructed calorimeter tower jet energies to matching 
Monte Carlo truth particle jet energies. Residual non-linearities (as function of pj) and non-uniformities 
(as function of Tj) are corrected by an additional calibration function parametrized in both variables. 
These corrections have also been calculated for other standard jet finding configurations and calorimeter 
signals. The calibration has been determined using the ideal, non-distorted detector geometries. See 
Ref. [17] for more details. 

2.6 Track jets 

Finding jets in reconstructed inner detector tracks is useful to recover possible inefficiencies of the 
calorimeter signals, especially for jets pointing to transition or crack regions. Even though the scheme of 
seeding calorimeter jets with track jets in these regions has not been completely evaluated, some recov- 
ery may be possible. At least tagging of suspicious event topologies using track jets without matching 
calorimeter jet, or matching a poorly reconstructed calorimeter jet, allows a suppression of events with 
significant fake missing transverse momentum in the hadronic final state reconstruction, thus improving 
the quality of any given sample for physics analysis. 

Matching the track jets with calorimeter jets is another promising approach to refine the jet energy 
measurement jet by jet, and the hadronic final state reconstruction in general. First, the pj fraction 
carried by tracks, defined as 

r /^T.track 

/trk = , (2) 

PTcalo 

with ^T.track being the transverse momentum from tracks and px,caio being the one reconstructed by the 
calorimeter, can be measured for each jet within the inner detector acceptance \r\\< 2.5. It can then be 
used to (relatively) improve the jet energy measurement, as indicated in Fig. 7. The figure shows results 
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obtained from simulations of QCD dijet processes of jets within |t]| < 0.7 and 40 < pj < 60 GeV. 
Even though the calorimeter jet calibration performs well on average, there are residual dependencies of 
the individual jet signal reconstruction quality on the jet fragmentation, i.e. the particle composition of 
the jet, for which / tt k provides a handle for a relative correction which can be applied jet by jet. This 
fragmentation dependency can be understood in that jets with large / tr k have a larger amount of their 
energy carried by charged hadrons, which tend to generate a smaller signal in the non-compensating 
ATLAS calorimeters. The standard jet calibration based on the calorimeter cell signals alone cannot 
completely recover the corresponding signal loss. This is certainly one of the promising techniques for a 
relative jet energy scale correction in the context of a refined jet calibration, with the immediate goal of 
improving the relative jet energy resolution. 

Jet finders usually cluster four- momentum in two dimensions, like azimuth (j) and pseudorapidity r\. 
Inner detector tracks in ATLAS have a reconstructed vertex associated, thus the z vtx coordinate can be 
added as a third dimension to jet finding from these tracks, see Fig. 8. Jet clustering in (j>, r\, and z vtx then 
allows the assignment of a vertex to a matching calorimeter jet, which is especially of interest in events 
with multiple interactions from pile-up. In this case jets not associated with the primary vertex can be 
tagged as such, and removed from the final state in jet counting experiments, e.g. a W + n jets analysis. 

2.7 Energy flow jets 

In Section 2.6 some aspects of the use of jets from charged tracks reconstructed with the inner detector 
are discussed. Another approach to improve the jet energy measurement is to match the calorimeter 
response in towers or clusters with a charged track pointing to it, and use the track kinematics if the track 
momentum resolution is better than the calorimeter energy resolution for the matched cluster, which for 
hadrons in ATLAS is the case up to pj ~ 140 GeV at r\ = 0. The principal method is referred to as 
energy flow reconstruction, and was pioneered at LEP [20] and is in use in hadron colliders; for example, 
the CDF application is described in Ref. [21]. 

The most important feature of energy flow reconstruction is the removal of the calorimeter signal 
generated by an accepted track. In ATLAS this has been studied using a track-cluster match approach. 
Figure 9 shows the relative variation of the pj resolution for cone jets from energy flow objects and 
topological clusters in QCD dijet production, both without final energy scale corrections. From this there 




Figure 7: The difference between jet pj reconstructed from the calorimeter and the matching truth par- 
ticle jet pj, for all jets with \t]\ < 0.7 and 40 < pj < 60 GeV, and jets from this sample in two regions 
of the track pj fraction / t rk defined in Eq.(2) (left). The plot in the center shows the relative differ- 
ence Apx/j'T, truth for various bins of / t rk and as function of the calorimeter jet pj. The distributions of 
Apj after applying these corrections are shown in the right figure. The lightly shaded area indicates the 
distribution for all jets, without any selection based on f tr ^. 
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z (cm) z (cm) 

Figure 8: Track jets in the horizontal (x,z) plane of a simulated event with multiple vertices. The left fig- 
ure shows tracks assigned to cone jets with pj > 5 GeV using the usual two-dimensional clustering algo- 
rithm in X] and 0, with R con e = 0.4 (same line patterns and shades indicate same jet). Clearly tracks from 
different vertices can end up in the same jet. The right figure shows cone jets from three-dimensional 
clustering for the same event, explicitly including the track vertices {R cone = 0.4, Az = 10mm). 
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Figure 9: The ratio of the relative pi resolutions for energy flow and cluster cone jets Ceflow/Ccaio, as de- 
termined with fully simulated dijet events, as function of the jet pj. Results for both narrow (R CO ne = 0.4) 
and wide (R cons = 0.7) jets within < 1.8 are shown. 
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are indications that for jets with pj < 80 GeV in this environment, which is characterized by typically 
low event activity in general, the jet energy resolution can be improved by the energy flow technique. 
The expected gain is most significant at lower jet pi, e.g. about 15% (relative) at p T = 40 GeV. 

2.8 Particle jets 

Particle jets are only available in simulated events. They are built from stable particles produced by the 
fragmentation model in the physics generator. Stable particles in ATLAS are those with a laboratory 
frame lifetime of about 10 picoseconds or more, thus typically including electrons, muons, photons, 
charged pions, kaons, protons, neutrons, neutrinos, and their corresponding antiparticles. These particles 
represent the "truth" reference of a hard scattering process for performance studies and simulation based 
calibration approaches. The particle jets are therefore referred to as truth particle jets, or truth jets, in 
the following sections of this note. 

Neutrinos and muons generated in the collision are excluded from these truth jets, as they have their 
own observables, i.e. missing transverse momentum for neutrinos and explicitly reconstructed tracks for 
muons. Jet finding with generated particles uses the same code as calorimeter reconstruction, obviously 
excluding the signal preparation for towers in reconstruction sequence I and all calibration steps. 

3 Jet reconstruction performance in ATLAS 

The performance of the ATLAS detector for jet reconstruction has been evaluated within the present 
day limitations of physics generators, mostly PYTHIA [22], and detector response simulations, all 
performed with GEANT4 [23, 24]. All results presented here should therefore be considered to be 
expectations and of preliminary nature. 

3.1 Preliminaries 

The comparisons of calibrated jet features discussed in this section uses calorimeter jets to which the 
cell signal based calibration has been applied, in addition to the overall scale correction for different 
jet finder configurations and calorimeter signal choices, i.e. jets from reconstruction sequence I and II 
introduced in Section 2.5.4, with calibrations applied as described in Section 2.5.5. As already discussed 
in that section, the parameters of the corresponding calibration functions have been determined using 
simulations of dijets from QCD processes with the ideal detector geometry, meaning no misalignment 
between detectors or detector elements, no detector shape distortions, and assuming a perfect knowledge 
of the material distribution from supports, services, cryostats, etc., in the complex ATLAS geometry. 

The simulations performed for the evaluation of the jet reconstruction performance use the same 
generated physics, meaning the same events at particle level, but include small shifts in relative detector 
positioning and small changes to the amount (increased) and location (more realistic asymmetric distri- 
bution in pseudorapidity and azimuth) of dead material in the detector description. As a consequence, 
the calorimeters do not respond optimally for this evaluation with respect to jet signals, but the esti- 
mated performance is likely closer to the initial one in ATLAS, with some a priori unknown distortions, 
misalignments, and other imperfections. 

Final corrections can be derived once experimental data from the detector become available with suf- 
ficient statistics by e.g. following the strategies lined out in Ref. [25]. For the performance expectations 
presented here these corrections, which could have been derived rather straight forwardly by using the 
misaligned detector geometry in the simulations for calibration as well, are intentionally not applied to 
probe some of the initial systematic uncertainties to be expected for the initial running of ATLAS. Note 
that none of the results presented here include pile-up. 
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3.2 Comparing jet reconstruction algorithm performances 

Jet reconstruction performance is typically expressed in terms of expected or measured signal linearity, 
i.e. flatness of the detector response to particle jets over the whole kinematic range of interest at the LHC 
(from pi « 10 GeV to a few TeV), signal uniformity in pseudorapidity 77 and azimuth 0 over the whole 
detector system coverage, and the achievable energy resolution. Additional features of jet reconstruction 
performance are the efficiency to find jets, and the purity of the found jet sample, which is of course 
related to the number of fake jets reconstructed. 

Other jet features studied with the ATLAS detector are the possibility to reconstruct the original 
source of a given jet. This is particularly interesting for heavily boosted heavy particle decays like for 
top quarks, where the final state will likely be reconstructed as just one (narrow) jet. The measurement 
of the jet mass and a substructure analysis are experimental tools which could address this question. 

The quality of the reconstructed variables depends on the choice of the calorimeter signal (towers or 
clusters), the choice of the jet finder and its configuration (wide/narrow jets), and the ability to unfold as 
much as possible the physics environment, like reflected in underlying event and pile-up contributions. 
A high precision analysis of a given event topology may require different configurations than offered 
in default jet reconstruction to optimize the signal. In this section the expected effect of the choices 
discussed above on the performance variables is shown for selected configurations. 

In most cases the performance is evaluated using a truth reference provided in the simulation by 
jets reconstructed at particle level, see Section 2.8. The detector and truth jets are associated through 
directional matching. The following list of variables is used: 

Jet directions are given by pseudorapidity 77 and azimuth 0. As 77 is directly related to the polar angle 
6, and thus useful (even for massive jets) to understand the variations of the detector response, the 
rapidity y can be used for physics analysis motivated selections. The variables are defined as: 

' - -H££)~ fc KD] 
' - -H§3) 

0 = arctan (^j . (5) 

Matching radius R m is defined by the directional distance between truth and detector jets, i.e. 

R m = V/ATJ2 + A0 2 . (6) 

Two jets are matched if the radial distance between them is R m < 0.2, if not stated otherwise. Only 
one match is allowed for each reference (truth) jet. In case of two or more nearby jets, the one 
closest to the reference is taken for efficiency and purity studies. For signal linearity and uniformity 
studies the reference and the calorimeter jets are omitted in this case. 

Signal linearity can be determined by the ratio X of the energy reconstructed for the calorimeter jet 
£jet,caio and the matched truth jet energy £j e t, truth, 

X = , (7) 

£jet, truth 

when X can be calculated as function of £j e t, truth or £j e t,caio- 

Signal uniformity is measured by the variation of the signal as function of the jet direction in the de- 
tector frame, as given by r\ in Eq.(3). The variable X defined in Eq.(7) can be used to estimate the 
uniformity from simulations, if calculated as function of r\ . 
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Relative energy resolution is given by the width of the distribution of the relative difference between 

£jet,calo an d £jet, truth : 



'£jet,calo — -Ejet, truth \ \ / £jet,calo — -Ejet truth \ ^ 
\ \ V ^jet, truth J J \ £jet, truth / 

Jet reconstruction efficiency e is defined by the following ratio: 

, . _ # matches of truth particle jets with reconstructed jets _ Nm ts (R m ) 
m '~ # truth particle jets ~~ 

It depends on the matching radius R m and is determined as function of the true jet energy, pi, or 

Jet reconstruction purity n is given by 

R _ # matches of truth particle jets with reconstructed jets _ Nm ts (R m ) 
# reconstructed jets N^f 0 

and also depends on the choice for R m . The fake rate f(R m ) is then given by f(R m ) = 1 — ft(Rm)- 
Purity is calculated as function of the reconstructed jet energy, p T , and/or direction. 

Additional variables reconstructed from jets, like substructure and shape measures, are discussed in 
Section 3.6. 

3.3 Comparisons of basic jet signal features 

The main data source for the evaluation of the performance of the ATLAS calorimeters for jet recon- 
struction are fully simulated QCD dijet events with at least one of the hard scattered partons having a pj 
within a certain bin. Eight bins are defined, with incrementing delimiters approximately following a 2" 
power law: 17 -> 35 GeV, 35 -> 70 GeV, etc., up to the final bin p T > 2240 GeV with the upper limit set 
by the kinematic limit introduced by the parton direction and the 14 TeV center-of-mass energy in the pp 
collisions at LHC. 



3.3.1 Signal linearity and resolution 

Signal linearity has been studied in detail for the evaluation of the various calorimeter jet calibration 
schemes under discussion in ATLAS [17]. The general expectation from these studies with the distorted 
detector is a response flat within ±1% for jets with pi > 50 GeV, and a slightly larger deviation from 
linearity for jets with lower transverse momentum down to ~ 10 GeV [17]. As the calibration functions 
are determined using one jet finder configuration and one simulated calibration sample in a perfect de- 
tector model (see discussion in Section 2.5.5), one can estimate the shift from a response flat within the 
margins introduced by the distorted detector when other configurations and/or a different calorimeter 
signal basis are considered. This shift can be expressed as the ratio of X a u, as given in Eq.(7), from a 
given alternative jet reconstruction to X re f, the reference from the same jet reconstruction configuration 
used to derive calibration functions in the ideal detector: 




278 



Jets and Missing E t - Jet Reconstruction Performance 



0.2 < \y\ < 0.4 



2.0 < \y\ < 2.2 



Seeded Cone Jets ; 



Seeded Cone Jets 



T + 

+ 



Cluster 



1000 100 
Truth jet transverse momentum (GeV) 



Figure 10: Residual calibration uncertainties for reconstructed jets from various jet finder configurations 
and the two calorimeter signals, as function of the truth jet pj and in two bins of jet rapidity y. The refer- 
ence configuration is seeded fixed cone tower jets with R COBS = 0.7. The residual calibration uncertainty 
is given by £ , as defined in Eq.(l 1). 
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Figure 11: Residual calibration uncertainty calculated relative to the fixed seeded cone (R CO ne = 0.7) 
tower jet calibration as described by Eq.(ll), as function of the truth jet p T in the forward direction of 
ATLAS. 
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The quantity £ can be viewed as a measure of the residual calibration uncertainty with respect to the best 
calibrated jet reconstruction configuration, and is thus an estimate of one of the systematic uncertainty 
contributions in the general jet reconstruction. Note that 

palt 
e ^ ^jetcalo 
<= ~ pref 

■^jetcalo 

when the alternative reconstruction uses the same jet finder with the same parameters as the reference, 
because tmth « E v ^ ttuth in this case. The prime example here is the comparison of seeded fixed cone 
tower jets with R CO m = 0.7 (the reference) with cluster jets found with the same configuration. 

Figure 10 shows expectations for £, as function of the jet pj in two different regions of jet rapidity. 
From this simulation based study it can be concluded that the cell signal based jet calibration, with the ad- 
ditional overall scale corrections discussed in Section 2.5.5 applied, is universal for the distorted detector 
at a level of about 2% for the studied QCD jets with pj > 20 GeV and the particular choice of distor- 
tions implemented in the detector description of the simulation program. As these distortions include 
additional inactive material between the electromagnetic liquid argon and the hadronic tile calorimeter 
(about 10% increase in nuclear absorption length), the effect is in particular emphasized for low pj jets 
in the central region, which mostly occupy the 0.4 < \y\ < 0.4 bin 2 . Here the sensitivity to the jet algo- 
rithm (seeded cone or k T ), its configuration (narrow or wide jets), and the choice of calorimeter signals 
(clusters or tower) is also largest, see Fig. 10. 

Sensitivities to signal and jet finder choices are stronger for jets within 3.7 < \y\ < 3.9. The calorime- 
ter jet shape in the corresponding forward region is dominated by (lateral) hadronic shower extension 
rather than the particle flow in a cone in (T] , 0), meaning that a considerable part of the calorimeter signal 
can be outside of a chosen jet cone and therefore be lost for the total jet signal. Jets from each of the 
jet finder configurations and calorimeter signal choices have been individually corrected for these signal 
losses using simulations with the ideal detector geometry. The effect of the distorted detector, which 
includes a change of the relative z position of the endcap and forward calorimeters in ATLAS and thus 
a change of the aspect ratio of the particle level jet shape to the calorimeter jet shape and consequently 
different energy losses, can be significant especially for the kj jets, as indicated in Fig. 11. Again using 
the seeded fixed-size cone tower jets with R con e = 0.7 as a reference, the cluster jets reconstructed with 
the same algorithm show a similar response, indicated by \E, — 1| < 2% in the whole kinematic range 
studied here. The narrow cone cluster jets lose about 1% of their energy, rather independent of their 
pj, which can be expected due to the lateral hadronic shower size varying only slightly with energy. 
The larger effects of the distorted detector on the kj jets, like the prediction of a nearly linear rise of 
E, with logpT for pi < 100 GeV for all considered kj jets, but most pronounced for wide kj tower jets, 
require more investigation of the distortion effects and their reflection in towers and clusters on the more 
complex dynamics driving the kj algorithm. 

The jet energy resolution is the other important contribution to precision jet reconstruction. It has 
been evaluated in the distorted detector geometry with the same QCD sample. The results are again dis- 
cussed in more detail in Ref. [17]. A typical relative energy resolution, calculated as described by Eq.(8), 
and achieved without particular corrections for the distorted detector, has a stochastic term of about 
60%/ GeV) and a high energy limit of about 3% in the central region of ATLAS, all determined 
with these simulated events. 

It is expected that the relative energy resolution depends on the calorimeter signal choice, the jet al- 
gorithm, the underlying event and pile-up activity, and the general particle density and flow of the physics 
environment. Using QCD dijet simulations, with electronics noise included in the detector simulation but 
without any pile-up activity, the difference in resolution between tower and cluster jets can be estimated 

2 Note that for the QCD jets under consideration here the jet mass m is generally small, in particular m<^p, i.e. y « r\. 
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Figure 12: The difference in relative energy resolution \y a (see Eq.(12)) between seeded cone cluster and 
tower jets, and kj cluster and tower jets, respectively, as function of the matching particle jet energy, and 
in two different regions of jet rapidity y. A negative value for \ffa indicates a better resolution for cluster 
jets. 

with the test variable xp a , which uses the fractional difference A a in the energy resolution 

A * = (?) 2 -(?) 2 ■ 

\EJ cluster \EJ tower 

with the relative resolutions o JE as given in Eq.(8). can then be defined as 

(12) 



Figure 12 shows the prediction for \jf a for various jet configurations in two different kinematic regimes 
defined by the jet rapidity. As this variable mainly folds differences in the stochastic and noise contri- 
bution to the jet energy resolution, the effect of noise suppression implicit for cluster jets is particularly 
visible at low energies and for wider jets, where more calorimeter cells with noise contribute to the tower 
jets than in narrow jets. At higher jet energies, the energy resolution contribution introduced by sig- 
nal fluctuations from electronics noise is significantly reduced and \y a is comparable with zero in both 
kinematic regimes. 

The relative difference y a can be transformed into an energy-equivalent difference in a straight 
forward manner: 

Ao abfi (E) = y a (E)-E. (13) 

Figure 13 shows predictions for this difference for the QCD jets discussed here. The general flat (energy 
independent) behaviour indicates that the differences between cluster and tower jets indeed are mostly 
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Figure 13: Estimates for the absolute difference in resolution, as defined in Eq.(13) between seeded cone 
cluster and tower jets (R con e = 0.7, open circles o). The filled circles (•) show the same difference for kj 
cluster and tower jets, with R = 0.6. The solid and dashed lines show results and the range of fits to the 
data points. Note the linear energy scale and the limited energy range, compared to Fig. 12. 



related to electronics noise. The average noise contribution (Aa a b s ) has been determined by straight line 
fits to the flat regions. Seeded cone tower jets with R cone = 0.7 in the lower rapidity regime 0.2 < \y\ < 0.4 
yield | (Aa abs )| ps 2.5 GeV more noise than the corresponding cluster jets. This contribution is estimated 
to be higher for k T jets with R = 0.6, at approximately 4 GeV. | (A<7 abs ) | increases in the higher rapidity 
region 2.0 < \y\ < 2.2 to about 6 GeV for cone and 8 GeV for kj jets, respectively. This observation 
already reflects the larger electronics noise in the endcap calorimeters, where most of the jets in this 
rapidity range go. The evaluation of \j/ a as function of the jet direction, as shown in Fig. 15 in Section 3.4, 
explores this observation further. 

Figure 13 also suggests that for E < 50 GeV, kj jets made from clusters in the lower rapidity regime 
are subjected to fluctuations from other sources than electronics noise, which partly cancel the noise 
reduction as |Aa a b s | gets smaller with decreasing energy. These fluctuations are not present in kj jets 
made from towers and in seeded cone jets reconstructed from towers or clusters in the same kinematic 
region, and need further exploration. 



3.4 Jet signal uniformity 

The variation of the jet response as function of the jet direction is a measure of the uniformity of the jet 
signals and more powerful to diagnose possible detector inefficiencies than the linearity studies discussed 
in the previous section. Figure 14 shows the variation of the response to wide and narrow seeded cone 
and kj tower jets as function of the jet pseudorapidity 3 for simulated QCD dijet events. The additional 
inactive material introduced between the electromagnetic and hadronic calorimeters in the central region, 
which is part of the distorted detector geometry model, especially affects the response to the lower energy 
jets. The drop in the jet response is also more enhanced for lower energetic jets in the crack region around 
|t]| ps 1.5, with up to 10% of signal loss, again due to more inactive material in this region. The response 
to the higher energetic jets is more uniform, as expected, with a signal drop in the cracks around 4%, 
which is also observed for the crack at | T7 1 ps 3.2. 

3 Contrary to the preceding discussion of the signal linearity and energy resolution, where the rapidity y is used to categorize 
different kinematic regimes for jets, in the discussion of the signal uniformity the pseudorapidity 77 is preferred, as it directly 
reflects a direction in ATLAS (see Eq.(3) in Section 3.2) and thus is more appropriate to characterize the response in a certain 
calorimeter region. 
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Figure 14: The uniformity of the ATLAS calorimeter response for wide (R CO ne = 0.7) and narrow 
(^cone = 0.4) seeded fixed sized cone jets, and for wide (R = 0.6) and narrow (R = 0.4) kj jets. The 
upper two plots show the response variation as function of the jet pseudorapidity tj for tower jets in two 
different jet energy ranges, while the lower plots show the signal difference between cluster and tower 
jets A£j et ca i 0 = £jet. duster — £jet, tower as function of tj, again in two jet energy bins. The pi variation for 
the two energy bins is shown in the middle section, for comparisons. 
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There are only relatively small difference between the seeded cone and kj jet signal uniformity, 
which are more pronounced at lower jet energies. Using clusters instead of towers as the calorimeter 
signal basis does not affect the signal uniformity for the higher energetic jets, but is expected to recover 
some signal losses, of the order of 1%, for the lower energetic jets. 

Predictions for the dependency of the relative energy resolution on the jet direction is shown in 
Fig. 15, for the same QCD jet sample. From this study there are significant differences expected between 
the seeded cone and the kj algorithm for lower jet energies. Both for wide and narrow jets the cone 
algorithm seems to perform better, especially in the endcap calorimeter region. For higher energetic jets 
these differences become insignificant. 

Testing the difference between cluster and tower jet energy resolution has been done using the rel- 
ative resolution difference \)/ a , as defined in Eq.(12). For the lower energetic jet sample the noise-like 
contribution to \y a already observed in Fig. 12 is confirmed, with the additional observation that in the 
endcap calorimeters clusters seem to reduce these fluctuations even more than in the central region. High 
energetic cluster and tower jets perform very similarly. From the absolute differences between the reso- 
lutions shown in Fig. 13, which is about 2.5 — 8 GeV depending on the kinematic regime and the chosen 
jet finder and its configuration, the expected value for \j/ a for higher jet energies is about 0.2 - 0.7%, 
which is in agreement with the predictions for the high energy jet sample shown in Fig. 15. The larger 
and positive y a at higher pseudorapidities observed especially for narrow k T jets indicates signal fluctu- 

narrowjets wide jets 




0 0.5 1 1.5 2 2.5 3 3.5 4 0 0.5 1 1.5 2 2.5 3 3.5 4 
Truth particle jet pseudorapidity 



Figure 15: Predictions for the direction dependence of the relative jet energy resolution for seeded cone 
(wide, R cone = 0.7, left, and narrow, R cone = 0.4, right) and wide (R = 0.6) and narrow (R = 0.4) kj tower 
jets in two different jet energy bins (upper plots). The relative resolution difference to cluster jets y a , as 
defined in Eq.(12), is shown for the same jet samples in the bottom plots. 
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ations introduced by clustering in the forward region. A more detailed investigation of the kj algorithm 
performance when configured for narrow jets in this region is under way. 

3.5 Efficiency and purity 

The efficiency of the jet reconstruction in ATLAS depends not only on the jet pj but also on the jet 
direction and the physics environment. For basic performance evaluation it has been studied using QCD 
dijet production. Figures 16, 17, and 18 show the expectations for seeded cone jets with R con e = 0.7 
and 0.4, and kj jets with R = 0.4 and 0.6, respectively, in various kinematic regimes defined by the jet 
rapidity y. It is noticeable that the efficiency at lower pj is rather low, with the wide kj jets performing 
better than the corresponding wide cone jets. Both jet algorithms have similar efficiencies for narrow jets 
for 0.2 < \y\ < 0.4, with increasingly better efficiencies predicted for kj jets at higher rapidities. 

Figures 16 through 18 also show the estimated differences in the jet reconstruction efficiency between 
tower and cluster jets, defined as Ae = £ c iuster — £tower, with £ c i us ter and Ctower defined in Eq.(9). In general 
cluster jets are expected to be reconstructed with higher efficiency in the low pj regime, especially in 
case of wide jets and in the forward direction. Narrow seeded cone jets are reconstructed with basically 
the same efficiency for both calorimeter signal choices, except at pj < 50 GeV in the forward region, 
where cluster jets are again more efficient. 

The predictions for Ae for wide kj indicate similar behaviour as the seeded cone jets in the central 
calorimeter region, but a much less pronounced difference between the tower and cluster jets in the 
forward region. There are also indications the narrow kj jet reconstruction efficiency is slightly improved 
when using clusters, see again Fig. 16 through Fig. 18. No significant variation of the jet reconstruction 
efficiency as function of the jet direction is expected, especially not for higher pi jets. 

The fake rate / = 1 — n (see also Eq.(10)) for jet reconstruction in QCD dijet events has also been 
studied. The rate drops quickly from more than 50% at jet pj of 10 GeV to around 1% for jets with 
pj > 50 GeV. There are indications that wide jets reconstructed from topological clusters have a lower 
number of fakes at low pj, especially in the central region. Also, the kj algorithm seems to generate 
fewer wide jets than the cone algorithm in this region, independent of the calorimeter signal choice. 
These differences are expected to be much smaller for narrow jets. 

3.6 Jet composition and mass 

The reconstruction of jet masses and substructure has gained interest at LHC, especially because of the 
expected production of heavy particles with masses of ^(100) GeV and transverse momenta of several 
hundred GeV. These may decay hadronically into very collimated final states. An important standard 
model example is the W boson [26]. Other examples include SUSY [27], exited heavy quarks [28], and 
in exotic final states involving extra dimensions [29]. 

In any case, the complete final state of a heavy and boosted particle may be reconstructed into one jet, 
which in the example of a fully hadronically decaying top quark actually contains three highly collimated 
jets. The mass of the reconstructed jet is then one of the indicators of its origin, in addition to a possible 
substructure reconstruction reflecting the three "internal" jets. 

The reconstruction of the jet mass is inherently difficult from calorimeter signals, as the shower 
development washes out the directions and energies of individual particles in the jet. In addition, the true 
jet mass is of course best reconstructed if all of its original particles can be measured at high precision. 
The mass reconstruction is thus disturbed by undetected particles, like the ones curling in the solenoidal 
magnetic field in the inner detector cavity of ATLAS, and the ones losing too much energy in upstream 
inactive material to generate a signal above threshold in the calorimeters. 

Figure 19 shows the expected composition of kj jets with R = 0.6 built from particles, towers, and 
clusters in three different rapidity regions. The number of constituents shown in this figure is of course 
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Figure 16: The calorimeter jet reconstruction efficiency e in ATLAS, calculated from simulations using 
Eq.(9), for seeded cone tower jets with R CO ne = 0.7 and R CO ne = 0.4, as function of the truth particle jet 
pi, in 0.2 < \y\ < 0.4 (top left). The plot on the top right shows predictions for wide (R = 0.6) and 
narrow (R = 0.4) kj tower jet reconstruction efficiencies in the same kinematic regime. The difference 
in efficiency between cluster and tower jets, defined as Ae = £ c iuster — £tower, is shown in the lower left 
plot for cone and in the lower right plot for kj jets. 
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Figure 17: Predictions for the calorimeter jet reconstruction efficiency in ATLAS for wide and narrow 
seeded cone (top left) and kj (top right) tower jets with 2.0 < \y\ < 2.2, shown together with correspond- 
ing differences in efficiency between cluster and tower jets (bottom plots), again as function of the truth 
particle jet pj (see caption of Fig. 16 for more details). 
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Figure 18: Jet reconstruction efficiencies e as function of the truth particle jet pi, for wide and narrow 
seeded cone and kj tower jets in ATLAS (top), estimated with simulations of QCD dijet processes, in the 
jet rapidity range 3.7 < \y\ < 3.9. The corresponding difference in efficiency between cluster and tower 
jets is shown in the bottom plots. See caption of Fig. 16 for more details. 



depending on the calorimeter signal choice, i.e. towers or clusters. The variation in the number of clusters 
between the rapidity regions reflects the changing spatial resolution power of the calorimeter with respect 
to resolving individual showers inside a jet. The observation that the number of particles inside jets seems 
to drop at highest pi indicates a change in the origin of the jets. Most jets at lower pi in the studied QCD 
sample are gluon jets, while for higher pi a significant fraction of jets is produced by quarks. As gluons 
have a larger probability to radiate off other gluons than quarks have, one can expect more particles 
inside gluon jets, and even more jets in the final state in case of gluons. 

The relative sensitivity of the jet mass reconstruction to low signal contributions has been studied 
for cluster, tower and particle jets with QCD dijet simulations. The relative change in the jet mass if pi 
thresholds are applied to the constituents, is defined as 

^ = mJ &?- 

Here m(pi > Pj m ) means the jet mass recalculated using only jet constituents with a transverse momen- 
tum above p™ m , while m(pi > 0) is the jet mass using all constituents. Figure 20 shows the expectations 
for the variation of Am re i as a function of the reconstructed m(pi > 0), for different pj m . Especially 
for high mass jets the cluster jets show a similar behaviour as the particle jets, indicating that clus- 
ters are probably better reflecting the particle composition in these jets than the tower jets. Note that 
pi > 400 MeV is around the threshold for charged particles to reach the calorimeter in the magnetic field 
of the inner detector, and is also close to the general signal threshold for the calorimeters in the central 
region of ATLAS. 
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Figure 19: The average number of constituents of kj jets with R = 0.6 as function of the jet pj, in 
three different regions of ATLAS, as predicted by simulations of QCD dijet events (figure adapted from 
Ref. [16]). 
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Figure 20: The variation of the jet mass re-reconstructed from its constituents with different levels of 
biases in pj, as function of the least-biased mass using all constituents, for simulated tower, cluster, and 
particle level jets in QCD dijet events in ATLAS (figure adapted from Ref. [16]). 
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3.7 Jet substructure 

The sensitivity in the mass reconstruction observed in Fig. 20 suggests the need for jet variables less 
sensitive to the soft particle contribution and response, yet providing sensitivity to the possible origin of 
the jet. One of the interesting variables is the characteristic scale j sca i e for sub-jet splitting in kj jets, i.e. 
the pi scale at which the n last recombinations of the kj algorithm are undone. It can be defined using 
y n such that 

yia\ e = PT x yn- 

For example, n = 2 means splitting into two sub-jets, identical to undoing the last recombination in the 
kj algorithm. y sca i e is logarithmically below the jet pj in gluon and quark jets, due to the strongly ordered 
(in kj) QCD evolution, see e.g. Ref. [30]. In case of a strongly boosted W boson decaying into quark 
and anti-quark, on the other hand, y sca i e is closer to the W mass oif. The left plot in Fig. 21 shows a 
qualitative comparison of the y sca i e spectrum for jets with masses m^ t > 40 GeV in simulated QCD dijet 
processes with the one for jets from simulated boosted W boson decays with the same mass cut applied. 
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Figure 21: Distributions of the scale variable j sca i e indicating the threshold for splitting a given jet with 
mass mj e t > 40 GeV into two sub-jets, for simulated QCD dijet processes and boosted W bosons decaying 
hadronically (left figure). The right figure shows predictions of the resolution power for y sca \ e for the 
boosted W bosons. The filled area distribution shows the spectrum for A;y sca i e = j S caie,reco — Jscaie, truth 
obtained with fully detailed simulations, while the solid distribution shows the optimistic estimate from 
a smeared particle-level calculation. Both distributions are normalized to unity. 

The resolution power of ATLAS for a measurement of y sca i e has been studied for splitting into two 
jets, i.e.: 

Jscale = X PT , 

with a sample of simulated highly boosted W decaying hadronically. For these the corresponding j sca i e 
distribution approximately peaks at mw/2, as expected, see Fig. 21. The resolution power for y sca i e can 
be evaluated by comparing this variable reconstructed from the calorimeter jet with the one from the 
matching particle level jet. The right plot in Fig. 21 shows the resolution for full simulation and for a 
smeared particle-level fast simulation. Pending a full scale evaluation in the context of a physics analy- 
sis, the present observation based on the the similarity of both distributions is that the reconstruction of 
Jscale does not seem to be too sensitive to details affecting the calorimeter signal, like showering, limited 
acceptance for low energetic particles, and similar. The prediction for the experimental y sca [ e resolu- 
tion from this figure is about 12%, while the optimistic simulation based on four-momentum smearing 
predicts about 9%. 
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4 Forward jet reconstruction and jets in minimum bias events 

The ATLAS detector provides near hermetic coverage within pseudorapidities of approximately 
—4.9 < tj < 4.9. The forward region is particularly challenging for jet reconstruction, as the jets of 
interest often have rather low transverse momentum (at high energy) and are thus closer to the fluctua- 
tions introduced by pile-up at design luminosity at LHC. For example, in a jet cone with R con e = 0.7 one 
expects fluctuations in transverse momentum of the order of 12 GeV [31], meaning that the minimum pj 
which can safely be reconstructed is around 40 GeV. The estimates for jet performance in the forward 
region presented here have been calculated without pile-up, i.e. only electronics noise is folded into the 
reconstructed signals. 

Events with depleted hadronic activity in the central region of the pp collisions are important signa- 
tures for discoveries, including but not limited to leptonically decaying Higgs bosons produced in vector 
boson fusion (VBF) events, like WW scattering. To reconstruct this signal with significant efficiency a 
central jet veto can be applied. The effectiveness of this veto can be understood from the efficiency to 
reconstruct jets in minimum bias events without hard scattering, and the rate for fake jet reconstruction. 
This has been studied with simulations for ATLAS and the results are presented in Section 4.2. 

4.1 Forward jets 

Recent studies indicate that the detection of forward going (light quark) jets with pseudorapidities 
2.7 < |t]| < 4.9 helps to significantly increase the discovery potential not only for heavy Higgs bosons 
[13], but also for intermediate mass Higgs bosons produced in VBF [32]. This region in the ATLAS de- 
tector features a complex calorimeter geometry in the transition region from the end-cap to the forward 
calorimeters. This leads to a loss of precision from the changing readout geometries. Some aspects of 
the performance of the ATLAS detector for these forward jets are discussed in this section. 

The VBF events have a specific topology in that on average two hard tag jets are produced by the two 
quarks radiated off the vector boson. Figure 22 shows predictions for the jet multiplicity distributions 
in these events, both over the whole detector acceptance (|tj| < 4.9), where most often the two tag jets 
are found, and in the forward direction (2.7 < \rj\ < 4.9) only. From this the expectation is in most 
events only one of the jets is going into the forward direction, at least for the particular Higgs boson 
mass considered here (m# = 120 GeV). This observation is independent of the jet size for seeded cone 
and kj jets. 

The relative transverse momentum resolution for both forward seeded fixed cone and kj jets in VBF 
produced H — > TT — ► }X}X events is estimated at about 9% for 20 < pj < 120 GeV. Signal linearity is 
expected to be within ±2% for these jets, at least for pj > 30 — 40 GeV, see Fig. 23. The precision 
achieved in the reconstruction of the jet kinematics for even lower pj, an attempt only realistic in low 
luminosity running at LHC, seems to be limited from this study to w 5%. 

Predictions for the efficiency and purity of forward jets reconstructed with the seeded cone 
(^cone = 0.4) finder are shown in Fig. 24. Here the indication is that using topological cell clusters 
as calorimeter signals for jet reconstruction makes jet reconstruction more efficient at the likely less rel- 
evant lower end of the jet pj spectrum (pj < 30 — 40 GeV), but generates more fake jets in the same pj 
range, i.e. has a lower purity in this region for this event sample. As both calorimeter signals reconstruct 
kj jets with basically the same efficiency and purity in the whole kinematic range of interest for these 
VBF events, there is an indication that the splitting of cell signals to fill towers, which is predominant in 
the forward region, generates less seeds than the more integrating cell clustering, where cell signals are 
summed up rather than split and thus create more likely signal objects above the seed threshold. See also 
text and Fig. 3 in Section 2.5.1 for tower formation, and Section 2.5.2 for clustering. 
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Figure 22: Simulated jet multiplicities for seeded cone jets (/? CO ne = 0.4) with pj > 20 GeV, built from 
topological cell clusters and generated particles, respectively, in VBF produced Higgs boson events (left). 
The Higgs boson decays as H — > xx — > The right plot shows the multiplicities of forward going jets 
with rapidities 2.7 < \y\ < 4.9. 
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Figure 23: Relative deviation of reconstructed transverse momentum pj from calorimeter jets and the pj 
from the matched particle jet in VBF Higgs boson production, for various jet finder configurations and 
the two calorimeter signal definitions (towers and clusters). 
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Figure 24: Estimated efficiency to reconstruct narrow seeded cone tower and cluster jets {R CO ne = 0.4) 
generated in VBF Higgs boson production in the forward direction (2.7 < |rj| < 4.9), as function of the 
truth jet transverse momentum (left). The plot on the right shows the corresponding fake reconstruction 
rate as function of the transverse momentum of the calorimeter jet. 



4.2 Jets in minimum bias events 

Soft underlying physics, as generated by the underlying event in hadron colliders and the multiple soft 
interactions at high luminosity is an important source of jet production not directly related to the triggered 
hard scattering process of interest. For efficient application of a central jet veto, which is an important 
tool in background suppression in VBF produced Higgs boson events, it is crucial to understand the jet 
rate from soft interactions in this region, and the particular characteristics of these jets. The latter point 
is subject to ongoing studies, but first estimates on the jet multiplicity and rate from simulated single 
minimum bias events are available. 

Figure 25 shows the expected average number of jets in these events as function of the pi threshold 
applied in the final jet selection, for various calorimeter signal definitions and the most commonly used 
jet finder configurations. The kj jet multiplicity for narrow (R = 0.4) jets is rather independent of the 
calorimeter signal choice, while for wider jets (R = 0.6) the cluster jets have a lower average multiplicity, 
i.e. are less problematic for a central jet veto. For seeded cone jets, the wider (R CO ne = 0.7) tower and 
cluster jets have very similar multiplicities, while here the narrow tower jets (R con e = 0.4) have a lower 
multiplicity than the cluster jets. 

Predictions for the probability P (A^ et > l,pj > 20 GeV, Tj range ) of reconstructing at least one jet per 
single minimum bias event with pj > 20 GeV within a pseudorapidity range of | tj | < T] rang e is shown in 
Fig. 26. Narrow jets from towers and clusters behave rather similarly for both the seeded cone and the kj 
algorithm. Wider jets are more often found in calorimeter tower jets than in topological cluster jets. In 
general the kj algorithm is less likely to reconstruct wider jets with R = 0.6 than the seeded cone is with 
^cone = 0.7. Here P (Nj et > l,p T > 20 GeV, T] ran g e ) has been calculated including the occasional jet from 
the minimum bias (soft or semi-hard) interaction as well as "true" fake jets from calorimeter signal fluc- 
tuations due to noise. Both lead to efficiency losses in an analysis selecting final states with no hadronic 
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Figure 25: Estimates for the average number of jets per minimum bias event, as function of the pj thresh- 
old applied to the final jet. Shown are results for wide and narrow cluster and tower jets in minimum bias 
simulations. 
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Figure 26: Prediction from full simulations for the probability P (/Vj et > 1,/?t > 20 GeV,T] range ) to re- 
construct at least one jet with pj > 20 GeV within a pseudorapidity range \r] \ < i] mnge considered for jet 
reconstruction in single minimum bias events, as function of rj ran ge- The jet veto efficiency in the region 
I V I < grange can be estimated from these curves as 1 - P (Nj et > l,pj>20 GeV, rj mnge ) . 
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activity in a given region |tj| < T7 range . Note that the expectations for the jet veto efficiency discussed 
here are estimated by 1 — P (/Vj et > l,pj > 20 GeV, Tj range ) under the assumption that the minimum bias 
events generate a similar underlying activity as can be expected in signal events. An additional loss of 
signal events is associated with the multiple soft interactions from pile-up. One expects ~ 25 minimum 
bias interactions per bunch crossing at the design luminosity of Jz? = 10 34 cm _1 s _1 . Experimental effects 
introduced by the calorimeter readout can increase this number by another factor of 2, giving jet veto effi- 
ciencies in the order of 95(50)% for narrow jets with pj > 20 GeV at Jzf = 10 33 cm~ 2 s _1 (10 34 cm _2 s _1 ). 

Reconstructing jets in minimum bias events can provide important information for the modeling of 
soft physics and on the underlying event for hard scattering processes if correlation effects are neglected. 
For example, the pseudorapidity distributions of kj jets with pj > 10 GeV are shown in Fig. 27. Most 
jets above threshold are centrally produced with \t] \ < 1 and reconstructed with some limited efficiency, 
both for towers and clusters. The fake rate for jet reconstruction at this pj threshold can be considerable 
(^(30%) in the same central region), but decreases quickly with increasing jet pj, thus likely restricting 
the accessible experimental phase space to test jet production in soft or semi-hard proton collisions with 
precision on production rates or cross section. 




Figure 27: The pseudorapidity distribution of jets with pi > 10 GeV in single minimum bias events in 
ATLAS (top plots for kj with R = 0.6 and R = 0.4, respectively). The distributions for the calorimeter 
jets exclude fake jets, i.e. each reconstructed calorimeter jet is matched with a truth particle jet. The 
efficiency for jet reconstruction in these events is shown in the bottom plots. 



5 Conclusions and outlook 

ATLAS supports a highly configurable and flexible jet reconstruction framework, which can easily be 
adapted to accommodate new jet algorithms or signal definitions from the detectors. At the same time, 
a default set of configurations for jet finding strategies is provided: a seeded fixed cone algorithm with 
split and merge, and the kj algorithm, both with two different parameters controlling the size of the 
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reconstructed jet. For the seeded cone, cone radii of R con e = 0.7 and/? cone = 0.4 are available. Similarly, 
the default choices for narrow and wide kj jets are distance parameters R = 0.4 and R = 0.6, respectively. 
For the detector jets, which are reconstructed from projective calorimeter towers or topological cell 
clusters, a calorimeter cell signal weighting based calibration is applied, with calibration weights derived 
from simulations with an ideal ATLAS detector geometry model, together with an additional jet energy 
scale correction parametrized in pseudorapidity and transverse momentum. 

Performance predictions for a slightly misaligned and distorted ATLAS detector system have been 
extracted mostly from PYTHIA generated QCD dijet processes simulated through the detector with 
GEANT4. The emphasis in these studies was to estimate possible deviations from signal linearity and 
uniformity, the deterioration of the relative jet energy resolution, the effect on jet reconstruction efficien- 
cies and the fake jet reconstruction rate for the first collisions, when the inactive material distributions 
and the alignment of detector components may not be well known. The corresponding pre-collision 
physics data estimates have been derived from Monte Carlo for the default jet configurations for the 
two considered calorimeter signal definitions (tower and cluster). A preliminary conclusion from these 
studies is that the signal linearity can likely be controlled at the level of 2 — 3%, depending on the de- 
tector region. The effect of the combination of a particular calorimeter signal choice with a given jet 
reconstruction configuration has been found to be consistent with expectations, i.e. the noise suppression 
intrinsic to clusters can be observed. In general there are strong indications that the effect of uncertainties 
in the knowledge of the detector geometry only leads to a modest degradation of the jet reconstruction 
performance. 

Special challenges to jet reconstruction in the forward direction or jet vetoes in the central region, 
have been evaluated and found to be of acceptable performance for physics analyses like vector-boson 
fusion produced Higgs boson events. Pile-up at high LHC luminosities leads to considerable degradation 
especially of the jet veto efficiency. Due the large uncertainties in the modeling of soft physics under- 
lying and overlapping the pp collisions, a precise quantitative evaluation of this degradation needs to 
be postponed until experimental minimum bias data and other event topologies become available with 
sufficient data quality and statistics. 

In preparation for the experimental data ATLAS is now focusing on "data only" calibration ap- 
proaches using tools like pj balance in prompt photon production and in QCD dijets. Additional efforts 
are concentrating on the subtraction of the pile-up contribution to jets, which can be estimated measuring 
the (transverse) energy scattered into a given area in pseudorapidity and azimuth in minimum bias events. 
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Abstract 

The jet energy scale is proven to be an important issue for many different 
physics analyses. It is the largest systematic uncertainty for the top mass mea- 
surement at Tevatron, it is one of the largest uncertainties in the inclusive jet 
cross section measurement, whose understanding is the first step towards new 
physics searches. Finally, it is an important ingredient of many standard model 
analyses. This note discusses different strategies to correct the jet energy for 
detector level effects. 

1 Introduction 

The jet calibration process can be seen as a two-step procedure. In the first step, the jet reconstructed 
from the calorimeters is corrected to remove all the effects due to the detector itself (nonlinearities due 
to the non-compensating ATLAS calorimeters, the presence of dead material, cracks in the calorimeters 
and tracks bending in/out the jet cone due to the solenoidal magnetic field). This calibrates the jet to the 
particle level, i.e. to the corresponding jet obtained running the same reconstruction algorithm directly 
on the final state Monte Carlo particles. The second step, is the correction of the jet energy back to the 
parton level, which will not be discussed in this section. 

There are currently several calibration approaches studied in the ATLAS collaboration based on the 
calorimeter response on the cell level or layer level and either in the context of jets or of clusters. 

The first part of the section describes a possible approach for the calibration to the particle jet. The 
energy of the jet is corrected using cell weights. The weights are computed by minimizing the resolution 
of the energy measurement with respect to the particle jet. The performance of the calibration in terms 
of jet linearity and resolution is assessed in a variety of events (QCD dijets, top-pairs and SUSY events). 
The different structure of these events (different color structure, different underlying event) will manifest 
itself as a variation in the quality of the calibration. This method has been the most widely used so far in 
the ATLAS collaboration. 

Other methods have also been studied. Here we discuss one alternative global calibration approach, 
which makes use of the longitudinal development of the shower to correct for calorimeter non-compen- 
sation. The jet energy is corrected weighting its energy deposits in the longitudinal calorimeter samples. 
Although the resolution improvement is smaller with respect to other methods, this method is simple and 
less demanding in terms of agreement between the detector simulation predictions and real data. 

The second part of this section describes the concept of local hadronic calibration. First clusters are 
reconstructed in the calorimeters with an algorithm to optimize noise suppression and particle separa- 
tion. Shower shapes and other cluster characteristics are then used to classify the clusters as hadronic 
or electromagnetic in nature. The hadronic clusters are subject to a cell weighting procedure to com- 
pensate for the different response to hadrons compared to electrons and for energy deposits outside the 
calorimeter. In contrast to the cell weights mentioned above no minimization is performed and the actual 
visible and invisible energy deposits in active and inactive calorimeter material as predicted by Monte 
Carlo simulations are used to derive the weights. One of the advantages of this method is that the jet 
reconstruction runs over objects which have the proper scale (in contrast to the global approach, where 
the scale corrections are applied after the jet is reconstructed from uncorrected objects). 

The third part of the note describes refinements of the jet calibration that can be done using the 
tracker information: the residual dependence of the jet scale on the jet charged fraction can be accounted 
for improving the jet resolution. An algorithm to correct the b-jet scale in case of semileptonic decays 
will also be discussed. 
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2 The calibration to the truth jet 

According to the perturbative QCD, jets are the manifestations of scattered partons (quarks and gluons). 
After undergoing fragmentation, a collimated collection of hadrons emerges and its energy is measured 
in the calorimeter system. In addition to this hard scattering, the final state also contains energy coming 
from multiple proton-proton (pile-up) interactions and the underlying event. 

The typical output of an event generator will provide theoretical predictions about the particle con- 
tent and spectra at this stage, the so called particle level. Jets resulting from the application of a jet 
reconstruction algorithm at the particle level are thus relevant as "truth", since they represent the final 
state jets that ideally must be reconstructed starting from the detector level. In the following we refer to 
them using the expression "truth jets". 

Since jet fragmentation functions are independent of jet energy, the fraction of the total jet energy 
carried by the different particle types in a jet is basically independent of energy. Figure 1 shows the 
relative contribution of the different particle types to the jet energy as a function of the jet Ej. About 




Figure 1 : Left: fractional energy carried by different particle types as a function of the jet energy. Right: 
fraction of true energy deposited in the different calorimeter samplings for a jet in the central (\t] \ < 0.7) 
calorimeter region as a function of its true energy. 

40% of the total energy is carried by charged pions, 25% is carried by photons (mainly coming from the 
7T° decay), another 20% is accounted for by kaons, nearly 10% by protons and neutrons. Therefore, 25% 
of the energy deposits in the calorimeters come directly from pure electromagnetic showers. The right 
plot of Fig. 1 shows the average fractional energy deposit in the different calorimeter samplings with 
respect to the true jet energy in the central calorimeter regions (|tj| < 0.7). Most of the energy (about 
2/3 of the reconstructed energy) is measured by the electromagnetic calorimeter. The total reconstructed 
energy differs significantly from the true jet energy. This is because of a number of detector effects: 

• if the calorimeters are non-compensating (as in ATLAS), their response to hadrons is lower than 
that to electrons and photons, and is non-linear with the hadron energy. 

• part of the energy is lost because of dead material, cracks and gaps in the calorimeters, and is also 
non-linear with hadron energy. 

• The solenoidal magnetic field will bend low energy charged particles outside the jet cone. 

The reconstructed jet energy must be corrected for these effects to obtain the best estimator for true 
jet energy. 

In the following we will discuss two possible strategies. The first one (referred to as global calibra- 
tion) aims to provide calibration coefficients at jet level; the second one, the local calibration, provides 
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calibration constants at the jet constituent level. The performance of both the approaches will be dis- 
cussed in detail. Both methods use simulated events to obtain the calibration coefficients. 

3 An energy density based cell calibration 

The shower produced by a jet impinging on the calorimeters is composed of an electromagnetic and 
a hadronic component. The electromagnetic component is characterized by a compact, highly dense, 
energy deposit, while the hadronic one is broader and less dense. This fact can be used to correct the 
energy measurement to recover for the non-linear calorimeter response to the hadrons. 

After jet reconstruction using calorimeter cells calibrated at the electromagnetic scale, the total en- 
ergy of a jet is reconstructed by summing the energies of its constituent cells multiplied by a weight 
which depends on the energy density of the cell itself. We thus define the EM scale jet energy as: 

E em = £ Ei (1) 

where Ej is the energy in the cell / for the considered jet. We then define a jet weighted 4- vector 

E= £ WiEi P= £ WiP (2) 

(=cells (=cells 

where Ei, Pi are the i-th cell energy and momentum (whose direction is defined by the position in the 
calorimeter and whose magnitude is equal to Ei), and w,- are correction factors that need to be determined. 
They depend on the cell energy density Z?,-/Vj, where V; is the volume of the /-th cell. 
In order to reduce the number of weights to be computed, the following steps are done: 

• The energy density distributions of the cells are divided into different bins with width increasing 
logarithmically with the cell energy density. 

• The calorimeters are subdivided into several regions k. The longitudinal segmentation is partially 
exploited. Broad pseudorapidity bins are also defined. Table 1 shows the defined regions. 

• The weight in the k-th calorimeter region, in the j-th energy density bin is defined to be: 

wf J) ^t^log-OE/V); (3) 

m=0 

where N p (the number of parameters used in the fit) is a number which depends on the region k consid- 
ered. The value of log(E/V)j is defined at the lower edge of the j-th bin. 

With this procedure, the number of independent parameters to be determined is significantly reduced 
(see Table 1). The pre-sampler and the strip-layer of the EM calorimeter have a single weight, constant 
with respect to the density of the energy deposits. The last three rows of the table refer to three energy 
terms which are also corrected with a single multiplicative factor: they are the cryostat term, the scin- 
tillator term and the gap term. The cryostat term is computed as the geometrical average of the energy 
deposited in the back of the electromagnetic barrel and the first layer of the TileCal barrel. It was in fact 
found in the past [1] that this gives a good estimate of the energy loss in the cryostat. The scintillator and 
gap terms correspond to the energy deposited in the scintillation counters in the region between the Tile 
barrel and extended barrel [2]. The weights applied to these terms are meant to recover for the presence 
of a large amount of dead material. 

The parameters have been determined considering QCD dijet events, simulated with PHYTHIA6.4 [3] 
with the ATLAS settings [4], and the detector simulated using GEANT4. The events have been generated 
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Table 1: Definition of regions defined for the minimization that determines the cell weights. The third 
column shows the number of parameters used in the minimization. 



Region name 


Longitudinal Sample 


Number of parameters Np 


wembO 


Barrel pre-sampler 


1 


wembl 


Barrel EM strips 


1 


wemeO 


End-Cap pre-sampler 


1 


wemel 


End-Cap EM strips 


1 


embO 


Barrel middle and back sample, |t] < 0.8 


4 


embl 


Barrel middle and back sample, |t] > 0.8 


4 


emeO 


Endcap middle and back sample, tj < 2.5 


4 


emel 


Endcap middle and back sample, tj > 2.5 


4 


tilO 


Barrel 


4 


till 


Extended Barrel 


4 


hecO 


Hadronic End-Cap, \r\\ < 2.5 


4 


heel 


Hadronic End-Cap, |tj| > 2.5 


4 


fern 


FCal first layer 


3 


fhad 


FCal second and third layer 


3 


cryo 


Cryostat term 


1 


scint 


Scintillator term 


1 


gap 


Gap term 


1 



Total | 45 



Table 2: List of the QCD dijet events used to compute the calibration constants listed in the text. 



Sample Tag 


Pt cut 


Jl 


17 GeV < Pl < 35 GeV 


J2 


35 GeV < p T < 70 GeV 


J3 


70 GeV < p T < 140 GeV 


J4 


140 GeV < p T < 280 GeV 


J5 


280 GeV < pt < 560 GeV 


J6 


560 GeV<p T < 1120 GeV 


J7 


1 120 GeV <p T < 2240 GeV 


J8 


Pt > 2240 GeV 



in bins of the partonic pt, as illustrated in Table 2. Approximately 10k events have been used for each 
bin. The jets have been reconstructed using calorimeter towers as input. The jet reconstruction algorithm 
used is a seeded cone algorithm with a seed threshold of Et = 1 GeV, and a cone size R cone = 0.7. 
Jets with a reconstructed axis lying close to the gap region (1.3 < |tj| < 1.5), or to the crack region 
(3.0 < |tj| < 3.5), or in the very forward region (|tj| > 4.4), are excluded from the minimization. 

Reconstructed jets are associated to the nearest truth jet (in 0 — tj space), obtained, as discussed in 
Section 2, running the same reconstruction algorithm on the final state particles from the event generator. 

The following quantity is minimized using MINUIT: 



2 = I 



E (e)_ E ie 
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where the sum runs over the considered events e, is defined in equation (2) and E^uth is tne energy 
of the matched truth jet. 

It should be noted that this approach partially absorbs effects that are not purely calorimetric into the 
weights. In particular, the energy smearing introduced by the central solenoidal magnetic field, which 
bends low pj particles in and out of the jet cone, is not treated separately, but the effect is proven to be 
small for the cone R cone = 0.7 jets used for the minimization. 

In order to correct for residual non-linearities in the jet response, a further, reconstruction algorithm 
dependent, correction function is introduced. The final 4-vector of a jet is thus defined as 

E s = Ps (E T , 7])E p s = Ps {E T , T])P (5) 

where 8 indicates the dependence on the jet reconstruction algorithm and Ej and tj are the transverse 
energy and the pseudorapidity of the 4-vector p^ . The scale factor p§ (Ej , tj ) is obtained fitting the ratio 
£'t/£'t, truth in 44 bins of tj as a function of Ej with the following function: 

3 

/(£ T ) = 5>log ( £T (6) 

i=0 

and, for a given 8 and tj bin, 

P = 1//- (V) 
The scale factor corrects for the residual non-linearity introduced by the cracks and gaps in the 
calorimeter and for differences introduced by the use of different reconstruction algorithms, finally re- 
covering the truth jet scale. The size of this final correction is at the level of few percent (up to 5%) in 
the crack and gap calorimeter region, while it is of the order of 1-2% (depending on the jet algorithm) in 
the rest of the pj-rj phase space. 

Therefore, the complete set of calibration parameters for a given reconstruction algorithm includes 
the cell energy density dependent weights obtained with cone R cone = 0.7 jets, plus specific scale factors. 

3.1 Results on dijet events 

All the jet corrections computed as described in the previous section (scale factor included) have been 
applied to dijet events. The parameters that are considered in order to assess the quality of the calibration 
are the jet linearity, defined as the ratio between the reconstructed jet energy and the corresponding truth 
jet energy (as defined in Section 2) and the energy resolution. 

The matching between the reconstructed jets and the truth jets is done considering their separation in 
a tj — (j) plane, defined as 

R cone = VA^ + At? 2 (8) 

A truth jet is matched with a reconstructed jet if R con e < 0.2. 

Once the matching is done, the E — tj phase space of the truth jets is subdivided into bins. For each 
bin in energy and pseudorapidity, a histogram is filled with the ratio between the reconstructed energy and 
the truth energy E ISC /E trut h- The resulting histogram is first fitted with a gaussian in the whole histogram 
range. This provides two estimates (/x raw , Craw) for the mean value and the width of the distribution. The 
fit is then repeated in the range ;U raw ± 2a raw . This provides the final values (jj,, a) that are used in the 
summary plots. Two examples of such histograms are shown in Fig. 2 for two different pseudorapidity 
and energy bins. 

Figure 3 shows the dependence of (^rec/^truth) (linearity, in the following) on the truth jet energy 
£"truth for jets reconstructed from calorimeter towers. The plot on the left refers to jets reconstructed with 
a cone algorithm with radius of 0.7 while the one on the right is for kj algorithm with R = 0.6 [5]. 
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8.2 0.4 0.6 0.8 1 1.2* 1.4 1.6 1.8 8.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 

E rSC /E ,m ' h E rec /E l,uth 

Figure 2: Two example histograms of E mc /E lm{h . On the left, the histogram is done for 
88 GeV < £ trath < 107 GeV and \r\\ < 0.5, on the right for 158 GeV < E mih < 191 GeV and 
1.0 < < 1.5. 




' ATLAS 




. K T R=0.6 0.00<T|<0.50 _ 




K T R=0.6 1 .50<T)<2.00 j 



















Figure 3: Dependence of the ratio E rec /E tmt ^ on £"truth for jets reconstructed with a cone algorithm with 
^cone = 0.7 and with a k T algorithm with R = 0.6. The black (white) dots refer to jet with |tj| < 0.5 
(1.5 < |?]| < 2). An ideal detector geometry has been used to simulate the events. 
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The results show that the linearity is recovered over a wide energy range, both in the central ( |t] | < 0.5) 
and in the intermediate (1.5 < \rf \ < 2) regions. For the cone algorithm, at low energy (E = 20 — 30GeV), 
the linearity differs by up to 5% from 1 in the central region. At low energy, there is a 5% residual non 
linearity, not fully recovered by the parametrization chosen for the scale factor. 

Concerning the intermediate pseudorapidity region, we can see a similar behavior around 100 GeV 
(note that in this region E ~ 100 GeV corresponds to Ej = Ej coshrj ~ 35 GeV). 

The linearity plot for the kj algorithm shows a more pronounced deviation from 1 at low energy 
((£ rec /£truth) = 5% at 50GeV, 8% at 30GeV). The linearity is fully recovered above ~ lOOGeV in the 
central region, ~ 300 GeV in the intermediate region. 

The uniformity of the response over pseudorapidity is also satisfactory. Figure 4 shows the depen- 
dence of the ratio Ef c /E^" th on the pseudorapidity of the matched truth jet for three different transverse 
energy bins. Again, the left plot refers to cone 0.7 jets, while the right one refers to kj jets with R = 0.6. 
We can observe that for the lowest considered transverse energy bin, the ratio increases with the pseudo- 
rapidity. This is a consequence of the fact that energy increases with tj at fixed Ej and that the linearity 
improves with increasing energy. 




Figure 4: Dependence of the ratio Ej°/E^ ath on the pseudorapidity for the cone algorithm with 
^cone = 0.7 (on the left) and for the kj algorithm with R = 0.6 (on the right). An ideal detector geometry 
has been used to simulate the events. 

We can also observe (in particular for the kj algorithm) the remnants of the calorimeter structure, 
which is not completely corrected by the procedure. There is a first, small dip at |tj| ~ 1.5, in corre- 
spondence with the gap between the TileCal barrel and extended barrel [2]. A second dip is observed in 
correspondence with the calorimeter crack between the End-Cap and the forward calorimeters. 

Even if the effect is smaller when higher Ej bins are considered, it is still present in the crack region. 
Jets with Ej ~ 400 GeV still show a slight tj dependence in their response. As a last indicator of the 
quality of the correction factors, we consider the energy resolution o(E mc )/E rec . The dependence of the 
energy resolution on the jet energy is shown in Fig. 5 for the cone (left) and k T (right) algorithms in two 
pseudorapidity bins. The fit to the data is done considering three terms contributing independently to the 
resolution: 

= ©^e^. (9) 



E y/E (GeV) £' 
The sampling term (a) is due to the statistical, poissonian, fluctuations in the energy deposits in the 
calorimeters. The constant term (b) reflects the effect of the calorimeter non-compensation and all the 
detector non- uniformities involved in the energy measurement. The noise term (c) is introduced to de- 
scribe the noise contribution to the energy measurement. Although the physics origin of the different 
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terms is quite clear, it should be kept in mind that many different ways of combining them have been 
used in literature. In particular, the three parameters are correlated, and their values depend on the par- 
ticular functional form used to parameterize the resolution. 





Figure 5: Energy resolution as a function of the jet energy for the cone algorithm with R CO ne = 0.7 (on 
the left) and for the kj algorithm with R = 0.6 (on the right). The black (white) dots refer to jets with 
| T] | < 0.5 (1 .5 < | T] | < 2). The smooth curves correspond to a fit done using the parametrization of Eq. 9. 
An ideal detector geometry has been used to simulate the events. 

The fit is performed between 30 GeV and 1 TeV. In the central region, the resolution of the kj 
algorithm is significantly worse than that of the cone algorithm at low energy, while it is similar in the 
high energy region. The dependence of the resolution on the pseudorapidity is shown in Fig. 6. Again, 
the effect of the gap and crack regions are visible in particular for the lowest Z? T bin, where a clear 
worsening of the resolution is present in particular around tj =3.2. 



TCAR=0.7 39<E T < 48 Gel 
ieAR=0.7 107<E T < 130G6V J 
ieAR=0.7 336<E T < 405 GeV J 



K T R=0.6 39<E T < 48 GeV 
K T R=0.6 107<E T < 130 GeV J 
K T R=0.6 336<E T < 405 GeV J 



Figure 6: Energy resolution as a function of the pseudorapidity for the cone algorithm with R CO ne = 0.7 
(on the left) and for the kj algorithm with R = 0.6. An ideal detector geometry has been used to simulate 
the events. 

The analysis has been repeated also for jets obtained using the uncalibrated topological clusters (see 
Section 5) as input for the jet reconstruction algorithms, obtaining very similar results for the linearity 
and the uniformity. The values of the parameters obtained with the fit of the function expressed in 
equation (9) to the simulation results are given in Table 3. 

First, at fixed pseudorapidity, the sampling term is almost constant for the different reconstruction 
algorithms and for the different inputs. The sampling term in the central calorimeter region is about 65%, 
in rough agreement with what has been found previously with similar calibration approaches [6]. The 
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constant term is also independent of the reconstruction algorithm and input type. The noise term on the 
other hand shows a slight variation between jets reconstructed from towers or from topological clusters, 
and is a significant contribution to the jet energy resolution for energies below 100 GeV. In particular, the 
noise term is lower if topoclusters are used. A significant difference is also observed between the cone 
and the kj algorithm, the latter showing a larger noise value. 



Table 3: Parameters of the resolution parametrization described in the text as obtained with the fit of the 
res olution curves of Fig.9. 



Reconstruction Algorithm 




0 < 77 < 0.5 




1.5 <T] <2.5 




a(%) 


b (%) c (GeV) 


a(%) 


b(%) 


c (GeV) 


Cone tf co „e = 0.7 Tower 


64±4 


2.6±0.1 4.9±0.5 


103 ±10 


2.6±0.8 


8±1 


kjR = 0.6 Tower 


68±5 


2.5 ±0.2 6.3 ±0.5 


110±1 


1±1 


12.2±2.5 


Cone /?cone = 0.7 Topo 


63±4 


2.7±0.1 4.2±0.5 


107 ±8 


1±1 


6.5±1.5 


k T R = 0.6 Topo 


64±5 


2.7 ±0.2 5.4 ±0.5 


112±4 


1±1 


10.0±1.5 



3.2 Results on tt and SUSY events 

We now verify the quality of the jet calibration on two radically different physics samples. We consider 
first a tt sample, generated with MC@NLO [7]. The event generator used to produce the final state par- 
ticles is HERWIG interfaced with JIMMY [8] , whose fragmentation model is different from the PHYTHIA 
one considered so far. Moreover, most of the jets in tt events are produced by the fragmentation of 
quarks, while the QCD dijet events contain mostly gluon jets for moderate Ej. The simulation layout 
used for the tt events is the same distorted layout discussed in [9], i.e. a geometry with increased dead 
material in particular in the gap region. 

We consider a cone algorithm with R CO m = 0.4 since it is the one most widely used for analyses 
concerning the top quark in ATLAS. The performance on QCD dijet events has been checked and the 
results are very similar to those discussed in Section 3.1 and in [9] for the cone with R CO ne = 0.7 




E Tm,h [GeV] if" 



Figure 7: Linearity as a function of energy for three pseudorapidity regions (on the left) and of the 
pseudorapidity for three transverse energy bins (on the right) for cone 0.4 tower jets in tt events. 

The linearity as a function of the energy and the pseudorapidity is shown in Fig. 7. Despite the 
fundamentally different event structure, an overall acceptable linearity is found also in top events. 

We finally consider SUSY events. The sample has been generated with HERWIG/JIMMY and sim- 
ulated with a distorted geometry. Such events are characterized by a high multiplicity of quark jets 
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I 




Figure 8: Linearity as a function of energy for three pseudorapidity regions (on the left) and of the 
pseudorapidity for three transverse energy bins (on the right) for cone 0.4 tower jets in SUSY events. 

(the SUSY point chosen is discussed in [10]). They are thus useful to asses the performance of the jet 
calibration in busy events. 

The linearity (Fig. 8) is overall good. In the central region, a deviation from 1 of maximum 4% is 
observed at low jet energy. Apart from the expected dip at tj = 1.5, we observe, also, a good uniformity 
of the linearity as a function of the pseudorapidity. At large pseudorapidity (\r]\ > 3.5) , the linearity is 
off by 5-6%. 

3.3 A check of the systematics with real data 

We applied the method discussed so far to single pions from the ATLAS combined test beam of the year 
2004 [11]. The weights have been computed on events fully simulated with GEANT4 using the test beam 
geometry. 

Positively charged beams of different energy impinging with an incident angle of 20 degrees on 
the calorimeters surface have been considered. The beams are composed of pions, protons, positrons 
and muons. Signals from scintillators present upstream and downstream of the calorimeters are used as 
vetoes to reject early showering particles and muons, respectively. To reject the electrons we required an 
energy deposit in the first two layers of the electromagnetic calorimeter of less than a certain threshold 
(75% - 90% of the beam energy). A fraction of protons equal to that expected for the chosen beam line 
has been added to the simulated sample. 

The energy distributions obtained for each energy point at the electromagnetic scale and after the 
calibration are fitted with a Gaussian function. The Gaussian mean values ((E)) are used to evaluate the 
calibration procedure. 

In Fig. 9 (left) the ratios (E) /£"beam are shown as a function of the beam energy. The black dots refer 
to the Monte Carlo at the electromagnetic scale. The black squares refer to the Monte Carlo after the 
weighting. The procedure restores the linearity at the 2% level for the simulation. 

In the same figure the results on the real data are also shown (gray markers). In this case the linearity 
is also restored to within a few percent. 

To evaluate these differences, we define the ratio R = (fidata/^Mc)- Fig. 9 (right) shows the double 
ratio /?had/^em of the points of Fig. 9 (left). This plot is showing the effect of the calibration procedure 
on the agreement between data and simulation. The agreement is worse after the application of the 
calibration procedure by maximum 4%. 
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Figure 9: On the left: (E) /E beam for simulated (black points) and real (gray points) data at the EM (dots) 
and calibrated (squares) scales. On the right: Double ratio /?had/^em- 



3.4 Summary 

The tests discussed in this Section are meant to demonstrate the robustness of the jet corrections com- 
puted as discussed at the beginning of the present Section. Summarizing, we can say that the discussed 
strategy is able to recover the linearity of the jet energy measurement over a wide energy range, invoking 
a relatively low number of parameters (of the order of 50) constrained by a fit on QCD dijet events. 
The fact that the jet corrections can be applied with success to events generated with different shower- 
ing models, with a very different quark-gluon jet ratio and different topologies, gives confidence in the 
correction strategy as a method to remove the detector effects. 
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4 Alternative global calibration methods 

Although the discussed calibration scheme is the most widely used at present in the ATLAS collabora- 
tion, it is not the only one that has been investigated. 

4.1 Longitudinal shower development 

On average, the early part of a hadron shower is dominated by electromagnetic energy deposited by 
neutral pions and the ratio of visible to invisible energy is large. In the deeper part of the shower this ratio 
becomes smaller and more of the hadron shower goes undetected. This can be seen in the a quantitative 
study that was carried out by the ATLAS TileCal collaboration in the 1996 test-beam [12]. It shows that 
in the first interaction length of the calorimeter approximately 70% of the energy of the hadron shower 
is deposited as visible electromagnetic energy. The fraction falls off with depth in the calorimeter and 
at 6A only 25% the energy of the hadron shower is deposited as electromagnetic energy. Therefore a 
longitudinal weighting of energy deposition as a function of depth can provide improved resolution and 
linearity [13]. Figure 1 (right) shows the fraction of energy deposited by a hadronic jet at different depths 
in the calorimeter. The layers used in this weighting scheme are defined below based on the properties 
and geometry of different calorimeters. 

The above motivation for longitudinal weighting is based on the average shower behavior. Hadron 
showers fluctuate event-by-event and in a jet, the incoming particle type and energy also varies depend- 
ing on how the jet fragmentation proceeds. Figure 1 (left) shows the average energy carried by different 
particle types in a jet. To better account for these differences in shower fluctuation and electromagnetic 
content of a hadronic jet, the longitudinal weighting is performed in bins of the fraction of energy de- 
posited in the LAr calorimeter. Furthermore, the Atlas calorimeter has a significant variation in geometry 
as a function of pseudo-rapidity and we therefore fit the parameters in independent bins of jet tj. 

As shown below, a longitudinal weighting based on the above properties of hadron shower devel- 
opment and jets shows a significant improvement in jet energy resolution and linearity with respect to 
uncorrected jet energy. 

Longitudinal weighting method 

In general the choice of energy layers to be weighted are motivated from the following. In the barrel 
LAr calorimeter, the first three depths (presampler, EMB1, EMB2) provide a total of 24Xq. This gives 
99% containment for photons with energies up to 140 GeV [14]. For hadronic jets, the energy in these 
layers is expected to be predominately from neutral pions. The presampler and EMB1 are defined as a 
single layer for longitudinal weighting purposes. The weight for this layer is expected to be sensitive to 
energy losses in the inner detector. EMB2, which has 18Xo is weighted alone. The EMB3 is thin and has 
small energy deposit. Therefore the energy in this layer is added to the energy in the first layer of the Tile 
calorimeter. This allows for simulations to provide an average correction for energy loss in the cryostat. 
Depending on the jet pseudorapidity, a different number of calorimeter layers are used in the fitting. Up 
to a pseudorapidity of 1.5 the jet energy is fitted in four layers in the calorimeter defined as follows: 



For a jet tj between 1.5 and 3.2 the jet is fitted for two layers in the calorimeter defined as follows. 



E 0 
Ex 

E 2 
E 3 



= ^presampler + ^EMB 1 

= £EMB2 

= £EMB3 + -ETilel 

= £'Tile2 + Enfe?, + .EhCAL • 



Eq — ^presampler + ^LArEMl +£"LArEM2 
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E\ — £LArEM3 + ^Tilel + £Tile2 + £Tile3 + ^HCAL + ^FCAL 

Beyond 77 of 3.2 up to 4.4 the jet is not divided into calorimeter layer segments and the full jet energy is 
fitted. 

The general strategy of deriving weights for different layers is to minimize the function: 

5 = I [ W -K £C ) 2 + h " C') ] ( 10) 

with 

c c = !>£,- (id 

where the w, are weights assigned to the elements £, of a calorimeter layer in a jet. E^ ef , the true energy 
to which we want to calibrate, is defined as the energy of all the MC generated particles contained 
in the cone of the reconstructed jet. The Lagrange multiplier A constrains the minimization such that 
(£' Ref — E rec ) = 0. The minimization is performed separately for jets classified in bins of eta (44 eta bins 
of size 0.1), three fractional energies (j em ) deposited in the EM calorimeter and two energy bins. The 
fractional energy / em is defined as 

fern = (-Epresampler + ^LArEM 1 + ^LArEM2 ) / E TSC . (12) 

Three bins in f em are chosen such that each bin has roughly the same statistics. At high energies 
the three bins are (small: 0.0-0.65), (mid: 0.65-0.75) and (large: 0.75-1.0). At low energies they are 





Figure 10: The longitudinal weights as a function of jet energy for four layers in three f em bins and for 
central jet X] . 
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(0.0-0.75), (0.75-0.85) and (0.85-1.0). The bin size varies with energy since more energetic jets deposit 
more energy in the deeper part of the calorimeter. Bins in f em are used only for jets with tj < 3.2. 
For each layer, weights are chosen to have the following dependence on the true jet energy: 

w = a + blog{E/E C ut), (13) 

where a and b are the parameters to be determined by minimization. When applying these weights to 
jets, the uncorrected jet energy is used instead of the true jet energy. The result is iterated until a stable 
value of the corrected jet energy is obtained. 

In the above equation, £cut is an arbitrary energy chosen according to the following criteria. Since the 
energy range covered is quite large (25 GeV - 2 TeV) the fit is performed in two independent energy bins. 
For | tj | < 1 .2, E C ut = 300 GeV, for 1 .2 < | tj | < 3.2, S^t = 450 and for | tj | > 3.2 it is set to 35 • cosh(Tj ) 
GeV. By choosing £cut to be the bin boundary one forces the weight to be equal to the value of a at the 
boundary. The energy range below £cut is fitted with a fixed value of a. This ensures a smooth behavior 
of weights across Ec u t and reduces the lower energy fit from a two parameter to a one parameter fit. The 
same smoothness in the behavior of weights across tj and f em is not strictly imposed in the present fit, 
although the weights do not have strong variation across the bin boundaries. 

The fitting procedure was applied to a fully simulated and reconstructed QCD dijet sample. To 
suppress noise, topological clusters were used. The jet algorithm (cone R con s = 0.7) is run on calorimeter 
towers which contain only cells which are included in the reconstructed topological clusters. Half of the 
events in the sample were used to determine the layer weights. These weights were then applied to the 
other half of the events to determine the effect of the weights on jet energy linearity and resolution. 

Figure 10 shows the behavior of the weights in bins of f em . A common feature in all the weight 
distributions is a small variation with respect to the jet energy, especially for high energies. This ensures 
insensitivity to the use of the true jet energy in equation 13. Layer 2, which measures the bulk of the 
jet energy has a weight close to 1 when / em is large i.e when the jet is predominantly electromagnetic in 
nature. When f em is large i.e when the jet is predominantly hadronic, the layer 2 weights are around 1.4. 
Layer 1 acquires a generally higher weight due to losses in the inner detector, even though it is within the 
early part of the jets. Layer 3 and 4 get weights larger than 1 corresponding to jets being predominantly 
hadronic in these layers. Layer 3 gets larger weights than layer 2 since it also corrects for energy lost in 
the cryostat. 

Figure 1 1 shows the jet energy scale linearity as a function of jet energy (left). The corrected jet 
energy scale is linear to about 2% with the largest non-linearity coming from low energies where the 
uncorrected non-linearity is approximately 30%. Figure 1 1 (right) shows the corresponding linearity as a 
function of detector pseudo-rapidity for jets of 1000 GeV in energy. The typical non-uniformity is about 
1%, increasing to about 2% in the region of tj ~ 3.0. Jet energy resolutions as a function of the jet energy 
scale is shown in Fig. 12 for two different jet eta regions. At high energy the jet resolution approaches 
about 4%. 

In terms of jet energy linearity, the longitudinal weighting and Hl-style weighting scheme (Section 
3) have comparable performance, although the Hl-style weighting scheme shows a slightly better reso- 
lution. This is expected since Hl-style weighting uses local cell energy density to discriminate between 
EM and non-EM like energy deposits. In contrast, longitudinal weighting is less sensitive to local energy 
fluctuations, which may be an advantage in the early data taking period, when the simulation of energy 
deposition in the calorimeter may not accurately reproduce the data. 

5 Local hadron calibration 

In contrast to the global calibration method just described, where first jets are made from towers or 
clusters on the electromagnetic scale and the calibration is applied after jet-making on cell or sampling 
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Figure 11: Jet energy linearity as a function of jet energy (left), and as a function of jet pseudorapidity 
(right). The points are for jets reconstructed at the electromagnetic scale (EM), for the global weighting 
scheme described here (Samp) and for the Hl-style calibration described in the previous Section. The 
jets have a cone radius of R CO ne = 0.7. 
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Figure 12: Jet energy resolution for jets with a cone radius of 0.7 for two regions in pseudorapidity. 
The three sets of point show the resolution at the detector (EM) scale, after Hl-style and longitudinal 
weighting. 
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level, the local hadron calibrated jets are made from clusters which are already calibrated to the hadronic 
scale. 

5.1 Topological clusters 

The cluster algorithm used is described in detail in Ref. [15]. Clusters grow dynamically around seed 
cells based on noise thresholds and are re-grouped in a second splitter step around local maxima. 
The aim of the clustering step before the actual jet making is two-fold: 

1 . To suppress noise from electronics and pile-up by reducing the number of cells included in the jets 
via noise-driven clustering thresholds. 

2. To improve the correspondence between clusters and particles. Due to the dynamic nature of the 
cluster growing, individual clusters correspond better to stable particles than towers or cells and 
the jet constituents can serve to further study the substructure of jets. 

To illustrate the effect of noise reduction by using topological clusters as input to jets the amount 
of noise at the electromagnetic scale and the number of cells per jet for cone jets with R CO ne = 0.7 is 
compared in Fig. 13 for jets from dijet simulations with towers as input and with topological clusters as 
input. The noise reduction is a direct consequence of selecting fewer cells with topological clusters. The 
effect is largest for low energetic jets since the size and number of signal clusters becomes small. The 
number of cells per jet for tower jets does not depend on the energy since no threshold for the towers is 
applied. Subsequently the noise changes only if a cell included in a tower jet switches to a lower gain. 
For the displayed energies this effect is visible in the forward region only. For jets from topological 
clusters the noise increases with energy and at transverse energies of 150GeV it is typically a factor of 
2 lower than for a corresponding tower jet except for the very forward region where the signals are so 
dense that the topological clusters again include almost all cells and the noise level reaches that of the 
tower jets. 

Figure 14 shows the correspondence of clusters and stable truth particles in a dijet simulation. The 
sample shown is a PYTHIA QCD dijet sample with the transverse energy of the leading jet between 
140GeV and 280GeV. Roughly 1.6 truth particles correspond to each of the 65 ± 30 clusters for a cut at 
1 GeV in transverse energy, but the ratio does not depend on the cut. It is close to the expected ratio of 
~ 4/3 since about 1/2 of the stable particles in jets are photons from 7T°-decays, which usually merge to 
one cluster. 




Figure 13: Noise contents (left) and number of cells per jet (right) of Cone jets with R con s = 0.7 for 
different energies for towers as input (open symbols) and topological clusters as input (filled symbols). 
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1 cluster corresponds to 1.6 truth particles 




truth particle Ej> 1 GeV 



Figure 14: Number of topo clusters with E± > 1 GeV vs. number of stable truth particles with 
Ej_ > 1 GeV from a QCD dijet simulation. 

5.2 Cluster Calibration 

The local hadron calibration of topological clusters is described in detail in Ref. [16]. The calibration 
starts by classifying clusters as mainly electromagnetic, hadronic, or unknown depending on cluster 
shape variables, moments derived from the positive cell contents of the cluster and the cluster energy. 
The classification is based on predictions from GEANT4 [17, 18] simulations for charged and neutral 
pions. The expected phase space population in logarithmic bins of the cluster energy, cluster depth in the 
calorimeter, and average cell energy density and linear bins in |t]| from neutral and charged pions with 
a ratio of 1 : 2 is converted to a classification weight, reflecting the a-priori assumption that 2/3 of the 
pions should be charged. 

Roughly 90% of the energy of charged pions is classified as hadronic by this procedure for all en- 
ergies, while for neutral pions 90% of their energy is classified as electromagnetic on average beyond 
100 GeV and the performance drops with the logarithm of the pion energy to about 50% at 10 GeV. 
The ideal fraction of 100% is not reached for the charged pions as sometimes the shower is split into 
more than one cluster with one of them being predominantly electromagnetic in nature. At low energies 
neutral pion clusters occupy the same phase space as charged pion clusters and the a-priori precedence 
for charged pions makes the classification as electromagnetic less likely. This leads to the high fraction 
of neutral pion energy classified as hadronic at low energies which is still acceptable, since the weights 
applied here are close to 1. Clusters classified as hadronic receive cell weights derived from detailed 
GEANT4 simulations of charged pions with so-called calibration hits in active and inactive calorimeter 
materials, which contain the energy from ionization losses and also from invisible processes, such as 
nuclear excitation, and from escaping particles, such as neutrinos. Cells in individual calorimeter sam- 
plings are treated in 0.2-wide 1 17 1 -bins. The weights are binned logarithmically in cluster energy and cell 
energy density. A flat distribution in the logarithm of the particle energy was used to generate the single 
pion events. 

Out-of-cluster (OOC) corrections are applied to correct for energy deposits inside the calorimeter but 
outside calorimeter clusters due to the noise thresholds applied during cluster making. These corrections 
depend on | r\ | , cluster energy and the cluster depth in the calorimeter. 

Dead material (DM) corrections are applied to compensate for energy deposits in materials outside 
of the calorimeters. For deposits in upstream material like the inner wall of the cryostat the presampler 
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signals are found to be highly correlated with the lost energy and the corrections are derived from the 
sum of calibration hit energies in the upstream regions and the presampler signal. 

The correction for energy deposited in the outer cryostat wall between the electromagnetic and 
hadronic barrel calorimeters is based on the geometrical mean of the energies in the samplings just 
before and just beyond the cryostat wall. Corrections for other energy deposits without clear correlations 
to cluster observables are obtained from lookup tables binned in cluster energy, 1 77 1 , and shower depth. 

5.3 Performance for jets 

The aim in this section is to evaluate the degree of completeness of the local hadron calibration when 
applied to jets. The performance of the local hadronic calibration scheme was evaluated using the dijet 
samples listed in Table 2 and two methods: by comparison to particle jets as described above in section 
3.1, and by comparison to the calibration hits in the GEANT4 record. Since no truth matching occurs 
in the derivation of the calibration constants genuine jet-level effects are expected to be visible once the 
reconstructed jet is compared to the matching jet made of stable truth particles. The main sources of 
remaining energy corrections are: 

Misclassification Hadronic energy deposits which are treated as electromagnetic lead to a lower energy 
response, while electromagnetic energy deposits wrongly treated as hadronic lead to a higher en- 
ergy response. The effect of energy underestimation dominates and is roughly 3% for ~ 150GeV. 

Lost Particles Low energetic particles might be bent outside the acceptance cone of the reconstructed 
jet or reach the calorimeter inside the acceptance cone but leaving a signal below threshold for the 
clustering. Both effects are estimated to add up to 5% for p± ~ 150GeV, with 3% stemming from 
low energy deposits not included in the clusters and 2% from particles bent outside the acceptance 
cone. 

Jets formed from topological clusters, calibrated using the local hadronic calibration scheme, were 
compared to the truth jets as done with the previous calibration methods. Figure 15 shows the linearity 
for Cone R C0De = 0.7 and ki R = 0.6 dijets for 3 different \r\ \ regions as a function of the jet energy. The 
performance in the forward region is especially low because of a scale error of 10% introduced in the 
simulations 1 . The forward scale error highlights another strength of the calibration hits - they can in fact 
reveal that there is a problem in the predicted reconstructed energy. All calibration methods discussed 
in this note would yield an overestimation of the jet energy in the forward region in real data if this 
simulation problem would not be fixed in the samples needed to derive the calibration constants. 

In the other pseudo-rapidity regions the linearity is rising from 80% at 30GeV to over 95% at 1 TeV. 
Figure 16 shows the linearity as a function of the true jet |tj| for 4 different jet energies. Dips in the 
linearity are clearly visible for the transition regions between the calorimeter systems at the gap region 
(1 .3 < I T] I < 1 .5) and the crack region (3.0 < | r\ \ < 3.5). For the forward region, the mentioned scale error 
is very clearly visible: the linearity cannot be recovered and the scale is off by 10%. The dependency of 
the linearity on the jet energies can also be observed in these plots: while the linearity for jets with about 
lOOGeV can be recovered up to about 90% (85% in the gap region), high energy jets (of about ITeV) 
show a linearity of about 95 — 97% which is compatible with the 3% loss due to misclassification as 
discussed above. Although a simple scale function, such as given in Eq. (5) on the jet level, would restore 
the linearity and give a comparable performance as the global calibration, our goal is to understand and 
correct for these effects in order to recover the linearity instead by correction functions based on the jet 
constituents. 

^he assumed sampling fraction did not correspond to the actual sampling fraction in the FCal region and thus the assump- 
tion that electromagnetic showers can remain un-scaled leads to an underestimation of the energy. 
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Figure 15: Linearity for Cone jets with R cone = 0.7 (left) and Kt jets with/? = 0.6 (right), both calibrated 
with the local hadron calibration method (LC), using truth particle jets (MC) as reference. The linearity 
is shown as a function of the matched truth jet energy. 
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Figure 16: Linearity for Cone jets with/? cone = 0.7 (left) and Kt jets with/? = 0.6 (right), both calibrated 
with the local hadron calibration method, using truth particle jets as reference. The linearity is shown as 
a function of the matched truth jet 1 7] | . 
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The jet energy resolution is shown in Fig. 17 as a function of the true jet energy. Table 4 shows the 
parameterised resolution obtained using this method as a function of energy and rapidity. It's perfor- 
mance is typically 20% or more above that obtained using the global calibration method. We discuss 
some possible improvements to the local hadronic calibration method below. 



C7 LC/MC Jets0< hl<0 
C7 LC/MC Jets 1.5 < In I < 



Figure 17: Resolution for Cone jets normalized to the reconstructed jet energy with R cone = 0.7 (left) and 
Kt jets with R = 0.6 (right), both calibrated with the local hadron calibration method, using truth particle 
jets as reference. The resolution is shown as a function of the matched truth jet energy. 



Table 4: Resolution as function of E true for jet with the local hadron calibration applied. 



Reconstruction Algorithm 




0< |T]| <0.5 


1.5 < |t]| <2.0 




a(%) 


b (%) c (GeV) 


a (%) b (%) c (GeV) 


Cone R cone = 0.7 LC 


78±8 


3.5±0.8 2.3±0.9 


98±14 7.7±1.7 3.3±0.7 


kj R = 0.6 LC 


79±8 


4.7±0.7 2.4±0.6 


117 ± 15 9.7±1.9 1.2±2.3 



A detailed analysis of the performance of the local calibration when applied to jets is also presented 
in Ref. [19], where different local calibration approaches are compared to the performance of the HI 
global calibration. 

5.4 Further Improvement 

As seen in the previous section several jet-level corrections need to be applied in order to bring jets made 
of local hadron calibrated topological clusters to the truth particle scale. However, the global method is 
seen to exhibit somewhat superior performance indicating that further improvements should be possible 
since both methods use shower development as their fundamental basis. 

Figure 1 8 (left) shows the ratio of the corrected energy as obtained from the reconstructed calorimeter 
cells to the energy obtained from the GEANT4 calibration hits in the clusters and in dead material. A 
significant deficit is seen at low energy. Figure 18 (right) shows the ratio of the the energy obtained 
from the GEANT4 calibration hits in the clusters and in dead material to the energy of the nearest truth 
particle jet. In this case we see roughly unity at high energy and only a 10% deficit at low energy. Since 
this comparison is to the truth particle jet, we attribute this deficit to out of cone energy. We therefore 
conclude that the dominant effects on the non-linearity at low energies seen in Fig. 18 stem from particles 
lost in the dead material upstream of the calorimeters. These low energy pions deposit most of their 
energy in upstream materials and often do not leave a sufficiently large signal in the calorimeters to 
cause a cluster to be formed. At present, no good observable on the local cluster level aids in recovering 



317 



Jets and Missing E t - Detector Level Jet Corrections 




• C7 0.2 < In I < 0.4 

□ C7 2.0 < h I < 2.2 

* C7 3.7 < In I < 3.9 
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Figure 18: Weighted + DM corrected / £ in cluster truth + DM truth' the reconstructed weighted and 
dead-material corrected energy over the the predicted true energy inside clusters and associated dead 
material regions (left) and E m c i uster tmt ^ + DM truth / £ truth> the ratio of the predicted true energy in- 
side clusters and associated dead material regions (the denominator in the left plot) over the energy of the 
matched truth particle jet (right) as function of the matched truth jet energy for cone jets with R CO ne = 0.7. 



this lost energy and the local calibration method can not account for it. Corrections for these effects are 
currently being studied. A scaling function like Eq. (5) which is used in the global method would help 
to restore the linearity in Fig. 15 but would not improve the resolution. The generalization of cluster 
shape variables to the jet level (number of low energetic constituent clusters, energy distribution of the 
constituent clusters, etc.) might help in order to obtain correction procedures that depend only indirectly 
on the used jet algorithm, restore the linearity and improve the resolution. The missing energy content 
can for example be estimated by extrapolating the actual distribution of constituent cluster energies to 
zero GeV to recover the lost contributions from low energetic particles. The in-situ methods as discussed 
in Ref. [20] can be used to validate the corrections obtained and to possibly compensate residual non- 
linearities. 



6 Track-based improvement in the jet energy resolution 

We present a track-based method for improving the jet energy resolution in ATLAS. Unlike energy-flow 
techniques reference, information is added to the reconstructed jet, after the global jet energy scale cor- 
rections have been implemented, and the track-based correction is applied based on the fraction of jet 
momentum carried by charged tracks associated with the jet. Using this correction, a ~ 20% improve- 
ment in jet energy resolution at low energy is achieved. 

In this chapter we describe a technique that uses tracks in jets to extract information from the jet 
topology and fragmentation in order to improve the jet energy resolution. The approach is conceptually 
different from more traditional energy flow methods, where precise track momentum measurements re- 
place calorimeter clusters. In the proposed technique, tracks are used to correct the response of jets as a 
function of the jet particle composition, specifically using the ratio of track to calorimeter transverse mo- 
mentum (/ t rk = cdorimeter )- Using / tr k provides an improvement in jet energy resolution without changing 
p T 

the jet energy scale applied during reconstruction. 

In general, jets are composed primarily of neutral and charged pions. Charged particles leave tracks 
in the detectors, and so one might naively expect that approximately two-thirds of the jet energy will 
be carried by tracks associated with that jet. Monte Carlo QCD dijet samples show Gaussian / tt k distri- 
butions centered around 0.66, with small tails extending above 1. The tails are more prominent at low 
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energies and include, for example, jets with a true / t rk near one and one or more tracks with incorrectly 
measured momenta. 

The fractional jet energy resolution, a ^ PT ptni f T - , is proportional to the width of the jet energy re- 
sponse in bins of transverse energy, normalized to the average jet energy in a bin. If the response of 
these jets varies significantly with / tr k, the transverse jet energy resolution will be artificially broadened, 
as shown in Fig. 19. One sees that the total measured transverse jet energy resolution is considerably 
wider than either of the constituents corresponding to jets with different charged particle fractions. By 
correcting the jet response as a function of jet pj and / trk we reduce the overall broadening of the energy 
distribution and, hence, improve the jet energy resolution. 




Ef-Er c ' e (GeV) 

Figure 19: Black: difference between reconstructed and truth jet transverse energy for jets with 
0 < |tj| < 0.7 and 40 GeV < p T < 200 GeV. The mean (width) of this distribution is proportional to 
the jet energy response (resolution). Since jets with different / tt k have different responses, the transverse 
energy resolution is artificially broadened because of the offset of the distributions for each / t rk bin. The 
normalization is arbitrary. 



6.1 Monte Carlo samples and event selection 

Track based jet corrections were determined using QCD dijet events. The MC events used are the same 
J1-J4 samples described in Table 2. The reconstructed jets in the samples ranged in pj from 7 GeV to 
280 GeV. Only jets with energies above 45 GeV were used in the fits. 

Reconstructed 0.4 cone, tower-seeded jets were selected from the event and separated into bins of 
pseudorapidity. Fits were formed for the central (|t]| < 1.2), transition (1.5< |t]| <1.8) and end cap 
(1.9< |t]| <2.2) regions of the calorimeters. Because the ATLAS tracker acceptance ends at |t]|=2.5, jets 
beyond |tj | = 2.0 were not considered for fits. 

Tracks within a cone of radius 0.4 in r\ — (j> around a jet axis were included in the calculation of f tl ^. 
In order to remove jets with a single poorly measured track, jets were required to be associated with at 
least two tracks, and each track was required to have % 2 /DoF < 3.0. Less than 1% of the tracks and 
jets were rejected by these cuts. The requirement of two tracks was only applied to derive the correction 
and not to evaluate the performance. In order to be as inclusive as possible, all calorimeter jets were 
considered. To find the track-based energy corrections, jets were also required to be isolated in a cone of 
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0.8 in r\ — (j) space, to avoid jet-jet contamination which skew / tt t considerably for low energy jets. After 
cuts, each jet contained typically 5 to 8 tracks. 

Jets were also required to have a truth jet matched within 0.1 in rj — (j> space, and were required not 
to be matched to a truth &-quark. Jets with ft-quarks could have different energy responses compared to 
light quark jets and were excluded from the fits. Incorporating a track-based b-jet energy correction is a 
topic for future studies. 

6.2 Track-based jet energy response parameterization 

Fits to the jet energy response are made as functions of f t ± and pj. The fits were also binned in regions 
with flat response as a function of pseudorapidity. Three such regions were identified: one in the barrel 
calorimeter, one in the endcap calorimeter, and one in the transition region between the two. The fits are 
then extended to nearby regions so that all jets are eventually corrected. Since the transitions between 
the calorimeters are discontinuous, the transitions between the corrections were discontinuous for this 
first version of track corrections. 

One dimensional fits of the response as a function of jet pj were performed in bins of / trk with the 
function R ftA (p T ) = E^/E^ = a (l -e^'*^). Sample fits are shown in Fig. 20 (left). The bins in 
/irk were adjusted so that each bin contained approximately the same number of jets, and no single bin 
spanned a large range of ft±. The pj used for each point of the fit was taken to be the average pi of jets 
in that bin, and the f tr ^ recorded for the bin was the average / t rk of all jets in that bin. 




100 200 300 400 500 600 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 

Reco jet p T (GeV) f, rk 



Figure 20: On the left, fits as a function of reconstructed jet pj for central (| r\ \ < 0.7) jets in bins of /irk- 
On the right, straight line fit of one of the parameters determined by pj fits, as a function of / tr k- 

The jet pj fit parameters were then extracted and fitted with a straight line function of / tr k, as shown 
in Fig. 20 (right) for parameter a. The 2-dimensional track-jet energy corrections, R(pj, /irk) can be 
applied to reconstructed jets to improve their average response. 

6.3 Algorithm performance 

The track-jet response correction R(pj, /irk) described previously was first applied to jets selected for 
determining the parameterization. The jet energy response was then checked as a function of f tT ^. Fig- 
ure 2 1 (left) shows the dependence of the response on / t rk before and after applying the track-jet correc- 
tion. Track-jet corrections flatten the response, Rj tA (pj) = Ej l /Ej ue , to a mean value of 1 for all / t rk- 
An overcorrection at high f tl ± appears. As the overlaid distribution of / tr k demonstrates, very few jets lie 
in this region and so the total energy resolution is not altered. 
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As described earlier and shown in Fig. 19, the jet resolution before the R(pj, / tr k) correction can be 
thought of as a convolution of distributions consisting of the sum of several offset Gaussians with dif- 
ferent /trk. By re-centering the underlying distributions we improve the jet transverse energy resolution. 
Fig. 21 (right) shows the overlapping distributions after re-centering. 




Figure 21: On the left: absolute jet energy response as a function of / t rk, before and after applying 
the track-jet response correction. The distribution of / tr k has been overlaid to show the jet distribution. 
All jets used in the fits are included. On the right: jet transverse energy response after the track-jet 
response correction for jets with 0 < |t]| < 0.7 and 40 GeV < pj < 200 GeV. The underlying Gaussian 
distributions are now overlapping, and the measured jet transverse energy resolution has been reduced. 

The fitted corrections were applied to all jets above 40 GeV in the dijet samples. The following stud- 
ies show the improvement of jet transverse energy resolution and £"™ ss distributions after the corrections 
are applied. 

Jet transverse energy resolution 

The transverse jet energy resolution is considerably improved at low jet pj, as demonstrated in Fig. 22. 
Fits are shown both before and after the corrections are applied. A ~ 15% improvement in energy 
resolution is achieved at 60 GeV. 

gmiss reso i u tion 

Track-based jet corrections can also be used to improve the scale of the missing transverse energy. The 
track-corrected Ej lss is computed as 

g T con = g T - 2>V* C °" - p T ) (14) 

where the sum refers to the two leading jets, and p* T trk ' corr = p T /R(p Tj f trk ) is the jet transverse momen- 
tum corrected as described in Section 6.2. 

Figure 23 (left) shows the mean value of the £"™ ss as a function of the / tr k difference between the two 
leading jets in a dijet sample. Both leading jets were required to have | tj | < 1 .2 for this study. Figure 23 
(left) shows a large imbalance of energy when the two jets have large differences in ff,^. When A/ tr k < 0, 
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Figure 23: Average missing transverse energy as a function of the / tt t difference between the two leading 
jets in a dijet sample before (left) and after (right) track-based jet energy corrections. The width and tails 
are improved. 

/trk > /trk an< ^ Pt * s underestimated resulting in a positive bias on Z?™ ss . Similar argument explains a 
negative £"™ ss bias for A/ tr k > 0. 

Figure 23 (right) shows that the £"™ ss scale is properly corrected after applying the track-based re- 
sponse correction to the leading two jets, and the Ej lss bias has been removed. 

6.4 Conclusions and future studies 

Although the corrections in this section were calculated only for cone jets with AR = 0.4, they can be 
trivially extended to any other jet collection, including kj jets. 

We introduced a track-based method for correcting the response of jets in ATLAS that provides a 
~ 20% improvement in jet energy resolution at 50 GeV. The corrections also improve missing energy 
distributions. These corrections do not require new jet energy scale corrections and can be applied 
after the standard reconstruction. By systematically adding information from the tracker to jets already 
reconstructed based on calorimeter information, considerable improvements can be made. 

There are several additions being explored to further improve the jet energy resolution using this 
technique and other similar track-based variable methods. This technique will be expanded to correct 
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Figure 24: Fraction of p T carried by the neutrino in b jets decaying semileptonically (b — >• }lX or 
b — >• c — >• }lX). The abscissa corresponds to the total transverse component for the jet (all interacting 
particles except muons), muon and neutrino momenta. We use b jets from QCD dijet samples as de- 
scribed in Section 7.1. 

&-jets in the same way that light quark jets have already been corrected. The radius used for track- 
jet association may be adjusted to improve the performance. Track-based response corrections will be 
expanded to include additional variables such as the fraction of transverse momentum carried by the 
leading track (/^ k ) and track multiplicity (« t rk)- 

7 Jet energy scale corrections to semileptonic b jets 

In this section, we discuss a possible strategy to correct the b-jet energy in case of semileptonic decays 
of the b quark. 

The decay of b quarks usually produces a c quark, which subsequently decays to a d quark. The b 
quark decays into a muon and a neutrino k, 10% of the time. As a result, a b jet is accompanied by a 
neutrino and a muon ps 19% of the time and by two neutrinos and two muons « 1% of the time. These 
neutrinos carry away a fraction of the jet energy, introducing a systematic underestimation of the energy 
of such jets. In this document, we concentrate on b jets that contain only one neutrino inside. The 
neutrino from the semileptonic b jet decay carries over 10% of the total jet pj. 

However, these jets can be tagged by the presence of a muon, if the muon is energetic enough to 
reach the Muon Spectrometer. Upon a successful tag, the jet energy scale can be corrected through a 
parameterization of the energy carried by the neutrino. In the following sections, a correction of the jet 
energy scale as a function of jet and muon pj for semileptonic b jets is presented and validated. 

7.1 Monte Carlo event selection 

For the studies in this document, two data samples were used (250 k it and the dijet samples described 
in Table 2) 

In addition, for the present studies, semileptonic b jets were required to be tagged by the soft-fc 
tagger [2 1 ] and be contained within | tj | < 1 .2. The r\ cut is required because the jet response changes for 
higher T7. Studies in larger r\ regions were not possible due to a lack of statistics. For the dijet sample, 
only events that had two b jets with A(j) > 1 .0 were used. This provides a sample composed mostly of bb 
events as well as a few gg events where one of the gluons decays to bb. 
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Figure 25: Left: Jet response as a function of jet pj for semileptonic b jets containing one muon for dif- 
ferent values of x (equation 15). Here p^^ +v re f ers t0 the transverse component of the vector addition 
of the true jet momentum (all interacting particles except muons included), the true muon momentum 
and the neutrino momentum. Right: Value of the results from straight line fits to the points in the left 
plot as a function of x. The fit function C(x) =a + be~ cx is also shown. 



7.2 Derivation of the jet energy scale correction 

The jet energy scale correction was derived using semileptonic b jets from the it sample. The jet response 
was studied as a function of jet and muon pj. In particular, it was found that a correlation exists between 
the jet response and the quantity 

X = -PT,reco// 7 T.reco' ( 15 ) 

where pj reco is the reconstructed muon pj, and Pj\ eco is the reconstructed jet pj, which does not include 
the muon contribution. The jet responses for samples with different values of x are shown in the left plot 
of Figure 25. The responses are shown together with the constant fits used to determine the correlation 
between the jet response and the quantity x. The fits do well in correcting for the missing neutrino 
energy, and, therefore, no further dependence on jet pj has been considered so far. A better modeling of 
the dependence of the response as a function of jet pi in the different samples could help improve the 
correction. The values of the constant fits (R, in the figure above) were then used to parameterize the jet 
response as a function of x. The values of R for samples with different values of x are shown on the right 
plot of Figure 25. These values were fit to the function C(x) =a + be~ cx . This fit is shown on the same 
plot and resulted in the following values for the parameters: a = 0.846, b = 0.1 1 and c = 6. 

This function was used to correct the jet energy scale. In particular, the corrected pj of the jet was 
calculated as: 

/C^cwry T x (i6) 

The momentum of the jet was increased accordingly along the direction of the vector addition of the 
muon and jet momenta. 

7.3 Validation 

The jet response before and after applying the correction for the it sample is shown in the left plot 
of Figure 26. The jet response is corrected appropriately with an uncertainty of m 2% on the correc- 
tion. The right plot shows the jet response in the dijet samples. The performance of the correction on 
this sample is comparable to that shown in the ti sample. The relative jet pj resolution, a r / r where 
r = ^Trtco/ ^Trtco^' was a l so studied for these two samples before and after the correction, but it is not 
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Figure 26: Response of semileptonic b jets before and after applying the neutrino correction to jets from 
a tt sample (left) and a bb sample (right). 

shown. The effect of the correction on this quantity is not noticeable within the statistical uncertainties 
of these studies. 

7.4 Conclusion on the corrections to semileptonic b jets 

In this section, a procedure for correcting the jet energy scale of semileptonic b jets decaying to a muon 
has been presented. The procedure has been validated on semileptonic b jets from two data samples 
showing a noticeable improvement in the jet response, while the relative pi resolution remains un- 
changed. To conclude, it should be emphasized that the correction of semileptonic b jets is strongly 
coupled to our ability to tag them. For this reason, studies in specific physics analyses need to be done 
to set the operating point of the soft-b tagger. For example, in a bb sample, where there are no light 
jets, we can benefit from this correction the most, while in a tt sample the operating point of the soft b- 
tagger might need to be adjusted and could have too low an efficiency for the correction to be noticeable. 
Further studies on this subject are required in order to determine when the correction is desirable. 

Conclusion 

In the first part of the article, we discussed two different ways of using the full ATLAS simulation to 
calibrate jets. The global method is proven to recover the linearity of the energy measurement while 
improving the resolution in a wide energy range on Monte Carlo samples. We proved its robustness 
over different quark content, shower model, and event complexity. The local hadron calibration has been 
shown to almost fully recover the linearity with respect to the jet calorimeter energy deposits. Even 
though a correction step to go back to the truth jet scale is missing, the method is promising. In the last 
part of the article, we discussed the possible use of the tracker information of the jet to improve the jet 
resolution. After the application of the global calibration, the method is able to further improve the jet 
energy resolution especially at low Ej. Finally we discussed a possible way of recovering the neutrino 
energy in semileptonic &-quark decays. 

The calibration methods discussed in this note will be used to provide jet corrections for the ATLAS 
detector. We stress that the validation of the corrections heavily relies on in-situ measurements as dis- 
cussed in [20]. 
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Abstract 

The precise measurement of the momentum of hadrons in the tracking detec- 
tors, compared to the energy E measured in the calorimeters can provide a 
validation of the hadronic calibration of the response of the calorimeter to pi- 
ons at the percent level. The E/p ratio allows the validation of the absolute 
energy scale calibration over the pseudorapidity range covered by the tracking 
detectors. Since the selected hadrons are mainly pions, the results obtained for 
pion response from test beam data are almost directly comparable. The single 
hadron E/p performances are studied using minimum bias events. 

1 Introduction 

The tracking system can be used to verify the hadronic calorimeter calibration in the energy range be- 
tween 400 MeV and 200 GeV. The inner tracking will be able to reach a 1% precision on the transverse 
momentum, pj , with a few months of data taking at low luminosity. The energy over momentum, E/p, 
ratio allows the validation of the absolute energy scale calibration over the pseudorapidity range of the 
tracking detectors which covers the Barrel Tile Calorimeter (\r] \ < 1), the Extended Barrel Tile Calorime- 
ter (1< \r}\ <1.7) and the Liquid Argon Hadronic Endcap Calorimeter up to |t]|=2.5. Furthermore, since 
the selected hadrons are almost exclusively pions, the results obtained for pion response from test beam 
data are almost directly comparable. 

The energy range for which the hadronic calibration must be checked can be determined by jet 
fragmentation studies at the generator level. It is instructive to investigate the lowest energy needed to 
obtain a given fraction of the jet energy, denoted x. For this reason, it is useful to define the lowest 
energy jet particles (LEJPs). Using the highest pj jet in QCD dijet events, the lowest energy jet particles 
(LEJPs) are determined by ordering in energy the stable particles (ct > 10 mm) in the jet. Starting with 
the highest energy, the particle energies are summed up until a fraction x of the jet energy is reached. The 
particle that pushes this sum over the threshold is the LEJP. Figure 1 shows the mean LEJP energy for a 
value of x = 0.95 asafunction of the jet energy for central (|tj| < 1.5) and endcap (1.5 < \r\\ <3.2)jets. 
To reconstruct 95% of the jet energy in the central region we have to measure particle energies down to 
800 MeV. To reconstruct 99% of the jet energy we need to reach energies down to 200 MeV. The energy 
of the highest energy particle in the jet ranges from 10 GeV to 550 GeV. 

Any hadronic calibration scheme is required to be robust over the energy range 200 MeV to 550 GeV 
to ensure the correct jet energies scale. Furthermore a large fraction of particles in jets have energy below 
10 GeV and a large proportion of them are charged pions. Hence it is important to check the hadronic 
calibration at low particle energies using the E/p method. The robustness of the hadronic calibration, the 
validity of the hadronic shower model in Monte Carlo, and the single charged hadron E/ p performance 
can be studied in several data samples which cover different energy ranges. The minimum bias sample 
can reach down to a momentum of 400 MeV and the remainder of this note deals with studies of E/p in 
the minimum bias sample. 

2 Study of E/p using Minimum Bias events 

As discussed above, low energy hadrons carry a large portion of the energy in a jet. It is therefore 
important to study the E/p performance at low energies to obtain an ultimate calibration of the jet 
energy scale to 1% precision. In early data, when a precise hadronic calibration will not be available, this 
study will be performed using the electromagnetic scale. Subsequently, each step of the local hadronic 
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calibration can be cross-checked and improved. These steps include removal of noise via the use of 
topological clusters, cluster classification as electromagnetic or hadronic, and calibration of hadronic 
clusters to the local hadronic energy scale. 

Minimum bias samples were used to determine the feasibility of using charged hadrons in the pj 
range between 1 to 10 GeV to cross-check the single hadron energy scale for topological clusters cali- 
brated to the local hadronic energy scale [1]. The simulation used for this study replicates data collected 
over only a few days at a trigger bandwidth of 10 Hz. A year of data-taking at low luminosity will reduce 
the statistical uncertainty to less than 1%. The data collected in one year should allow the local hadronic 
calibration to be checked as a function of both r\ and 0. 

The calibration of topological clusters calibrated to the local hadronic energy includes corrections 
for invisible and escaped energy in the hadronic shower as well as energy lost in dead material, such 
as the material of the inner detector. It does not include corrections for energy lost in cells outside the 
cluster (out-of-cluster). In the very low energy regime considered by this study, there are significant 
energy losses in dead material and from out-of-cluster effects. Even with cluster calibration, it is not 
possible to completely recover these losses, since many low energy particles do not leave sufficient 
energy in the calorimeter to meet the cluster reconstruction thresholds. As a consequence, we expect 
the calibrated value of {E/p) to be below 1. However, as p T increases, these energy losses diminish 
and therefore {E/p) approaches 1. Measuring the E/p distributions for single hadrons can provide a 
way of determining the size of these energy losses in-situ. It can also be directly compared to the single 
pion Monte Carlo used to derive the weights and cluster classification used as part of the local hadronic 
calibration. 

2.1 Isolated pions in Minimum Bias events 

To eliminate fake tracks and ensure an accurate momentum measurement, only tracks satisfying the 
following criteria were considered in this study: 

• At least one hit in the B Layer of the pixel detector. 

• No more than one missing hit in the other pixel and SCT layers. 

• Good quality track fit satisfying # 2 /ndof < 1.5. 

Approximately 76% of all reconstructed tracks pass these criteria. The pj range for pions surviving the 
track quality constraints is given by the dashed curve in Fig. 2. This shows that minimum bias events can 




Figure 1: Mean LEJP energy (x = 0.95) as a function of the jet energy for jets in central (left) and endcap 
(right) regions. The error bars represent the Full- Width-Half-Maximum of the LEJP energy spectrum. 
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Figure 2: Distribution of pj of all tracks (solid line) and selected tracks (dashed line) in minimum bias 
events, scaled to 1 ub _1 . 

provide a source of pions to check the hadronic energy scale from 500 MeV, the nominal lower energy 
required for a reconstructed track, up to energies of the order 10 GeV. 

The pion energy deposited in the calorimeter is found by extrapolating the track direction to the 
second sampling layer of the EM calorimeter. From this position a cone is defined, AR cons = 1.0, and 
the energy of each charged hadron is calculated as the sum of all topological clusters within this cone. A 
topological cluster is considered to be inside the cone if its barycenter is within this A/? cone = 1.0 around 
the extrapolated track position. 

In minimum bias events, energy from the underlying event inside the region defined by A/? cone con- 
siderably biases the measured pion energies. Photons originating from the decay of neutral pions are the 
predominant source of background. Charged particles also contribute to the background as the recon- 
struction efficiency of tracks below 5 GeV can be lower than 80% [2]. Selection criteria were used to 
reduce the contamination from the underlying event. These criteria differ slightly for charged hadrons 
with pi < 3 GeV and Pt > 3 GeV due to the low multiplicity of higher pj tracks in the event and the 
different hadronic shower shapes in the calorimeter. The selection criteria used to minimise the amount 
of energy from the underlying event are: 

• Isolated from other tracks by at least AR = 0.4. The track positions were taken at the second layer 
of the EM calorimeter. 

• The charged hadron must be one of the harder particles in the event. 

- Pt/ £ p T >0.1for/?T<3GeV. 

all tracks 

- Pt/ £ Pt > 0.3 for pj > 3 GeV. 

all tracks 

• The energy in the hadronic calorimeter must be isolated by requiring the energy in the outer region 
0.4 < A/? cone < 1.0 of the cone to be small. 

- £hadi. 0 -o.4 < 0.01 x p Xmck for p T < 3 GeV. 

- .^had! 0-0.4 < 0.05 x p tmck for p T > 3 GeV. 
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Figure 3: The E/p distributions for pions from minimum bias events before cuts (dotted line), after cuts 
(dark solid line) and for a reference sample of single particles (dashed line) for tracks in the pj range 
0.8 - 1.2 GeV (top left), 1.6 - 2.4 GeV (top right), 4-6 GeV (bottom left) and 8-12 GeV (bottom 
right). All distributions are normalised to unity. The negative E/p ratios are a consequence of clusters 
with negative energy resulting from noise. 

Figure 3 shows the E/p distribution in minimum bias events before and after cuts, and compares it to 
single pions. All results shown here and in Section 2.2 are for tracks associated with real pions (studies 
of the impact of non-pion ttacks are in progress). These selection criteria approximately halve the shift 
in (E/p) caused by other particles in the cone. 

The shift of (E/p) caused by the selection is primarily due to the isolation cut in the hadronic 
calorimeter. The shift was estimated using the pion from single particle Monte Carlo and was found 
to change (E/p) by 0.006 for 1 GeV pions, 0.002 for 2 GeV pions, negligible for 5 GeV pions and 0.001 
for 10 GeV pions. The remaining background, due to energy from the underlying event, cannot be re- 
moved with cuts. This is justified by the absence of isolated charged hadrons in minimum bias events. 

2.2 Data-driven unfolding procedure 

The remaining background, present inside the region defined by AR cone , was estimated using a data- 
driven method described in Ref. [3]. This was then unfolded from the pion E/p distribution. Most of the 
energy coming from the underlying event is deposited in the electromagnetic calorimeter. Charged pions 
which deposit most of their energy in the hadronic calorimeter were therefore selected in order to estimate 
the energy contamination in the electromagnetic calorimeter. These are labelled as late showering pions. 
The energy from the underlying event was studied by dividing AR cone into the following two regions: 

• A region where very little background energy is deposited, consisting of: 

- The hadronic calorimeter and 
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Figure 4: The total, (E/p) meas , (left) and underlying event, (E/p) contam , (right) distributions used to 
extract the E/p of isolated charged hadrons in the range 0.8 - 1.2 GeV. Error bars show the statistical 
error on the distribution before fitting (solid line). 

- A core region close to the track: 

* A/? core = 0.1for/? T <3GeV. 

* AJ? core = 0.05 for pj > 3 GeV. 

• A ring surrounding the assumed trajectory of the pion through the electromagnetic calorimeter 
contains energy of the underlying event, £outer-cone> defined to be AR core < AR < 1 in the electro- 
magnetic calorimeter. 

We use the energy in the Eouter-cone of late-showering pions to estimate the underlying event energy. 
For this study we assume that the energy in ^outer-cone f° r late- showering charged particles is zero as they 
act as minimum ionising particles (mips) in the electromagnetic calorimeter. 1 

Late-showering charged pions were identified by two criteria based on the energy in a narrow cone 
of AR < 0.05 around the track: 

• Energy in the electromagnetic calorimeter < 0.5 x ptrack- 

• Energy in the hadronic calorimeter > 0.5 x ;? t rack ■ 

The background estimated using late-showering pions is applied to all pions (late and early showering 
ones). Figure 4 shows the underlying event energy obtained by this method for tracks in the pi range 
0.8 — 1.2 GeV. The underlying event energy divided by the track momentum is defined as (E/p ) contam . 

The (E/p) meas distributions shown by the solid curves in Fig. 3 are a convolution of the E/p distri- 
butions for isolated pions, (E/p) 1 ™, and the underlying event energy. By deconvoluting the background 
from the measured E/p, we recovered (E/p) lso . 

The convolution can be written in terms of the number of entries in each bin i in the measured E/p 
histogram: 

(E/ P yr as = , £p ij x(E/pyj o , (i) 

where the elements of the matrix Py represent the probability of background contamination shifting the 
energy from bin j to bin i. 

Each element of Py is taken from the energy deposited in ^outer-cone (shown in Fig. 4 for pj = I GeV 
pions). The matrix elements are defined as Py = (E/p)™ nta ™, i.e. Py is given by the z' th — j th bin con- 
tent of the normalised histogram in the region ^outer-cone- The probability that each bin contributes to 

'We assumed that the mips penetrating depth was uncorrelated with the background. 
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the (E/p) meafi value is given by /^/det(Py). The diagonal elements of the normalised matrix give the 
probability that a measurement is free from contaminating energy. 

The distribution (E/p) mo is derived by solving the set of linear equations described above via matrix 
inversion. The unfolding method is described in Ref. [4] and Ref. [5]. The background is fitted with two 
exponential functions: one above zero and one below. The measured E/p is fitted with an exponential 
function below zero, a 7th order polynomial function above zero up to half the maximum height, and an 
exponential function for the high E/p tail. The result of these fits for 1 GeV pions is shown in Fig. 4. 
The final distributions obtained for (E / p) lso are shown in Fig. 5. The subtraction: 

(E/ P r = <£/p} meas - (E/p)""*" (2) 

is used to examine the shift caused by any remaining contamination or by the method itself. Results 
are shown in Table 1 . The E/p values are consistent with the single particle mean, showing that this 
procedure is promising. However, larger statistics are required to properly assess the ultimate precision 
of this method. 

We compared charged hadrons in minimum bias Monte Carlo events of pj = 0.8 — 1.2 GeV, 
1.6 - 2.4 GeV, 4-6 GeV and 8-12 GeV with single pion samples of 50000 events generated with 
Pt = 1 GeV, 2 GeV, 5 GeV and 10 GeV. As both minimum bias and single particle simulations are done 
with the same detector geometry and showering model, the E/p distributions are identical, providing a 
way to test how well we can retrieve the single pion E/p distribution in minimum bias events. 

Both positive and negative pions were used, and the results were combined to improve the statistical 




Figure 5: E/p distributions obtained from deconvolution for pions in minimum bias events (solid line) 
and single pions (dashed line) for the track momentum ranges 0.8 GeV < pj < 1.2 GeV (top left), 
1.6 GeV <p T < 2.4 GeV (top right), 4 GeV < p T < 6 GeV (bottom left) and 8 GeV < p T < 12 GeV 
(bottom right). The deconvolution also removes the effect of 'fake' clusters in £ 0 uter-cone> reconstructed 
from electronics noise. 
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Table 1: (E/p) for pions at pj = 1, 2, 5 and 10 GeV compared to pions in single particle events. 



Pion pi Range 


Minimum Bias 


Single Particles 


0.8 -1.2 GeV 
1.6 -2.4 GeV 
4 - 6 GeV 
8-12 GeV 


0.429 ±0.052 
0.622 ±0.043 
0.818±0.015 
0.870 ±0.036 


0.449 ±0.003 
0.609 ±0.003 
0.786 ±0.002 
0.869 ±0.001 



precision. Due to limited Monte Carlo statistics, here we bin only in pj . Ultimately, the E/p response 
will be studied in bins of both energy and pseudorapidity, because the dead material distribution varies 
dramatically across \r]\. In minimum bias events, the mean charged particle multiplicity is relatively flat 
in tj for a given p T . Single particle samples are weighted to have the same |tj | distribution as minimum 
bias tracks. 

2.3 Sources of systematic uncertainties 

We studied a number of effects which biased the E/ p distribution. In our study we assumed that all 
charged hadrons were pions. In minimum bias events, about 75% of all charged particles are pions, with 
a small number of kaons and protons. Less than 2% of the tracks are due to heavier hadrons, electrons, 
muons or are fake. However, the mix of particle types is different for late-showering hadrons. Biases 
arising from this different particle mixture were studied by comparing the (E/p) calculated using all 
track types with the (E/p) derived frompion tracks alone. Within the available statistical precision, we 
can put an upper limit of 10% on the shift. We use the energy in the ^outer-cone of late- showering pions 
to estimate the contaminating energy. For this study we assume that the energy in ^outer-cone for late- 
showering charged particles is zero as they act as mips through the electromagnetic calorimeter. This 
assumption translates to a shift on (E/p) of less than 4%. The energy from the underlying event in the 
hadron calorimeter and in the AR core region cannot be measured by this in-situ method. The shift of the 
(E/p) due to this background is estimated using single pions and was found to be less than 4%. The 
uncertainty in the energy scale is dominated by the unmeasured contaminating energy and the effect of 
non-pion tracks. All the above systematic uncertainties are statistically dominated. 

3 Conclusions 

Jet fragmentation studies show that low energy hadrons carry a large portion of the energy in a jet. This 
study shows that isolated charged hadrons produced in minimum bias events can be used to study the 
E/p performance in the pj range from 1 to 10 GeV to obtain an ultimate calibration of the jet energy 
scale. This sample will provide an in-situ test of the cluster level hadronic calibration down to 1 GeV. 
Within a year at low luminosity, it will be possible to reduce the statistical uncertainty to less than 1%. 

With the available Monte Carlo statistics we obtained a statistical uncertainty of 10% at 2 GeV which 
goes down to 3% at 10 GeV, mostly dominated by the statistics of the late showering pion control sam- 
ple. Systematic biases from several sources have been studied. Effects, such as unpredicted detector 
inhomogeneities, will be studied once data are available. 
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Abstract 

This note outlines procedures for the in-situ determination of the jet energy 
scale and resolution of the ATLAS calorimeters. The jet energy scale is evalu- 
ated using energy -balanced processes: y/Z+ jet, dijet, and multijet events. The 
jet energy resolution is obtained from dijet events. Generators using leading- 
order multiparton matrix element calculations merged with parton showers 
along with leading-logarithmic Monte Carlo models are compared. Effects 
on the energy-balance due to hadronization effects are also studied. 

1 Introduction 

Precise reconstruction of the jet energy is a fundamental ingredient of many physics analyses at the 
LHC, such as the determination of the top quark mass, the reconstruction of dijet resonances, and the 
measurement of the inclusive jet cross-section. Moreover the performance obtained on jet reconstruction 
has a direct impact on the quality of the measurement of the missing transverse energy which will play a 
decisive role in many searches for new physics at the LHC. 

The ultimate goal of the jet energy measurement is, in most cases, the reconstruction of the initial 
parton momentum. On the other hand, the measurement in the ATLAS detector starts from signals 
recorded in the calorimeter cells which have been calibrated at the electromagnetic (EM) scale. This 
scale is set in test beams and is defined to reproduce correctly the electron energy in the beams. 

A subsequent software jet calibration procedure is performed in two major steps. First, corrections 
are made for detector effects, in particular calorimeter non-compensation, noise, losses in dead material 
and cracks, longitudinal leakage and particle deflection in the magnetic field. The procedure is described 
in Ref. [1]. After this step the hadronic scale which provides the jet energy at the particle level is 
obtained, i.e. it should correspond to the jet energy obtained after running the same jet algorithm over all 
true momenta of the final state particles in the event. Throughout this note, we will refer to these jets as 
"particle-level jets" or "truth jets". At the second step physics effects, such as clustering, fragmentation, 
initial and final state radiation (ISR and FSR), underlying event (UE) and pile-up are considered. After 
that the final scale is reached which corresponds to the energy at the parton level. These effects can 
depend on the type of interaction, e.g. they can differ for quark and gluon jets, can depend on the parton 
momentum scales and multiparton interactions, so that an individual study may be necessary for each 
data analysis involving jets. 

The validation of the whole jet calibration has to be performed in-situ using suitable physics pro- 
cesses. In the course of in-situ validation (also called in-situ calibration) the systematic uncertainty of 
the hadronic energy scale is determined, and the final tuning of this scale is possibly performed. Fur- 
thermore, the level of physics effects affecting the final scale is estimated. In addition, the jet energy 
resolution can be determined in-situ. 

The validation procedure will start with QCD dijet events, which allow us to check the uniformity of 
the calibration as a function of azimuth 0 and of pseudo-rapidity tj . The uniformity in 0 can be checked 
by studying jet rates; the uniformity in tj can be validated using pj balance between the jets. The dijet 
events also open two ways of determining the jet energy resolution. 

After a uniform detector response is obtained, the absolute hadronic energy scale will be studied 
using 7 or Z+jet events, in which the Z boson is reconstructed via the Z — >• e + e~ or }X + }X~ decay. The 
pj balance between the jet and the boson in such events will be used to relate the hadronic scale of the 
jets to the well understood energy of electromagnetic objects. 
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At very high jet energies, the statistics of y/Z + jet events vanishes. After the validation of the absolute 
jet energy scale (JES) with these events in a limited pj range, the scale at higher pj will be validated 
using QCD multijet events by balancing the momentum of the leading jet to the momentum sum of the 
other jets. Alternatively, it can be inferred from the opening angle between the two leading charged 
particle tracks associated with the jet. 

These methods are discussed in the present note. Expectations of several Monte Carlo programs 
are compared for various jet algorithms. The statistical and systematic uncertainties of the methods are 
assessed with an emphasis on the first lOOpb -1 and 1 fb 1 of ATLAS data. It is important to note that 
one missing contribution is the precision with which the Monte Carlo is tuned to and reproduces the data. 
This, naturally, can only be determined once data are available. 

Several related studies are covered in other notes. The determination of JES from the invariant 
mass of the W boson in W — > qq decays is covered in Ref. [2]. The study of jet fragmentation and 
of the underlying event using tracks associated with the jets and in the complete event is discussed in 
Ref. [3]. The measurement of E/p for isolated charged particle tracks in order to study the response of 
the calorimeter is considered in [4]. 

This note is organized as follows. First, the Monte Carlo programs used for the present studies are 
briefly described. Then, the standard ATLAS jet reconstruction and calibration procedure is outlined. Af- 
terward, we discuss the basic features of / and Z+ jet events and of the pj balance calibration technique, 
the physics effects influencing the calibration procedure, the limitations of the underlying models and 
the expected systematic and statistical uncertainties of the jet energy scale. Subsequently, we consider 
the calibration techniques in QCD dijet and multijet events, followed by the methods of determination of 
the jet energy resolution. A summary is given at the end of the note. 

2 Monte Carlo event generators 

The general-purpose Monte Carlo programs HERWIG, PYTHIA and ALPGEN are used here to model the 
final states at the LHC energies. The event generation relies on phenomenological approaches to describe 
the processes which occur at all levels apart from calculation of the matrix elements. This is done by 
factorizing the event generation into several stages: 

• hard subprocess at a fixed order of perturbation theory, 

• initial and final state QCD radiation using partem shower models, 

• multiple parton interactions (MI) contributing to the underlying event (UE), 

• hadronization (fragmentation). 

PYTHIA and HERWIG include leading order (2 — > 2) matrix elements for generating y/Z + jet and QCD 
dijet events. Higher order QCD effects are modeled using parton showers, which can be insufficient to 
describe hard QCD radiation. These effects may significantly affect the event kinematics and thus spoil 
the determination of the jet energy scale and resolution. They are checked using ALPGEN which includes 
leading order matrix elements for generating multiparton final states. ALPGEN simulations are also used 
in addition to PYTHIA to study multijet events at very high transverse momenta, where such events 
provide the main means to determine the jet energy scale. 

2.1 PYTHIA 

PYTHIA 6.4 [5] is used with the on-shell leading order (LO) matrix element to model the final states. 
Higher-order QCD effects are simulated in the leading-logarithmic approximation with initial- and final- 
state radiation following the DGLAP evolution [6]. Coherence effects from soft-gluon interference are 
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included. The underlying event has been tuned to reproduce the CDF data. The parton density functions 
(PDF) CTEQ611 [7] are used for the proton. 

The initial-state shower is based on backward evolution to the shower initiators. Initial and final 
state showers are matched to each other by maximum emission cones. The Lund string fragmentation 
model is used to produce the final state hadrons. The string model is based on linear confinement, where 
(anti-)quarks or other color (anti-)triplets are located at the ends of the string, and gluons are energy 
momentum carrying kinks on the string. When the invariant mass of a string piece gets small enough, it 
is identified as a hadron, thus the whole system eventually evolves into hadrons. 

2.2 HERWIG 

HERWIG 6.510 [8] is used in conjunction with the underlying event generator JIMMY 4.31 [9]. As with 
PYTHIA, the CTEQ611 set is used for the proton PDFs. HERWIG uses the parton shower approach for 
initial and final state QCD radiation, including color coherence effects and azimuthal correlations both 
within and between the jets. It includes the angular ordered parton shower algorithm which re-sums 
both soft and collinear singularities. HERWIG uses the cluster fragmentation model [10, 11], where all 
the outgoing gluons are first split into quark/anti-quark or diquark/anti-diquark pairs. Then, quarks are 
combined with their nearest neighbor (in the color field) anti-quark or diquark to form color singlet 
clusters. These clusters have mass and spatial distributions peaked at relatively low values. For large 
cluster masses, the g— distributions fall rapidly and are asymptotically independent of the hard subprocess 
scale. If a cluster is too light to decay into two hadrons, it is allowed to become the lightest hadron of the 
relevant flavor. A similar cluster model is also used to model soft and underlying hadronic events. 

2.3 ALPGEN 

ALPGEN 2.06 [12] is used with the HERWIG parton shower and the JIMMY underlying event model and 
with the subsequent HERWIG cluster fragmentation. Similarly to PYTHIA and HERWIG simulations, the 
CTEQ611 set is used for the proton PDFs. Each final state parton multiplicity is generated individually 
by ALPGEN. A matching using a given scheme between a generated parton with the parton shower is 
performed in order to avoid double counting of the jet multiplicity. We use the MLM matching scheme. 

3 Jet reconstruction in ATLAS 

Several jet collections are built during event reconstruction in ATLAS, varying the input to the jet finder 
and the jet finding algorithm. The inputs considered for the jet finder are "calorimeter towers" (calo- 
towers) and "topological clusters" (topoclusters). In addition, "truth particles" are used for simulated 
events. Calorimeter towers are built from all calorimeter cells contained in a region of (drj x d0) of size 
(0.1 x 0.1). The initial cell energy is calibrated at the EM scale. Topological clusters are built according 
to criteria that identify significant energy deposits in topologically connected cells. Three different levels 
of signal significance are applied to the seed, the neighboring and peripheral cells. Currently, the settings 
of (4,2,0) in units of sigma of noise are used [1]. 

The jet finder methods used are cone and kj algorithms. The seeded cone algorithm is used with radii 
of R = 0.4 or 0.7, p T of seeds = 1 GeV, and a split/merge fraction of 0.5. The inclusive k T algorithm 
(fast version) is run with a D parameter of 0.4, 0.6 or 1.0 [3]. 

Two calibration approaches are developed in ATLAS, the "global" and the "local" schemes [1]. The 
global scheme uses Hl-style weights [1] to correct calorimeter cells after jet finding. The weights are 
based on the energy density in a cell. This calibration is specific to each type of jet finder. In the local 
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scheme, jets are built from pre-calibrated topoclusters, which already include hadronic calibration, as 
well as dead material and out-of-cluster corrections. In this note, we report on the "global" scheme 1 . 

For most of the studies, a minimum transverse momentum pj of at least 10 GeV is required for a jet 
to be considered in the data analysis. 



4 In-situ studies using y/Z + jet events 
4.1 Electromagnetic final state in y/Z + jet events 

The Z boson is observed via its Z — ► ee or Z — > jiji. decay products. The good lepton identification 
capability of the ATLAS detector allows reconstruction of Z decays with very low background [13]. 

Before comparing the jet energy scales in data and in simulation using y/Z + jet events, basic pi and 
tj spectra of the vector bosons have to be checked. These spectra are affected mainly by the efficiency 
of photon or lepton identification, and by the trigger efficiency. In the case of y production, significant 
background due to misidentified jets in QCD dijet events remains despite the rejection power of the 
detector and of the reconstruction algorithms. The theoretical systematic uncertainties associated with 
the vector bosons are expected to be small at all levels. The effects on transverse momentum due to 
additional photon radiation at the hadron level are predicted to be negligible. 



4.2 Use of pj balance 

In leading order of perturbation theory the final state of ylZ + jet events can be considered as a two- 
body system in which the transverse momentum of the jet prjet is exactly balanced by the transverse 
momentum of the vector boson pj r or pr,z- The pj balance can thus be defined as 



_ gTJet _ 
PJ, 7 /Z 



1 The local scheme approach was not yet fully developed when studies reported here w 
and compared to the global scheme in the future. 



(1) 



e done; this approach will be studied 
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Figure 1: Mean jet multiplicity for jets with pj > 10 GeV as a function of pj of the Z boson in Z+jet 
events. 
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Figure 2: A<j) between the photon and the jet for a) cone algorithm with R = 0.4, and b) kj algorithm 
withD= 1. 

However in reality, these events contain more than one jet in the final state at partem, particle and detector 
level, so that considering only the leading (highest pj) jet is in general too crude an approximation of 
the event kinematics. This is one of the major issues in this study and requires a detailed understanding 
of jet multiplicities, as well as their p T and angular spectra. The Monte Carlo models will have to be 
tuned to reproduce these jet properties and their energy dependence as observed with the data. A striking 
example of the current uncertainty in modeling these quantities is shown in Fig. 1 , in which the average 
multiplicity of jets in Z+jet events, generated using PYTHIA and HERWIG, is depicted as a function of 
the transverse momentum of the Z boson. Cone jets reconstructed with R = 0.7 at the particle level with 
pj > 10 GeV in the pseudorapidity range \t]\ < 5 are used. HERWIG predicts a larger number of jets than 
PYTHIA mainly due to its mechanism of cluster formation and the assumptions used to model soft and 
underlying hadronic events. Further extensive studies have been performed in order to understand the 
differences in the basic jet distributions at the parton and the particle level for various jet algorithms, and 
to quantify the influence of initial and final state radiation and of the underlying event model. 

A very useful quantity to consider is the azimuthal angular difference A(j) between the boson and 
the leading jet. It provides information about the event topology and is sensitive to additional physics 
effects, like initial state radiation and final state radiation. Figure 2 shows the A0 distribution at parton 
and at hadron level for three jet algorithms in 7+ jet events with multiple interactions switched on and 
off, respectively. Although the hadronization effects are apparent for cone jets with R = 0.4, the resulting 
leading jet in most cases is in a back-to-back topology with the photon. In order to reduce the topological 
bias due to additional radiation, the boson and jets in both y and Z+jet events are typically required in 
our studies to be back-to-back within A0 of ±0.2. 

In principle, one can consider two extreme ways of taking the physics effects into account: 

• Select only those events where the Z or y is back-to-back to only one jet in the event. Require 
any other jet to have a small transverse momentum. The jet energy correction factors can then be 
determined from the requirement fii = 0 as a function of energy and pseudorapidity. The strong 
back-to-back cut and the requirement against other jets in the event will severely cut the statistics 
in such an analysis. 

• The other extreme is to take all jets in the event that pass loose selection cuts and balance the sum 
of their momenta to the Z or y momentum. The pj balance can then be defined as 

Bz= \i^M_ lt (2) 
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Figure 3: Left in all rows: the mean value of the fitted pj balance Bz as a function of /?t,z in Z+jet 
events. Particle level jets (squares) and jets reconstructed from detector signals (circles) are shown. 
Middle in all rows: distribution for pj z ~ 50 GeV for truth jets. Right in all rows: B^ distribution 
for pt z ~ 50 GeV for reconstructed jets. Upper row: all jets with pj > 1 GeV are taken into account. 
Middle row: only jets with pj > 10 GeV are used and the requirement | n — A(j) | < 0.2 is imposed. Lower 
row: in addition, no further jet with pj > 10 GeV is allowed. 

This approach should be less sensitive to issues related to physics modeling. However, it is more 
difficult in this case to relate directly the measured balance to the energy scales of specific jets. 

Any kinematic selection cut will affect the global pj balance between the boson and the hadronic 
system. This is illustrated in Fig. 3. The central figure in the upper row shows the momentum balance Bz 
obtained by summing all particle level jets with pj > 1 GeV for events with pj z ~ 50 GeV. The actual 
balance fluctuates around a mean value close to zero. The right plot shows the corresponding distribution 
for reconstructed (HI) jets which is broader due to the detector resolution effects. 

The left plot shows the pj balance obtained by fitting a Gaussian to the pj balance distribution in the 
various pj z bins 2 . The fitted mean shows a small negative bias (of the order of 1-3%) which decreases 
with increasing pj z- Adding the pj of neutrinos and muons to that of the jets reduces the bias by about 
1%, so this cannot be the origin of this systematic effect, which is further discussed later on. 

The effect of selection cuts is studied further, as shown in the middle row of Fig. 3, which presents the 

throughout this note we use the mean value of a Gaussian distribution fitted within a limited sigma range (typically ±lcr) 
to characterize the pj balance. We refer to it also as the most probable value. 
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Pj balance for particle jets with pj > 10 GeV. In addition, the leading jet is required to be back-to-back 
with the Z within \AQ\ = (ft ±0.2). The bias increases and the spread deteriorates after increasing the 
pj threshold for jets. The effects are still strong even though the A0 cut is imposed. They become less 
significant as pj z increases. The main effect is due to fluctuations of the fragmentation combined with 
out-of-cone losses and with jet splitting. In addition, residual ISR/FSR contributes significantly to the 
spread. The underlying event may also contribute particles to the clustered jet. Studies performed with 
PYTHIA 7+ jet events [14, 15] indicate that the underlying event contributes ~ 20 MeV on average to the 
transverse energy E T at truth particle level in each (At] x A<j>) region of (0.1 x 0.1) which corresponds to 
one calorimeter tower. This results in ~ 1 GeV contribution per cone jet with R = 0.4 and ~ 3 GeV per 
cone jet with R = 0.7. The lower row of Fig. 3 shows the same distributions after an additional require- 
ment to have no further jet with pi > 10 GeV in the event. The spread of the pj balance distributions is 
reduced. The negative bias is also reduced but remains significant, especially at low pj z values. 

Furthermore, these effects strongly depend on the chosen jet algorithm. In Fig. 4 the pj balance for 
the leading jet relative to the photon is depicted as a function of photon pj for cone jets with R = 0.4, 
R = 0.7 and for kj jets with D = 1. The jets are reconstructed at the truth particle level and at the parton 
level after parton showering in HERWIG 7+ jet events and after applying the A0 cut. The kj jets on 
parton level reveal an essentially perfect balance, as expected from theory. Cone jets are subject to out- 
of-cone losses due to parton showering. The relative losses increase with decreasing pj, and are bigger 
for cone jets with smaller radii. They are much larger for particle level cone jets due to the lateral spread 
of the fragmentation. On the contrary, the kj algorithm tends to include particles originating from other 
partons, which leads to a significant positive bias at low pj. 

A further important issue in the use of the pj balance is the quantity used as a reference to define the 
pj ranges. This is illustrated in Fig. 5 in which PYTHIA 7+ jet events are used which were generated with 
a minimum pj of 30 GeV for the hard scattering process. The scatter plot shows the photon pj versus 
the pj of the parton as produced in the hard interaction. The events are distributed around the diagonal 
as a result of ISR. The cut at the generator stage is visible. If we choose p T ,7 bins to study p T balance 
distributions (e.g. the set of events with 30 GeV < pj, r < 40 GeV contained between the two horizontal 
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Figure 4: Mean value of the fitted p T balance (fii + 1) as a function of pi )7 in 7+ jet HERWIG events for 
various jet algorithms. The points correspond to particle level and parton level jets. 
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Figure 5: The pj of the partem versus the pj of the photon as produced in the hard interaction in 7+ jet 
events. 

dashed lines), we will observe a bias towards lower pj balance values, since in this case we populate 
the distribution with significantly more events from the left side of the diagonal than from the right side. 
The events on the left side are generated on average in interactions with smaller center-of-mass energy, 
hence with higher cross-section. The cross-section decreases quickly with energy and the negative bias 
of the pj balance is significant 3 . One should note also that the bias is proportional to the spread around 
the diagonal. Hence it decreases with pj. It also decreases if one requires the partem and the 7 to be 
back-to-back in azimuth within some A(j) limit, since the ISR spread is then reduced [14]. If we were 
able to select events according to (pT.7 + 7^T,parton) /2 (with lines perpendicular to the diagonal), then the 
symmetry would be restored. 

This can be seen in Fig. 6. The pj balance B[ for events with 101 < pj < 152 GeV is shown in the 

3 Throughout this note, we will genetically refer to such bias as "bin migration effect". 




Figure 6: Left: pj balance at particle level for events with 101 < pi < 152 GeV. The solid line shows 
the balance when the pj reference for binning is taken as the average pj of the photon and the jet; the 
triangles when it is taken as the photon pj. The circles show the balance when the photon pj is used and 
the photon and the jet are required to be back-to-back within 0.2. Right: the pj dependence of the most 
probable value of the particle level jet balance for these three cases. 



342 



Jets and Missing E t - Jet Energy Scale: In-situ Calibration Strategies 



left panel of the figure. The solid line shows the balance when the pj is taken as the average pj of the 
photon and the jet 4 and the triangles when it is taken as the photon pj. The difference in the shape is 
clearly visible with a more pronounced low tail for the pj r reference, because the cross-section favors 
the case when ISR has reduced the leading jet pj, as discussed above. The circles show the balance with 
the photon pj reference when the photon and the jet are required to be back-to-back within 0.2 to reduce 
ISR. The resulting distribution is indeed more symmetric. 

The right panel of the figure shows the most probable value of the particle level jet balance for these 
three cases. The effect of varying the quantity used to define the pi bins can be as large as 3% for low 
pj. The A(j) cut helps reducing the effect to the percent level. More details on this issue can be found 
in Ref. [14]. A disadvantage of using the average pj to define the energy bins is to introduce a coupling 
with the jet energy scale that one is trying to measure. Hence, we decided to use pj,y and £>t,z in the 
baseline study. The average p T will be used as a cross-check. 

The above studies show that the level of imbalance is determined in first order by "out-of-cone" losses 
related to the jet algorithm under consideration. In addition ISR/FSR and UE influence the measured bal- 
ance. ISR/FSR effects can be reduced by suitable cuts like a "back-to-back" requirement. UE can be 
estimated separately by looking at the energy outside jets and can be subtracted, if necessary. The im- 
balance, depending on the algorithm and on the choice of the reference scale, is up to 5 - 10% at 20 GeV 
and becomes smaller than 1% around 100 — 200 GeV. A careful tuning of Monte Carlo simulations will 
be necessary to reproduce and disentangle the effects that cause the residual imbalance. 

In the following, the specific issues of 7+ jet and Z+jet data analyses are discussed. In particular, 
backgrounds, event rates and the kinematic reach for each channel are considered. 

4.3 Analysis of y + jet events 

The data used in this section are 7+ jet events generated using PYTHIA. They are generated in intervals 
of pj of the hard scattered partons to provide adequate statistical coverage over a wide range of pj. The 
signal has been generated via the annihilation process qq —>■ gy (ISUB = 15) [5], and the QCD Compton 
process qg —>■ qy (ISUB = 30). The Compton process dominates over the whole pj range. 

The main background comes from QCD dijet events, where one of the jets is misidentified as a 
photon in the calorimeter. For background estimation, QCD jets are simulated with the same version 
of PYTHIA and similar intervals of pj. Table 1 gives the cross-sections for the various pj intervals for 
7+ jet and QCD dijet events. 

In this study, the leading jet pj balance method is used, which consists of first selecting the leading 
photon in the event and then selecting the leading jet in the opposite hemisphere. The balance is inves- 
tigated at the reconstruction level using full detector simulation. The cone jet algorithm with R = 0.7 is 

4 The jet pj is used as an approximation to the parton pj. 



Table 1: 7+ jet and QCD dijet cross-sections for the various pj intervals. The total inelastic 
cross-section is 8 x 10 11 pb. 



pi interval 


7+ jet a (pb) 


Dijet a (pb) 


a ratio (Dijet / y+ jet) 


17 < p T < 35 GeV 


2.61 x 10 5 


1.38 x 10 y 


5.29 x 10 3 


35 < p T < 70 GeV 


2.76 x 10 4 


9.33 x 10 7 


3.38 x 10 3 


70 < p T < 140 GeV 


2.59 x 10 3 


5.88 x 10 6 


2.27 x 10 3 


140 < p T < 280 GeV 


1.99 x 10 2 


3.08 x 10 5 


1.55 x 10 3 


280 < pt < 560 GeV 


1.17 x 10 1 


1.25 x 10 4 


1.07 x 10 3 


560 <p T < 1120 GeV 


4.90 x 10- 1 


3.60 x 10 2 


7.35 x 10 2 
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Figure 7: The most probable value of the balance at reconstruction level for cone jets with R = 0.7. Black 
and dots are for default and tight selection, respectively, and the points show the truth level balance. The 
back-to-back A0 cut is applied. 



used. Only jets with |tj| < 2.5 are considered. 

Two different photon selections have been considered. The tight photon selection adds isolation 
criteria in the calorimeter and tracker to provide further jet rejection, as discussed later in this section. A 
calorimeter isolation criterion is applied on the relative transverse isolation energy in a cone with half- 
opening angle 0.2, requiring that £r(cone)/jE>T,y < 0-05. The track isolation criterion requires that the 
number of tracks pointing to any jet around the photon is less than three. Here, track parameters are 
measured at the origin. The tight selection maintains a good efficiency of > 60% above 100 GeV while 
it is significantly reduced at low pj. These tight selection cuts have not been optimized. Their purpose 
in this note is to show that additional purity can be obtained, without too much loss of efficiency, at least 
for the higher pj range. The photon energy calibration, pj (reconstructed )/pj (truth), has been checked 
and is between 0.995 and 1 over most of the pj range. No photons impinging in the calorimeter crack 
regions with 1.37 < | r\ \ < 1.52 are used. 

The average response of jets over the various calorimeter regions was checked and a residual miscal- 
ibration of about 1 % was found for all pj values. Jets close to the crack regions are not well calibrated, 
therefore jets with 1.3 < |t] | < 1.8 are excluded from further analysis. Eventually we will rely on dijet 
balance to correct for such tj -dependent effects in the calibration (see Section 5). 

Figure 7 shows the measured pj balance. The photon pj is used as the reference and the standard 
back-to-back A0 cut is applied. Table 2 shows the fitted mean, the statistical uncertainty and the inte- 
grated luminosity for each pj interval for the tight photon selection. The pj balance above 80 GeV is 
flattening at the level of -0.02. At low pi the balance with the tight photon selection is a few percent 
below the balance measured with the default cuts. The photon reconstruction efficiency is low, and the 
stringent isolation cuts likely bias the sample composition rejecting events with strong ISR or underlying 
event. The match between the leading reconstructed jet and the leading truth jet is better than 98.5% 
for jets above 70 GeV; it degrades to 85% for jets above 17 GeV. The reconstruction level pj balance 
is one percent below the truth level pj balance. This is the result of the residual miscalibration of the 
jets in the particular release of the ATLAS event reconstruction software, as checked by comparing the 
reconstructed jet pj with the truth particle level jet pj. 

The PYTHIA QCD jet samples described in Table 1 were used to estimate the influence of the dijet 
background. The jet rejection factors for the standard photon selection and the tight photon selection were 
studied as a function of jet pj. With the default photon selection criteria, the probability of identifying 
a jet as the leading photon is of the order of 10~ 3 and has some pj dependence. The resulting signal to 
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Table 2: Fitted mean pj balance, error and integrated luminosity of the analysed samples for the 
various pi intervals for the tight photon selection. The last column shows the precision expected 
for 10pb~ ! obtained by scaling the error according to recorded integrated luminosity. 



;? T low ed£ 


e Bin width 


Fitted balance 


Integrated luminosity, pb 1 


Error for 10 pb 1 


20 GeV 


10 GeV 


-0.052 ±0.007 


0.67 


0.2% 


30 GeV 


15 GeV 


-0.042 ±0.005 


0.67 


0.2% 


45 GeV 


22.5 GeV 


-0.047 ±0.005 


9.1 


0.4% 


67.5 GeV 


33.5 GeV 


-0.027 ±0.003 


9.1 


0.4% 


101 GeV 


51 GeV 


-0.026 ±0.003 


47 


0.7% 


152 GeV 


76 GeV 


-0.018 ±0.002 


47 


0.4% 


228 GeV 


114 GeV 


-0.016 ±0.002 


535 


1.7% 


342 GeV 


171 GeV 


-0.021 ±0.005 


535 


4% 


513 GeV 


256 GeV 


-0.006 ±0.026 


535 


19% 



background ratio is estimated to be of the order of 1 above 100 GeV and degrades to about 0.1 for lower 
pi. The tight photon selection improves the signal to background reduced by a factor of three or more. 

Figure 8 left shows the pj balance distribution for QCD background events passing the default photon 
selection in the interval 140 < PT,y < 280 GeV. The back-to43ack A0 cut has been applied. The fitted 
most probable value of the distribution with the default photon selection and the A0 cut is 0. 13. Applying 
the tight selection reduces the number of background events, but a few events still remain in the tail. It 
will thus be desirable to maintain a good signal to background ratio to avoid a bias that would be difficult 
to control. For the tight photon selection, the mean of the Gaussian fit to the signal shown in Fig. 8 (right) 
for the interval 140 < pj^ 7 < 280 GeV changes by less than 1% when the background is added in. 

Thus, the background studies have shown that above ~ 80 GeV, the signal to background ratio is 
good enough to avoid bias. A more detailed and higher statistics study of the background rejection at 
lower pi should be carried out, to understand if the range of high precision could be extended to lower 
Pt- 

Table 2 shows that a good statistical precision is obtained for a relatively small integrated lu- 
minosity. For example, 100 pb -1 is sufficient to reach a precision of 1 —2% for jets in the range 
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Figure 8: Left: pi balance for the background sample of 140 < pj < 280 GeV for the default and 
tight photon selection. Right: pj balance for the signal and background sample in the interval 
96 < /?t,7 < 224 GeV for tight photon selection. 
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300 < Ej < 500 GeV. We expect that the threshold of the un-prescaled photon trigger will be 60 GeV 
for early running. Hence above that value the full statistics should be recorded while prescale factors 
will have to be applied for lower pi. 

4.4 Analysis of Z + jet events 

A sample of inclusive Z boson events corresponding to an integrated luminosity of 500 pb 1 has been 
produced using ALPGEN. The MLM matching cut is imposed at pj > 20 GeV and \r]\ < 6, and the 
factorization scale is set at m\ + p\ z where mz is the Z boson mass. The invariant mass is required 
to lie between 40 and 200 GeV. A generator-level filter requires one truth cone jet with R = 0.4, with 
Pt > 20 GeV and |tj | < 5.0 and two electrons/muons with p T > 10 GeV and \r}\ < 2.7 in the event. 

Another sample of inclusive Z events corresponding to 120 pb -1 , as well as the relevant background 
samples have been generated using PYTHIA. The signal has been generated via the Drell-Yan process 
(ISUB = 1) [5]. As for 7+ jet events, the qg Compton process dominates over the whole p T range. The 
Drell-Yan process contributes due to higher order QCD radiation which may reproduce the leading jet. 

The Z — > ee and W — > ev events are produced with a generator-level filter requiring one truth 
electron with pj > 10 GeV and \r] \ < 2.7. The Z — > TT events are generated with a filter requiring two 
electrons/muons with pj > 5 GeV and | T] | < 2.8. For Z — > TT and Z — > ee the dilepton mass is required 
to be larger than 60 GeV. For more details on the data samples see Ref. [16]. 

The fully simulated events are required to pass the isolated single-electron trigger with a p T threshold 
of 25 GeV or the isolated di-electron trigger with the pj threshold of 15 GeV at trigger level 1 and 2. 
Electrons are required to have pj > 25 GeV, |tj| < 2.5 and to be outside the calorimeter cracks defined 
as 1.37 < |t]| < 1.52. Jets are reconstructed with the cone tower algorithm with R = 0.7. They are 
required to have a distance of AR > 0.4 from a reconstructed electron and pj > 30 GeV. 

The Z bosons are selected by requiring two reconstructed electrons with invariant mass in the range 
mz ± 20 GeV. If more than two electrons are reconstructed, the pair with the invariant mass closest to mz 
is chosen. The combined distribution of the invariant mass for the signal and background events is shown 
in the left panel of Fig. 9 for ALPGEN Z+jet events where jets with p T > 40 GeV are reconstructed using 
the cone tower algorithm with R = 0.7. The main backgrounds are from QCD multijets and from top 
production, while Z — > TT and W — > ev events are an order of magnitude smaller. The background 
level is reasonably low. Requiring the jet and the Z to be back-to-back in azimuth within 0.2 reduces the 
background to a negligible level. 

The right panel of Fig. 9 shows the measured balance for cone jets with R = 0.7 as predicted by ALP- 
GEN for 500 pb -1 . The precision with which the pj balance can be measured experimentally depends 
on the efficiency for reconstructing the Z in its electron or muon decay channels, on the trigger efficiency 
and on the experimental width of the p T balance distribution. Table 3 shows the fitted most probable 
value of the p T balance and the statistical precision for an integrated luminosity of 500 pb -1 obtained 
with the Z — > ee sample. Below 200 GeV, the precision is of the order of 0.8%, while above 200 GeV, 
it gets worse. The Z —>■ jiji decay mode can also be used. A similar efficiency and background level 
can be achieved as shown in [16] doubling the available statistics. A statistical precision of 1% can be 
achieved below 200 GeV with an integrated luminosity of 100pb _1 , while above the precision is at the 
level of 2%. This assumes also that the complete tj range can be used for the measurement, relying on 
techniques like dijet balancing to ensure uniformity across 77 to the desired precision. 

The measurement is also affected by systematic uncertainties. The balance is sensitive to the correct 
modeling of higher order radiation. Comparing the predictions from different Monte Carlo simulation 
programs gives an indication of the size of such effects. This is illustrated in Fig. 10, in which ALPGEN 
and PYTHIA samples of Z+jet events corresponding to 120 pb -1 and 500 pb -1 (ALPGEN only) are 
compared after applying the A0 cut, for cone jets with R = 0.7. PYTHIA predicts a more negative average 
pj balance at low pj than ALPGEN. This effect is related to broader distributions and larger tails in the 
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Figure 9: Left: Distribution of the dielectron mass for Z — > ee+ > ljet events and the relevant background 
in a simulated event sample corresponding to an integrated luminosity of 200 pb -1 with cone jets with 
R = 0.7. Right: The pj balance for an integrated luminosity of 500 pb -1 of cone jets with R = 0.7 in 
events generated with ALPGEN in 5 bins of pr,z- The red dots are for reconstructed jets, solid triangles 
for truth jets and open triangles for truth in bins of average pj z and jet pj. 



Table 3: The fitted most probable value of the pj balance and the statistical precision for an 
integrated luminosity of 500 pb -1 obtained with the Z — > ee + jets sample for reconstructed cone 
jets with R = 0.7 and R = 0.4. 



pj interval 


cone 0.7 jet balance 




cone 0.4 jet balance 


error 


30 < p T < 40 GeV 


-0.032 


0.008 


-0.150 


0.007 


40 < p T < 60 GeV 


-0.059 


0.007 


-0.162 


0.008 


60 < p T < 100 GeV 


-0.032 


0.006 


-0.113 


0.007 


100 < p T < 200 GeV 


-0.027 


0.007 


-0.081 


0.008 


200 < pt < 400 GeV 


-0.026 


0.014 


-0.060 


0.016 
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Figure 10: The pj balance for an integrated luminosity of 120pb _1 and 500pb _1 in events generated with 
ALPGEN (dots and triangles, respectively) and for 120pb _1 in events generated with PYTHIA (squares) 
in bins of piz for cone jets with R = 0.7. 



balance distributions resulting from low pj activity in PYTHIA induced by higher order radiation. The 
differences between the two generators can be tested with ~ 100pb _1 of data for pj < 100 GeV. 



4.5 The missing Et projection method 

The missing Ej projection method is an alternative approach to test the jet energy scale and has been used 
successfully by the DO experiment to determine the response of their calorimeter to hadronic jets [17]. It 
is explained here for the example of 7+ jet events, but it can also be used for Z+jet events. Details about 
the study presented in this section can be found in Ref. [15]. 

At leading order, momentum conservation between the photon and the jet gives 

E T,y + PT, parton = 0 . (3) 

Neglecting parton showering and hadronization effects, this can be rewritten at particle level as 

^r + ^TJet-O, (4) 

where momentum has been replaced by energy, which is a good approximation for light quark jets. 

The systematic effects of parton showering and hadronization, including initial state radiation (ISR) 
and final state radiation (FSR), can be reduced by requiring that the leading jet and the photon are back- 
to-back within 0.2 in azimuth in a similar way to what was shown in Fig. 2. 

At the calorimeter level, the balance equation is modified to 

e£ T , y + 7'( £ jet)^T,jet = ~Ej lss . (5) 

where e, j are the electromagnetic and hadronic responses, -Ecaio/^particie, of the calorimeters, respectively. 
It is implicit in Eq. 5 that energy deposited in the calorimeters by the underlying event and pileup is 
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symmetric in 0. These assumptions have been tested and their effect is small. Anticipating that the 
electromagnetic scale will be well calibrated, for example by using the electrons from the decay of a Z 
boson, we can assume that e « 1. The quantities can then be projected in the direction of the photon, 
yielding 

£t, 7 + y(£jet)£'T,jet •hy=—hy £"™ ss , (6) 

where n y is the unit vector in the direction of the photon. Using the relation 

£™- = -£ T>r -£'£ T , (7) 

where £' indicates a sum over all activity in the calorimeter other than the 7 , the response can be further 
simplified as 

J=Z ~ = — „ > (°) 

where Eq. 4 was used in the last step. In this form it is clear that this method of measuring the response 
is independent of the underlying event, since the hadronic activity outside the 7+ jet system is approxi- 
mately 0 -symmetric and its contribution to the sum cancels out. It is also (mostly) independent of the jet 
algorithm. 

However, the jet response depends on the jet energy because the relative fraction of EM energy in the 
calorimeter level jet increases with increasing energy, hence a correction for non-compensation must be 
applied. Thus, the response has to be measured as a function of the jet energy. If we bin the response as a 
function of the measured jet energy (£j e t), it would be biased towards lower £x,y due to the bin migration 
described in Section 4.2. Instead, the jet energy is taken as E' = £x,y • cosh(% t ), which uses the much 
better measured energy of the photon and projects it in the jet direction; this procedure is based on the 
momentum balance of Eq. 4. The EM scale jet energy distribution is fitted with a Gaussian in each bin of 
reference energy E', and the jet energy scale is then calculated. The correspondence between E' and the 
measured jet energy is also determined. The resulting jet energy scale can then be plotted as a function 
of£ jet . 

The response function is then plotted for each algorithm as a function of the corresponding mean jet 
energy and parameterized by 

j(E) = b 0 + b 1 \n-^ + b 2 \n 2 -^- (9) 

£ scale £ scale 

with £"scaie set to 200 GeV and the b parameters fitted, as shown in Fig. 11. 

The fit function is used to correct the jet energy at the EM scale £"™ eas . The corrected energy 
£caiib _ E T eas /j(E) is then compared to the truth information E T C in the Monte Carlo simulation. Fig- 
ure 12 is a plot of the ratio E T c /Ej- hb as a function of jet energy, which shows a linear response within 
« 2% over the range 50 - 900 GeV. 

This procedure accounts for the fact that the ATLAS calorimeters are non-compensating and that 
the electromagnetic content of a hadronic shower is energy dependent. Further adjustments, such as 
out-of-cone corrections, must be made to obtain the final jet energy scale. It is only at this point that 
jet algorithm dependent corrections are made and is therefore a useful method as a cross-check of our 
understanding of the energy scale with different sensitivity to systematic effects. 

4.6 Conclusions of the studies of ylZ + jet events 

The use of the in-situ 7/Z+ jet processes allows us to propagate the knowledge of the electromagnetic 
scale characteristic of the 7 or the Z —>■ it decay to the hadronic recoil system. Typically the leading jet 
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Figure 11: The energy dependence of the jet response for cone jets with R = 0.4. The solid line corre- 
sponds to the fit using Eq. 9. 
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Figure 12: The ratios E^ c /E^ lib (triangles) and Ef c /E^ eas (squares) for jets reconstructed i 
cone algorithm with R = 0.4. See the text for an explanation of the symbols. 



ing the 



is required to be back-to-back in azimuth with the y or the Z boson, and the pj balance between them is 
used to connect the two scales. 

The balance is affected by various physics effects which systematically limit the precision of the 
in-situ validation procedure. These effects can be as large as 5 — 10% at 20 GeV and tend to decrease to 
the percent level at about 100 GeV. 

The 7+ jet process has the advantage of higher statistics compared to Z+ jet. However, at low pi it 
may be seriously affected by background from QCD jets. Hence it may turn out to be most useful above 
~50 — 100 GeV where the signal to background ratio is more favorable and where an inclusive single 
photon trigger is available. A statistical precision better than a percent is achieved already with 10pb _1 
at 100 GeV, and in the range 300 - 500 GeV the percent level is reached with about 100pb~ 1 . The Z + jet 
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Figure 13: The jet response /^(reconstructed) / ^(truth) at the EM scale versus the jet pseudorapidity rj 



process allows us to measure the absolute energy scale in the low pj range with a statistical precision 
of 1% with ~ 300pb _1 of data. It provides also the highest pi reach with percent level precision in 
the range 300 — 500 GeV with ~ 200pb _1 . Above ~ 500 GeV, other methods will be used as discussed 
further on. 

5 In-situ studies using QCD jet events 

5.1 Calorimeter intercalibration in QCD dijet samples 

The calorimeter response /?T(reconstructed)//?T(truth) for jets at the EM scale reveals significant vari- 
ations with pseudorapidity r\ , as shown in Fig. 13. The dips correspond to known cracks and dead 
material regions. The response may also be non-uniform in azimuth 0 for the same tj due to the varia- 
tions in the basic calorimeter response and due to the non-uniform distribution of the dead material. The 
response in (j) and rj should become flat at the hadronic scale. This has to be checked in-situ with high 
precision, and if necessary, an intercalibration of different regions of the calorimeter has to be performed. 
For this study we use PYTHIA QCD jet events, generated in intervals of pj of the hard scattered partons, 
as described in Section 4.3. 

5.1.1 Intercalibration in azimuth 

The intercalibration in 0 can be checked using the high rates of QCD jet events. As the scattering cross 
section is constant in (j), the jet rates above a fixed pj threshold have to be nearly equal in different 0 
sectors, as shown in Fig. 14 (left) for 64 (j) sectors in the tj range |rj| < 0.1 which corresponds to the 
segmentation of the ATLAS hadronic calorimeter in one calorimeter wheel. Strongly deviating rates in 
particular sectors would point to dead calorimeter cells, losses in dead material, or noise. 

For a perfectly calibrated calorimeter the spread of rates around the mean rate value N should be 
t/N following the Poissonian distribution. Thus, collecting on average e.g. ~1000 events per sector, one 
would expect the spread of ~30 events, i.e. 3% of the average rate. A significantly larger spread would 
point to a relative miscalibration of different sectors. The spread can be obtained by filling a histogram 
with the rates and fitting it with a gaussian. Due to the strongly falling jet pj distribution the rates are 
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Figure 14: Left: The jet rate as a function of 0 for jets with the transverse momentum above a certain 
threshold. Right: Integrated luminosity required to collect 1000 events with jets above the given pi 
thresholds in each of the 64 0 sectors in the region |tj | < 0.1. 



extremely sensitive to the jet energy scale: a 1.5 — 2% shift in the energy scale would result in ~6 — 9% 
change of the rate, which is a 2 - 3a effect for the above example with 1000 events per sector. Thus, the 
statistics of 1000 events per sector should allow us to control the jet energy scale to within 1.5 — 2%. The 
luminosity necessary to collect such statistics in 64 0 sectors within the tj interval \r\ \ < 0. 1 is shown for 
different pj thresholds in Fig. 14 (right). A study of the systematics for this method is on the way. 

The trigger menu for the initial running period at the luminosity of 10 31 cm _2 s _1 is currently under 
development, and the effect of trigger prescaling has not yet been taken into account in this plot. The 
highest pj threshold for the inclusive jet trigger is expected to be at 100 — 150 GeV, above which no 
prescaling will be applied. The prescale factors will steeply rise for lower thresholds, such that the 
recorded event rate should be roughly flat in pj. Scaling the shown luminosity values for the lowest two 
Pt bins down with the currently foreseen prescale factors shows that the rate of recorded events will be 
still sufficient to reach the statistical precision of 1.5 - 2% with ~ 10pb _1 of data. 

However, the rates may be affected by the calibration of the level 1 calorimeter trigger. To avoid a 
bias, the pj thresholds chosen for this analysis should be significantly higher than the respective trigger 
thresholds. This will lead to a loss of statistics, and thus a higher integrated luminosity will be necessary 
for the same statistical precision. 



5.1.2 Intercalibration in pseudorapidity 

The intercalibration in tj can be done based on the pj balance in QCD dijet events. The roughly three 
orders of magnitude higher cross section of dijet events, as compared to Z/y+jet events, allows an 
intercalibration of the calorimeter with higher granularity, higher precision and for higher energy ranges 
than can be achieved in Z/y+jet events for the same integrated luminosity. Applying the calibration 
based on the pj balance, one corrects for the relative difference in calorimeter response for jets, the noise 
and pile-up in different regions of pseudorapidity tj . The goal is to achieve a flat response in tj and to 
extend the previous calibrations for |tj| < 2.5 to higher pseudorapidities. 

The method is straightforward: one calorimeter region in pseudorapidity tj is chosen as the reference 
region and jets in other tj regions are calibrated relative to jets in this region. Normally, one calorimeter 
tj wheel in the central range, far from the crack regions, will be chosen as the reference region. Other 
considerations are the existence of dead or noisy cells in a particular part of the calorimeter, the unifor- 
mity of the response in 0 within one wheel etc. The width of the region will also depend on the available 
statistics. 
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Figure 15: Left: The asymmetry A as measured with both jets in the central region |tj| < 0.7 as defined in 
Eq. 10. Right: The mean asymmetry obtained from gaussian fits, plotted as a function of the half scalar 
sum of pj of both jets at the reconstruction level (closed circles) and at the truth particle level (stars). 



We choose to characterize the /^-balance in dijet events by the asymmetry A given by 

probe _ re f 

A= _Pj_ (1Q) 

( P r be +PT f )/2 

where pj f is the transverse momentum of the jet in the reference region, and pj 0be is that of the jet 
in the region to be calibrated. Using this form, the pj balance is symmetric, as shown in Fig. 15 (left) 
for reconstructed cone jets with R = 0.7. It thus provides better properties than the simple definition 
Pj° he / pj f which is intrinsically asymmetric. 

To limit ISR/FSR and thereby reduce the width of the asymmetry distribution, a cut on the azimuthal 
angle between the two jets, A(j) > 3, is applied. The fitted mean value of the asymmetry as a function of 
pj is displayed in the right panel of Fig. 15 and shows that for this definition there is very little bias both 
for truth particle jets and for reconstructed jets at the hadronic scale. 

Further cuts can be applied to obtain a cleaner dijet sample and thus reduce further the width of the 
pj balance distributions. For example, the total number of reconstructed jets with pj > 10 GeV can be 
limited to Nj et < 4 or even more strongly to Nj et = 2. However, these additional cuts reduce the statistics 
of the sample, as demonstrated in Fig. 16, which shows the integrated luminosity required to reach a 
statistical precision of 0.5% of the pj balance fit mean value in the probe range 0.7 < r\ < 0.8, where 
0 < |tj| < 0.7 is taken as the reference region. The luminosity values are shown for different sets of 
cuts. For the case of applying only the back-to-back cut on the angle between the two leading jets, 0.5% 
precision can be reached in this r\ region with 10pb~ ! of data for pj ~ 300 GeV, and with 100pb _1 for 
pr < 500 GeV. 

Similar plots were produced for the other X] regions, which show that an order of magnitude more 
luminosity is necessary for \r\ | > 2.5, compared to the central region. On the other hand, the size of the 
reference region can be increased sequentially, i.e. as soon as one r\ has been checked or recalibrated, it 
can be added to the reference region to study the next Y\ range. 

As in the luminosity estimation for the 0 intercalibration in Section 5.1.1, the trigger prescaling has 
not yet been taken into account in Fig. 16. As in Sect. 5.1.1, scaling the shown luminosity values for 
the lowest two pj bins down with the currently foreseen prescale factors shows that the rate of recorded 
events will be sufficient to reach 0.5% precision with ~ 10pb~ ! of data. 

The calibration method is further tested in the full r\ region for different pj ranges. For each v\ 
region / in each pi range j, the asymmetry A defined in Eq. 10 is fitted and the mean value A,y is used to 
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calculate a correction factor c,y via 

In the case of an imperfect original calibration, factors deviating from 1 are obtained. Each jet pj can 
then be multiplied by the appropriate factor to restore the balance of fully simulated jets. More details of 
this study can be found in Ref. [18]. 



5.2 High pj jet calibration 

The calibration of the jet energy scale for high pj jets (with pj ~ 500 GeV) is a challenge at hadron 
collider experiments. In this energy range, methods developed for lower pj jets, normally using well 
defined objects as reference, start to fail. This difficulty will be further increased at the early stage of 
data taking, where both our understanding of the detectors and the statistics of such high pj jets are 
usually limited. Our goal is to develop a calibration technique for jets with pj > 500 GeV based on 
data collected with an integrated luminosity of 0.1-1 fb _1 . The basic idea is to determine the energy 
scale of high pj jets from lower pj jets, for which the energy scale will be obtained by the techniques 
described earlier in this note. Two calibration techniques are discussed in the following sections: 1) 
the correlation between different pj jets in momentum balance in the transverse plane, and 2) the angle 
between particles in the jets. 



5.2.1 Multijet p T balance method 

This method is based on selecting events with multiple (> 2) jets and calibrating the energy scale of 
the highest pj jet by requiring a pj balance between the leading jet and the system of remaining lower 
pi jets. The absolute jet energy scale of the non-leading jets, which is expected to be obtained from 
photon+jets, Z+jets and/or W — > jet jet studies, is thus propagated to the JES of the higher pj jets using 
pj balance methods. The large QCD jet cross-section allows for a very high reach in pj. This method 
has been studied with samples generated with both ALPGEN and PYTHIA using the fast and full detector 
simulation. Two different analysis techniques were applied as described in the following. 
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Figure 16: Integrated luminosity required to reach 0.5% precision for various pj ranges in the region 
0.7 < 77 < 0.8 with different sets of selection cuts: all PYTHIA dijet events (circles), requiring A(j) > 3 
between the two leading jets (triangles), requiring in addition less than 4 reconstructed jets in an event 
(squares), requiring exactly two reconstructed jets (stars). The reference region is 0 < |tj| < 0.7. 
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Figure 17: Energy scale (left) and energy scale uncertainty (right) of high pj jets relative to lower pi 
remnant jets as a function of jet p T , obtained by multijet p T balance method at an integrated luminosity 
of 1 fb -1 . The error bars shown are statistical only. 

ALPGEN study 

For this study QCD multijet samples with 4, 5 and 6 partons were generated using ALPGEN interfaced 
with JIMMY. Four data samples were formed by requiring the leading partem pj to be larger than 400, 
600, 800, or 1000 GeV, while the pj of the next-to-leading partem was required to be smaller than 200, 
300, 400, or 500 GeV, respectively. The total number of partons (p T > 40 GeV) can be 4, 5 and 6 in each 
of the above samples. All the partons are required to be in the range \ r]\ < 3. These samples, referred to 
as PT400, PT600, PT800, and PT1000 hereafter, were processed through the full detector simulation. 

At least four cone jets with R = 0.7 with p T > 40 GeV must be reconstructed in each event. The 
leading jet p T in the samples PT400, 600, 800, and 1000 must be in the interval 400 < p 1 * 1 < 460 GeV, 
600 < pi* 1 < 700 GeV, 800 < ^ T etl < 920 GeV, and 1000 < ^ T etl < 1140 GeV, respectively, while the 
pi of the next-to-leading jet must correspondingly respect the bound p^ a < 190 GeV, p^ t2 < 280 GeV, 
p*j 2 < 370 GeV, and p 1 ^ 2 < 470 GeV. In addition the leading jet and the vector sum of the remaining 
lower pi jets with pi > 40 GeV (called remnant jets) are required to be back-to-back in azimuth within 
±20°. After all the cuts, about 3200, 310, 40, and 11 events remain in the PT400, 600, 800, and 1000 
samples, respectively, for an integrated luminosity of 1 fb -1 . 

The distribution of the pi balance, defined as 



L non - leading jets | £ pi | ' 

is plotted for each sample and fitted by a Gaussian. The mean values of the Gaussian fits with their 
uncertainties for 1 fb -1 are shown in Fig. 17 as a function of p^ tl . The figure indicates that B' z could be 
determined with a statistical accuracy of 2% or better for high pi jets with 400 < pi < 1 100 GeV and 
that it shows a constant offset of 2-3% over the p^ 1 range, as also observed in the PYTHIA study. 

Various potential sources of systematic effect have been studied. Low pi jets will not contribute to 
remnant jet system because of the 40 GeV pi threshold applied to the jets. If the threshold is varied by 
±20 GeV, B f z varies by ±1% at most. The effect is smaller, for the same range of thresholds, than in the 
PYTHIA analysis because of the back-to-back requirement imposed on the system. A sizable fraction of 
the soft radiation is still missed is the event. This is the main cause of the positive B' z offset. It could 
be reduced by requiring the overall event pi to be balanced, e.g. by requiring that Ej lss < 40 GeV. 
In that case, decreases by 1-2% and becomes close to 1. However, an Ej lss cut should be used 
with caution as fake £"™ ss could cause biases. The effect of the underlying event on B' z has also been 
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evaluated by subtracting the Ej measured in a cone (R = 0.7) placed randomly in a region of azimuth 
within 60° < A(j) < 120° from the leading jet and not overlapping with any of the remnant jets. After 
subtraction, B' z increases by 1-2%. 

The precision on the JES for high pj jets is affected by the JES uncertainty of the remnant jets. 
If we assume the uncertainty to be ±7% for low pi jets with \r\ \ < 3.2, which is the current linearity 
performance of the HI weight calibration method used in this analysis [1], vary the scale of the remnant 
jets by ±7%, and apply again all the event selection, B' z changes by ±7% independently of . This 
provides an estimate of the JES uncertainty for high p T jets. The uncertainty on B'^ is increased since 
more events fail the next-to-leading jet pj cut that is included in the overall JES uncertainty. 

Adding the statistical and above systematic uncertainties in quadrature, the JES uncertainty obtained 
from multijet balance method is estimated to be about 8% in the pj range of 400 < pj < 1100 GeV 
and is totally dominated by the absolute JES uncertainty for lower p T remnant jets. If the absolute JES 
uncertainty is not included in the systematics, the relative JES uncertainty is 3%. 

pythia study 

In this study, PYTHIA samples with full and fast detector simulation were used. The fully simulated 
sample was generated with kinematic cuts of 280 < pj < 560 GeV for the hard scattering process. It 
corresponds to an integrated luminosity of about 25 pb -1 . Cone jets with R = 0.7 were reconstructed 
based on calorimeter towers, topoclusters and on Monte Carlo truth particles. 

The fast simulation package ATLFAST was used for a high statistics study. A cut on the hard scat- 
tered parton at pi > 280 GeV was applied. The sample represents an integrated luminosity of about 
0.8 fb _1 . Cone jets with R = 0.4 were used for this study. 

Events are selected with a minimum of three reconstructed jets, the leading jet being above a fixed 
Pt threshold, up to which the JES is assumed to be calibrated. All non-leading jets are required to be 
below that value. A further event selection cut is |rj | < 2 for the leading jet. Generally non-leading jets 
have been reconstructed up to | r/ 1 < 5 and down to pj > 10 GeV. For specific studies of their systematic 
influences all these cuts have been varied. 

Figure 18 presents results of the method for the sample using full detector simulation. The event 
selection cut on non-leading jets is pj < 300 GeV. On the left, the pj balance B' z is shown for leading 
jets with 370 < pi < 380 GeV, together with a Gaussian fit. While non-Gaussian tails are visible, which 
could be partly reduced by a stricter event selection, they do not influence the fit and it was thus decided 
to use the maximum statistics. 

To reduce the bias of the p T balance due to migration effects, the division of the p T region into bins 
is done based on the average pj of the sum of the non-leading jets and the leading jet of each event. This 
reduces the bias to about 1% at truth level. This is similar to what is studied in the Z/y+ jet analyses 
(Section 4.2), where the average (px,y/z + Fr,jet)/2 is an option for the binning. 

On the right of Fig. 18 the JES was studied from 350 GeV to 550 GeV. The result is reasonably flat 
but shows a systematic offset of about 2%, as also observed in the ALPGEN study. There are various 
effects contributing to this imbalance, mostly originating from soft radiation affecting the low pj jets 
in the hemisphere opposite the leading jet. At truth jet level, changing the minimum jet pj from 5 
to 10/20/40 GeV introduces a negative bias of 0.2/1/3%, respectively. The pj threshold value for real 
data should be chosen taking the noise, pile-up conditions and jet finding efficiency at low pi into 
account. Out of cone losses also play a role: changing the cone size from 0.7 to 0.4 at truth level, affects 
the measured balance by ~ 1%. Furthermore, imperfect calibration of low pj jets at the level of few 
percent [1] may be a cause of bias. 

In a high statistics study, based on ATLFAST, the non-leading jets were restricted to a pj value below 
350 GeV during event selection. Figure 19 shows the result on the left. The pj balance distribution is 
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Figure 18: Left: the ratio of the absolute value of the vector sum of the non-leading jet pj to the leading 
jet pi for the pi bin 370-380 GeV fitted by a Gaussian. Right: this ratio as a function of jet pj. The 
mean and the error of the mean of the Gaussian fits are shown. The average of the leading jet pj and of 
the total pi of the non-leading jets is used for the binning. 
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Figure 19: Results using ATLFAST with cone jets with R = 0.4. Left: The fitted balance as a function of 
the average leading and non-leading jets pj. Right: Iterations of the method using the pj range checked 
by one iteration as the reference region for the next. 



lower than for cone jets with R = 0.7, as discussed above. Truth and ATLFAST jets give closer results in 
this case, since the latter are truth jets smeared for resolution effects. 

The reach in pj of the method can be further increased by iterating the procedure. Apart from the 
QCD jet cross-section, the statistics are mainly limited by the cut on the pj of the non-leading jets during 
event selection, essentially requiring that one partem in the hard scattering lost a significant part of its 
energy due to gluon radiation. For higher pj ranges this cut should be moved to higher values to obtain 
sufficient statistics. The procedure is thus, after applying the method once, up to a certain pi, to use 
the range up to this pj as reference region for a next iteration. Results using this approach based on 
the ATLFAST sample are shown in Fig. 19 on the right hand side. For the studied high pj region, the 
multijet statistics predicted by PYTHIA may be an underestimation, as the PYTHIA parton shower model 
is not perfectly tuned for this pj range. 

A crucial feature for this method, as observed in PYTHIA simulations, is that the imbalance value 
remains essentially constant for one iteration if the tested JES is correct. The balance can differ between 
the iterations. In general, it is slightly closer to 1 for higher pj ranges, as the system of non-leading jets 
becomes harder and thus less sensitive to low energy effects. To ensure that the JES is correct, we require 
that the beginning of the test region for a given iteration overlaps with the end of the region verified in 
the previous iteration, or at the very beginning verified by another method. 
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A complete understanding of the feature that the balance should remain constant for all pj within 
one iteration, is necessary in order to rely on it in data. In this case, in addition to the absolute JES 
uncertainty of the non-leading jets, the systematic uncertainty of this method is given by deviations of 
the balance from a constant value as a function of pj. 

To determine this additional systematic uncertainty, the deviations are split into a global slope of a 
linear fit and into local deviations around this fit. As possible sources of systematic effects, the absolute 
JES of the remnant system, the choice of the pj binning, the lower and upper pj thresholds for the 
non-leading jets, the effect of the underlying event and of the noise, and the effect of the varying jet 
multiplicity in the remnant system are studied. To estimate the deviations quantitatively, the absolute 
JES is varied by 10%; the lower jet pj threshold is varied between 10 and 50 GeV; the upper threshold 
for each iteration is varied by 100 GeV; the effect of the underlying event and of the noise is estimated by 
adding ±5 GeV to each jet in the event; and the analysis is performed separately for events with exactly 
4, 5 and 6 jets. 

The slope of the fit is compatible with zero within statistical uncertainties for all variations of the 
sources. The corresponding systematic uncertainties for each source is estimated to be ~ 0.1%. The 
estimation is limited by the available statistics. All local deviations from a linear fit are of the order 
of the statistical uncertainties. The width of their distribution is found to be less than 0.2% above the 
statistical expectation for each of the above influences. Adding all uncertainties quadratically results in 
a very small total systematic uncertainty of 0.5%. Thus, the main contribution to the JES uncertainty is 
expected to be the accuracy of the calibration in the reference region. The JES can then be calibrated up 
to px ~ 1-5 TeV, with the specific uncertainties introduced by this method being below 1% and requiring 
less than lfb 1 of integrated luminosity. 

5.2.2 Track angle method 

The idea behind this method is that the invariant mass of two particles in a jet is approximately constant 
(given by the scale of Aqcd), which leads to a pj 1 behavior of the r\ -0 distance between two particles 
in a jet AR, where pj is the transverse momentum of the jet. The dependence of AT? on p^ 1 of the jet 
could provide a means of determining the energy scale of high pj jets once the JES of lower pj jets is 
calibrated with some accuracy using other in-situ techniques. The procedure to determine the JES of 
high pj jets is as follows: 1) measure AR for low pj jets with the calibrated JES in both data and Monte 
Carlo samples, 2) calibrate the simulation so that AR in the simulation matches AR in data in the low pj 
range, then 3) measure the AR for a sample of given high pj jets in data, and look for the Monte Carlo 
jet pj scale corresponding to the measured AR value on the curve of AR(p^ 1 ). 

The studies were performed on PYTHIA dijet samples generated in intervals of pj. The jets selected 
have to have pj > 20 GeV and should not overlap with other physics objects within R < 0.4. Afterward 
all tracks are selected which fall in a cone of R = 0.4 around reconstructed jet axis of the jet with the 
highest pj. If fewer than two tracks are found for the jet, the event is rejected. The AR of two tracks is 
then calculated for all combinations from two up to the five highest pj tracks in the jet. Figure 20 shows 
the mean of the AR values (one entry per event), normalized to an integrated luminosity of lfb -1 , for 
leading jets with 140 < p T ath < 160 GeV (top-left) and 1120 < p T ath < 1280 GeV (top-right). The two 
histograms correspond to the AR obtained from leading two and five tracks. When more low pj tracks 
are involved, the AR distribution broadens as a result of jet fragmentation. 

The bottom plots in Fig. 20 show the mean (left) and most probable value (MPV) obtained from a 
Landau fit to the peak (right) of the AR distributions as a function of the leading jet truth pj. Both mean 
and MPV are well fit by a function of the form po/x + pi as shown by the curves. Figure 21 (top-left) 
shows the uncertainty of the jet pj scale obtained by evaluating the jet pj range on the curves covered 
by the statistical uncertainties of the data points in the bottom of Fig. 20 and dividing the pj range by 
the jet truth pj. The pj scale uncertainty varies significantly at high pj with different choices of AR 
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Figure 20: Top: Distributions of the mean of the AR values for the leading two (solid histogram) and 
five (dashed histogram) tracks in jets with 140 < p^ ath < 160 GeV (left) and 1120 < ^ rath < 1280 GeV 
(right) for an integrated luminosity of lfb 1 . Bottom: Mean (left) and most probable value obtained 
from a Landau fit to the peak (right) of the AR distributions as a function of the leading jet truth p T for 
the leading two (solid points) and five (open points) tracks. The curves represent fits with a function of 
the form po/x + p\. 
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values. In general, the MPV is more stable than the mean value as it suffers less from fluctuations in jet 
hadronization. The MPV-based scale uncertainty seems less dependent on the leading track multiplicity 
in PYTHlA-only comparisons, but the situation changes when PYTHIA is compared with HERWIG as in 
Fig. 21 (top-right). Apparently the choice of the leading two tracks is more robust against different 
jet fragmentation models, and therefore the AR obtained from the MPV of leading two tracks is used 
hereafter as a central value. 




Figure 21: Top-Left: Jet pj scale uncertainty (statistical uncertainty only) as a function of jet truth pj 
obtained for different choices of AR values and track multiplicities. Top-Right: The most probable AR 
of the leading two and five tracks as a function of the jet truth pi in PYTHIA (open points) and HERWIG 
(solid points). Bottom-Left: Default fit (solid curve) to the most probable AR of the leading two tracks as 
a function of the truth jet pj and the curves corresponding to ±5% JES variations at p 1 ^ 1 = 5 GeV (dashed 
and dotted). Bottom-Right: Total and individual systematic and statistical uncertainties as a function of 
the truth jet pj expected to be obtained from the track angle method for an integrated luminosity of 
lfb- 1 . 

The following sources of systematic uncertainties are considered: 1) the jet fragmentation model 
as discussed above, 2) the track selection cuts and reconstruction inefficiencies, 3) absolute JES at low 
energies, and 4) the uncertainty associated with the fit. The systematic uncertainty on the jet pi scale 
is evaluated by obtaining the AR versus jet pj distributions shifted by ±la (or the equivalent amount 
specified) of the source, and looking at the variation in the jet pj at the measured AR values. For 1), the 
difference between PYTHIA and HERWIG is assigned. For 2), the deviated distributions are obtained by 
applying different selection cuts on tracks and assuming an extra 5% reconstruction inefficiency for tracks 
in jets. The JES uncertainty for low pj jets is assumed to be ±5% at p*j = 100 GeV, and the deviated 
distributions are obtained by estimating the corresponding variations in AT? at pJj =100 GeV using the 
default fit function and transferring the variations to uncertainties on the parameters of the function. The 
JES deviated and default distributions are shown in Fig. 21 bottom-left. The total uncertainty obtained 
by adding all the systematic uncertainties and the statistical uncertainty in quadrature, and individual 
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systematic uncertainties as a function of the jet truth pj are shown in the bottom-right of the figure. With 
the assumption of ±5% JES uncertainty at $ = 100 GeV, the JES of 150, 300, 600, and 1200 GeV p T 
jets can be potentially measured with an accuracy of 8, 10, 13, and 21%, respectively, with an integrated 
luminosity of lfb 1 . 

5.3 Obtaining jet energy resolution in QCD dijet sample 

A precise measurement of the jet energy resolution is necessary to model and control the systematic 
effects associated with it on jet observables and the quantities used in physics measurements. A key 
issue in estimating the jet energy and momentum resolution is the need to disentangle detector effects 
from those associated with physics such as the underlying event, and initial and final state radiation. 

Two data-based techniques are presented which allow the jet energy resolution to be estimated, and 
they were compared where appropriate to the resolution obtained using particle jets: the dijet balance 
method, used by the D0 collaboration, and the kj balance technique, developed by UA2 and used by 
CDF. 

The QCD dijet events for this analysis were generated using PYTHIA in the pj range from 17 GeV 
up to 1120 GeV. Jets were reconstructed using the cone algorithm with R = 0.7. 
To select dijet events the following basic cuts are applied: 

• One primary vertex; 

• Exactly two back-to-back leading jets with |A0| < 0.3 and p T > 10 GeV; 

• Both jets in the same r\ region \r\ \ < 1.2. 

The A0 and minimum pi cuts are varied for some systematic studies. 

5.3.1 Dijet balance method 

The determination of the jet p T resolution in the dijet balance technique is based on energy conservation 
in the transverse plane. It assumes that there are only two jets in the event which have similar p T . If one 
defines the asymmetry distribution A for the two jets as: 

^ _ Fr,i — Pt,2 ^ (13) 
Pti + Pt,2 

then for jets in the same rapidity region, which therefore have the same resolution, the fractional trans- 
verse jet energy resolution can be expressed as a function of o A : 

^ = V2a A , (14) 

PT 

where p T = (pr,i + Pt,i)/2. 

The asymmetry distribution A for two representative pj bins obtained by using cone jets with R = 0.7 
are shown in Fig. 22. The distributions have been symmetrized by computing either pj i — /?t,2 or 
Pt,2 — Pt,i randomly for each event. The resolution function is well described by a Gaussian fit, al- 
though it is worth mentioning that previous studies done by the D0 Collaboration suggest using a double 
Gaussian fit for large pj bins, to absorb effects of non-Gaussian tails. 

The simple dijet balance method does not return the true fractional transverse energy resolution 
associated purely with detector effects, since the two leading jets of an event could be imbalanced due to 
undetected jets in the event with pj < 10 GeV such as from soft radiation. This radiation will broaden 
the resolution function and a correction may be derived by studying the effect of the cut on the pi of any 
third jet in the event as described in the next section. 
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40GeV<Jetp y < 60 GeV 300 GeV < Jet p T < 450 GeV 




Asymmetry Variable Asymmetry Variable 



Figure 22: Asymmetry distributions of two jets for two representative pj bins. Cone jets with R = 0.7 
in the pseudorapidity region |tj| < 1.2 are used. The distributions were fitted with a single Gaussian 
function. 



5.3.2 Soft radiation correction 

In our basic event selection, both the A(j) cut and the cut requiring exactly two jets with pi > 10 GeV 
are implemented in order to reduce the broadening of the pj balance distribution due to soft radiation 
effects. However, the effects are not fully suppressed, as further jets with pj < 10 GeV may remain in 
the selected events. To account for this effect, the jet resolution is computed by allowing a third jet with 
transverse momentum pj i, up to some threshold e, and varying the threshold values between e = 12.5 
and 27.5 GeV. As before, the resolution distributions in the jet pj bins are fitted using single Gaussian 
functions. For each jet pj bin, the set of resolutions obtained from the different /?t,3 thresholds are fitted 
with a straight line and extrapolated to e = 0. We define this value as: 




which would be the resolution that we would have measured from an ideal dijet sample with e = 0. 
Examples of the linear fits and the extrapolations for two pj bins are presented in Fig. 23. 

After fitting the resolution for the different third jet thresholds, we calculate a collection factor, 
K(pj), as follows: 



K(pr) = 




Finally, correcting by this factor, the unbiased fractional transverse energy resolution is obtained via 

fa \ e=10GeV 
= K(p T ) x ( — ) • (17) 

corrected V PT / uncorrected 

Since the soft radiation bias should be larger at small transverse energies, and negligible at high pi we 
parametrize K(pj), with the function: 

K(p T ) = l-exp(a-bp T ) . (18) 

This parametrization can be used to correct the resolution for each pj bin and determine the unbiased 
resolution associated purely with detector effects. The jet energy resolutions with and without the soft 
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Figure 23: Resolution versus the px,3 threshold cut for different pj bins. The line corresponds to the 
linear fit applied while the dashed-line shows the extrapolation to pj 3=0, which corresponds to an ideal 
dijet sample (e = 0). 



radiation correction are shown in Fig. 24. As we 
tained by other approaches we can establish an est 
resolution. 
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Figure 24: Jet energy resolution for cone jets with 
R = 0.7 in the pseudorapidity range |tj | < 1.2. The 
results are obtained by using dijet balance tech- 
niques with and without applying the soft radia- 
tion correction. 



see below, by comparing this result to that ob- 
e of the systematic uncertainty of the jet energy 
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Figure 25: Sketch of the kj balance technique. 
The T] axis corresponds to the azimuthal angular 
bisector of the dijet system while the y axis is de- 
fined as being orthogonal to the r\ axis. 



5.3.3 The k T balance technique 

The &t balance technique was developed by UA2 and studied at CDF. To extract the contributions of 
detector effects on to the jet energy resolution, the imbalance vector Kj = Pt.i + Pt,2 is projected onto 
two components (i/^tj) as shown in Fig. 25. 

The Tj axis corresponds to the azimuthal angular bisector of the dijet system while the y axis is 
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defined as being orthogonal to the T\ axis. The two components, Kj^ and Kj^, are sensitive to different 
effects. The calorimeter energy resolution represents the main source for oy, the spread of Kt, v . Gluon 
radiation effects are smaller but affect this component as well. On the other hand, cv,, the spread of Kj^, 
is significantly affected by gluon radiation. In addition, there are other smaller effects such as jet angular 
resolution, underlying event and out-of-cone fluctuations. In order to reduce the hard gluon radiation 
effects, events with pj 3 > 11 GeV are rejected. If we define the contributions from the calorimeter 
resolution, a res , and from soft radiation, Osr^ and (7sr ± , then we can write 

<*\ = °"re S + OsR|| 
= a S 2 R ± • 

Assuming 

tfs 2 R|l = oL x , (19) 
the soft radiation effects are removed by subtracting in quadrature from cy: 

a res = y/o$-o% . (20) 

The data sample was divided into six pj regions, and the distributions Kj^ and Kjy were fitted with 
single Gaussians. Since in this method the measured resolution comes from the convolution of the single 
jet resolution, to obtain the single jet energy resolution, a res must be scaled by a factor 

Figure 26 shows the resulting -^jOy and -j^Gr\ as a function of the square root of the average pj of 
both jets, cy has an approximately linear dependence with (pr l2 ), while o n has a flat dependence, 
especially at high pj, as expected. The effective jet energy resolution after removing soft radiation effects 
by subtracting in quadrature a n from cy is shown in Fig. 27. 
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Figure 26: ^jO'v' an ^ 72 a? 7 ^ n ^j et events as a function of the square root of the average pi of both jets. 



5.3.4 Comparison of dijet and kj balance techniques 

A comparison of the two methods, the dijet balance and the &t balance technique, allows an estimate of 
the systematic uncertainty associated with the determination of the resolution associated with detector 
response. By comparing the reconstructed jets with the jets from Monte Carlo truth we obtain informa- 
tion on its dependence on the jet algorithm and the jet matching. Each of the first two leading calorimeter 
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Figure 27: The effective jet energy resolution for 
cone 0.7 jets after removing soft radiation effects 
by subtracting in quadrature cv, from a ¥ . 
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Figure 28: True jet energy resolution for cone 0.7 
compared with those obtained using dijet and kj 
balance techniques. 



jets is matched to particle jets, which lie within AR < 0.1 w.r.t. the reconstructed jet. After matching, the 
response pf Xo / p l ™ th is determined, and the true resolution is obtained by fitting the response with single 
Gaussians. This is shown in Fig. 28 together with the results from the two data-driven estimates. 

The two data-driven approaches agree within uncertainties, though a hint that the kj method is un- 
derestimating the true detector resolution might be visible. The determination of the resolution using 
reconstructed jets matched to particle jets gives results in between the two data-driven approaches and 
together with them provides a reasonable basis from which to estimate the systematic uncertainties in the 
determination of the noise, stochastic and constant terms in the jet energy resolution. 

6 Summary and conclusions 

Precise reconstruction of the jet energy is required by many physics analyses. In-situ processes provide 
the platform against which our understanding, based on theory and detector simulation, must be tested 
in order to achieve maximum precision. From the analyses presented in this paper, we conclude that one 
can characterize three ranges of jet pj, in which the different features of detector response and underlying 
physics control the precision which can be achieved. 

From the reconstruction threshold of 10 GeV up to ~ 100 — 200 GeV, the combination of effects 
from jet clustering and physics effects like ISR/FSR and underlying event can be as large as 5 — 10% at 
low pj, slowly decreasing to the percent level around 100 — 200 GeV. With QCD dijet events, we will 
be able to control the uniformity of the response on the percent level with high granularity in 0 and X] 
already with ~ 10pb~ ! . In turn, Z+jet balance is an adequate process to measure the absolute energy 
scale with a statistical precision of 1% with approximately 300pb _1 of data. Hence the precision will 
rather quickly be dominated by the systematic uncertainty. Validation of the Monte Carlo simulation to 
ensure that both physics and detector effects are well modeled will be particularly important to control 
systematic uncertainties. Differences between currently available Monte Carlo generators are on the level 
of 5 — 10% in some of the observables, which sets the scale for the challenge in this region of jet pj. 

The medium pj region extends from 100 — 200 GeV to about 500 GeV. Physics effects are at the 
level of 1 — 2% in this region. The momenta of the particles resulting from jet fragmentation are within 
the range of the test beam measurement. It will be an important range in which to validate the Monte 
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Carlo description of the detector response. In addition the dijet balance is still statistically powerful, and 
1% precision for an 77 bin size of 0.1 can be achieved with ~ lOOpb -1 . In y+jet events the pj balance 
can be measured with negligible bias from QCD background. The statistical precision is better than 
one percent for lOpb -1 at 100 GeV, and in the range 300 - 500 GeV the percent level is reached with 
~ lOOpb- 1 . 

Above 500 GeV, one enters a regime which is beyond any existing jet measurement and extrapolation 
of the Monte Carlo modeling and comparison to in-situ data is crucial. A much larger integrated lumi- 
nosity is required for in-situ measurements of similar precision. In addition, for the highest momentum 
particles which may be produced in these jets, our simulation of the detector no longer has test beam 
data available as a basis. Techniques such as multijet balancing or extrapolation based on jet particle 
properties can be used and should allow us to control the scale at the level of a few percent with lfb 1 
for jets up to 1 TeV. 
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Abstract 

This note discusses the overall ATLAS detector performance for the recon- 
struction of the missing transverse energy, fa. Two reconstruction algorithms 
are discussed and their performance is evaluated for a variety of simulated 
physics processes which probe different topologies and different total trans- 
verse energy regimes. In addition, effects of fake fa resulting from instru- 
mental effects and from false reconstructions are investigated. Finally, studies 
with first data, corresponding to an integrated luminosity of lOOpb -1 , are sug- 
gested which can be used to assess and calibrate the fa performance at the 
startup of data taking. 

1 Introduction 

A very good measurement of the missing transverse energy, fa, is essential for many physics studies 
in ATLAS. Events with large fa are expected to be the key signature for new physics such as super- 
symmetry and extra dimensions. A good fa measurement in terms of linearity and resolution is also 
important for the reconstruction of the top-quark mass from it events with one top quark decaying semi- 
leptonically. Furthermore, it is crucial for the efficient and accurate reconstruction of the Higgs boson 
mass when the Higgs boson decays to a pair of T-leptons. 

This Note describes the overall performance of the fa measurement in ATLAS. The performance is 
checked using fully simulated Monte Carlo (MC) samples in different physics channels with differences 
in event topology and kinematics range. Events with no true fa as well as events with a true fa, $j ms , 
due to particles unseen in the detector as neutrinos or lightest supersymmetric particles, ranging from 
~ 20 to ~ 500 GeV are used. 

An important requirement on the measurement of fa is to minimize the impact of limited detector 
coverage, finite detector resolution, presence of dead regions and different sources of noise that produce 
fake fa, $j ake . The ATLAS calorimeter coverage extends to large pseudorapidity angles to minimize 
the impact of high energy particles escaping in the very forward direction. Even so, there are inactive 
transition regions between different calorimeters that produce ^"| ake . Dead and noisy readout channels 
in the running detector, if present, will also produce If,^^. Such ift^ sources can significantly enhance 
the background from QCD multi-jet events in supersymmetry searches or the background from Z — ► it 
events accompanied by high-^x jets in Higgs boson searches when the Higgs boson decays into two 
leptons and neutrinos. 

The calorimeter plays a crucial role in the fa measurement and an important first step of the fa 
measurement is the suppression of noise in the calorimeter. Section 2 describes the techniques used for 
noise suppression in calorimeters. The two fa reconstruction algorithms used in ATLAS, Cell-based 
and Object-based, are described in detail in Sections 2.2 and 2.3. The overall performance of the fa 
measurement in ATLAS is reported in Section 3. Any mis-measurement in the event, due to finite 
resolution or acceptance of the detector or due to instrumental effects related to dead or noisy channels, 
will degrade the fa measurement. Such sources of fa, that can lead to large values of fake fa, are 
studied in Section 4. Section 5 introduces the fa algorithm for the ATLAS triggers and describes its 
performance. Finally, Section 6 describes techniques in different physics channels that can be used with 
the very first ATLAS data to validate the fa measurement and determine the fa scale in-situ. 
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2 The algorithms for fj reconstruction in ATLAS 

The transverse missing energy in ATLAS is primarily reconstructed from energy deposits in the calorime- 
ter and reconstructed muon tracks. Apart from the hard scattering process of interest, many other sources, 
such as the underlying event, multiple interactions, pile-up and coherent electronics noise, lead to energy 
deposits and/or muon tracks. Classifying the energy deposits into various types (e.g. electrons or jets) 
and calibrating them accordingly is the essential key for an optimal ^measurement. In addition, the loss 
of energy in dead regions and readout channels make the measurement a real challenge. 

There are two algorithms for reconstruction in ATLAS that emphasize different aspects of energy 
classification and calibration. 

The Cell-based algorithm starts from the energy deposits in calorimeter cells that survive a noise 
suppression procedure. The cells can be calibrated using global calibration weights depending on their 
energy density. This procedure will be robust already at initial data taking because it does not rely on 
other reconstructed objects. In a subsequent step, the cells can be calibrated according to the recon- 
structed object they are assigned to. Corrections are applied for the muon energy and for the energy lost 
in the cryostats. 

The Object-based algorithm starts from the reconstructed, calibrated and classified objects in the 
event. The energy outside these objects is further classified as low pj deposit from charged and neutral 
pions and calibrated accordingly. 

The noise suppression in the calorimeter is common for the Cell- and Object-based algorithm and is 
described below, followed by a detailed description of the algorithms. 

2.1 Calorimeter noise suppression 

The electronics noise alone in the « 200k readout channels of the ATLAS calorimeter contributes about 
13 GeV to the width of the $t distribution. Especially in events that do not have large such as in 
Z — > TT used for an in-situ determination of the fa scale (Section 6.2), the noise suppression is of crucial 
importance. 

For the $j measurement, two noise suppression methods have been studied so far. Both require 
knowledge of the width of the noise distribution, (7 no i se , which can be either purely electronics noise or a 
combination of electronics and pile-up noise. 

Standard Noise Suppression Method. The first method is based on only using calorimeter cells 
with energies larger than a threshold, generally corresponding to a certain number of a no i se . The threshold 
is optimized for $j resolution, the scale of the total transverse energy in the calorimeters, and 
for the highest pj jet to be close to the case without noise simulation. Two cases are studied: a symmetric 
threshold (|£ C eu| > n x a no i se ) and an asymmetric one (E ce \\ > n x a no i se ). A symmetric threshold with 
n = 2 for all calorimeters is generally used. 

Noise Suppression using TopoClusters. The second method only uses cells in 3-dimensional topo- 
logical calorimeter clusters [1,2], hereafter called TopoClusters. A TopoCluster is reconstructed starting 
from a seed cell with an absolute energy value |£ ce ii| > 4a no i se to which neighbors with l-Eceiil > 2a no i se 
are added. Finally the cells at the boundary are required to have l-E^nl > 0a no j se . The cells that constitute 
the TopoCluster are hereafter called TopoCells. This set of thresholds, referred to as 4/2/0, is optimized 
to suppress electronics noise as well as pile-up from minimum bias events, while keeping the single pion 
efficiency as high as possible. 

As a result of the large energy density of electromagnetic showers, the n° reconstruction efficiency 
is high (close to 100% for energies > 4 GeV) for the 4/2/0 configuration. On the other hand, the 
reconstruction efficiency for charged pions is very sensitive to the parameters of the TopoClusters. For 
example, changing the cuts on neighbors from 4/2 to 6/3, the n ± efficiency significantly decreases for 
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TopoClusters with E < 4 GeV. This sensitivity highlights the importance of a good modeling of the noise 
level from first data. 

In Z — > vv events simulated with electronics noise, the $t resolution degrades by only 3 % for 
the 4/2/0 configuration as compared to the same events without noise added. Also, the TopoCluster 
algorithm performs better in terms of linearity and resolution of the #j measurement, compared to the 
standard noise suppression method. Therefore Cell- and Object-based |?t algorithms apply the noise 
suppression method based on TopoClusters with configuration 4/2/0. 

2.2 Cell-based $V reconstruction 

The Cell-based #j reconstruction includes contributions from transverse energy deposits in the calorime- 
ters, corrections for energy loss in the cryostat and measured muons: 

|f Final _ |?Calo _|_ ^Cryo _|_ ^.Muon ^ 

In the following, the three terms in the above equation, referred to as calorimeter, cryostat and muon 
terms, are described in some detail. 

2.2.1 The $t calorimeter term 

As described in the previous section, the first step is to select calorimeter cells that belong to reconstructed 
TopoClusters to minimize the impact of noise. 

The x and y components of the calorimeter fa term are calculated from the transverse energies 
measured in TopoCells: 

& C f = - I E XJ . (2) 

TopoCells 

The total transverse energy in the calorimeters, X$ T , is calculated from the scalar sum of E T of all 
TopoCells: 

ZE T Calo = £ E T . (3) 

TopoCells 

The straightforward result, obtained by using the electromagnetic calibration for all cells, gives a 
large shift in the fa scale of about 30% with respect to $j me (see Section 3). 

This result illustrates the necessity of developing a dedicated calibration scheme to reduce the sys- 
tematic shift of the # T scale and optimize its resolution. This goal is achieved in several steps according 
to the cell classification. The classification depends on whether the energy deposits in the calorimeter 
are electromagnetic or hadronic in nature and whether they are associated with high pj particles. 

To classify energy deposits, schemes to calibrate hadronic showers such as 'HI -like' calibration 
or 'Local-Hadronic' calibration [3] utilize the energy density in a cell. Electromagnetic showers tend 
to have higher energy densities as compared to hadronic showers. The 'Local-Hadronic' calibration 
scheme uses further information related to shape and depth of the calorimetric shower to classify a 
TopoCluster. The next step in the cell-based $j reconstruction is to globally calibrate all calorimeter 
cells using the 'Hl-like' or 'Local-Hadronic' calibration schemes. As can be seen in the performance 
section, this already gives a very good # T performance. The final refinement step of the calibration using 
the association of cells with reconstructed objects is described in Section 2.2.4. It improves the linearity 
and the resolution particularly for events containing electrons (Section 3). 
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2.2.2 The #j muon term 

The #j muon term is calculated from the momenta of muons measured in a large range of pseudorapidity, 
defined by \r]\ < 2.7: 

#Muon = _ £ (4) 
RecMuons 

In the region \r]\ < 2.5 only good-quality muons in the muon spectrometer with a matched track in 
the inner detector are considered. The matching requirement reduces considerably contributions from 
fake muons, sometimes created from high hit multiplicities in the muon spectrometer in events with very 
energetic jets. For higher values of the pseudorapidity (2.5 < |t]| < 2.7), outside the fiducial volume of 
the inner detector, there is no matched track required and the muon spectrometer is used alone. 

The muon momentum measured by the muon spectrometer is taken in the two cases. Energy lost 
in the calorimeter is already included in the calorimeter term. No pi threshold cut is applied to re- 
constructed muons. Apart from the loss of muons outside the acceptance of the muon spectrometer 
(|T7| > 2.7), there is a loss of muons in other regions (see Section 4.1) due to limited coverage of the 
muon spectrometer. The muons reconstructed from the inner detector and calorimeter energy deposits 
could be used to recover these events, but they are not yet used here. 

As can be seen in the performance section, the $j resolution is only marginally affected by the muon 
term, due to the good identification efficiency and resolution of the ATLAS muon system. However, 
unmeasured, badly measured or fake muons can be a source of large fake #j (see Section 4). 

2.2.3 # T cryostat term 

The thickness of the cryostat between the LAr barrel electromagnetic calorimeter and the tile barrel 
hadronic calorimeter is about half an interaction length where hadronic showers can lose energy. The 
$t reconstruction recovers this loss of energy in the cryostat using the correlation of energies between 
the last layer of the LAr calorimeter and the first layer of the hadronic calorimeter. A similar correction 
for the end-cap cryostats is applied. This correction is called the cryostat term when used for jet energy 
correction [3]. It is defined as follows: 

$r=- 1 £ ;<7°> ( 5 ) 

where all reconstructed jets are summed in the event, and 

Ejet c <y° = w Ci ?V£em 3 x£had, (6) 

where w Cryo is a calibration weight (determined together with the cell calibration weights in the HI -like 
calibration) and £"em3 and £"had are the jet energies in the third layer of the electromagnetic calorimeter 
and in the first layer of the hadronic calorimeter, respectively. The cryostat correction turns out to be 
non-negligible for high-pr jets. It contributes at the level of ~ 5% per jet with pj above 500 GeV. 

2.2.4 Refined calibration of the # T calorimeter term 

The final step is the refinement of the calibration of cells associated with each high-/? T object. Calorime- 
ter cells are associated with a parent reconstructed and identified high-/? T object, in a chosen order: 
electrons, photons, muons, hadronically decaying T-leptons, ft-jets and light jets. Refined calibration of 
the object is then used in $j to replace the initial global calibration cells. The calibration of these objects 
is known to higher accuracy than the global calibration, enabling to improve the f T reconstruction. 
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Figure 1 : Distribution of the difference between true and reconstructed #j for Z — > xx events (left) 
including and (right) excluding cells in Topoclusters not associated with reconstructed high-px objects. 

The calorimeter cells are associated with the reconstructed objects through the use of an association 
map. This map is filled starting from the reconstructed/identified objects in the chosen order, navigating 
back to their component clusters and back again to their cells. If a cell belongs to several kinds of 
reconstructed objects, only the first association is included in the map, i.e. the overlap removal is done 
at cell level. This avoids double counting of cells in the $j calculation. If a cell belongs to more than 
one object of the same kind, all associations are included in the map and the geometrical weight of the 
cells, accounting for the sharing of energy of cells owned by two different TopoClusters, is also included 
to avoid double counting. 

Attention has to be paid to the calibration of cells inside different objects. For example, for elec- 
trons/photons, the final cluster-level calibration (which can be propagated back to the cell-level) corrects 
for upstream material, longitudinal leakage and out-of-cone energy. The last correction should not be 
applied in the #j calculation because the contribution of cells outside objects already accounts for it. 
In a similar way, for x lepton decays and for jets, the overall scale factors which correct the energy for 
physics effects like final state radiation, fragmentation or the underlying event as well as for the effects 
due to the clustering algorithm are not applied in the calculation of $t, because they also contain the 
out-of-cluster correction. 

All TopoCells, even if not associated with any high-p T reconstructed object, are used in the # T 
calculation. They are calibrated using the global calibration scheme. The importance of the energy 
deposits of these low energy particles for the #j calculation is shown in Fig. 1. The shift in the absolute 
value of the reconstructed #j increases by about 1 GeV while the resolution is degraded by a factor 
-1.25. 

Once the cells are associated with categories of objects as described above, the contribution to |?x is 
calculated as follows: 

|fCalo _ ^.RefCalib _ _^RefEle _|_ ^RefTau _|_ ^Refbjets _|_ ^Refjets _|_ ^RefMuo _|_ ^RefOut^ ^ 

where each term is calculated from the negative of the sum of calibrated cells inside a specific object 
and |?^ f0ut is calculated from the cells in TopoClusters which are not included in the reconstructed 
objects. In the following the final $ T calculation obtained from Equation (1) with = #* e / Callb w iH 
be referred to as # RefFinal . 

2.3 Object-based $V reconstruction 

The motivation of the method is to reliably reconstruct $j for analyses that are sensitive to low pi 
deposits coming mostly from neutral and charged pions, from soft jets, from the underlying event and 
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from pile-up. This is important, for example, in reconstructing the invariant mass of the Standard Model 
Higgs boson, m#, in the H — »■ t + t~ final state for masses in the range 1 15 < tun < 140 GeV [4]. 
The object-based method comprises two main steps: 

• Establish a classification between two main types of objects: high pj (e/y,jU,T, jets) and low 
pj objects (n°,n ± , unclustered deposits) coming from underlying event, pile-up of multiple pp 
collisions, initial and final state radiation, and other soft QCD processes. 

• Apply the object-based calibration optimized for $j calculation. 

The x and y components of $ T and ££" T are calculated by adding the contributions from each type of 
components: 



where the indices 'High' and 'Low' correspond to the #j and ££t calculated from high pj and low pj 
objects, defined below. 

The object-based algorithm uses mostly the calorimeter to reconstruct # T - Some objects such as 
electrons and taus also use the inner detector tracking, while the muons use both, inner detector and 
muon spectrometer information. Tracking is also used for the low pj deposits of soft objects. 

The object-based method uses the TopoClusters 4/2/0 (Section 2.1) and first calculates all contribu- 
tions of high pj objects. Each TopoCluster is allowed to be included only once by the first object that is 
associated with it. The classification starts with the identification of electrons, photons, muons and t's. 
Once the clusters belonging to these objects are removed from the event record, hadronic jets above a 
certain threshold are identified. TopoCells not part of any of the above high pj objects are classified as 
low pj deposit. The next subsections discuss the calibration of these objects. 

2.3.1 Calorimeter objects: electrons, hadronic T-jets, jets 

Electrons: The electron objects are taken from the standard electron reconstruction and a matched 
track is always required. The default calibration of an electron is based on the 'sliding window' EM 
clusters [5]. To be consistent with the combined 4/2/0 TopoClustering used for the object-based recon- 
struction, electrons are reconstructed from TopoClusters and they are calibrated by weighting the energy 
deposits in the longitudinal calorimeter layers. The electron pj is required to be at least 8 GeV. 

Tau Leptons: The object-based method uses the calo-based reconstructed T-jets [6] and it calibrates 
them as jets. A cut of 20 GeV on the T-jet pj is applied and a x likelihood > 4 is required. 

Jets: A cone (AR = 0.7) jet algorithm running on the TopoClusters is used for calculating the fj 
contribution from jets. Jets not overlapping with the electron and tau jets are chosen. The jet energy 
calibration is based on the 'Hl-like' hadronic calibration [3], which corrects for the difference in response 
for e/y and hadrons, followed by a scale correction due to the non-uniformity of the reconstruction in X] . 
It is based on the $j projection method (used by CDF and DO) [7] using a dijet sample where forward 
jets are corrected with respect to well measured central jets (|t] | < 0.8). The jet pj is required to be at 
least 20 GeV . 




'-x,y = -* x ,f -* x ,y 
Ej = X^T lgh + I^T 



.£High _ £Low 



(8) 



(9) 
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Figure 2: Linearity for single charged % (left) and single 71° (right) as a function of r\ for the energy bin 
1< E < 10 GeV). 



2.3.2 Muons 

In general, the combined muons, reconstructed from the inner detector and the muon spectrometer, are 
used. In the region where the inner detector has no coverage (2.5 < |t] | < 2.7) muons reconstructed from 
the spectrometer only are used. To reduce the number of fake muons originating from punch through of 
high pj jets, strong quality requirements are imposed on the combined track from the inner detector and 
the muon spectrometer. Additionally, if a muon passes inside a jet and the ratio of measured momenta in 
the inner detector and in the muon spectrometer is below 0.2, the muon is rejected. If the muon is found 
inside a jet or the total pi at the EM scale of the TopoClusters within a cone of 0.2 around the muon is 
> 10 GeV, either the measured muon energy loss in the calorimeter is subtracted from the combined pj 
or the muon spectrometer pj is used. The combined muon pj is required to be at least 6 GeV. 

Inner detector tracks may be used to improve the fa reconstruction. They contain the muons not 
covered by the combination of inner detector and muon spectrometer, especially in the crack regions of 
the muon system. Only those tracks are retained which are isolated from others by a cone of size 0.3. 
Those tracks overlapping with electrons, muons or jets that are already used for fa are discarded. The 
tracks should have pj > 6 GeV and should satisfy a muon likelihood criterion based on E/p and the 
energy in the calorimeter sampling layers. In addition, isolated tracks (mostly pions) with little energy 
deposits in the calorimeter can be used to improve fa. These tracks are used only if pj less than 10 GeV, 
to avoid tracks with spurious high /?T,which deteriorates the fa performance. 



2.3.3 Low pj depositions: classification and calibration 

Only those TopoClusters that have not been assigned to high pi objects are considered for classification 
as low pj objects of either electromagnetic or hadronic nature. For this purpose, clusters are further sub- 
divided into so-called mini-jets. Mini-jets are reconstructed from TopoClusters using a cone algorithm 
of a relatively small radius of AR = 0.2 and a seed cluster of pj > 0.5 GeV. A mini-jet is required to 
have pj > 0.5 GeV. The mini-jets are next classified as charged and neutral pions. 

The separation between charged and neutral pions is done only in < 3.2. A mini -jet is defined 
to be a 7T° if the fractional energy in the hadronic compartments is less than 2 % and pj > 10 GeV. The 
low pi deposits are calibrated under single charged and neutral pion hypotheses in the full r\ range. A 
sampling method similar to that used to extract longitudinal weights as implemented for the electron 
and photon calibration is used. The calorimeter sampling weights are determined separately for neutral 
and charged pions in different energy and Tj regions. A good linearity is achieved for both neutral and 
charged pion hypotheses. Figure 2 shows the linearity for charged and neutral pions in the centtal region, 
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which roughly fluctuates within 5%. 

Mini-jets will not saturate all the low pj calorimeter deposits. There will remain a non-trivial amount 
of energy left in the calorimeter and clustered in TopoClusters from very low momentum pions, which 
will not form a mini-jet. These deposits are referred to as unassociated deposits. The energy calibration 
of these unassociated deposits has been estimated in three 77 regions, depending on the calorimeter 
region (barrel, end-cap, forward), and has been added to the #j calculation. 

3 Performance of the reconstructed $V 

In this section the performance of the $j reconstruction is discussed, focusing on the linearity and resolu- 
tion of the reconstructed #j as a function of the true missing transverse energy, $j me . The measurement 
of the $i direction and the dependence on topology are also discussed. Note that the performance of 
the two reconstruction methods is very similar, so it is not specified which method has been used to 
produce each performance plot. 

^ me is defined from the sum of all stable and non-interacting particles in the final state (neutrinos 
and the lightest supersymmetric particles). Comparisons between $t and $£ me are made for a number 
of physics processes with different topologies and final states. 

The $t performance in case of events with large |?£ ake , which contribute predominantly to the tails 
of the |?t distribution, is described in the next section. 

3.1 Linearity and resolution 

The $t linearity is defined by the following expression: 

Linearity = ($ me - $ T ) /^ rae , (10) 

where #j and $J me are reconstructed and true fa, respectively. This definition of linearity assumes an 
$j me value above a threshold and an ^ ake value that is small such that the $j angle is well measured. 

Figure 3 shows the reconstructed linearity as a function of $^ me for a number of physics processes. 
The following statements summarize the behavior of the linearity distributions: 

• The uncalibrated |? T corresponds to the use of cell energies at the electromagnetic scale and shows 
a large systematic bias of 30%. In W —>■ ev and W — > /J.V decays, the bias is smaller since the 
hadronic activity on average is smaller. 

• The reconstructed #j based on globally calibrated cell energies and reconstructed muons gives a 
linearity to within 5%. 

• The reconstructed Tf.i including the cryostat correction shows a linearity to within 1% for all pro- 
cesses except for W — >• ev . 

• The refined $j calibration, which optimizes the calibration with reconstructed object identity, 
recovers the linearity for W —> ev events to within 1%. The refined calibration also gives the best 
resolution when compared with the above steps of calibration (see also Section 6). 

The linearity for A -> tt with m A = 800 GeV is shown in Fig. 3 (right) as a function of ^ me . The 
bias of linearity at low $£ me is due to the finite resolution of the #j measurement. The reconstructed 
|?t is positive by definition, so the linearity is negative when the true #j is near to zero. Excluding the 
events with ^ rue < 40 GeV, which have a small statistics the observed linearity is found to be within 
2%. 
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The resolution is estimated from the width of the # x , y — ^ r " e distribution in bins of the total trans- 
verse energy deposited in the calorimeters (L$j). The core of each distribution is fitted with a Gaussian 
shape to estimate the width. Figure 4 shows the a of the fit plotted as a function of when refined 
calibration is applied. The $j resolution approximately follows a stochastic behaviour as a function of 
Lfj. Deviations from this simple behaviour are expected, and observed for low values of X$t where 
the contribution of noise is important and for very high values of L$j where the constant term in the 
resolution of the calorimetric energy measurement dominates. 

The # T resolution is fitted with a function a = a ■ y 7 !^ for values of between 20 and 2000 GeV. 
The parameter a, which quantifies the $j resolution, varies between 0.53 and 0.57 (see Fig. 4 left and 
right, respectively). Refined calibration yields the best results when compared to earlier stages of 
the calibration as described above. For W — ► ev decays the a parameter is reduced by 88% and for 
Z —> ee events it is reduced by 78% with respect to the global calibration with the cryostat correction 
applied. Figure 5 shows the resolution in the high £^t region for QCD jet samples. The jet samples used 
are generated in parton pj bins: Jl corresponds to 17 < pj < 35 GeV, J2 to 35 < pj < 70 GeV, J3 to 
70 < p T < 140 GeV, J4 to 140 < p T < 280 GeV, J5 to280 < p T < 560 GeV, J6 to 560 < p T < 1 120 GeV, 
J7 to 1 120 < pj < 2240 GeV. There is a clear degradation in the performance for the high pj jet samples 
(J6 and J7), where the linear term dominates. 

The effect of angular calorimeter coverage on the fa measurement is evaluated by comparing the res- 
olution with and without including the forward calorimeters (FCAL). In Z —> xx events the resolution 
is 7.8 and 10.1 GeV, respectively, showing that the ATLAS coverage minimises by design the effect of 
particles escaping at very large r\ and that the forward calorimeter is very important to guarantee that. 

Projections of #j along suitable axes can be used to check calibration problems and understand 
topology dependences. The quantity $l (longitudinal $t projection) is the #j projection onto the axis 
pointing in the direction of the genuine $t of the event. In events without genuine tfj, such as Z — ► It 
events this axis is reconstructed from the direction of flight of the Z and in dijet events the projection is 
done along the dijet thrust axis. The quantity |?p (perpendicular #j projection) is defined as a projection 
orthogonal to the $l direction. 

A bias of $l usually indicates mis-calibration of the object's energy scale (in most cases the hadronic 
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Figure 3: (left) Linearity of response for reconstructed $t as a function of the average true tfj for different 
physics processes covering a wide range of true $t and for the different steps of $t reconstruction (see 
text). The points at average true $j of 20 GeV are from Z — » xx events, those at 35 GeV are from W — > ev 
and W — ► }XV events, those at 68 GeV are from semi-leptonic it events, those at 124 GeV are from 
A — > xx events with m& = 800 GeV, and those at 280 GeV are from events containing supersymmetric 
particles at a mass scale of 1 TeV. (right) Linearity of response for reconstructed as a function of the 
true $x for A — > xx events with m A = 800 GeV. 



376 



Jets and Missing E t - Measurement of Missing Transverse Energy 




Figure 4: Resolution of the two $t components with refined calibration as a function of the total trans- 
verse energy, JlEj for low to medium values (left) and for higher values (right). The curves correspond to 
the best fits of a = 0.53y/LEj through the points from Z — > xx events (left) and a = 0.57a/T£t through 
the points from A — > xx events (right). The points from A — > xx events are for masses ranging from 
150 to 800 GeV and the points from QCD jets correspond to dijet events with 560 < p T < 1 120 GeV. 
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Figure 5: Resolution of the two $t components with refined calibration as a function of for QCD 
dijet samples (17 < pj < 2240 GeV). See text for the definition of samples J1-J7. The curve corresponds 
to a = 0.55 y^T^r (combined fit in the low and medium L^j regions). 
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Figure 6: Linearity of response for #l (left) and |?p (right) as a function of the average true fa for 
different physics processes. $l and |?p are defined in the text. 
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Figure 7: Resolution of fa(lefi) and ^p(right) as a function of Lfa for QCD jet samples. See text for the 
definition of samples J1-J7. 

scale). For events with significant values of $j me , fa is a measure of the fa angular resolution. For dijet 
events and other back-to-back topologies with similarly defined fa usually there is no bias in fa, but the 
resolution of fa is sensitive to soft radiation in the event. One possible source of bias in fa comes from a 
bias in the angular measurement of the objects in the event, which will cause a shift in the same direction 
on both sides of fa. 

Figure 6 shows the fa and fa linearity as a function of the true fa for different physics channels. 
The fa linearity is within 2% above 100 GeV and deviates by 5-10% at lower U™. The fa bias is 
consistent with zero throughout. Figure 7 shows the fa and fa resolution as a function of the true Lfa 
for the QCD jet samples. As the fa for jet events is defined as the fa projection onto the thrust axis of 
the two leading jets which are back-to-back, a much larger resolution in absolute terms is expected with 
respect to fa. The discontinuity in J4 (140 < p T < 280 GeV) and J5 (280 < pj < 560 GeV) fa resolution 
is a result of dividing the dijet events into samples based on the parton pj. Events with the same Lfa 
in J4 and J5 can have different energies in the perpendicular direction due to differences in underlying 
event, initial or final state radiation. This changes the fa resolution in the two samples. 

A good performance in terms of linearity and resolution may enhance the ability to reconstruct the 
mass of final states which involve neutrinos. Despite the presence of several neutrinos in the final state, 
the invariant mass of the x pair can also be reconstructed in Z — > xx and supersymmetric Higgs boson 




Figure 8: Distributions of the reconstructed invariant mass of T-lepton pairs with one T-lepton decaying 
to a lepton and the other one decaying to hadrons. The results are shown for Z —> xx decays (left) and 
for A — > xx decays with m A = 450 GeV (right). 
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Figure 9: (left) Accuracy of the measurement of the azimuth of the vector as a function of the true 
$t for three different physics processes: semi-leptonic tt events, Z — > tt and W — > <?v events, (right) 
^ ake as a function of the reconstructed 0($t) in tt events, simulated with extra material in 0. 



decays like A — > tt under simplifying assumptions [8, 9, 10]. Figure 8 shows reconstructed mass peaks 
of Z — > TT and supersymmetric Higgs boson decays A — ► tt with = 450 GeV. The reconstructed 
masses are correct to ~ 2% and the mass resolution is approximately 11%. Nevertheless, significant tails 
remain in the distributions because of the highly non-Gaussian effects induced by mis-measurements of 
$T and by the approximations used. 

3.2 Measurement of the $j direction 

Large energy fluctuations in the calorimeter or muon mis-measurements can produce large |?| ake . In 
general, for events with genuine missing transverse energy, the angular resolution will depend on the 
relative fraction of |?| ake and on the event topology. Figure 9 shows the #j azimuthal angular resolution 
as a function of the $£ me for three different physics processes. The measurement of the #j azimuth is 
clearly more accurate for W — ► ev events, which in general contain one high-pj electron and moderate 
hadronic activity in addition, compared to tt events. For values of $£ me below 40 GeV, the accuracy 
of the measurement of the direction degrades rapidly. In contrast, for high values of $j ms , azimuthal 
accuracies below 100 mrad are achieved. 

Detector inefficiencies may perturb the radial symmetry of the physics events. Thus, observations of 
0 asymmetries in reconstructed variables may be a hint of instrumental problems. Due to the increased 
material (of ~ 5% to 10%) in the upper half of the detector that was added artificially into the simulation, 
a (j) asymmetry is observed in $t as seen in Fig. 9 (right). This 0 asymmetry can also be observed in the 
|?t computed at the event filter trigger level. 

Similarly, problematic tj regions may be spotted by looking at fa correlations with jet pseudorapid- 
ity, in particular in QCD events. In fact, in this kind of events, #j is mainly due to jet mis-measurements 
which will affect more the #j component parallel to the dijet axis than that perpendicular to it. Detector 
failures or incorrect calibration sets may be revealed as unexpected peaks in such plots. 

4 Fake $ T 

The reconstructed ^ has two constituents - one that is produced by particles that interact weakly with 
the detector ($j me ) and the other one due to detector inefficiencies and resolution (^| ake ). Figure 10 
shows the rate of #£ ake and $^ me for the QCD sample generated with 560 < p T < 1120 GeV, where 
^ ake dominates at lower values and also has a larger tail. The same figure also shows these distributions 
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after excluding events with high pi jets within 17° of the reconstructed $t in the transverse plane. 
This considerably lowers the |?| ake rate compared to ^ rue . For an accurate measurement of $t it is 
important to have a good understanding of the sources of $j ake in data. Since the goal here is to study the 
performance and not the relative contributions of signal and background, comparisons between different 
physics samples are made for a common number of events rather than for a common luminosity. 

The first two subsections discuss the ^ ake from muons and from the calorimeter under the assump- 
tion that all detector readout channels are functional. The impact of dead-regions in the detector is 
examined in the subsequent subsections. 

4.1 Fake $1 from muons 

|?Fake f mm muons ca n be caused either by inefficiencies in reconstructing a high pj muon or by recon- 
structing a fake high pi muon. The latter could be present due to a combination of a lower pi muon 
and/or random hits from high pj jet punch-throughs from the calorimeter to the muon chambers. It can 
be argued that for reasonable muon identification efficiencies, |?| ake from missed muons will only be a 
small fraction of the $£ me from neutrinos. For example in QCD samples the neutrino to muon ratio is 
roughly two. It gets higher in other physics samples depending on the fraction of z candidates in the 
event. On the other hand fake muons that are reconstructed from random hits in the muon chambers 
can be arbitrarily hard and strongly contribute to |?| ake . However, the study here shows that |?| ake from 
muons is dominated by missed muons rather than fake muons. 

The total |?| ake in the MC sample can be defined as the vectorial difference of reconstructed |?t and 
$ m& , as follows: 

£Fake = y^Fake^^Fake^ whfire #F^ = ^ y _^True 

$ Xj y and are the x and y components of the final reconstructed and of the ^p ue defined in Section 
3.' 

The contribution to the total ^ ake from muon mis-measurements can be defined by 

^■FakeMuon _ ^Muon ^TrueMuon (12) 

where ^" on and ^J™ eMuon are calculated by summing the reconstructed and true x and y components 
from muons in the event. 
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Figure 10: The rates of ^| ake and $^ ue in the QCD sample with 560 < p T < 1120 GeV: (left) overall 
rates, (right) requiring a A(j) separation between $j and the leading high-/?T jet in the event. The ^| ake 
rates can be strongly reduced. It should be noted though that such cuts are very analysis dependent. 
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Table 1 : Number of events with ^ ake above various thresholds from muon (top) and calorimeter (bottom) 
mis-measurements. The J6 (QCD jets in 560 < pj < 1 120 GeV range), SU3, it , and Z — > /x/x samples 
are normalized to the same number of events (25k). 
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Figure 11 (left) shows the scatter plot of the two quantities defined in Equations (11) and (12) for 
the QCD sample with 560 < pj < 1120 GeV. In order to separate the |?£ ake from muons with respect 
to the calorimeter related #£ ake the following cuts can be used: (a) ^F akeMuon > #F ake /2 which selects 
events with fake coming predominantly from muon mis-measurements, and (b) |?F akeMu ° n < |?F ake /2 
which selects events with fake #t coming predominantly from other sources like jet mis-measurements 
in the calorimeter. Figure 1 1 also shows how these cuts separate events with ^ ake from muons and from 
calorimeter induced effects. 

Table 1 shows the number of events above various |?| ake thresholds for physics samples (QCD jets 
in the range 560 < pi < 1120 GeV, SU3, tt and Z — > jljl) that are predominantly from muons (top 
rows) or from the calorimeter (bottom rows). It can be seen that the relative rate of |?£ ake across samples 
depends on the muon and calorimeter activities. 

In Table 2 different categories of muon mis-measurements and their relative contribution to p^ ke 
are shown 1 . The first two rows are from missed muons and the last two rows are from fake muons 
reconstructed from random hits in the muon chambers and with a possible match to soft muon tracks in 
the inner detector. Table 2 shows a smaller contribution to |?| ake from fake muons compared to missed 
muons. Therefore the dominant contribution to Jf,^^ from muons is due to inefficiencies in the muon 

^ue to technical reasons a small fraction of muon events were not classified in any category. 




Figure 1 1 : (left) Total Ifij^ as a function of ^ ake from muon mis-measurements, (middle and right) 
Cuts defined in (b) and (a) (see text) are used to separate calorimeter/jet and muon ^ ake components. 
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Figure 12: (left) The X] distribution of true muons that were missed during reconstruction in a Z — > jx/x 
high pj > 100 GeV sample, (right) ^ ake in tt events in the electron (hatched) and muon channel. 

identification. 

Figure 12 (left) shows the T\ distribution of the true muons that were missed at the reconstruction 
level in the Z — ► jiji. sample with pj > 100 GeV. There are missed muons around tj =0, |tj |=1.2 and at 
high Tj (| 7] | >2.7) where there is no muon coverage. Muon tracks cannot be reconstructed by the muon 
system around r\ = 0 (—0.05 < r\ < 0.05), because of service holes required for cables and cryogenics 
passage to the inner detectors and calorimeters. In the region around |tj|=1 there is a loss of efficiency 
in muon reconstruction, due to the middle muon station missing for initial data taking. In these studies 
muons missed due to limitations of muon detector coverage or poor muon reconstruction have not been 
recovered. In the next software releases algorithms to recover some of these missed muons using energy 
deposits in the calorimeter and tracks in the inner detector will be used. 

Figure 12 (right) shows the |?| ake distributions for events in which the leptonically decaying W results 
in an electron and those resulting in a muon, respectively. The latter distribution clearly contains larger 
non-Gaussian tails. As discussed above, the sources of these large tails are either missed or fake muons. 

4.2 Fake $i from the calorimeter 

In this section it is assumed that all calorimeter readout channels are functional. |?| ake in the calorimeter 
is then produced by mis-measurements of hadronic jets, taus, electrons or photons. 

The calorimeter has cracks and gaps in the transition regions, which are also used for service out- 
lets. These regions have poorer resolution and are expected to have larger contributions to |^ ake com- 
pared to the rest of the calorimeter. There are two gap regions defined in the following Y\ ranges: 
(1.3 < \r\\ < 1.6) and (3.1 < |tj|< 3.3). Figure 13 shows the r\ distribution of the worst and the second 
worst measured jet (defined w.r.t. the closest true jet and their energy difference) in the calorimeter for the 



Table 2: The sources of mis-measured muons that contribute to ^ ake > 60 GeV. The columns are 
normalized to the same number of events (25k). 
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30 


18 


40 


18 
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28 
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Reco muon not close to MC muon ("fake muon' 


) 11 


23 
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25 
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QCD sample generated with 560 < pj < 1120 GeV. It shows that a large number of the worst measured 
jets have 77 pointing to \r] \ in 1.3-1.6. The 77 distribution of the second worst measured jet is more flat 
and peaks around \rj\ in 0.6-0.9, the transition region of the barrel tile calorimeter to the extended barrel 
tile calorimeter. 

The above correlation of worst measured jets and their 77 suggests a large correlation between the 
jet 77 and |?£ ake . However, the ^ ake distribution from full simulation samples suggests otherwise. 
Figure 14 shows the ^ ake distribution in the QCD samples generated with 560 < Pt < 1120 GeV and 
140 < pj < 280 GeV, when a jet points to the crack/gap region or not. The slope of the distributions 
suggests no significant correlation between jets pointing to cracks and ^ ake . 

This apparent contradiction between Fig. 13 and Fig. 14 can be understood as follows: even though 
the worst measured jet contributes strongly to the ^ ake , it is not the only source of |?| ake in the event. 
The worst measured jet contributes on average about 60% and the second worst measured jet contributes 
about 20% to ^ ake . But not all worst measured jets are along the crack region and each event has many 
jets. In lower #j ake regions there is a stronger correlation between ^| ake and jets pointing to cracks. For 
higher ^ ake ( > 50 GeV considered here ) there is more than one source contributing to jft^ 6 and the 
correlation of jets pointing to cracks is smeared out as can be seen in Fig. 14. 

4.3 Fake $1 from calorimeter leakage 

Jet leakage from the calorimeters or fluctuations in large jet energy deposits in non-instrumented regions 
such as the cryostat between the liquid argon and tile calorimeters can also be a source of The 
method used to detect events with potential jet leakage is to look for large energy deposits in the follow- 
ing regions: the outermost layers of the TileCal and the HEC, the outermost LAr barrel layer and the 
innermost TileCal barrel layer, and in the TileCal gap and crack scintillators. 

The following shows an example of selection cuts applied on different variables of the three leading 
px jets with p T > 100 GeV: E T iie2/E T otai > 0.05, E T iieio/E T otai > 0.7, Ec r yo/E T otai > 0.2, E G ap/E T otai > 0.2 
and E H EC3/ETotai > 0.5, where E Total is the total jet energy, E Tile2 is the jet energy in the outermost tile 
layer, Ejiieio is the jet energy in the first two innermost tile layer, Ecryo is the energy lost by the jet in the 
cryostat, Eo a p is the jet energy in the gap scintillators and Ehec3 is the jet energy in the outermost layer 
of the HEC calorimeters. If any of these cuts is satisfied the event is rejected. 

Furthermore, as the tracks found in the inner detector are not affected by the reconstruction, com- 




N hi 

Figure 13: The 77 distribution of the worst (left) and second- worst measured jet (right) in the calorimeter 
in QCD events generated with 560 < p T < 1 120 GeV. 
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Figure 14: The ^ ake rate for QCD sample in 560 < p T < 1120 GeV range (left) and QCD sample in 
140 < pi < 280 GeV (right) due to calorimeter mis-measurements. 




Figure 15: Fraction of events remaining after the cuts discussed in the text as a function of ^(left) and 
^ ake (right) for the QCD sample generated with 560 < p T < 1 120 GeV. 



plementary information on events with fake $j can be obtained using $j trk fr° m tracks which is the $j 
computed only from tracks. A cut on (#T trk — $t > 50 GeV) was chosen for the optimization of the 
signal significance. 

Figure 15 shows the percentage of events remaining after the cuts described above are applied on 
QCD sample generated with 560 < pj < 1120 GeV. The right plot shows suppression of large fake ^ 
generated due to high pj jet leakage. Larger fake #j are suppressed more sttongly. It can also be noticed 
from the left plot that these cuts, although removing a large fraction of the events dominated by fake fa, 
are not sensitive to the overall #j in the event and the fraction of remaining events is fairly constant over 
$t- The method is of course analysis dependent and the values of the cuts should be chosen taking into 
account the signal efficiency. 

4.4 Fake $i from instrumental effects 

In real data there will be sources of ^ ake which are not fully modeled in Monte Carlo simulations in- 
cluding, for example, mis-modeling of material distributions and instrumental failures. As the details 
of these |?| ake sources will be understood with time, increasingly refined analyses will be developed 
to minimize their associated backgrounds while maintaining high selection efficiencies for signals with 
genuine missing energy. While it is difficult to predict in advance the exact sources of mis-modeled 
|?£ ake , it is nevertheless possible to insert problems into the Monte Carlo that match potential hardware 
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Figure 16: Cell-killed QCD sample in 560 < p T < 1120 GeV range: the $ T distribution (left) and the 
EM fraction (right). The histograms are normalized by area. 

failures. These include trips in high-voltage channels or readout power supplies of the calorimeter or 
noise in calorimeter channels or regions. The initial studies presented here are based on samples with 
simulated dead regions of the calorimeter, so called 'cell-killed' samples: one dead front-end readout 
crate in the LAr electromagnetic barrel calorimeter and one dead front-end readout crate affecting LAr 
electromagnetic and endcap calorimeters. Based on the location of these hardware failures the calorime- 
ter is divided into three regions of $($t): 'Region 1' with EM endcap and hadronic endcap problems, 
'Region 2' with EM barrel problems, and 'Region 3' with no problems. 

These samples were used as references for the development of cuts that reduce the ^ ake background 
while maintaining high efficiencies for potential signal events. Since we will not know the precise lo- 
cation and nature of hardware problems in advance, the cuts are not tuned assuming that knowledge. In 
real data it will be possible to significantly improve the analysis performance by cutting harder when 
energy deposits are expected near regions with detector hardware problems, but that is not exploited in 
the studies so far. Future work will also include using these samples to develop data-driven techniques 
for predicting the $f ake tails. 

The high-£>T (560 < pj < 1120 GeV) y+jet MC sample was processed with the cell-killed configu- 
ration described above. Events with at least one back-to-back photon-jet pair were selected. Figure 16 
shows the |?| ake generated in the three regions. Large $t tails are seen in Regions 1 and 2 where the holes 
in calorimeter coverage have been introduced. The effect of dead regions can also be seen in Table 3. 
The columns show the number of events above various ^ ake thresholds. The first two rows show the 
number of events with no dead-regions and with the above dead-regions simulated. A very large increase 
in high ^ ake is seen. 

Several methods have been developed to suppress events with large fake fa: 

• EM Fraction Method: The EM fraction method starts by finding the closest calorimeter jet to the 
|?t direction vector using A0 between the calorimeter jet and fj. Figure 16 (right) shows the EM 
fraction distributions. Small EM fractions are due to a dead LAr EM calorimeter crates, whereas 
large EM fractions are due to a dead hadron calorimeter crates. The fake #j generated this way 
can be suppressed by requiring the EM fraction to be in a window from 0.40 to 0.96. The effect 
of this cut can be seen in the third row of Table 3 when compared to the increase in |?| ake due to 
dead-regions in row 2. A rejection of 1.5 to 4 is seen from low to high ^ ake bins. The selection 
efficiency can be defined as the ratio of the number of events after and before the cuts when events 
fall in Region 3 (Region 3 has small |^ ake ). For the EM fraction method the selection efficiency is 
~ 90%. 

• Track- Jet Methods: jets were reconstructed from inner detector tracks using the cone algorithm 
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Table 3: The number of events from y+jet samples above various tfj^ thresholds with no dead-regions, 
with simulated dead-regions and after applying various suppression techniques (see text). All numbers 
are normalized to 25k events sample size. 



fr Fake (GeV) 


> 100 


>200 


>300 


>400 


>500 


No dead regions 


555 


18 


3 


1 


0 


w/ dead regions 


2482 


1308 


864 
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199 


w/ EM fraction method 


1651 


572 


287 


122 


50 


w/ track-jet cluster method 


1786 


664 


313 


129 


49 


w/ comb, track-jet cluster method 


1402 


392 


150 


46 


14 



with AR = 0.4. A fiducial volume cut of \t]\ < 2.5 is applied due to the tracking coverage. Since 
track-jets use the inner tracking detectors, they provide a complementary identification of events 
with fake A number of different methods of exploiting the track-jets were studied. An effective 
method which only uses information present in the Analysis Object Data (AOD) is to sum the Ej 
of calorimeter topological clusters within the track-jet 7]-<j) cone. The distributions of the Ej ratio 
of the track-jet to the clusters has tails due to the dead calorimeter regions. A cut is applied 
requiring Ej ratios larger than 1.0; the value was chosen to maintain a significant efficiency for 
signal samples, and could be substantially tightened for different physics analyses. The fourth row 
of Table 3 shows the effectiveness of this cut. Rejections from tj = 1.4 to T7 = 4 are achieved from 
the low to high |?| ake regions, with efficiencies of ~ 93%. 

Since the EM fraction method and track-jet methods are largely uncorrelated, they can be combined 
for better suppression of large ^ ake . The last row in Table 3 shows the performance of the combined 
track-jet method. The selection efficiency for this combined method is ~ 84%. It must be emphasized 
that the results presented in this table represent the worst case since the cuts do not use the detailed infor- 
mation about detector problems that will be known from the detector control and data quality systems. 
In a final analysis, much harder cuts can be applied in regions with known detector problems. Of course, 
this will reduce the acceptance of the detector. 

5 The Trigger algorithm performance 

This section briefly describes the # T algorithms applied at the first level trigger (LI) and the higher- level 
trigger (HLT). The HLT is a combination of second level trigger (L2), and a third level trigger (or event 
filter, EF). The LI algorithm is based on hardware, while the HLT algorithms are software based. 

5.1 The #t at LI 

The |?t LI calorimeter triggers cover the region |j] | < 4.9, which is the limit of the forward calorimeters 
[11]. The basic units of LI $j and trigger algorithms are 'jet elements', formed by summing over 
trigger towers within windows of 0.2 x 0.2 in the (T7,0) plane. 2 They are processed by the Jet/Energy 
modules, which compute E x , E y and Z^t of each jet element. Four thresholds are available on Z|?t with 
values up to 2044 counts in steps of 4 (usually, 1 count = 1 GeV). Eight $j thresholds are available, 
up to a maximum threshold value of 504 counts. If an overflow occurs at any point in the #j or Z^t 
algorithm, all of the corresponding thresholds are set as passed. 

2 For 2.4 < |t]| < 3.2 the granularity is either A77 = 0.2 or A77 = 0.3, while FCAL jet elements extend from \rj\ = 3.2 to 
I r\ I = 4.9. The (j) granularity of the FCAL jet elements is A0 = 0.4. 
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Figure 17: LI, EF and offline results for the $t (left) and the (right) resolution as a function of the 
true in tt events. 



For each event, the LI E x , E y and are saved into one object of the RecEnergyRoI class, including 
overflow flags. If any threshold is passed, the JetEnergy Rol is also produced containing the bit pattern 
of the thresholds passed. 

5.2 $ T at HLT 

The offline algorithm described in Section 2.2 is too resource intensive (both in terms of memory access 
and calculations to be done) to be applied in the HLT. The following algorithms are ready for the first 
data taking 3 : 

• The L2 algorithm uses the calorimeter information in RecEnergyRoI provided by the LI trigger 
and applies a correction for L2 muon objects. 

• The default EF algorithm sums all calorimeter cells and applies a 0-th order hadronic calibration by 
multiplying #j and Lfij by a constant related to the hadronic/electromagnetic calorimeter energy 
fraction in a jet. Finally, the EF takes the muon contribution into account. 

Both at L2 and EF, the muon correction may be switched off independently for each trigger chain. 

The decision taken both at L2 and EF is carried out by the same software package. This hypothesis 
testing code can be configured to accept events based on $t> on L$j or on both. 

5.3 The $t and E^ T resolution at trigger level 

Figure 17 shows the #j and Ltfj resolution for the LI, EF and offline algorithms as a function of the true 
L$j for tt events. Since the L2 algorithm takes the LI result and applies relatively small corrections, the 
present L2 resolution is practically the same as for LI and therefore not shown. At true values of 
500 GeV, the LI resolution on the fa measurement is about 25 GeV which is a factor of two larger than 
what is achieved offline. The EF resolution, for both and Z^t lies in between the values for LI and 
offline resolution. 

3 At present, work is in progress to detect fake sources due to detector effects like uninstrumented regions or hardware 
failure, or physics environment, e.g. beam-halo or cosmic ray tracks. 
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6 $t in early data 

Validation of the fa reconstruction described in the previous sections will be performed with the first 
LHC data accumulated by ATLAS. For the very first data, the two main issues are controlling instrumen- 
tal failures and calibration of energy deposits in the calorimeter. At this stage, the overwhelming number 
of minimum bias events will be used to monitor and diagnose fa reconstruction problems. 

The development of algorithms for data quality checks is being actively pursued to minimize the im- 
pact of such failures and will be optimized when the first data is collected. The 'standard' fa calculation, 
using the calorimeter cells at the electromagnetic scale above a threshold will be always provided as a 
reference. The fa calibration strategy follows the steps described in Section 3. First, a simple global 
calibration for all the TopoCells will be used. As shown in Section 3 this already gives a good linearity 
behavior. As the event reconstruction becomes more robust, the event objects (e.g. electrons, jets, taus) 
will be used to obtain the best fa resolution. 

Figure 3 already shows how the fa linearity improves from a simple to a more refined fa calculation. 
Figure 1 8 shows how the fa resolution improves arriving in steps to the final refined calibration. 
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Figure 18: The fa resolution as a function of true Lfa from the uncalibrated to the refined calibration 
as indicated for the following channels: W — ► ;UV corresponding to true Ufa = 130 GeV, W — ► ev to 
165 GeV, Z — > TT to 210 GeV, it to 470 GeV, A/H — > TT to 843 GeV, J5 to 800 GeV and SUSY to 
906 GeV. 

Once data of the order of 100 pb _1 are collected, the fa validation focuses on physics channels with 
relatively large fa and/or Ufa. This section briefly describes a few studies that will be performed with 
the first data. 

The Z — > tt process, using the Z mass constraint, can be used to determine the fa scale in-situ 
to about 8% accuracy. The Z — >• li process with decays to electrons and muons does not have any 
significant $£ me from neutrinos. The small background to these events will help to test possible fa 
biases, expected to be zero, and the resolution in a straightforward manner. The copiously produced 
W —> ev and W — > /J.V events can be used to test the reconstructed fa in the (20 — 150) GeV range. Two 
methods to use these events are discussed. Semileptonic tt events also have genuine fa and allow a 
test of the fa reconstruction in an environment relevant for many physics analyses and searches, notably 
SUSY. 
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Figure 19: (left) The $ x , v resolution for different regions in minimum bias events, (right) The 
$T resolution in QCD dijet events (J0-J3: see Section 3.1 for definition) is shown together with the |?t 
resolution from minimum bias events (black filled circles) as a function of £$t- An integrated luminosity 
of the order of 10~ 5 pb _1 is used. 

6.1 Minimum bias events 

Minimum bias interactions at the LHC are dominated by soft collisions of the two interacting protons. 
These events are useful for $j commissioning, especially in the early stages of the experiment, due to 
their large statistics and their comparatively simple event selection. The main background in minimum 
bias events will originate from empty, beam gas and beam halo events, especially at the beginning of the 
experiment. Minimum bias events will be used to verify the fa reconstruction procedure and estimate 
the resolution for low Z|?t events. 

In the early stages of the experiment, minimum bias events will be selected by three types of triggers: 
randomly selected bunch crossings (MB1), randomly selected bunch crossings together with a Semicon- 
ductor Tracker space point trigger (MB2), and minimum bias trigger scintillator (MBTS2). The details 
of the triggers are described in Ref. [12]. 

For the study of |?t in minimum bias events, high signal efficiency and background rejection are 
required. The relative fraction of non-diffractive, single diffractive and double diffractive events in the 
sample is not a concern. The selection criteria require at least 20 semiconductor tracker space points to 
reject empty events and at least one good reconstructed track to reject beam gas and halo events. 

A Monte Carlo study predicts an overall trigger efficiency of 96.8%. The offline track selection 
efficiency (with respect to events passing the trigger) is 80.6% for a total selection efficiency of 78.0%. 

The $t in minimum bias events is fairly low with a mean of 4.3 GeV. Fake #j is caused mainly 
by calorimeter energy resolution (82%) and acceptance (18%). The true is 0.06 GeV on average, 
originating from K/n decays-in-flight and from the decay of charm and bottom particles. The true 
in non-diffractive minimum bias events is typically 64 GeV, while the reconstructed Lfij is on average 
49 GeV due to the loss of low energy particles which do not reach the calorimeters. Since minimum 
bias events are dominated by soft (1ow-/?t) interactions, jets are reconstructed with a rate depending on 
£|?t- When minimum bias events with L#j of 50 GeV are compared to events with X$t of 250 GeV, the 
average number of reconstructed jets with pj > 7 GeV increases from below one to about nine with an 
average jet energy of 10 and 13 GeV, respectively. 

The $r resolution in minimum bias events is expected to scale as y/LEj because the stochastic term 
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of the calorimeter resolution is dominant in L#j regions as shown in the left plot of Fig. 19. These 
distributions are well fitted by Gaussian functions with offsets of zero (in the case of no 0 asymmetry) 
and resolutions which scale with Z|?t- 

The right plot of Fig. 19 shows the comparison of the #j resolution evaluated in this study with the 
higher Z|?t region (X$t > 300 GeV). The #j resolution in QCD dijet events matches well the resolution 
obtained from minimum bias events. 

6.2 Determining the $i scale using Z — > tt events 

At the beginning of ATLAS operation, about 70k events of type Z — > xx with one leptonic and one 
hadronic T-decay will be produced in 100 pb _1 of data. Such events can be selected with a lepton 
trigger. The Z — > xx — > £h are produced with genuine # T of typically 20 GeV and of the order of 
200 GeV. In these events the peak position of the xx invariant mass distribution is sensitive to $ T and 
can be very useful in determining the $t scale [10]. 

The main backgrounds come from W —> £v+jets events, where one jet fakes a X decay, and from 
QCD events (mainly bb). The tt background has a much lower cross-section; the Z — > ee and Z — ► jiji 
backgrounds are also small and the WW background is negligible. Events with the final state lepton and 
T-jet of the same-sign are not expected to come from Z — > xx events which have opposite-sign. The 
backgrounds (apart from the tt background, which is anyway low) contribute in the same way to the 
opposite-sign and the same-sign samples. Hence, the effect of backgrounds will be minimized using 
same-sign events, subtracted from the opposite-sign events. 

For each reconstructed event, the leading and isolated lepton (electron or muon) with p f T > 15 GeV 
and 1 77 I < 2.5 is chosen and a set of basic cuts is applied: $t > 20 GeV (rejects QCD events), the 
transverse mass calculated from $j and the lepton < 50 GeV (suppresses events from semileptonic W 
decays), and Z$t < 400 GeV (suppresses QCD). In addition it is required to have no tagged b jets 
(suppresses tt and bb events). Then at least one identified T-jet with Pj ]et > 15 GeV, |T7 T_ j et | < 2.5, 
and a track multiplicity of one or three is required. The A0 between the isolated lepton and the T-jet is 
required to be in the range between 1 - 2.8, which reduces badly reconstructed events and further rejects 
backgrounds. 

With 100 pb _1 of data, 210 signal events (opposite-sign) are expected in the invariant mass range 
66 GeV < m xx < 116 GeV. A total background of 16 events is expected. Figure 20 (left) shows the 
reconstructed mass peak for Z —>■ xx events as well as the small total backgrounds after analysis cuts for 
opposite-sign and same-sign events. 

Figure 20 (right) shows a very good sensitivity of the measured Z mass reconstructed from T-pairs 
to the absolute $j scale. With an integrated luminosity of 100 pb _1 , the Z mass can be reconstructed 
with an uncertainty of ±0.8 GeV. Taking into account the statistical uncertainty only, the scale could 
be determined with a precision of ~ 3%. But systematic effects, such as the subtraction of same-sign 
events and the stability of the fit will affect the measurement of the reconstructed mass peak. Therefore, 
assuming a resolution of ±3(7 on the reconstructed Z mass, the scale can be determined to about 
±8%. 

6.3 Z^ll events 

This analysis uses inclusive Z — > ee and Z — > samples to investigate the scale and resolution of the 
$t reconstruction in the first data. In these samples the transverse momentum of the two leptons from 
the Z boson decay are balanced by the hadronic recoil and Lfj reaches values up to a few hundred GeV. 

Events are selected by requiring two well reconstructed, identified and isolated leptons with 
Pt > 25 GeV. They have to have equal or opposite charge and a reconstructed mass, m«, in the interval 
70 — 100 GeV. In a sample of 250pb _1 of data, about 400k events are expected. 
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Figure 20: (left) Reconstructed invariant mass of the pair of % leptons for Z — > xx decays and all back- 
grounds: opposite-sign background (dashed) and same-sign background (dotted), (right) Reconstructed 
invariant mass of the pair of x leptons for Z — > xx decays as a function of the scale. The horizontal 
lines correspond to ±la and to ±3(7 w.r.t. the Z peak position. The analysis is based on an integrated 
luminosity of 100 pb _1 of data. 

Backgrounds from Z — >• xx and W —> £v events are negligible. The background from QCD events 
in which two leptons are falsely identified is expected to be small but has to be carefully evaluated when 
data are available. In the present study, these backgrounds are not considered as they are expected to 
have negligible impact. 

In Section 3, projections of fa, called $l and were introduced. This analysis aims at optimizing 
the principle of using projections by resolving the missing transverse momentum along the so called 
'longitudinal axis' which is defined by the combined direction of flight of the two leptons. The perpen- 
dicular axis is also defined in the transverse plane which is orthogonal to the longitudinal axis. The axes 
as reconstructed from the measured angles of the leptons and their measured energies are thus not used 
at this point, which would fully exploit the good angular resolution of the ATLAS detector. In general, 
the longitudinal axis points in the direction of flight of the Z boson and away from the hadronic recoil. 

Figure 21 (left) shows, for Z —> ee events, the average |?t resolved along both axes as a function of 
the transverse momentum of the lepton system resolved along the longitudinal axis. 

The results for the longitudinal axis exhibit a negative offset of up to ~ 4 GeV at high values of pj 
of the lepton system, while the results for the perpendicular axis are consistent with zero. For Z — > jiji. 
events similar results are obtained. It has been verified that this offset is not caused by real neutrinos in 
the event. If the event topology is considered, it is clear that this is suggesting that the magnitude of the 
hadronic recoil pi is underestimated. The resolution of the #j projection on both axes as a function of 
the total scalar sum of the activity in the hadronic calorimeter, L-Et, cluster, is shown in Figure 21 (right) for 
Z^ee events. The fitted curve is of the form <j($t) = Po Ej. duster + -Pi and illustrates the stochastic 
behavior of the calorimetric energy measurement. 

These results on Z — > ee and Z — > fifx events demonstrate that potential problems of the $r recon- 
struction can be located with high accuracy using the first data. 

6.4 W — > £v events 

Events with W — ► ev and W —>■ }XV will be copiously produced at the LHC. Tens of thousands of events 
with an excellent signal-to-background ratio can be collected per pb _1 . With these events, a good 
understanding of the $t reconstruction can be achieved up to values of few hundred GeV. The 
average is of the order of 150 GeV. 

In order to isolate W — ► £v events, two basic selection cuts are required: existence of one high pj 
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Figure 21: For Z — > <?<? events: (left) |?t projected onto the longitudinal and perpendicular axes as 
explained in the text as function of the pj of the lepton system resolved along the longitudinal axis, 
(right) The width o(fa) as a function of £ £t, cluster- Both plots use a sample corresponding to 250 pb _1 
of data. 

charged lepton with |tj | < 2.5 and fa > 20 GeV. Possible backgrounds are from tt production, W — > TV, 
Z ££, and QCD events. 

Two methods have been investigated to check the fa reconstruction using these events. The first 
is based on the fact that the average pj of the charged lepton and the neutrino are the same, so the 
ratio R = Pj,v/pt,£ has been studied. This variable should be ~ 1, but its distribution is distorted by 
the kinematic and acceptance requirements on the charged leptons. The method and related systematics 
have been checked with the fast ATLAS simulation. It is expected to be sensitive to values of fa up to 
60 GeV even with lpb -1 of data. Full simulation studies are in progress. 

The second method, based on the shape of the reconstructed transverse mass of the W boson, is 
sensitive to both the fa resolution and the scale. The transverse mass, nij, is reconstructed under the 
hypothesis that fa is completely due to p\. An example of an distribution is shown in the next 
section for tt events, which have high values of Lfa of typically 500 GeV. The focus in this section 
however is on a dedicated analysis of the corresponding distribution for Drell-Yan events at lower Lfa 
as statistically required. 

The nij distribution (for Drell-Yan events) is fitted in a binned log-likelihood fit that uses template 
histograms. To minimize the dependence on the kinematics of the W boson, e.g. on its transverse mo- 
mentum, the fit is restricted to values of nij in the range of 65 to 90 GeV. The template histograms of the 
nij distributions are generated by convolving the true transverse mass distribution with the fa response: 
fa,y = (*Pj( x ,y) ©Gauss(0,a) where parameters a and a are the fa scale and resolution (in GeV), 
respectively. Since the fa resolution strongly depends on the activity in the calorimeter, the analysis is 
performed in several Lfa intervals. 

Figure 22 (left) shows the resolution for W — ► {XV events. The results of the fit agree well with the 
expectations using truth information labeled as 'pseudo-data'. In Fig. 22 (right) the result for the fa 
scale is shown. The scale is measured at the 1% level over a large range of Lfa, confirming the excellent 
performance of this technique. The template fitting performs well in the low Lfa region, while a small 
discrepancy is observed in the high Lfa region. 

This method described so far is applicable to jiv events, whereas in W — ► ev events it has to 
be modified because the electron is included in the Lfa calculation, while the muon is not. This leads 
to a significant correlation between Lfa and the shape of the transverse W mass distribution when the 
template is made including the Lfa dependence. Similar results for the fa scale and resolution are 
obtained, but systematic uncertainties have not yet been estimated. 

Also if the backgrounds, including the QCD background, cannot be efficiently suppressed by the 
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Figure 22: For jj,v events: resolution (left) and scale (right) as a function of ~L$j, using the 
second method described in the text. The circular dots represent the value calculated from pseudo-data 
and the triangular markers represent the estimation. 

selection cuts, their influence could be non-negligible. Work is in progress on that. 
6.5 Semileptonic tt events 

Semileptonic tt events have an interesting multi-jet topology. With genuine #j in the range from 20 GeV 
to 100 GeV and a total transverse energy of typically 500 GeV, they are representative of other physics 
channels such as SUSY. This section shows that the reconstructed transverse W mass as well as a kine- 
matic fit that exploits all mass constraints in tt events without requiring b jet tagging, will be useful to 
investigate possible problems of the $j measurement in early data. Both methods are sensitive to the 
scale of $t to the level of a few percent (statistically) when a sample with an integrated luminosity of 
200 pb" 1 is used. These methods are affected differently by jet energy scales and background and can 
provide complementary information. 

About 7k events survive the selection requirements, which are: at least 3 jets with pj > 40 GeV, at 
least one more jet with pi > 20 GeV, $r > 20 GeV, and one isolated lepton (e or with pj > 20 GeV. 
The requirements strongly suppress the background from QCD events, which is expected to have no 
effect and is ignored. The QCD background is expected to be < 10% in tt events, so, after the require- 
ments for the kinematic fit, this assumption should be safe. The background from W+jets events is at the 
level of 20% and is included in this study. 

In tt analyses the usual assumption is that the $t in an event can be assigned to the neutrino from the 
leptonically decaying W. With this assumption, the transverse mass mj can be reconstructed from the #j 
vector and the transverse momentum of the charged lepton. Figure 23 (left) shows that the shape of the 
m% distribution is distinctly different for various ranges of fake $j 4 , illustrating the power of these events 
to locate problems. To demonstrate that the transverse W mass distribution can be used to check the fa 
scale in early data, two additional event samples with the true $t scaled by 0.8 and 1.2, respectively have 
been produced and reconstructed. The samples are analysed by fitting a Gaussian shape to the core of the 
peak in the transverse mass distributions. The peak position shifts by -7 and +7 GeV for the sample with 
scale of 0.8 and 1.2 respectively, both with an statistical uncertainty of 0.5 GeV, indicating a sensitivity 

4 Here fake fti is defined as the scalar difference of reconstructed and true 



393 



Jets and Missing E t - Measurement of Missing Transverse Energy 



to the fa scale at the level of 2%. 

Note that backgrounds other than W+jets events are not considered in this study. Background from 
SUSY events can have a severe impact on the distribution by shifting the peak and thus mimicking a 
fa scale calibration offset. The existing knowledge of it events can be combined in a kinematic fit to 
improve the measured quantities and to investigate the scale of the fa measurement. The following mass 
constraints are available: m^ d = = 80.4 GeV, where m^ d is the reconstructed mass of two light jets 
of the hadronically decaying W and m|^ p is the reconstructed mass of the lepton and the neutrino of the 
leptonically decaying W. The reconstructed mass of the leptonically decaying top quark and that of the 
hadronically decaying top quark are assumed to be = m^ p = 175 GeV. The neutrino's transverse 
momentum is set equal to fa and its longitudinal momentum is analytically calculated from the 
constraint. 

The X 2 function of the fit is built using the energy of the four leading jets, with fit parameters to 
scale the corrected jet energies, a constraint on the product of the fit parameters to the a-priori known or 
assumed overall jet energy scale and with the implementation of the four mass constraints. All twelve 
possible permutations of assigning jets to the two top quarks and W bosons respectively are considered. 
Finally, only the permutation with the lowest % 2 is selected in each event. 

It is found that the fa, re-calculated after the fit, is not significantly improved. However, using a cut 
on x 2 improves the resolution on fa, so it is possible to use the x 2 of a kinematic fit to classify it events 
with a relatively good fa measurement without using b tagging. In the first data this classification helps 
to locate possible detector problems. 

The kinematic fitting procedure can be utilized to check the fa scale in early data. Background 
events are expected to be incompatible with the constraints used in the fit and thus be reduced by a cut 
on x 2 - Therefore, in contrast to the study using the transverse W mass as described above, this method 
suffers significantly less from backgrounds. 

After applying the fit to the events and requiring x 2 < 10 and in additional fa > 40 the background 
from W+jets event is reduced to the 1% level. A robust estimator of the fa scale is the measured 
transverse momentum difference of the (anti-) top quark mother of the leptonically and hadronically 
decaying W respectively: Apj = p l ^ p — p^f A where p l ^ p is the combined transverse momentum of the 
measured charged lepton, fa and one b jet, while p^f A is the momentum of the jets of the hadronically 
decaying W and the other b jet. Of course, the assignment of the jets to the correct top quark is not 
guaranteed. Nevertheless, this quantity is remarkably sensitive to the fa scale, as can be seen in Fig. 




Figure 23: In semileptonic it events, (left) the reconstructed transverse W mass in various ranges of fake 
fa, (right) the distribution of the measured momentum difference of the two top quarks, Apj for (true) 
fa scales of 0.8, 1.0 and 1.2 as indicated. The analysis is based on an integrated luminosity of 200 pb _1 
of data. 
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23(right). The mean values of the distributions are -16.4±0.9 GeV , -4.5±0.8 GeV, and4.5±0.9 GeV 
for scales of 0.8, 1.0, and 1.2, respectively. This implies a sensitivity on the |?t scale at the level of 2%. 
The systematic variation due to a shift of the top quark mass of 2.5 GeV is about 2%. 

7 Summary 

The $t in ATLAS is calculated from the energy in the calorimeter and from the reconstructed muons. 
The energy in the calorimeter is classified and calibrated according to the reconstructed objects to which 
it belongs. Two algorithms for reconstruction and calibration are presently implemented in the ATLAS 
software, one Cell-based, where the $t reconstruction and calibration is done starting from the energy 
deposited in calorimeter cells, and the other one Object-based, where the Iflj reconstruction is done from 
the reconstructed, classified and calibrated objects and from the energy outside of them. The performance 
of the two is similar. 

The $t performance has been checked on a large variety of events with physical |?t such as Z —>■ xx 
, A/H —>■ xx, SUSY, and tt, as well as events with no physical |?t like minimum bias events, events 
with QCD jets, and Z+jets processes. The resulting linearity of the response is within 5%, even for 
low true $t values of the order of 40 GeV. The $j resolution, o, follows an approximate stochastic 
behaviour over a wide range of values of the total transverse energy deposited in the calorimeters. A 
simple fit to a function a = a - y/LEj yields values between 0.53 and 0.57 for the parameter a, for LEj 
values between 20 and 2000 GeV. Deviations from this simple behaviour are expected and observed for 
low values of ZEj where noise is an important contribution, and for very high values of ZEj where 
the constant term in the jet energy resolution dominates. For values of the true $ T below 40 GeV, the 
accuracy of the measurement of the direction of the $t vector for small values of #j degrades rapidly. 
In contrast, for high values of the true fa, azimuthal accuracies better than lOOmrad can be achieved. 
This accuracy of the measurement of the direction allows an isolation cut on which can efficiently 
suppress events with a badly measured jet and the resulting fa pointing in the jet direction. 

A dedicated study of fake ffj shows that instrumental effects like hot/dead/noisy cells (regions) in 
calorimeters, as well as beam-gas scattering or other machine backgrounds, or displaced vertices, are 
very important, and that their understanding will be crucial in the first days of data taking. Different 
methods can be used to clean events and to correct/recover the $ T measurement. Mis-measurements in 
the detector itself, due to high-p T muons escaping from the fiducial acceptance or from large losses of 
deposited energy in cracks or inactive materials, might also effectively limit the performance of the fa 
reconstruction and have therefore been studied in detail. 

With the first 100pb _1 the algorithms for #j reconstruction and calibration can be checked studying 
the $t linearity and resolution in minimum bias events and in Standard Model processes like Z and W 
decays and in tt processes. Complementary methods for the determination of the $t scale in-situ have 
been studied and it has been shown that it will be possible to determine the tfj scale with a precision of 
at least 8%. 
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Abstract 

The ability to identify jets stemming from the fragmentation and hadroniza- 
tion of b quarks is important for the high-pj physics program of ATLAS: 
top physics, Higgs boson searches and studies, new phenomena. After an 
overview of the reconstruction of the key ingredients for Z>-tagging, the tag- 
ging techniques are described. The performance of ^-tagging algorithms is 
then detailed, as well as the impact on performance of several factors and new 
promising directions. Finally, expected performance in the first data and the 
anticipated uncertainty with which it can be measured are briefly discussed. 

1 Introduction 

This note discusses the identification of jets stemming from the hadronization of b quarks, or ^-tagging. 
The ability to identify jets containing ^-hadrons is important for the hi%h-pj physics program of a 
general -purpose experiment at the LHC such as ATLAS. This is in particular useful to select very pure 
top samples, to search and/or study Standard Model or supersymmetric (SUSY) Higgs bosons which 
couple preferably to heavy objects or are produced in association with heavy quarks, to veto the large 
dominant tt background for several physics channels and finally to search for new physics: SUSY decay 
chains, heavy gauge bosons, etc. 

The large majority of these studies requires good ^-tagging performance for jets with a transverse 
momentum ranging from 20 to 150 GeV. However, for super-symmetric processes, jets of pj as high 
as 500 GeV may have to be tagged [1], and for exotic phenomena Z?-jets of up to a few TeV can be 
produced. For top studies, the signal rates are very high at the LHC and therefore a moderate ^-tagging 
efficiency (> 50%) is acceptable, while a fraction of light jets mis-identified as Z>-jets below a few per 
mille suppresses most of the W+jets background (see for instance Ref. [2]). One of the most demanding 
channels for ^-tagging is the production of a light Standard Model Higgs boson in association with a 
top-antitop pair [3]: ttH(H — > bb). Four fr-jets have to be tagged with very high efficiency (Eb k, 70%) 
since the signal cross-section is low, and the mis-tagging rate must be kept below 1% to fight the large 
«+jets background. 

The identification of &-jets takes advantage of several of their properties which allow us to distinguish 
them from jets which contain only lighter quarks. First the fragmentation is hard and the Z?-hadron retains 
about 70% of the original b quark momentum. In addition, the mass of Z>-hadrons is relatively high (> 5 
GeV). Thus, their decay products may have a large transverse momentum with respect to the jet axis 
and the opening angle of the decay products is large enough to allow separation. The third and most 
important property is the relatively long lifetime of hadrons containing a b quark, of the order of 1 .5 ps 
(ct « 450/xm). A ^-hadron in a jet with pj = 50 GeV will therefore have a significant flight path length 
(/} = f}ycc, traveling on average about 3 mm in the transverse plane before decaying. Such displaced 
vertices can first be identified inclusively by measuring the impact parameters of the tracks from the 
ft-hadron decay products. The transverse impact parameter, do, is the distance of closest approach of 
the track to the primary vertex point, in the r — (p projection. The longitudinal impact parameter, zo, is 
the z coordinate of the track at the point of closest approach in r — (p. The tracks from Z>-hadron decay 
products tend to have rather large impact parameters which can be distinguished from tracks stemming 
from the primary vertex. The other more demanding option is to reconstruct explicitly the displaced 
vertices. These two approaches of using the impact parameters of tracks or reconstructing the secondary 
vertex will be referred to later on as spatial ^-tagging. Finally, the semi-leptonic decays of Z>-hadrons 
can be used by tagging the lepton in the jet. In addition, thanks to the hard fragmentation and high mass 
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of ^-hadrons, the lepton will have a relatively large transverse momentum and also a large momentum 
relative to the jet axis. This is the so-called soft lepton tagging (the lepton being soft compared to high-pr 
leptons from W or Z decays). 

The tagging methods relying on the impact parameter of tracks are detailed in this note. Only a 
summary and the main results of the other methods are given. The techniques employed to reconstruct 
either a single inclusive vertex or to attempt to resolve the complex topologies with a secondary b- 
hadron vertex and a tertiary c-hadron vertex are discussed in Ref. [4], as well as the reconstruction of the 
primary vertex. The tagging with soft muons or electrons from ^-hadron decays is detailed respectively 
in Ref. [5] and Ref. [6]. The expected performance of the ^-tagging algorithms in ATLAS, and the impact 
of several factors, are explained in detail in this note. However, the assessment of the impact of residual 
misalignments on the performance is just starting and first results are available in Ref. [7]. While a large 
effort is put into having a very accurate Monte Carlo simulation, the ^-tagging performance must be 
measured in data. Several studies aiming at measuring the ^-tagging efficiency in dijet events (Ref. [8]) 
or in tt events (Ref. [9]) have been performed. The studies to measure the mis-tagging rates are just 
starting and are not discussed. Finally, the high-level trigger of ATLAS has the capability to select Z?-jets. 
This is particularly interesting for channels with several b-jets where jet thresholds can be lowered at 
the first level thanks to the ^-tagging applied at the second and event-filter levels. The high-level trigger 
^-tagging performance and strategies are discussed in Ref. [10]. 

The layout of this note is as follows: in Section 2, the reconstruction of the key objects for b- 
tagging is briefly explained and the performance summarized. Since the definition of the flavour of a 
jet is not unambiguous in Monte Carlo, the estimators used to assess the performance are defined in 
Section 3. Section 4 is intended to be a pedagogical approach to the various tagging algorithms available 
and to the likelihood ratio formalism used by ATLAS. The ^-tagging performance for various physics 
processes is described in Section 5, relying on the current state-of-the-art ^-tagging production software. 
In Section 6, a few additional studies aiming at better understanding some critical aspects of the b- 
tagging are detailed, while in Section 7 three studies showing new directions to improve the ^-tagging 
performance are presented. In both cases the studies are described in a separate section because either 
they required specific datasets or they relied on software and/or cuts/optimizations which were different 
from the ones currently in use in the ATLAS software or they even required new software developments. 
In addition, the anticipated uncertainty with which the ^-tagging performance may be measured in data 
is discussed in Section 8. Finally in Section 9, the main findings and the expected performance in the 
first data are summarized. 

2 Reconstruction of the key objects 

The reconstruction of the various objects needed for ^-tagging and its performance are summarized in 
this section. 

2.1 Charged tracks 

The tracks reconstructed in the ATLAS Inner Detector [11] are the main ingredient for ^-tagging. On 
average a track consists of 3 pixel hits, 4 space-points in the silicon micro-strip detector and about 36 hits 
in the Transition Radiation Tracker (TRT). The innermost pixel layer (the so-called Mayer) is located at 
a radius of 5 cm, while the TRT extends up to a radius of 1 m. The tracker is immersed in a 2 T magnetic 
field generated by the central solenoid. The intrinsic measurement accuracy of the pixels is around 10 
jUm in r(j) and 1 15 /J,m in z. All these allow the tracker to measure efficiently and with good accuracy 
the tracks within \r\ \ < 2.5 and down to pr ~ 500 MeV. For a central track with pj = 5 GeV, which 
is typical for ^-tagging, the relative transverse momentum resolution is around 1.5% and the transverse 
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impact parameter resolution is about 35 jUm. Further details can be found in Refs. [11, 12]. 

Most of the results in this note are based on the default pattern-recognition and fitting algorithm, 
NewTracking. Its performance is described in Ref. [12]. When relevant, some comparisons are made 
with an alternate algorithm, iPatRec. 

2.1.1 Baseline track selection 

The track selection for ^-tagging is designed to select well-measured tracks and reject fake tracks and 
tracks from long-lived particles (K S ,A or other hyperon decays) and material interactions (photon con- 
versions or hadronic interactions). 

Two different quality levels are used. For the standard quality level, at least seven precision hits 
(pixel or micro-strip hits) are required. The transverse and longitudinal impact parameters at the perigee 
must fulfil \do\ < 2 mm and |zo — z pv \ sin 6 < 10 mm respectively, where z pv is the longitudinal location 
of the primary vertex. Only tracks with pj > 1 GeV are considered. For the Z>-tagging quality, the extra 
requirements are: at least two hits in the pixel detector of which one must be in the Mayer, as well as 
| do | < 1 mm and |zo — z pv \ sin 6 < 1.5 mm. This selection is used by all the tagging algorithms relying 
on the impact parameters of tracks, while slightly different selections are used by the secondary vertex 
algorithms as discussed in Ref. [4]. 

2.1.2 Tracking efficiency 

The Z?-tagging performance strongly depends upon the tracking efficiency. The tracking performance 
inside jets, where the track density may be high, is discussed in the following. The tracking performance 
for single tracks is discussed in Ref. [12]. 

Figure 1 shows the tracking efficiency and fake rate for tracks in it events as a function of the track 
pseudo-rapidity. For the efficiency denominator, only charged primary pions 1 produced well before the 
Mayer (\x — x pv \ < 10 mm, \y — y pv \ < 10 mm) and with pj > 1 GeV and |tj| < 2.5 are considered. The 
first level of the efficiency corresponds to the basic reconstruction efficiency, where a track matched to 
a Monte Carlo particle is found. The fake rate is defined as the fraction of reconstructed tracks which 
do not pass the matching criteria used for the efficiency, i.e. less than 80% of their hits are coming from 
the same Monte Carlo particle. At high pseudo-rapidities, the tracking performance deteriorates mostly 
because of increased material and more ambiguous measurements. 

Figure 2 shows the tracking efficiency and fake rate for tracks in tt events as a function of their 
distance AR = a/A0 2 + Atj 2 to the axis of the closest jet, for tracks fulfilling the ^-tagging quality cuts. 
The tracking performance degrades near the core of the jet where the track density is the highest and 
induces pattern-recognition problems. This is especially visible for high-pr (> 100 GeV) jets. 

Finally, in Figure 3 the tracking efficiency and fake rates obtained with the default algorithm and 
with iPatRec are compared. The first plot shows the comparison for several bins in the track pj for all 
jets, while the second plot is as a function of the distance to the jet axis for jets with Ej > 100 GeV 
It is interesting to note that the two algorithms have a different working point: the default algorithm 
maintains a low level of fakes at the price of losing in efficiency, while the complementary choice was 
taken for iPatRec. This difference in treatment will lead to different ^-tagging performance for jets with 
high momentum, as discussed in particular in Section 5.6. The features seen in these plots are specific to 
the pattern-recognition inside jets: for instance the decrease of the NewTracking efficiency at high track 
Pt is not visible for isolated tracks; it is here correlated with the local track density since high-/^ tracks 
are more likely to originate from denser high-pr jets. 

! For the tracking studies only pions were considered, but similar results are expected for charged kaons, protons, etc. which 
are all used for i-tagging. 
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Figure 1: Tracking efficiency (top plot) and fake 
rate (bottom plot) versus track pseudo-rapidity, for 
three levels of track selection: matching (blue tri- 
angles), standard quality cuts (red squares) and b- 
tagging quality cuts (black circles), in tt events. 



Figure 2: Tracking efficiency (top plot) and fake 
rate (bottom) versus distance to jet axis, for tracks 
fulfilling the Z?-tagging quality cuts and associated 
to low-pr jets (black symbols) or bigh-pr jets 
(green symbols), in tt events. 
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Figure 3: Tracking efficiency (top plots) and fake rate (bottom plots) in tt events after the ^-tagging qual- 
ity cuts, for two tracking algorithms: default NewTracking (black symbols) and iPatRec (red symbols). 
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Figure 4: Fraction of tracks with shared hits ver- 
sus distance to the jet axis. Tracks fulfilling the b- 
tagging quality cuts, and with at least one shared 
hit in the silicon systems are shown. The standard 
definition of shared hits (see text) is shown as well. 



Figure 5: Transverse impact parameter signifi- 
cance do/Od 0 for tracks in light jets. Two cate- 
gories of tracks are used: regular ones (red plain 
curve) and tracks with shared hits (blue dashed). 
Both distributions are normalized to unity. 



2.1.3 Tracks with shared hits 

Tracks originating from the same point and passing the track selection cuts will not necessarily have the 
same impact parameter distributions. First of all, even using the track parameters normalized to their 
error will not compensate for all resolution effects, such as non-Gaussian tails. In addition, the pattern- 
recognition process itself can produce tracks of variable quality depending on their hit contents. Those 
tracks require a special treatment to be flagged appropriately. The most significant subset of such tracks 
is formed by the tracks which are sharing some of their hits with other tracks. 

Figure 4 shows the fraction of tracks which are sharing at least one hit with another reconstructed 
track versus the distance of the track to the jet axis, for jets originating from tt events. Currently for 
^-tagging puiposes, a track is defined as a track with shared hits if it has at least one shared hit in the 
pixels or two shared hits in the strips. As expected, the fraction of tracks with shared hits increases with 
the local track density, and is therefore higher for high-pr jets and in the core of the jets. In tt events 
the average pj for taggable {i.e. pj > 15 GeV and \t] \ < 2.5) b-jets and light jets are respectively 74 
and 55 GeV. The fraction of tracks with shared hits is about 2%. For jets with a transverse momentum 
of about 140 GeV (WH events with m//=400 GeV, see below), this fraction is twice as high. In both 
cases the fraction is roughly similar for NewTracking and iPatRec. In an extreme case, for Z' —>■ bb 
events with m z > = 2 TeV, a majority of tracks have shared hits and the fraction depends significantly on 
the reconstruction algorithm (cf. Section 5.6). Even when the overall level of shared hits is relatively 
low, it has been demonstrated that those tracks should be treated appropriately since their impact on the 
^-tagging performance is significant. Indeed, the impact parameter significances, defined as the ratios 
do/(Jd 0 and z.q/o zo of the impact parameters to their measured error, for tracks in light jets exhibit a very 
different behavior depending on whether the track is a regular one or a track with shared hits, as shown 
in Figure 5. It is clear that tracks with shared hits can mimic lifetime more easily. 
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Figure 6: Track impact parameter resolution versus track pj, for several bins in the track pseudo-rapidity. 
2.1.4 Impact parameter resolution 

The resolution of the track impact parameter is a crucial ingredient to be able to discriminate tracks 
coming from long-lived hadrons and prompt tracks. To estimate it, all the reconstructed tracks in it events 
fulfilling the ^-tagging quality cuts and matched to a good Monte-Carlo track as defined in Section 2.1.2 
were used. The difference between the reconstructed and the true impact parameter within a bin was 
fitted with a single gaussian, whose a is reported on Figures 6(a) and 6(b), for respectively the transverse 
and longitudinal impact parameters. For a central track with pj = 5 GeV, which is typical for Z>-tagging, 
the transverse impact parameter resolution is about 35 }Xm. 

2.2 Primary vertex finding 

Another key ingredient for Z?-tagging is the primary vertex of the event. The impact parameters of tracks 
are recomputed with respect to its position and tracks compatible with the primary vertex are excluded 
from the secondary vertex searches. At LHC the beam-spot size will be = 15 fxm and a, = 5.6 cm: 
therefore the primary vertex is especially important for the z direction, while in the transverse plane only 
the beam-line could be used. The strategies to find the primary vertex and their performance are explained 
in Ref. [4]. The efficiency to find the primary vertex is very high in the high-;?7- events of interest, and the 
resolution on its position is around 12 jxm in each transverse direction and 50 jltm along z. With pile-up, 
the presence of additional minimum bias vertices makes the choice of the primary vertex less trivial: at 
a luminosity of 2 x 10 33 cm _2 s _1 (on average 4.6 minimum bias events per bunch-crossing) a wrong 
vertex can be picked up as the primary vertex in about 10% of the cases [4], thus causing a deterioration 
in the ft-jet tagging efficiency. 

2.3 Jet algorithms 

The baseline jet algorithm for the studies in this note is a seeded cone algorithm using the calorimeter 
towers with a cone size of AR = 0.4, and where the cells were calibrated using the HI method (see 
Ref. [13] for details). The impact on ^-tagging performance of using other jet algorithms is discussed in 
Section 6.4. 

For ^-tagging puiposes, only the jet direction is relevant. In the first place, this direction is used 
to define which tracks should be associated with the jets. The actual tagging is done on this subset of 
tracks. Currently tracks within a distance AR < 0.4 of the jet axis are associated to the jet. A given track 
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is associated to only one jet (the closest in AR). This is the case for actually any jet collections, regardless 
of the cone size of the jet. The jet direction is also used to sign the impact parameters of the tracks in the 
jet as explained in Section 4. 1.2. 

Except when stated otherwise, there was no attempt to remove from the reconstructed jet collection 
the jets which are composed of only electrons. In the tt sample (semi-leptonic and di-leptonic channels), 
about 5% of the taggable reconstructed jets are electrons. There is no dedicated treatment for muons. 
Isolated muons are very unlikely to fake jets, at least for the common processes under consideration 
in this note where pt{h) < 100 GeV. Muons in jets, stemming from ZVc-hadron semi-leptonic decays 
and measured in the muon spectrometer, deposit on average about 3 GeV in the calorimeter but their 
momentum as measured in the inner detector and the muon system is not used to refine the kinematics 
of the jet, which remain purely calorimeter-based. 

Only jets fulfilling pj > 15 GeV and |t]| < 2.5 are deemed taggable and considered in the perfor- 
mance studies. 

2.4 Soft lepton reconstruction 

Leptons arising from semi-leptonic decays of Z?-hadrons or subsequent ohadrons can be used to tag 
ft-jets. 

Soft muons are reconstructed [5] using two complementary reconstruction algorithms. A combined 
muon corresponds to a track fully reconstructed in the muon spectrometer that matches a track in the 
inner detector. Low-momentum muons (below p ~ 5 GeV) which cannot reach the muon middle and 
outer stations are identified by matching an inner detector track with a segment in the muon spectrometer 
inner stations. Muons satisfying some basic requirements (pr > 3 GeV, \do\ < 4 mm) are associated to 
the closest jet provided that AR < 0.5. Finally, the kinematic properties of the jet-muon system are used 
in order to reject the background caused by punch-throughs and decays-in-flight in light jets. 

Reconstructing soft electrons [6] in the calorimeter inside a jet is more difficult. This is achieved 
by matching an inner detector track to an electromagnetic cluster. For a given track, only the energy 
contained in a small window around the track extrapolation is used. The contribution of neighbouring 
hadronic showers is therefore reduced. The identification procedure takes full advantage of the tracking 
capabilities of the inner detector as well as of the granularity of the electromagnetic calorimeter: a 
likelihood ratio combines inner detector information such as transition radiation hits with shower shape 
variables from the calorimeter. The performance is, however, highly dependent on the track density in 
jets as well as the quantity of matter in front of the electromagnetic calorimeter. 

3 Performance estimators 
3.1 Labelling 

To define ^-tagging performance, the Monte Carlo event history is used to know the type of parton 
from which a jet originates. This labelling procedure is not unambiguous and is not strictly identical 
for different Monte Carlo generators. For the results presented here, a quark labelling has been used: a 
jet is labelled as a Z?-jet if a b quark with pj > 5 GeV is found in a cone of size AR = 0.3 around the 
jet direction. The various labelling hypotheses are tried in this order: b quark, c quark and % lepton. 
When no heavy flavour quark nor % lepton satisfies these requirements, the jet is labelled as a light-jet. 
No attempt is made to distinguish between u, d, s quarks and gluon since such a label is even more 
ambiguous. 
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3.2 Efficiency and rejection 

For performance studies, only jets fulfilling pj > 15 GeV and |t] | < 2.5 are considered and refered to as 
taggable jets. In the following, jets for which no track passed the Z?-tagging quality cuts are still counted 
in the performance estimators. However, events where the primary vertex could not be reconstructed 
are ignored. In addition, ft-jets were not categorized according to the nature of the Z>-hadron decay: b- 
jets with semi-leptonic decays behave quite differently from jets with hadronic decays, even when using 
purely spatial methods, but in the following no distinction was made. 

The tagging efficiency is naturally defined as the fraction of taggable jets labelled as &-jets (see 
previous section) which are actually tagged as Z>-jets by the tagging algorithm under study. The mis- 
tagging rate is the fraction of taggable jets not labelled as b which are actually tagged as b-jets,. For 
historical reasons the jet rejection is used instead: this is simply the inverse of the mis-tagging rate. 

3.3 Purification 

A difficulty arises as soon as the jet multiplicity is high and various jet flavours are present in a single 
event: a jet with A/?(jet — b) = 0.31 is labelled as a light jet, although tracks from ft-hadron decay with 
high lifetime content are likely to be associated to it. 

This leads to a decrease of the estimated performance, not related to the ^-tagging algorithm itself 
but to the labelling procedure which strongly depends on the activity of the event. In order to obtain 
a more reliable estimation of ^-tagging performance, a purification procedure has been devised: light 
jets for which a b quark, a c quark or a t lepton is found within a cone of size AR = 0.8 around the jet 
direction are not used to compute the rejection. 

The performance estimated after purification represents the intrinsic power of the ^-tagging algo- 
rithms and should be similar for different kinds of hard event, whereas results obtained for the complete 
light jet sample are more dependent on the event type. On the other hand, the latter is more representative 
of the actual ^-tagging power for a given physics analysis. This is illustrated in Figure 7: the light jet 
rejection in simple WH events is similar without or with purification (left plot), while for busier tt events 
(right plot) the two curves differ in the region where lifetime content as opposed to resolution effects 
dominates (i.e. for < 80%). In the following, jets fulfilling the purification procedure will be referred 
to as purified or pure jets, the ones failing this procedure will be called non-pure jets, while all the jets 
will be called raw jets. 

4 ^-tagging algorithms 

In this section the various algorithms used in ATLAS to tag Z?-jets are explained. The spatial algorithms, 
built on tracks and subsequently vertices, are the most powerful ones. Most of them are based on a 
likelihood ratio approach, but simpler and more robust tagging algorithms are also available. Soft lepton 
tagging algorithms are also very important, in particular since the correlation with the previous ones is 
minimal. Their performance is summarized in section 4.3. 

4.1 Spatial algorithms based on likelihood ratio 

All tracks in the jet fulfilling the ^-tagging quality cuts described in 2.1.1 are considered for the spatial 
Z>-tagging algorithms. In typical tt events, the average number of those tracks per light (£»-) jet is 3.7 (5.5) 
and their average pj is 6.6 (6.3) GeV, respectively. 
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Figure 7: Rejection of light jets and c-jets with and without purification versus b-jet efficiency for WH 
(ma =120 GeV) and tt events, using the tagging algorithm based on 3D impact parameter and secondary 
vertex. 

4.1.1 V° and secondary interactions rejection 

The preselection cuts on impact parameters reject a large fraction of long-lived particles and secondary 
interactions. Among the remaining tracks, the ones identified by the secondary vertex search (sec- 
tion 4.1.3) as likely to come from V° decays are rejected (they amount to between 1% and 3% of the 
tracks in light and Z?-jets respectively). To do so, the search starts by building all two-track pairs that 
form a good vertex. The mass of the vertex is used to reject the tracks which are likely to come from 
K S ,A decays and photon conversions. The radius of the vertex is compared to a crude description of 
the innermost pixel layers to reject secondary interactions in material. The cuts and performance of this 
selection are described in Ref. [4]. 

4.1.2 Impact parameter tagging algorithms 

For the tagging itself, the impact parameters of tracks are computed with respect to the primary vertex (cf. 
section 2.2). On the basis that the decay point of the Z>-hadron must lie along its flight path, the impact 
parameter is signed to further discriminate the tracks from Z>-hadron decay from tracks originating from 
the primary vertex. The sign is defined using the jet direction Pj as measured by the calorimeters (cf. 
section 2.3), the direction P, and the position X t of the track at the point of closest approach to the primary 
vertex and the position X pv of the primary vertex: 

sign(Jo) = (Pj x ? t ) ■ (?t x {X pv -%)) 

The experimental resolution generates a random sign for the tracks originating from the primary vertex, 
while tracks from the blc hadron decay tend to have a positive sign. The sign of the longitudinal impact 
parameter z,q is given by the sign of — r\ t ) x z,o t where again the t subscript refers to quantities defined 
at the point of closest approach to the primary vertex. 

The distribution of the signed transverse impact parameter do is shown on Figure 8, left plot, for 
tracks coming from ft-jets, c-jets and light jets. The right plot shows the significance distribution do/a^ 
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Figure 8: Signed transverse impact parameter do distribution (left) and signed transverse impact param- 
eter significance do/Od 0 distribution (right) for b-jets, c-jets and light jets. 



which gives more weight to precisely measured tracks. Combining the impact parameter significances of 
all the tracks in the jet is the basis of the first method to tag fr-jets. Three tagging algorithms are defined 
in this way: IP1D relies on the longitudinal impact parameter, IP2D on the transverse impact parameter 
and finally IP3D which uses two-dimensional histograms of the longitudinal versus transverse impact 
parameters, taking advantage of their correlations. 



4.1.3 Secondary vertex tagging algorithms 

To further increase the discrimination between b-jets and light jets, the inclusive vertex formed by the 
decay products of the bottom hadron, including the products of the eventual subsequent charm hadron 
decay, can be sought. The reader is referred to Ref. [4] for all details. The search starts by build- 
ing all two-track pairs that form a good vertex, using only tracks far enough from the primary vertex 
(L^d I (Jl 3d > 2 where L^o = \\X pv — X t || is the three dimensional distance between the primary vertex and 
the point of closest approach of the track to this vertex). Vertices compatible with a V° or material inter- 
action are rejected. All tracks from the remaining two-track vertices are combined into a single inclusive 
vertex, using an iterative procedure to remove the worst track until the x 2 of the vertex fit is good. Three 
of the vertex properties are exploited: the invariant mass of all tracks associated to the vertex, the ratio 
of the sum of the energies of the tracks participating to the vertex to the sum of the energies of all tracks 
in the jet and the number of two-track vertices. These properties are illustrated in Figure 9 for Z?-jets 
and light jets. The so-called SV tagging algorithms make different use of these properties: SV1 relies 
on a 2D-distribution of the two first variables and a ID-distribution of the number of two-track vertices, 
while SV2 is based on a 3D-histogram of the three properties which requires quite some statistics. The 
secondary vertex finding efficiency depends in particular on the event topology, but the typical efficiency 
E% v is higher than 60% in b-jets. The SV taggers require an a priori knowledge of ejj v and e^ v . 

A completely new algorithm, JetFitter, is also available, which exploits the topological structure of 
weak b- and c-hadron decays inside the jet. A Kalman filter is used to find a common line on which the 
primary vertex and the beauty and charm vertices lie, as well as their position on this line approximating 
the ft-hadron flight path. With this approach, the b- and c-hadron vertices are not merged, even when 
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Figure 9: Secondary vertex variables: invariant mass of all tracks in vertex (left), energy fraction ver- 
tex/jet (center) and number of two-track vertices (right) for ft-jets and light jets. 

only a single track is attached to each of them. The discrimination between b-, c- and light jets is based 
on a likelihood using similar variables to the SV tagging algorithm above, and additional variables such 
as the flight length significances of the vertices. This algorithm and its performance are also described in 
detail in Ref. [4]. 

4.1.4 Formalism of likelihood ratio 

For both the impact parameter tagging and the secondary vertex tagging, a likelihood ratio method is 
used: the measured value S, of a discriminating variable is compared to pre-defined smoothed and nor- 
malized distributions for both the b- and light jet hypotheses, b(Sj) and u(Sj). Two- and three-dimensional 
probability density functions are used as well for some tagging algorithms. The ratio of the probabilities 
b(Sj)/u(Sj) defines the track or vertex weight, which can be combined into a jet weight Wj et as the sum 
of the logarithms of the Nj individual track weights W,-: 



N T N t 



W/* = £lnW, = £ln^ (1) 

The distribution of such a weight is shown in Figure 10 for b-, c- and light jets for two different tagging 
algorithms: IP2D and the sum of the weights from IP3D and SV1. When no vertex is found, the SV 
taggers return a weight of In \_ e Jsv - To select Z?-jets, a cut value on Wj et must be chosen, corresponding 
to a given efficiency. The relation between the cut value and the efficiency depends on the jet transverse 
momentum and rapidity, and therefore is different for different samples. 

4.1.5 Likelihood ratio and track categories 

As seen already, tracks may exhibit different behavior even after the track selection, such as the tracks 
with shared hits (Figure 4). One idea to take advantage of the different properties of tracks is to arrange 
all tracks into various categories and use dedicated probability density functions for each category. The 
likelihood ratio formalism permits to incorporate such categories in a straightforward way. After the 
division of the tracks into disjoint categories j, where every category has its own set of reference his- 
tograms bj and iij, the jet weight can simply be written as the sum over all tracks in each category Nj 
and all categories Nc' 
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Figure 10: Jet ^-tagging weight distribution for b-jets, c-jets and purified light jets. The left plot is for 
the IP2D tagging algorithm. The right plot corresponds to the IP3D+S V 1 tagging algorithm. 

Currently in the Z>-tagging software, two track categories are used: the Shared tracks (tracks with 
shared hits), and the complementary subset of tracks called Good tracks. These track categories are only 
used for the time being for the IP1D, IP2D and IP3D tagging algorithms. 

4.2 Other spatial algorithms 

The spatial algorithms based on likelihood ratios require an a-priori knowledge of the properties of both 
b-jets and light jets. Methods to measure them in data are being devised for the b-jets [8, 9] but will 
require at least about 100 pb _1 . In addition, there is no clear way to extract a pure enough sample of 
light jets, and Monte Carlo simulation will probably have to be used once a thorough validation against 
data has been performed. A few other spatial tagging algorithms, less powerful, are therefore developed, 
which have less reliance on Monte Carlo and are expected to be easier and faster to commission with the 
first real data. 

The simplest approach that could be used, at least at the beginning, is the counting of tracks with 
large impact parameter or large impact parameter significance. Requiring a few of these tracks provides 
a sample enriched in ft-jets. The performance of such a tagging algorithm is not discussed in this note 
because it is not yet fully implemented in ATLAS. Such a simple tagger may also be very useful at the 
trigger level. 

Another approach is to combine the impact parameter of all the tracks in the jet. JetProb is an imple- 
mentation of the ALEPH tagging algorithm [14], used extensively at LEP and later at the Tevatron. The 
signed impact parameter significance d® / o c i Q of each selected track in the jet is compared to a resolution 
function for prompt tracks, in order to measure the probability that the track i originates from the 
primary vertex (Figure 11(a)): 

r-\4/°L\ 

@>i= £g(x)dx (3) 

The resolution function can be measured in data using the negative side of the signed impact param- 
eter distribution (cf. section 6.5.1), assuming there is no contribution from heavy-flavour particles which 



409 



b- Tagging - 6-Tagging Performance 



is not strictly true. 

The individual probability of each of the N tracks associated to the jet are then combined to obtain a 
jet probability j et which discriminates ft-jets against light jets (Figure 1 1(b)): 

7=0 J" 

where 

n ( & = if 4 > 0 




(a) Individual track probability x Sign(rfg) (b) Jet probability &j et 



Figure 1 1 : Distributions of the probability of compatibility with the primary vertex for individual tracks 
(left plot) and for all tracks in the jet (right plot) as defined for JetProb. The cases of b-jets (red plain), 
c-jets (green dashed) and light jets (blue dotted line) are shown. 

4.3 Soft lepton algorithms 

Soft lepton tagging relies on the semi-leptonic decays of bottom and charm hadrons. Therefore it is in- 
trinsically limited by the branching ratios to leptons: at most 21% [15] of fr-jets will contain a soft lepton 
of a given flavour, including cascade decays of bottom to charm hadrons. However, tagging algorithms 
based on soft leptons exhibit very high purity and low correlations with the track-based tagging algo- 
rithms, which is very important for checking and cross-calibrating performance in data (see for instance 
Ref. [8]). 

4.3.1 Soft muons 

Once a reconstructed muon is associated to a jet as explained briefly in Section 2.4, a likelihood permits 
to discriminate light jets from fc-jets. The algorithm and its performance are detailed in Ref. [5] and will 
not be discussed further in this note. To summarize, a light jet rejection of about 300 can be achieved for 
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a Z>-tagging efficiency of 10%. Those numbers include the semi-leptonic branching ratio, the detector ac- 
ceptance, the reconstruction efficiency as well as the jet-muon association efficiency. This was estimated 
in tt events, including a simulation of the cavern background (low-energy neutrons and photons stem- 
ming from the interaction of forward particles with the shielding) which reduces the rejection level by 
about 30%. The performance is relatively steady in the jet pj range 15-100 GeV and in pseudo-rapidity. 

4.3.2 Soft electrons 

A likelihood ratio is also used for soft electrons. The algorithm and its performance are detailed in 
Ref. [6] and will not be discussed further in this note. A light jet rejection of about 100 can be achieved 
for a ^-tagging efficiency of 7%. The efficiency of the soft electron identification is high, since two-thirds 
of the true b-jets containing a real soft electron are tagged by the soft electron algorithm. However, about 
25% of light jets are mis-tagged by real electrons from photon conversions and Dalitz decays. This 
was estimated in WH (ra# = 120 GeV) events without pile-up. Based on previous study [16], a further 
degradation by 10% (30%) is expected when on average 4.6 (23) minimum-bias events are added. While 
the performance is constant in jet pj in the range 15-100 GeV, it degrades quickly with the jet pseudo- 
rapidity because of the higher amount of dead material, the poor performance in the transition region 
between the barrel and end-cap cryostats of the electromagnetic calorimeter (1.37 < \r\ \ < 1.52) and the 
absence of the TRT beyond | r\ | > 2. 

4.4 Combining tagging algorithms 

Currently only the likelihood-based tagging algorithms have been combined, since the formalism is 
easy in this case: the weights of the individual tagging algorithms are simply summed up. The most 
commonly used tagging algorithm, IP3D+SV1, is actually such a combination. It should be noted that 
the SV tagging algorithms have been optimized to work in conjunction with the IP ones. Another one 
combines IP3D and JetFitter. Multivariate approaches to combine all tagging algorithms, including the 
soft lepton ones, have not received much attention so far. There are, however, some new studies and the 
use for instance of boosted decision trees is discussed in Section 7.3. 

4.5 Calibration of tagging algorithms 

The likelihood-based tagging algorithms require knowledge of the probability density functions of the 
discriminating variables for both the b- and light jet hypotheses: this is called the calibration of the 
tagging algorithms, or their reference histograms. In the following, those functions have been derived 
from a large sample of jets coming from tt and ttjj events. Several issues about the calibration and its 
impact on ^-tagging performance are discussed in Section 6.5. 

5 Performance for various physics processes 

In this section, the ^-tagging performance is reviewed for several physics channels of interest. Several 
spatial tagging algorithms are considered: JetProb and IP2D which are best suited for the initial period, 
IP3D and then IP3D+SV1 for regular operations once the secondary vertexing is understood and finally 
IP3D+ JetFitter for the ultimate performance. 

5.1 Dependence on jet transverse momentum and pseudo-rapidity 

The spatial ^-tagging performance depends strongly on the jet momentum and rapidity: the pj and r\ 
dependencies of the ^-tagging efficiency and light jet rejection for a given cut on the Z>-tagging weight 
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Figure 12: Z>-tagging efficiency and purified light jet rejection obtained with the IP3D+SV1 tagging 
algorithm operating at a fixed cut of 4 on the ^-tagging weight, for tt events. 
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(a) All jet p T (b) Jets with 30 < p T < 45 GeV (c) Jets with 60 < p T < 100 GeV 



Figure 13: Rejection of light jets with purification versus jet tj for the IP3D+SV1 tagging algorithm and 
for two different physics channels: jets from tt events and from WH (mn = 120 GeV) events, for a fixed 
60% tagging efficiency in each bin. 



are shown in Figure 12. At high pj or at high |tj|, the Z>-jet tagging performance is poor, regardless of 
which tagging algorithm is used. At low pj, maintaining a reasonable b-jet efficiency is possible only by 
loosening the cut on the weight, at the price of a very low rejection of light jets. The strong dependence, 
especially in pj, makes the extraction of the ft-jet efficiency from data complicated and means that more 
integrated luminosity will be required, since several bins are needed. 

Because of these strong pj and X] dependencies, and since various samples have very different spec- 
tra, it is not straightforward to compare between channels the integrated rejection numbers shown in 
the following. It is worth noting that this dependence is really a two-dimensional one, thus Figure 12 
is useful for illustrative purposes but the (pr, \f]\) spectrum of the jets in the sample considered is not 
properly factorized out. This is important for instance to parametrize the ^-tagging performance. This is 
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Figure 14: pj and \r\\ spectra of b (upper plots) and light (lower plots) jets for the various channels 
considered in this section. 



further illustrated in Figure 13: the rejections achieved in two different samples become more similar as 
a function of r\ when looking in bins of pj. The remaining differences are mostly because the binning in 
Pt is still too large, but also because of other minor differences between the samples: for example they 
have been generated with different Monte Carlo generators (cf. section 6.6). 

For reference, the pj and \r\ \ spectra of b and light jets in the various samples used in the following 
are shown on Figure 14. They affect the integrated rejections for the various channels. 

5.2 Simple topologies: WH channels 

This first class of events illustrates the performance obtained on simple event topologies where the jet 
multiplicity is very low. As discussed in Section 3.3, purification is not an issue in this case. 

Events from Higgs boson production in association with a W boson are interesting in this respect 
and are a benchmark for ^-tagging performance, even though the channel itself is no longer thought 
to be very promising at the LHC (see however Ref. [17] for a recent re-investigation). The W decays 
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leptonically, and there are only two jets coming from the hard process, originating from the H decay. To 
study the ^-tagging efficiency the decay H — > bb is simulated while here for the rejection of charmed and 
light jets the Higgs boson is forced to decay to cc or to the unlikely uu channel respectively. 

The ^-tagging performance obtained on this kind of events and for m# = 120 GeV is shown in 
Table 1 for the tagging algorithms considered. Two typical ^-tagging efficiencies were considered: 50% 
and 60%. For each tagging algorithm, the cuts on the weight required to achieve these efficiencies were 
determined over the whole sample, and then applied to estimate the rejections. 

To study more energetic jets, similar physics processes have been considered for a different Higgs 
boson mass, m//=400 GeV (again such a choice is unphysical since a 400 GeV Higgs boson would not 
decay to bb but is useful for these studies). The results are shown in Table 1 for the light jet rejection 
and in Table 2 for the c-jet rejection. The differences in performance between the two mass cases are the 
result of the different pj and Tj spectra: the jets for m# = 400 GeV are more energetic and explain most 
of the discrepancy, this effect being only slightly balanced by the fact that jets for m# = 120 GeV are 
more forward. For this channel, the gain obtained with JetFitter is more visible. 

5.3 Multi-jets channels: the top case 

The jet multiplicity in pair-produced top quark events is much higher. In the following, the channels 
where at least one of the W bosons decays to leptons are considered. For the dominant lepton+jets chan- 
nel, there are usually at least four jets from the hard process and extra jets from radiation. Several flavours 
of jets are present at the same time in the event: two b-jets from the top quarks, light jet(s) and often a 
c-jet from the W decaying hadronically. This increases the likelihood of having light jets contaminated 
with heavy flavour and also makes the labelling of jets even more ambiguous as discussed previously. 
The benchmark curves of jet rejection versus ^-tagging efficiency are shown on Figures 15(a) and 15(b), 
for light jets and for several tagging algorithms. The jets of the various flavours were taken from the 
same sample in this case, unlike for events of the WH channels. Table 1 shows the light jet rejection 
achieved in tt events and in ttjj events, both samples being generated with MC@NLO+HERWIG. The 
latter events are tt events which were filtered in order to have at least six jets, of which four are taggable. 
Since the performance in those two samples is similar, they have been merged. For light jets, both the 
raw (without purification) and purified rejections are shown. For c-jets and T-jets the purification does 
not make any significant difference. The rejection power of c-jets (Table 2 and Figure 15(c)) is naturally 
very limited because of the lifetime of c-hadrons and is almost independent of the physics process. With- 
out any optimization, the ^-tagging algorithms also prove to be useful for the identification of T-jets, as 
shown on Figure 15(d). The small discontinuities in the curves on Figure 15, visible notably for the IP3D 
tagger, are due to the conjunction of a coarser binning of the underlying probability density functions 
for this tagger and the presence of single-track jets (notably electrons faking jets). This effect is more 
pronounced for the T-jets (Figure 15(d)) where single-prong decays are abundant. 

The impact on the light jet rejection of electrons faking jets can be seen in the fourth block of 
Table 1 . Electron jets are seldom mis-tagged as Z?-jets, since they have usually a single prompt high- 
er track which is well-measured. In this sample, the hi%h-pj electrons are coming from W —> ev or 
indirectly from W — > TV. In the fourth part of Table 1, a jet j is considered as an electron faking a 
jet, and therefore discarded, if it matches with a reconstructed electron candidate e: AR(eJ) < 0.1 and 
E T {e)/E T {j) > 0.75. 

It is interesting to notice that, despite the more complex topology of these tt events, the integrated 
light-jet rejection achieved is higher than for the WH (m//=120 GeV) case. This is mostly because jets 
in tt events are more central than the ones in WH (m#=120 GeV) production (cf. Figure 14), and the 
Z>-tagging performance degrades quickly at large pseudo-rapidities as seen already. 

Table 1 shows the light jet rejection achieved in even more complex topologies with at least six jets. 
Those channels are relevant for the Higgs discovery channel ttH(bb) which requires a high ^-tagging 
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Table 1: Integrated rejection of light jets (with and without purification when it applies), for various 
event types and for several tagging algorithms. For each case, the cut on the Z?-tagging weight is chosen 
to lead to the quoted average Z?-tagging efficiency E b over the sample considered. The quoted errors are 
statistical only. 





JetProb 


IP2D 


IP3D 


IP3D+SV1 


IP3D+JetFitter 


WH (m H = 120 GeV) events 


e b = 50% 


83±1 


116±2 


190±3 


458±13 


555±17 


e b = 60% 


30±0 


42±0 


59±1 


117±2 


134±2 


WH (m H = 400 GeV) events 


e b = 50% 


73±1 


163±3 


179±3 


298±7 


396±11 


e b = 60% 


27±0 


56±1 


58±1 


96±1 


123 ±2 


tt and tt jj events 


Raw, e b = 50% 


91±0 


146±1 


232±2 


456±4 


635±7 


Purified, e b = 50% 


97 ±0 


186±1 


310±3 


789±10 


924±13 


Raw, e b = 60% 


28±0 


46±0 


67±0 


154±1 


189±1 


Purified, e b = 60% 


28 ±0 


51±0 


76±0 


206±1 


224±2 


tt and tt jj events, once electrons fakiri 


g jets are removed 


Raw, e b = 50% 


92±0 


142±1 


219±1 


423±4 


593±6 


Purified, e b = 50% 


99±0 


181±1 


293±2 


732±10 


863±12 


Raw, £ h = 60% 


31±0 


49±0 


67±0 


144±1 


180±1 


Purified, e b = 60% 


33±0 


56±0 


76±0 


194±1 


213±2 


ttH events 


Raw, £ b = 60% 


23 ±0 


35±0 


49±1 


90±2 


113±2 


Purified, E b = 60% 


25±0 


48±1 


72±1 


188±5 


188±5 


Raw,£ h = 70% 


10±0 


14±0 


18±0 


32±0 


31±0 


Purified, e b = 70% 


11±0 


17±0 


22±0 


46±1 


37±1 


ttbb events 


Raw, £ b = 60% 


23 ±0 


34±0 


50±1 


100±2 


123 ±2 


Purified, e b = 60% 


24±0 


41 ±0 


64±1 


156±4 


166±4 


Raw, E h = 70% 


10±0 


13±0 


18±0 


32±0 


28±0 


Purified, £ h = 70% 


10±0 


15±0 


20±0 


40±0 


31±0 


SUSY SU3 events 


Raw, £ b = 50% 


66±1 


140±4 


162±5 


246±9 


328±14 


Purified, £ b = 50% 


68±1 


161±5 


183±6 


290±13 


375±19 


Raw, £ h = 60% 


24±0 


50±1 


55±1 


89±2 


110±3 


Purified, £ h = 60% 


25±0 


53±1 


58±1 


99±3 


117±3 
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Table 2: Integrated rejection of c- and T-jets, for various event types and for several tagging algorithms. 
For each case, the cut on the ^-tagging weight is chosen to lead to the quoted average ^-tagging efficiency 
Eb over the sample considered. 





JetProb 


IP2D 


IP3D 


IP3D+SV1 


IP3D+JetFitter 


c-jet rejection for WH = 400 GeV) events 


E b = 50% 
£ b = 60% 


7.9±0.1 
4.7±0.0 


9.7±0.1 
5.7±0.1 


10.7±0.2 
6.1 ±0.1 


12.4±0.2 
6.8±0.1 


12.7±0.2 
7.3±0.1 


c-jet rejection for ti and tt jj events 


E b = 50% 
£ h = 60% 


8.4±0.0 
5.1±0.0 


9.5±0.0 
5.8±0.0 


10.6±0.0 
6.5±0.0 


12.4±0.1 
7.4±0.0 


12.3±0.1 
7.4±0.0 




T-jet rejection 


for tt and tt jj events 




£ b = 50% 
£ b = 60% 


10.2±0.1 
5.1±0.0 


13.9±0.1 
6.4±0.0 


20.3±0.2 
8.0±0.1 


45.2±0.8 
24.6±0.3 


36.9±0.6 
19.3±0.2 



efficiency since four jets are ^-tagged and the cross section is low: therefore the more typical working 
points of Eb around 60-70% are shown. As shown in Figure 14, the pj spectrum for the fc-jets in these 
samples is harder than for b-jets in tt events, explaining partly the differences in performance. In addition 
the high b-jet multiplicity in ttH events leads to some lifetime contamination in the few light jets available 
in this sample. All these samples are based on PYTHIA Monte Carlo, unlike the ttjj sample which is 
a background for this channel as well but is based on MC@NLO+HERWIG Monte Carlo and was kept 
separate for this reason. 

5.4 High-p r jets: SUSY 

Events from the SUSY bulk region (SU3 point, see Ref. [1]) were considered. In these events, the average 
taggable jet multiplicity is about 5.3 and a large number of T-leptons are produced in the decay chain of 
charginos and neutralinos. There are on average 0.6 fc-jets per event, with a relatively hard pj spectrum 
as shown on Fig. 14: the average pj is 144 GeV. On average about 0.6 taggable jets per event are labelled 
as T, compared to 0.2 in the semi-leptonic tt channel. They are not considered as light jets. The results 
are shown in Tables 1 and 2: because the pj of the jets is quite high, the light jet rejection is similar to 
that achieved for WH (ra# = 400 GeV) events but significantly worse than for the other channels. 

5.5 Degradation of performance at low and high pj 

At low pj, performance is degraded mostly because multiple scattering is increased. This also holds 
for the high | X] | region, where the amount of material in the tracking region increases very significantly, 
inducing more secondary interactions. There is currently no rejection of secondary interactions found in 
the pixel disks, unlike in the barrel (cf. Section 4. 1.3). More importantly, the increase of the extrapolation 
distance from the Mayer to the primary vertex at large pseudo-rapidities significantly degrades the zo 
resolution as seen in Figure 6(b). 

Several effects conspire to reduce the ^-tagging performance as the jet pj increases above 120 GeV. 
First of all, the fraction of fragmentation tracks increases with the parton transverse momentum, as shown 
in Figure 16, while the jet is collimated into a narrower cone: since a fixed-size cone is currently used 
to associate tracks to the jet this leads to a dilution of the discriminating power in ^-jets. The density 
of tracks in the core of energetic jets challenges the pattern-recognition ability of the software and of 
the inner detector itself, leading to either a reduced tracking efficiency or a high level of fakes as shown 
in Figure 3 for jets with Ej > 100 GeV. Finally, for very energetic ft-hadrons the Lorentz boost leads 



416 



b- Tagging - 6-Tagging Performance 




b-jet efficiency b-jet efficiency 

(c) ojets (d) T-jets 

Figure 15: Rejection of light jets, c- and T-jets versus b-jet efficiency for tt and ttjj events and for all 
tagging algorithms: JetProb, IP2D, IP3D, IP3D+SV1, IP3D+JetFitter. 
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to a much enhanced decay length. The typical ct (~ 450/im) of Z>-hadrons is thus scaled by a factor 
7 ~ \PB\/mB which can be large. For high pj jets, the Z>-hadron can decay at a rather large radius Rb, as 
illustrated in Table 3: close to the inner radius of the pixel detector, leading to more tracking ambiguities 
in the first detection layers, or even after the first pixel layer. In the latter case, the current requirement 
(for the IPnD and JetProb tagging algorithms) of a hit on the Mayer actually kills the signal. At very 
high pj (above 500 GeV), these effects become so critical that a dedicated strategy has to be devised, as 
discussed in the next section. In the current simulation, the ft-hadrons decaying at a radius larger than the 
beam-pipe or the ft-layer radii do not interact with these objects, while in real events this will even more 
reduce the performance. 




Figure 16: Fraction of selected tracks which are 
not from B/D decays versus jet pj, in &-jets from 
WH (m H = 400 GeV) events. 



Table 3: Fraction of £-jets in WH (m H = 400 
GeV) events for which the Z;-hadron decays be- 
yond the beam-pipe vacuum (first column) or be- 
yond the Mayer (second column). 





R B > 2.9 cm 


R B > 5.1 cm 


all£ r 

E T > 100 GeV 
E T > 200 GeV 


9.0% 
12.2% 
21.1% 


2.8% 
3.9% 
7.9% 



5.6 The case of very high-p r jets: exotic physics 

The very high pj range is defined as jets exceeding a transverse energy of 500 GeV. Identification of 
such very high pj &-jets is required for the search for heavy resonances with (predominantly) hadronic 
decays. A large number of exotic physics models presents signatures with very high pj Z?-jets, up to a 
few TeV An example is the decay Zh — > Zh in the little Higgs model [18], where Z — ► e + e~ and h — ► bb. 

In the following, three samples corresponding to the process Z' — ► qq, where q denotes u, b and c 
quarks respectively were used. The Z' mass is chosen to be 2 TeV, so that the primary partons have 
transverse momenta in the range from 300 GeV to 1 TeV. 

The ^-tagging algorithms rely particularly on the determination of the jet axis as an approximation of 
the b- and c-hadron flight direction. The difference between the jet pseudo-rapidity and the true ft-hadron 
direction exhibits a narrow Gaussian core. The widths of the core range are a n ps 0.025 and « 0.010 
mrad, with a moderate dependence on jet Ej and pseudo-rapidity. Non-gaussian tails give rise to large 
RMS: RMS ^,0 ~ 0.050. The particles from the decay of the highly boosted ft-hadron are emitted at very 
small angles. Up to half of the tracks from the ^-hadron decay lie within the azimuthal angle between the 
ft-hadron and the jet. In these cases the sign of the impact parameter cannot be determined accurately. 

The reconstruction of tracks in high pj jets presents a series of specific challenges, as already ex- 
plained in Section 5.5. Particles are associated to reconstructed jets using a AR < 0.4 criterion. To 
highlight reconstruction effects only true pions reaching the outer radius of the tracker are taken into ac- 
count. Particles are moreover required to originate from a well-defined vertex: either the primary vertex 
of the event or the ZVc-hadron decay vertex. Only the first level of the efficiency, i.e. the matching to 
hits (Section 2.1.2), is considered. With this definition the efficiency on a reference sample of low pj 
jets is close to 100% and essentially independent of the jet energy and of the distance to the jet axis. In 
Figure 17 the track efficiency in the Z' sample is plotted versus the jet transverse energy, for the default 
track reconstruction - NewTracking - and for iPatRec. 

The algorithmic reconstruction efficiency for prompt tracks is only slightly degraded. Even in the 
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Figure 17: Algorithmic tracking efficiency for 
long-lived pions as a function of the jet trans- 
verse energy, for prompt tracks or tracks from blc- 
hadron decays, and for two reconstruction algo- 
rithms: NewTracking and iPatRec. 



Figure 18: Raw light jet rejection for jets from 
Z' — > qq with mz> = 2 TeV, versus jet transverse 
energy, for the IP3D+SV1 tagging algorithm (with 
tracks from iPatRec). and for three fr-tagging effi- 
ciencies. 



very harsh environment of a 1 TeV jet, the efficiency is approximately 90%. The degradation is most 
pronounced in the core of the jet ( AR < 0.1 ). For pions originating in ft/c-hadron decays a much more 
significant degradation of the efficiency towards high jet Ej is observed. For 1 TeV jets the efficiency is 
approximately 50%. The efficiency shows a strong dependence on the decay vertex radius. It is worth 
mentioning that NewTracking and iPatRec assign very different errors to the positions defined by large 
pixel or SCT clusters arising in the inner layers with such dense jets: the former assigns a very small 
eiTor assuming only one particle was involved in the cluster, while the latter is assuming the opposite and 
assigns the maximal error (cluster width/ VT2). 

In very high pj jets the probability that two or more tracks share a hit is very high (> 10%), unlike 
what was seen for low and moderate pr jets (cf. Figure 4). The number of shared hits per track - eval- 
uated for individual sub-detectors, or for the complete silicon tracker - is a factor 2-3 larger in iPatRec. 
For particles from very displaced ZVc-hadron decay vertices reconstructed with iPatRec, tracks with a 
shared hit in the Mayer actually outnumber the tracks with an unambiguous assignment. Again the two 
tracking algorithms have made opposite choices for their working point: NewTracking considers shared 
hits are stemming from pattern-recognition errors and tries to assign them to the best track, which is 
not necessarily meaningful when the cluster is really originating from several near-by particles, while 
iPatRec does not try to resolve the ambiguity. 

The high multiplicity of fragmentation tracks, the degraded tracking efficiency, the ambiguities in hit 
assignment particularly in the innermost layer and the uncertainty in the impact parameter sign, all render 
high pj jets a harsh environment. This may be improved though by the use of dedicated reconstruction 
algorithms. A priori, the current simulation should also be updated to transport properly the ^-hadrons 
through the material. While studies have started, they are beyond the scope of this note. For the time 
being, a re-optimization of the default tagging algorithm parameters has been performed with the aim 
of improving the ^-tagging performance over a large jet Ej range (from 200 GeV to 1 TeV). The cone 
size for the jet-to-track association was reduced to 0.2 (see Section 7.2 for a possible improvement of the 
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current treatment), the pj cut on tracks raised to 5 GeV and the use of tracks without a hit in the Mayer 
was allowed (see also Section 7.1). 

The tagging performance on iPatRec tracks is found to be on average 60% better than for the default 
algorithm. The choice of iPatRec to maintain high tracking efficiency inside dense jets (cf. Figure 3), 
even at a price of higher fake rates, seems to be instrumental in achieving better performance here. The 
resulting ^-tagging performance is presented in Figure 18. Given the modest level of rejection achieved, 
a tagging efficiency of 40% is considered here. Without any tuning, the rejection level of the IP3D+SV1 
tagging algorithm would be about three times worse for the same ^-tagging efficiency. 

In this study, the standard (low pj) reference histograms were used for the tagging algorithms. So 
far, none of the methods developed to extract the calibration histograms from data has been shown to 
work for these very high pj jets. Reference histograms for very high pj jets may be extracted from 
Monte Carlo simulation, provided it reliably describes the data. Doing so, a modest improvement of 
the performance (up to 50% higher light jet rejection for the same b-tagging efficiency compared to the 
results shown here) can be achieved. 

To conclude, ^-tagging for very high pj jets faces a series of specific difficulties. This study demon- 
strated that a rejection between 10 and 70 for jets with pj > 500 GeV can be achieved by tuning the cur- 
rent algorithms. Further improvements require dedicated treatments at the clustering level (with probably 
a second-pass approach to break down large clusters coming from near-by particles) and at the pattern- 
recognition stage of the track reconstruction. On small preselected datasets, retracking with a specially 
optimized pattern-recognition algorithm should be possible. 

6 Specific studies to characterize ^-tagging performance 

In this section, a few additional studies aimed at better understanding some critical aspects of the b- 
tagging performance are detailed. Those studies are described in a separate section because either they 
required specific datasets or they rely on software and/or cuts/optimizations which are different from the 
ones currently in use in the ATLAS software. 

6.1 Impact of residual misalignments 

All the studies in this note do not take into account the effect of residual misalignments. While all 
the samples studied were simulated with misalignments, they were reconstructed assuming a perfect 
knowledge of those misalignments. However, detailed studies, discussed in Ref. [7], are in progress on 
this subject. Two different approaches were used: residual misalignment sets and actual realignment. 
In the former, the events simulated with misalignments are reconstructed using the knowledge of the 
misalignments, but the true detector elements positions are shifted and/or rotated slightly from their 
actual position to mimic residual misalignments. The individual pixel modules were shifted by about 
10 }Xm in x and 30 }im in y and z, and rotated by about 0.3 mrad. The pixel layers, disks and the 
whole detector were displaced by slightly smaller amounts. In this case, the light jet rejection drops by a 
factor 2 for the same Z>-tagging efficiency. For residual misalignments about half as big, the drop in light 
jet rejection is degraded by 40% compared to the ideal case. The reduction of residual misalignments 
relies on the actual alignment procedure, which is performed to obtain the new positions of the detector 
elements. This is the most realistic case considered so far, and includes many (but not all) systematic 
deformations caused by the alignment procedure itself. In this case, the light jet rejection is at most 25% 
lower for the same ^-tagging efficiency. 
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6.2 Impact of the tracker material on performance 

A major effort has been invested in describing accurately the material in the tracking volume of ATLAS. 
However, some underestimation is possible. To assess the impact of extra material on the Magging per- 
formance, results with different geometries were compared. Extra material was added, mostly beyond 
the Mayer, increasing the thickness in radiation lengths by about 8% (15%) at |t]| 0(1). The first 
noticeable effect is the degradation of the impact parameter resolution. The other effect is an increased 
fraction of particles undergoing interactions in the matter of the detector and producing secondary par- 
ticles which can, directly or through pattern-recognition problems, fake non-prompt tracks. At a 60% 
Magging efficiency, the extra material decreases by 10% the light jet rejection power. About 60% of 
the loss of rejection is explained by the worsening of the impact parameter resolution, and about 40% by 
extra secondaries. 

6.3 Impact of the pixel detector conditions 

The pixel detector and notably the innermost Mayer are critical for achieving good Magging perfor- 
mance. The detector efficiency clearly affects the tracking performance but is also explicitly a key in- 
gredient for Magging since the Magging quality cuts require that each track have at least two pixel hits 
of which one is in the Mayer. These pixel hit requirements are made in order to maintain the highest 
resolution on the impact parameter of tracks. 

A single pixel inefficiency of 5% has been used to simulate the events. Measurements on the pixel 
staves before the detector integration gave a single pixel inefficiency below ~ 0.3% (and below ~ 0.1% 
for the Mayer for which the highest quality components were used). The effect of this inefficiency is 
especially relevant at small | Tj | where half of the pixel clusters contain only one pixel. The impact of 
varying the fraction of randomly distributed dead pixels was studied for three tagging algorithms on 
a large statistics (600k events) tt sample and the results are shown in Table 4. When decreasing the 
fraction of dead pixels from 5% to 1%, the tracking efficiency for tracks fulfilling the Magging quality 
cuts improves by up to 2.5% absolute (around T7 ~ 0), leading to a relative gain in rejection of about 
10%. 



Table 4: Reference light jet rejection for several tagging algorithms and the relative change with vari- 
ous configurations of the pixel system (see text) for a 60% Magging efficiency in tt events. The used 
reference histograms were produced with the respective samples. 





IP2D 


IP3D 


IP3D+SV1 


Reference rejection (5% of dead pixels) 


54±1 


77±2 


229 ±10 


Relative change with 1 % of dead pixels 


+9% 


+10% 


+17% 


Relative change with a dead half-stave on Mayer 
Relative change with a dead bi-stave on Mayer 
Relative change with a dead half-stave on external pixel layer 
Relative change with a dead bi-stave on external pixel layer 


-8% 
-34% 
< 1% 

-1% 


-8% 
-34% 
< 1% 

-1% 


-10% 
-28% 
-1% 
-4% 



More global problems, such as chip and module inefficiencies were not considered in the studies. 
Their effect has been studied in detail in Ref. [19] and can be very important. However, the latest 
measurements made right before the detector integration indicate that only one module (in the middle 
layer) out of 1744 is dead and that fewer than 5 chips are dead out of 27904. Besides single module 
failures, more dramatic failures might happen. During the pixel operation, it is thought that the two 
most likely sources of potential failures could be an opto-board failure, leading to half a stave (at most 
7 modules) not functioning and a cooling problem implying that a whole bi-stave (26 modules) could 
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not be used. To study those scenarios, the pixel digitization was modified to disable the corresponding 
modules, in either the Mayer or the external pixel layer. The impact on Z>-tagging performance of these 
two scenarios is shown in Table 4. In the case of well-identified module failures, it is clear that some 
recovery strategies can be used, either directly in the tracking code or at least in the ^-tagging algorithm, 
for instance by not requiring a hit on a dead module. 

6.4 Jet algorithms 

For ^-tagging purposes, an accurate knowledge of the jet direction is relevant. In the first place, this 
direction is used to define which tracks should be associated to the jets. Then it is used to sign the impact 
parameter of tracks. 

As mentioned earlier, all ^-tagging results are given for jets reconstructed with a cone algorithm of 
size AR = 0.4. Since a given physics analysis may opt for a different jet algorithm, the impact of this 
choice on the Z>-tagging performance is tested in this section. In all cases, as it is the default in the b- 
tagging software, only the tracks within a distance AR < 0.4 of the jet axis were used for the tagging, 
even for jet algorithms which could benefit from a less geometric track-jet association such as the kj 
algorithm. 

Several cone sizes AR and size parameters R were studied for the cone algorithm and the kj algo- 
rithm: from 0.2 to 0.8 in steps of 0.1. Finally, the ^-tagging performance of the mid-point algorithm [20], 
an alternate jet algorithm addressing the infrared sensitivity of cone algorithms, was also checked, for 
two different cone size of AR = 0.4 and 0.7. In this study, electrons faking jets were removed. 

Figure 19 shows the rejection of light jets versus the ^-tagging efficiency obtained with the IP3D+SV1 
tagging algorithm, for several jet algorithms run on tt events. Results with and without purification are 
very different. With purification (Figure 19(b)) there is no significant difference in performance in the 
relevant range of ^-tagging efficiency (40% <£b < 75%). 




Figure 19: Rejection of light jets versus ^-tagging efficiency for the IP3D+SV1 tagging algorithm applied 
on jets reconstructed with different algorithms: cone algorithm with size AR = 0.4,0.7 or kj algorithm 
with parameter R = 0.4,0.6. See text for the last plot. 

This stability is the anticipated behavior, since only the jet direction is meaningful for ^-tagging 
purposes and it does vary with the jet algorithm but not drastically for moderate jet pj\ the mean of the 
distance AR(b,jet) for a 50 GeV jet is 0.081 for a jet of size AT? = 0.4 and 0.095 for a jet of size AR = 0.7 
(see Ref. [9] for more details). For completeness, it should be mentioned that no differences were found 
between tower-based and topological cluster-based jets. 

Without the purification procedure (Figure 19(a)), the results are different for different jet defini- 
tions. However, the interpretation is not straightforward. In principle broader jets could be more easily 
contaminated by neighbouring tracks originating from distinct partons whose showers could not be re- 
solved: light jets for instance could be contaminated by heavy-flavour decay products. However, this 
effect should be marginal since the maximum track-jet distance for association is kept to AR = 0.4 in all 
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cases. 

A better explanation is linked to the ambiguity and arbitrariness of the labelling procedure: with 
fewer, broader jets the assignment of partons to the jets is more ambiguous and more likely to have 
AR(b,jet) > 0.3, and thus more jets wrongly labelled as light jets. Therefore, for example, the rejection 
at Eb = 50% with a cone radius AR = 0.7 appears to be three times less than with a cone of radius 0.4, by 
relabelling the jets with a AR cut of 0.4 instead of 0.3 this difference can be virtually eliminated as seen 
in Figure 19(c). Thus although it appears to be more difficult to define the true flavour of a broader jet 
this does not necessarily exclude the choice of a cone size of 0.7 for a given analysis. 

6.5 Sensitivity to the calibration of tagging algorithms 

Most of the ATLAS tagging algorithms make use of an a priori knowledge to discriminate fr-jets from 
light jets, which comes in various forms. The simplest of these ingredients is the transverse impact 
parameter resolution function used by the JetProb tagging algorithm to measure the compatibility of 
tracks with the primary vertex. The likelihood ratio tagging algorithms rely on several such distributions, 
with the further complication that they must be known for both the light and the b- hypothesis. In 
this section, the way this knowledge may affect the performance is studied. It is not currently possible 
to know if these settings are a good representation, both in nature and amplitude, of the differences 
data/Monte Carlo that will be observed, but at least they give information about the robustness of the 
tagging algorithms. All the results are given for the tt sample. 

6.5.1 JetProb tagging algorithm 

JetProb is expected to be one of the first tagging algorithms to be commissioned in ATLAS. To perform 
well, the resolution function (cf. Section 4.2) must be measured in data to avoid possible short-comings 
of the Monte Carlo simulation (resolutions and non-gaussian tails mostly). One of the major advantages 
of this tagging algorithm is that a priori any track from any physics process, e.g. the tracks from the first 
minimum-bias events, could be used to calibrate the resolution function, provided the contamination of 
non-prompt tracks can be kept at a very low level. 

The sensitivity to this last point was studied by checking two different scenarios to select tracks 
in order to build the resolution functions. In all cases, only reconstructed tracks with negative impact 
parameter significance and fulfilling the Z;-tag quality cuts are used. This was done on tt events but 
similar or better (because of less heavy flavour contamination) results are expected for minimum-bias 
events. In the ideal case, the tracks are required to match to a true particle whose true origin is at the 
primary vertex of the event. In the realistic case, this requirement was not enforced. The distributions of 
the negative impact parameter significance do / d f / 0 of tracks obtained in the two cases exhibit significant 
differences in the tails: in the ideal case, 0.6% of the tracks have \do\ > 5(Jd 0 (the RMS of the distribution 
is 1.3) while this fraction is 3.2% for the realistic case (RMS is 2.1). These distributions are then used 
as resolution functions to measure the ^-tagging performance on the same events: at a 50% ^-tagging 
efficiency, the light jet rejection with the realistic scenario is 15% lower than for the ideal case. 

6.5.2 Likelihood-based tagging algorithms 

For the likelihood-based tagging algorithms, the probability density functions for all the variables (cf. 
Section 4.1) are built for the b-jet and light jet hypotheses using Monte Carlo. The tt channel can be 
used in data to isolate a sample of pure ft-jets from which the various distributions can be derived, using 
the methods described in [9]. However, more than a few hundreds of pb _1 of data are needed. For the 
light jets, it seems very difficult to isolate a pure enough sample in data. In this section, the sensitivity 
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to the calibration is estimated using reference histograms obtained from Monte Carlo with very different 
settings. 

First the impact of using a different tracking algorithm for the calibration (iPatRec) and for the 
performance measurement (NewTracking) was assessed and led to a very small variation of the rejection 
power, below 10%. Another study consisted of using different detector descriptions for calibrating and 
testing: the two geometries compared were relatively similar, with a relative difference in the amount 
of material in the tracking volume of 8% (15%) at rj ~ 0 (1). At most a 5% change in rejection power 
is seen in this case. Another issue is the sample composition of the reference histograms: using only tt 
events or a mix of tt, WH {inn = 120,400 GeV) and SUSY events does not change significantly (< 5% 
relative change on rejection power) the ^-tagging performance on a tt sample. However, larger effects 
are expected when ^-tagging is run on a very different sample from the one used for the calibration. 

A possible bias when building the calibration distributions and looking at performance on the same 
event sample was studied by dividing the sample into two. The bias on the resulting rejection factors was 
found to be usually negligible, and in all cases below 10%. 

Finally, it was checked what statistics are needed to define the underlying histograms for the various 
tagging algorithms in order for results to be stable. This was checked on semi-leptonic tt events by 
halving a 600k event sample, building calibrations on 10k, 50k, 100k and 300k events from the first half- 
sample and checking the performance on the other half. To obtain a rejection level stable within 3%, 50k 
events are needed for all the IP and SV tagging algorithms when used in a regime where Eb > 50%. 

6.6 Sensitivity to the Monte Carlo modelling 

In the Monte Carlo modelling, several parameters can affect the ability to tag ft-jets. Any effect that 
can change the lifetimes of the produced particles, the multiplicity of the charged tracks or the momenta 
of these tracks can potentially change the tagging efficiency. This modelling is not necessarily a good 
description of data, and in addition it is also performed differently across generators. 

6.6.1 Fragmentation 

First of all, various fragmentation models, describing the non-perturbative process in which quarks 
hadronize into colorless hadronic states, are implemented in the Monte Carlo generators. For heavy- 
flavour quarks, two options are available in the PYTHIA generator: the Lund-Bowler model and the 
Peterson fragmentation model. While the former has been found to give reasonable agreement with 
experimental data at LEP, SLC and HERA, the latter is currently the default in ATLAS PYTHIA produc- 
tions. The impact of these various fragmentation models was investigated with six different tt samples. 
For the Peterson fragmentation, three samples were produced with different values for the Eb parameter: 
0.003, 0.006 (default), 0.012. For the Lund-Bowler model, the r Q parameter was varied: 0.50, 0.75, 
1.0 (default). The maximum relative discrepancy in the ^-tagging efficiency was found to be around 
6%, comparing the Peterson model with Eb = 0.012 to the Lund-Bowler model for rg = 0.5. However, 
this choice of parameters is a bit extreme: the difference between the default Peterson model and the 
Lund-Bowler model with rg = 0.75 (which was found to fit best the OPAL and SLD data) leads to an 
uncertainty on the ^-tagging efficiency of 1 . 1 % for a fixed fr-tagging cut leading to a ^-tagging efficiency 
of 72%. 

6.6.2 Heavy flavour production 

The production fraction of various ZVc-hadron species can also lead to different ^-tagging efficiencies 
since they have different lifetimes and decay modes. The measured Z?-hadron fractions [15] and their 
values in the generators are shown in Table 5. For HERWIG, the defaults have been changed following 
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the CDF tuning [21] by setting the CLPOW parameter to 1.2, in order to obtain a Z>-baryon fraction in 
agreement with the PDG and with PYTHIA. Another ingredient is the production of excited bottom and 
charm states which can give rise to soft charged pions or kaons, affecting the topology of the events: this 
has not been studied yet. 

Table 5: Fraction (in %) of £-hadron species from the PDG (assuming f{B d ) = /(S ± )) and in PYTHIA, 
the default HE RWIG and HERWIG tuned for ATLAS (CLPOW=1.2). 





B d 


B± 


B s 


Baryons 


PDG 


39.8 ±1.0 


39.8 ±1.0 


10.4 ±1.4 


9.9±1.7 


PYTHIA (ATLAS) 


39.7 


39.2 


12.1 


9.1 


HERWIG 


44.3 


44.8 


10.8 


0.0 


HERWIG (tuned) 


39.4 


39.9 


10.4 


10.3 



The various production fractions of each type of the ft-mesons were varied according to the mea- 
sured errors from PDG 2006 [15], and the impact on a simulated PYTHIA tt sample was studied by 
a re-weighting technique. This source of systematics can be safely neglected since the net effect is an 
uncertainty below the per mil level on the tagging efficiency. 

6.6.3 ft-hadron lifetimes and decays 

The uncertainty on the lifetime of the various b hadrons was also studied and found to give rise to an 
uncertainty of 0.3% for a fr-tagging efficiency of 72%. 

The uncertainty in the charged track multiplicity of b hadron decays was estimated by comparing 
PYTHIA with measurements from LEP [22]. The resulting uncertainty on the ^-tagging efficiency was 
found to be 0.9%. 

6.6.4 Heavy flavour decays with EvtGen 

The two event generators used in ATLAS to fragment and decay particles, PYTHIA and HERWIG, 
implement different algorithms to simulate the decays of generated particles, using their own decay 
tables to specify decay modes and branching fractions. The sophistication of the decay simulation and 
the scope of decay tables vary considerably between generators. For B meson decays, arguably the most 
detailed simulation is currently provided by EvtGen [23]. 

Since the ^-tagging performance on Monte-Carlo samples may depend on details of simulated parti- 
cle decays, such as the charged particle multiplicity or the spatial distribution of secondary decay vertices 
in B decays, the impact of using EvtGen as a decayer instead of PYTHIA was studied on tt events. For 
this study, a decay file for inclusive decays was assembled based on the latest (as of summer 2005) 
version of the decay files used by the BaBar and CDF experiments. For decay channels where experi- 
mental data is available, branching fractions were taken from PDG [15], while the remaining decays are 
simulated generically with JETSET. 

For this study, two specific samples were generated: the first tt sample was generated by PYTHIA 
and the decays were handled by PYTHIA. The second one was generated in the same way except that 
particle decays were simulated by EvtGen. As expected, changing the particle decay simulation leads to 
small differences in some distributions of generator-level quantities. For example, the mean multiplicity 
of charged pions in decays of B* and B° mesons, not including decays of long-lived weakly decaying 
strange particles such as K% and A, is 3.89 (RMS 2.19) with PYTHIA and 3.62 (RMS 2.13) with EvtGen. 
The average multiplicity obtained by EvtGen agrees well with the experimental value of 3.58 ±0.07 [15]. 
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The ^-tagging weight distributions obtained with the two generators are thus slightly different. For 
a fixed ^-tagging weight cut, the b-jet efficiency varies by about 1%. Tuning the cut to keep the same 
^-tagging efficiency in both samples leads to non-negligible changes in the light jet rejection: it de- 
creases by about 5% to 15% when EvtGen is used, depending on the tagging algorithm and chosen b-jet 
efficiency. 

7 Specific studies for improving ^-tagging performance 

In this section, three studies aiming at improving the ^-tagging performance are presented. Most of them 
rely on specific software developments. 

In the first part, new track categories are defined to make a better use of the slight differences in e.g. 
impact parameter resolution that such tracks may exhibit. A second study evaluates the potential gain 
by varying the way tracks are associated to jets. The last study shows the improvement obtained when 
combining several tagging algorithms in a multivariate approach. Those studies were done independently 
and no attempt was made yet to combine them. It is worth noting that some approaches advocated in the 
first two studies are expected to be highly correlated. 

7.1 Improving performance with track categories 

Grouping the tracks used for Z>-tagging in several categories has been discussed in Section 4.1.5. Ded- 
icated treatment for Shared tracks (cf. 2.1.3) is already implemented and used in the current ^-tagging 
software. Using dedicated probability density functions for the Shared tracks improves the light jet re- 
jection by 23% (7%) for a b-jet tagging efficiency of 50% (60% respectively) in tt events. This is the 
default treatment in the software. This effect is sample-dependent and is more important for samples 
with high jet multiplicities and energetic jets, which tend to be more collimated. 

Using additional track categories to improve the ^-tagging performance is being further investigated. 
A possible interest of the track categories is to try to loosen the track quality cuts and therefore gain in 
efficiency without diluting the discrimination power of the good tracks. As discussed in Section 2.1.2, 
requiring each track to have a hit on the Mayer leads to an absolute loss in efficiency of about 2.5%, 
which is not negligible for ^-tagging purposes where few tracks are available. Thus one attempt consisted 
of trying to keep tracks with no Mayer hit in a special category. About 4% of the tracks in jets from tt 
events which pass the rest of the ^-tagging selection fall in this category. 

The categories could also be used to deal with the non-Gaussian resolution tails and the imperfect 
treatment of the matter in the tracking error estimation process. A priori some fraction of these effects 
would be better treated from first principles directly in the tracking, but the experience with previous 
experiments shows that this is difficult in practice and therefore ad-hoc treatments may be justified. The 
natural variables to partition tracks are pj (actually p for multiple-scattering) and pseudo-rapidity (since 
the material is very non-uniform in tj). 

Finally, another potential use of track categories was studied: in Wets, the fragmentation tracks 
accompanying the Z?-hadron decay products are prompt and should therefore have different distributions 
of the discriminating variables. Tracks with pj (track) jpj (jet) < 0.04 were defined as fragmentation 
tracks since those are in principle softer and were put in a special category. In typical ft-jets from the tt 
sample, this cut selects 13% of the tracks. Some correlation with the treatment in pj bins (previous case 
above) is expected. 

The use of these categories brings some improvement to the light jet rejection, as high as 60% for 
the binning in track pj which is the most powerful way of pardoning tracks. The improvement for some 
categories depends on the sample: the dedicated category for fragmentation tracks for example is more 
helpful for the WH (nin =400 GeV) sample where the actual decay products of the ft-hadron are very 
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collimated and the fixed-size cone for associating tracks to the jets brings in a larger fraction of prompt 
tracks. 

Various ways to combine all the new categories were investigated. The gain in rejection brought by 
the best combinations, using 1 1 partitions formed with the aforementioned categories, ranges from 20% 
to 70% depending on the sample and the tagging algorithm, for a 60% Z>-tagging efficiency. 

7.2 Optimizing the track-to-jet association 

As the jet transverse momentum increases, its particles are collimated into a narrower cone. But currently 
all tracks within AR < 0.4 of the jet axis are associated with the jet, regardless of its momentum. For a 
300 GeV b-jet, only 30% of the tracks associated to the jet comes from the Z>-hadron decay products, as 
shown in Figure 16. Therefore at high pj the ^-tagging discriminating power is diluted since a larger 
fraction of the tracks in the jet may be picked up from environmental contamination: underlying event, 
pile-up or neighbouring jets in busy events. An alternative track-to-jet association has been studied, with 
a AR cut varying with the jet pj\ AR < f{pr)- Based on the distribution of AR (jet, track) for tracks 
originating from fr-hadron decays in b-jets, a functional form f(pr) has been chosen which ensures that 
95% of the tracks from Z?-hadron decays in these events are associated to the jet for any jet pr in the 
range [15,500] GeV. 

The impact on ^-tagging performance of using this association instead of the standard one is checked 
on tt events. The relative improvement on the overall raw light jet rejection is 46% (7%) for respectively 
a ^-tagging efficiency of 50% (60%). This improved treatment actually affects only the non-pure jets 
(22% of the light jets in this sample), for which the rejection triples for e b = 50% and doubles for 
£ b = 60%. Obviously the fraction of non-isolated jets and therefore the possible gain with this method 
are sample-dependent. 

7.3 Combining tagging algorithms with boosted decision trees 

Several multi-variant techniques exist that can combine different ^-tagging algorithms into a single clas- 
sifier for discriminating Z?-jets from light jets. We investigated boosted decision trees (BDT). 

BDT can be applied to any classification problem and their use for combining several ^-tagging 
algorithms into a single classifier for discriminating b-jets from light jets was investigated [24]. In this 
study, a BDT classifier was optimized on a training sample containing b-jet and light jet patterns extracted 
from WH (ra# = 120 GeV, H —> bb or H — > uu) and it samples. The following input variables were used: 
the weight from the IP3D tagging algorithm, the three variables on which the SV tagging algorithms 
are based (cf. Figure 9), the number of tracks associated with the secondary vertex, the weights of the 
soft muon and soft electron tagging algorithms, the largest transverse and longitudinal impact parameter 
significances and transverse momentum of the tracks in the jet, the jet transverse momentum and the 
number of tracks in the jet. For a fair comparison with IP3D+SV1, a BDT classifier with only the first 
four variables was also studied. The predictive power of the classifier was estimated using a distinct 
sample of b-jet and «-jet patterns (test sample). 



Table 6: Rejection of light jets given by IP3D+SV1 and the boosted decision tree for WH (with ra#=120 
GeV) and tt events, for fixed ^-tagging efficiencies of 50% and 60% in each sample. 





IP3D+SV1 

e b = 50% e b = 60% 


BDT 4 variables 
e b = 50% e b = 60% 


BDT 12 variables 
e b = 50% e b = 60% 


WH (m H =l20 GeV) 


529±40 155±6 


682±79 189±12 


762±93 201±13 


tt 


393±16 143±3 


484 ±34 161 ±6 


563 ±42 187 ±8 


tt after purification 


720±42 205±6 


808±73 226±11 


1021±103 278±15 
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Table 6 compares the rejection of light jets given by the two BDT configurations and the likelihood 
ratio weight IP3D+SV1 for WH events and tt events. It shows that with the same variables the BDT 
outperforms IP3D+SV1 by 10 to 30%. When using additional information, including the soft lepton tag- 
ging, the light jet rejection on both event topologies increases by about 50% with respect to IP3D+SV1. 
For these results, the training and test samples were based on similar events (WH or tt). When train- 
ing the BDT on WH events and using tt events for the test sample, the gain in rejection compared to 
IP3D+SV1 is lower but still interesting (> 20%). 

8 Measuring ^-tagging performance in data 

For analyses using ^-tagging, the estimation of the backgrounds from well-known Standard Model pro- 
cesses requires knowledge of the tagging and mis-tagging efficiency for the various flavours of jets with 
high accuracy. The quality of Monte Carlo simulation of these properties is unknown and therefore 
strategies must be developed to measure the tagging and mis-tagging efficiencies directly in data. 

8.1 Z?-tagging efficiency 

Several strategies for measuring the Z>-tagging efficiency directly in data are investigated in detail. The 
relative precision they permit has been estimated for a typical ^-tagging efficiency of 60%. 

The first approach, described in Ref. [8], relies on a sample of the abundantly produced dijet events, 
in which one of the jets contains a muon. A muon+jet trigger has been conceived and proposed for this 
purpose. Two methods, also employed at the Tevatron, are used to estimate the fr-content of the dijet 
sample. The pj rd method is based on templates of the muon pj relative to the jet axis. The templates 
are derived from Monte Carlo events for the three types of jets: bottom, charm and light (the latter will 
eventually be derived from data). The so-called System 8 method uses two samples of differing bottom 
quark content and two uncorrelated tagging algorithms, typically the soft muon one and the tagging 
algorithm to be calibrated, to form a system of equations from which the Z>-tagging efficiency can be 
extracted. Using 50 pb _1 of data, a detailed pj- or r\ -dependent calibration curve could be derived with 
the pr rel method and with System 8. Since it is expected that the systematic uncertainties will dominate 
rapidly the total error for these methods, a careful study of systematics errors has to be done which was 
not fully completed for this note: the systematic errors studied so far indicate that it should be possible to 
control the absolute error on Eb to 6%. Currently the two methods are proven to work well for jets below 
a p T of 80 GeV. 

The second approach, discussed in Ref. [9], makes use of tt events and is complementary: a little 
more data is needed but the tagging efficiency of higher pj jets can be measured. Two distinct ways are 
described: by counting the number of selected tt events with one, two or more ^-tagged jets, or by study- 
ing directly the output distributions of ^-tagging algorithms on samples of Z?-jets pre-selected by several 
methods (topological, kinematic or likelihood selection). The counting method allows measurement of 
the integrated ^-tagging efficiency with a relative precision of ±2.2(stat.)±3.5(syst.)% in the lepton+jets 
channel and ±3.7(stat.)±2.7(syst.)% in the di-lepton channel for 100 pb _1 of data. For the topological 
selection, 200 pb _1 of data are needed and allow a relative precision of ±7.7(stat.)±3.2(syst.)%. 

8.2 Mis-tagging rates 

Measuring the mis-tagging rate in data is more difficult and under study. The main approach is to use 
the negative tags (either in impact parameter or decay length) which describe the effects of a limited 
resolution on a priori prompt tracks and then to correct for long-lived particles (K s , A, etc), material 
interactions and heavy flavour jets which are negatively tagged. So far no study in ATLAS has estimated 
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the accuracy with which fake rates can be measured. However, based on the Tevatron experience, it 
seems that a 10% relative error could be achievable with 100 pb -1 . 

8.3 Extracting reference distributions from data 

For the likelihood tagging algorithms, reference distributions for light and ft-jets are needed. In the 
case of Z?-jets, they could in principle [25] be measured in data from a pure sample of b-jets using the 
techniques developed for the ^-tagging efficiency estimation in tt events. As shown in Ref. [9], the 
various distributions can be checked in data with a few hundred pb _1 . However, much more data is 
needed to extract multi-dimensional likelihoods. For light jets, it seems difficult to extract from data a 
sample with sufficient purity. In any case, a Monte Carlo accurately describing the data is also needed 
to extrapolate those reference distributions to ranges where they certainly can not be measured, the very 
high-pr regime for instance. 

9 Conclusions and outlook: realistic performance in first data 

The first tagging algorithm to be commissioned with real data is expected to be (besides the track count- 
ing method) JetProb, using the tracks from any kind of events to define its resolution function. For a 
Z>-tagging efficiency of 60%, a light jet rejection of around 30 could be achieved with this tagging algo- 
rithm but further improvements are expected. The soft lepton tagging algorithms will be commissioned 
at the same time, leading to higher rejection levels when considering semi-leptonic fr-jets. Once the qual- 
ity of the Monte Carlo simulation is checked and better understood with data, a tagging algorithm relying 
on Monte Carlo templates for b and light jet hypotheses such as IP3D can be used, perhaps doubling the 
rejection power. The commissioning of the tagging algorithms relying on secondary vertexing may take 
more time, but tagging algorithms like SV1 should quadruple the initial rejection level, bringing it above 
100. Finally the combination of the ultimate JetFitter tagging algorithm and the various improvements 
described in this note should permit a rejection of 200, or more interestingly to maintain a rejection of 
100 at a higher ^-tagging efficiency, around 70%. 

Those estimates do not take into account the impact of residual misalignments in the tracker. How- 
ever, a first study has been performed in which the actual alignment procedure was run. This is the most 
realistic study so far, and includes many systematic deformations caused by the alignment procedure 
itself. It concludes that the mis-tagging rate is at most 30% lower for the same ^-tagging efficiency with 
a realistic early detector alignment. 

The impact of several other effects on the ^-tagging performance has been studied in this note. In 
the simulation used, the fraction of dead pixels is overestimated. Using 1% instead of 5% dead pixels 
improves the light jet rejection by about 10%. Relative improvement in the light jet rejection, from 
10% in typical tt events to 50% for high-/?/- samples, could be achieved with a tuning of the tracking in 
jets. The sensitivity to the accuracy of the passive material description in the simulation could be quite 
dramatic. However, a large effort has been made in describing accurately the material in the tracking 
volume. If an 8% to 15% discrepancy between Monte Carlo and data would remain, the impact on the 
^-tagging performance is a 10% relative change in light jet rejection power. 

New ideas to improve the performance have been studied: the generalization of the track categories 
could bring a 10% to 60% improvement, optimizing the track-to-jet association could help significantly 
in busy events and finally the use of multivariate techniques such as BDT could lead to gains in the range 
10-50%. However, all these potential gains only apply to some tagging algorithms, some pj region, etc. 
Furthermore some correlations are expected among them. Therefore it will be interesting to assess the 
net impact of these improvements when they are all available, used simultaneously and optimized. 
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Finally, detailed studies have shown that the ^-tagging efficiency can be measured directly in data 
using dijet or it events. With 100 pb -1 , a relative precision of about 5% can be achieved for Z>-jet effi- 
ciency The accuracy with which the mis-tagging rates can be measured deserves more study, however, 
a 10% precision seems feasible based on the Tevatron experience. 
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Abstract 

Tagging of b-quark jets, "^-tagging", is an important ingredient for Standard 
Model analyses as well as for searches for new physics. A property of b-quark 
jets exploited by ^-tagging algorithms is the presence of secondary b- and c- 
hadron decay vertices. In this note, methods for the explicit reconstruction 
of secondary and/or tertiary decay vertices of b- and ohadron decays are pre- 
sented. The performance of the secondary vertex based ^-tagging algorithms 
and the dependences on the event topology and jet kinematics are studied. The 
efficient reconstruction of the primary interaction vertex is also crucial for b- 
tagging, especially in the presence of pile-up interaction vertices at LHC lumi- 
nosity. The ATLAS primary vertex reconstruction strategies and performances 
are presented in this note as well. 

1 Introduction 

Identification of b-quark jets relies on the properties of the production and weak decay of fe-hadrons. 
The most important one is their relatively large lifetime of about 1.5 ps (cT ps 450/xm). The resulting 
^-hadron flight path < I >= /3ycT leads to a signature of one or more displaced secondary vertices (e.g. 
inside a jet originating from a b-quark with transverse momentum of 50 GeV ^-hadrons travel on average 
about 3 mm in the transverse plane before their decay). 

Numerous methods can be used for the identification of &-quark jets (usually called b-jet tagging or 
just b-tagging), based e.g. on the presence of leptons inside jets due to semileptonic ^-hadron decays, 
kinematical properties of jets or explicit b- or c-hadron reconstruction. The most efficient tagging meth- 
ods are based on the presence of displaced vertices inside jets. Such vertices can be detected either by 
the presence of tracks incompatible with the primary event vertex or by explicit reconstruction of those 
secondary vertices. Many track based and vertex based ^-tagging methods are known together with 
combined methods using both track impact parameters and vertex reconstruction simultaneously. 

The reconstruction of secondary ft-hadron decay vertices inside jets is challenging for several reasons. 
The multiplicity of charged particle tracks belonging to the vertex is, contrary to the reconstruction of ex- 
clusive decay modes, a-priori not known. In cases of less than two reconstructed charged particles, a well 
defined secondary vertex cannot be reconstructed. This can happen either because of the charged particle 
multiplicity produced in the decay or limitations in the track reconstruction efficiency, as imposed by the 
geometrical acceptance of the inner tracking detectors or interactions in the detector material. Secondary 
vertex reconstruction efficiencies thus show an upper limit. In addition, weak ft-hadron decays almost 
always lead to one or more charm hadrons which subsequently decay through weak interaction. Since 
c-hadrons also have significant lifetimes, the resulting topology is a set of charged particle tracks either 
stemming from the primary event vertex, the secondary ^-hadron or tertiary c-hadron decay vertices. The 
resolution of the ATLAS tracking system does not resolve this decay topology in all cases. Vertex recon- 
struction inside jets for b-]et tagging thus has to be done in an inclusive way. It should be targeted towards 
highest efficiency to detect secondary vertices and identify its topology as well as possible. Kinematical 
features of reconstructed vertices like e.g. the invariant mass may be used in combination with spatial 
features (track-vertex and vertex-vertex distances, etc.) to increase the &-jet identification power of the 
algorithms. 

Another important ingredient for tagging b-quark jets is the reconstruction of the primary event 
vertex. The size of the beamspot in the transverse plane (about 15 ;Um) is sufficiently small to allow 
the application of ^-tagging algorithms if only information as defined in the transverse plane is used, 
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once the position of the beam is known with high precision. Explicit reconstruction on event by event 
basis does not improve significantly the resolution of the primary event vertex in the transverse plane. 
In the longitudinal direction along the beam, however, the a-priori knowledge of the interaction point is 
only poor (several cm). Explicit reconstruction is thus mandatory if ^-tagging algorithms are not only 
based on track and vertex information in the transverse plane. Furthermore, at nominal running of the 
LHC, additional minimum bias events produce additional so-called pile-up vertices which have to be 
distinguished from the hard primary interaction. The main task of the primary vertex reconstruction for 
^-tagging is thus to reconstruct and identify the signal event vertex out of all interaction vertices along 
the beam. 

At the LHC, ^-tagging has to be applied to jets covering a wide kinematic range both in transverse 
momentum and pseudorapidity. ^-tagging has to be applied over the full acceptance of the tracking de- 
tectors of about 1 7] | < 2.5. The track reconstruction performance, and thus also the vertex reconstruction 
and ^-tagging performance, varies strongly with transverse momenta and pseudorapidities of charged 
particle tracks. 

This note focuses on aspects of primary and secondary vertex reconstruction relevant for ^-tagging. 
Other aspects, also more technical ones, and performance issues are addressed in notes dedicated to 
primary [1] and secondary vertex reconstruction [2,3]. The note is structured as follows. In Section 2 
some information on definitions, physics input objects and data sets used in this note is given. Section 
3 describes the reconstruction of the primary event vertex, its performance and impact on ^-tagging. 
Algorithms designed for the inclusive reconstruction of secondary decay vertices inside jets based on 
two different methods are described in Section 4. The performance of these algorithms in terms of 
secondary vertex related quantities is discussed in Section 5. Their combination with the pure track 
impact parameter based ^-tagging algorithms is described in Section 6. The ^-tagging performance of 
the algorithms is presented in Section 7. Section 8 gives a summary of the note together with an outlook 
towards possible future improvements. 

2 Definitions, Reconstruction Details and Datasets 

This section gives definitions and technical details of data reconstruction and analysis procedures used 
in this note. More detailed information can be found in [4]. 

The efficiency to tag a jet of flavour q as b-]et, e q , is defined as: 

^ Number of jets of real flavour q tagged as b ^ 
q Number of jets of real flavour q 

Usually Eb is called tagging efficiency and z udsc mistagging rate. The inverse of the mistagging rate 
r u dsc = l/ £ «rf«c is called b-tagging rejection power or simply rejection. The assignment of a certain 
flavour to a jet in the Monte Carlo simulation is not unambiguously defined. The following definition 
has been introduced: the jet flavour is the flavour of the heaviest quark after gluon radiation and splitting 
within a cone of some size around the jet direction. The default cone size used in ATLAS and in the 
current note is AR < 0.3. To test the pure algorithmic performance of the algorithms, a procedure called 
"purification" may be applied. Here, a light quark jet is only considered if there is no heavy parton (b- or 
c-quark) or t lepton within a cone of 0.8 around the jet direction. 

Several jet reconstruction algorithms are available in ATLAS [5]. The ^-tagging performance may 
depend on the jet algorithm, but the studies of this dependence are outside the scope of this note (see [4] 
for such studies). For the studies in this note, an iterative cone algorithm with a cone size of AR = 0.4 
using combined calorimeter towers as input, has been used. Charged particle tracks have been recon- 
structed with a Kalman filter based algorithm [6]. 
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Offline ATLAS jet reconstruction and track reconstruction are independent. An assignment of tracks 
to jets is thus necessary. The current ATLAS method used also in this note is also based on a geometrical 
cone around the jet direction. All charged tracks within a certain cone (the default value is AR < 0.4) 
around the jet axis are assigned to a jet. 

The following fully simulated Monte Carlo samples have been used for the performance studies 
described in this note: 

• Higgs boson production in association with a W boson. The W boson was forced to decay into a 
muon and its anti-neutrino, W — > jUV^, the Higgs boson was forced to decay into pairs of b-, c- or 
M-quarks: H —>■ bb, H —>■ cc, H —>■ utT. This ensures that the jets of different quark flavour have very 
similar kinematics. To cover a wide range of jet transverse momenta, samples with Higgs boson 
masses of mn = 120 GeV and m# = 400 GeV have been produced. 

• Events with pairs of top quarks, it, where one or both W bosons decayed leptonically. 

• Events with pairs of top quarks and at least two additional jets: ttjj. These events show a large 
hadronic activity with a high probability of overlapping jets. 

During the first few years the LHC will operate at a luminosity of 2 • 10 33 cm _2 s _1 (low luminosity 
mode), reaching 10 34 cm~ 2 s _1 (high luminosity mode) at later stages. Each signal event triggered and 
reconstructed in the ATLAS detector will be overlay ed by several low-pj proton-proton interactions, 
commonly denoted as minimum bias interactions. The average number of minimum bias interactions 
per bunch crossing is 4.6 and 23 for the low and high luminosity modes, respectively. To study the 
influence of these additional so-called Pile-Up interactions, fully simulated Monte-Carlo samples for the 
low luminosity mode have also been generated for the WH (m# = 120 GeV) data set. 

To parametrize the trajectory of a charged particle in a magnetic field, so-called "perigee" parameters 
are used [6]. The ATLAS track based ^-tagging algorithms use do and zo, the transverse and longitudinal 
impact parameters at the point of closest approach of the trajectory to the primary vertex in the transverse 
plane. To reduce the dependence on the track parameter resolutions, the corresponding variables are 
divided by their errors. Each impact parameter significance also gets a sign based on the track and jet 
directions. The sign is positive if the track crosses the jet axis in front of the primary vertex and negative 
if behind. Most of the tracks produced in ^-hadron decays have a positive sign whereas tracks originating 
from the primary vertex have both signs with equal probabilities [4]. The distance between the primary 
and secondary vertices may also have a sign based on the jet direction. 

3 Primary Vertex Reconstruction 

In this section different strategies for the reconstruction of primary vertices currently implemented in 
the ATLAS reconstruction software are described. The evaluation of the algorithms is performed in the 
single collision and low luminosity pile-up modes. 

The reconstruction of primary vertices can generally be subdivided into two problems: vertex finding 
and vertex fitting. The primary vertex finding deals with the association of reconstructed tracks to a par- 
ticular vertex candidate. The task of the vertex fitting is to reconstruct the position of the primary vertex 
and its covariance matrix, recalculate the parameters of the incident tracks using the vertex constraint 
and provide a measure of goodness of the fit, such as for example the x 2 of the fit [7]. In addition, the 
X 2 of the refit of track parameters with the knowledge of the reconstructed vertex is a criterion of track 
to vertex compatibility. 

A brief description of the mathematical properties of the algorithms for primary vertex reconstruction 
and evaluation of their performance with respect to ^-tagging are presented below. For a more detailed 
description of the algorithms the reader is referred to [1]. 
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3.1 Algorithms for Vertex Fitting 

Several strategies for vertex fitting are currently implemented in the ATLAS reconstruction software. 
All implemented vertex fitters are based on the minimization of a x 2 function with respect to a position 
of the vertex and parameters of incident tracks at this position. In addition, all fitters are currently 
using the exact analytical solution for the dependence of the track parameters on the vertex position and 
track parameters at the vertex. Indeed, the full solution with respect to the ATLAS version of perigee 
parametrization has a simple form, which is also computationally economical [1]. The approach to the 
minimization of the % 2 , treatment of components of track momenta in the vicinity of the vertex candidate 
and reweighting of tracks during the iterative process, however, differs from algorithm to algorithm. 

3.1.1 The Billoir Full and Fast Vertex Fitting Algorithms 

Two algorithms, the Billoir Full and the Billoir Fast Vertex Fitter were implemented, following the 
schema presented in [8]. Here, the inversion of the full (3n + 3) x (3n + 3) matrix 1 of parameters of 
the fit, where n is the number of tracks, is replaced with n inversions of (3 x 3) matrices. Compared to 
the global % 2 minimization, this approach leads to a significant gain in CPU time due to the inversion 
of smaller matrices. In addition to the reconstruction of the vertex position and its covariance matrix, 
the Billoir Full Vertex Fitter also refits the parameters of the incident tracks with the knowledge of the 
vertex. 

The Billoir Fast Vertex Fitter is a simplified version of the vertex fit, where the trajectories of charged 
particles are approximated with straight lines in the vicinity of the vertex and their momentum compo- 
nents p = (6, (j) v ,q/p) are considered to be constant. While the precision of this approach is only a little 
smaller than the one of the Full Billoir Vertex Fitter, the fit itself is significantly faster due to the reduc- 
tion of the size of covariance matrices to be inverted. It should also be noted, that the Fast Vertex Fit does 
not refit the momenta of the incident tracks. Due to its reasonable resolution and high CPU performance, 
this is the default vertex fitter to be used with InDetPriVxFinder, as explained in Section 3.2.1. 

3.1.2 The Sequential and Adaptive Vertex Fitting Algorithms 

The Sequential Vertex Fitter implements the conventional Kalman Filter for vertex fitting as described 
in [7]. The vertex estimate is updated iteratively using the information from a single track at a time. The 
SequentialVertexSmoother allows the refit of parameters of incident trajectories with the knowledge of 
the reconstructed vertex position. In addition, the smoother allows for a calculation of the smoothed % 2 
of a track, which is a good criterion of compatibility of a track to a vertex. 

A robust version of the above algorithm is the AdaptiveVertexFitter. It is an iterative reweighted 
Kalman Vertex Fitter, where each track is down-weighted according to its compatibility to the actual ver- 
tex position [9]. The dependence of the weighting factor on the iteration number of the fit is determined 
by a thermodynamic annealing procedure. The assignment of tracks to a vertex candidate thus becomes 
stronger with iterations and the outliers are efficiently discarded. 

3.1.3 The TrkVKalVrtFitter Algorithm 

The TrkVKalVrtFitter is a universal vertex fitter with constraints. It uses the Billoir method [8] to estimate 
the local vertex position and calculate refit track parameters with the requirement of passing through the 
vertex position. The possibility of applying a beam spot constraint when reconstructing the primary 
vertex is also provided. Neutral and charged particles can be used simultaneously in the fit. 

'in the vicinity of the vertex, the polar angle 6 and the momentum p of the trajectories are considered to be constant. 
The number of parameters of the fit therefore reduces to 3n + 3: 3 remaining parameters lor each of n tracks and 3 vertex 
coordinates. 
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In the presence of badly measured tracks the precision and stability of the fit may be improved by the 
optional use of robust functionals. In each iteration step the error matrix for each track is recalculated 
based on an eigenvector decomposition and downweighting of badly measured directions in the track 
parameter space. Several functionals for downweighting known as M-type estimators are implemented 
[2]. 

3.2 Vertex Finding Strategies 

Three different strategies for primary vertex finding are currently implemented. These are the InDet- 
PriVxFinder, the InDetAdaptiveMultiPriVxFinder and the VKalVrtPrim. In the InDetPriVxFinder the 
process of primary vertex finding is decoupled from the vertex fitting and the maximal number of re- 
constructed vertices is therefore defined at the seeding step. The latter two algorithms exhibit a finding 
through fitting approach, where the number of vertex candidates changes according to the results of the 
previous iteration of the fit. 

3.2.1 The InDetPriVxFinder Algorithm 

The reconstruction strategy of the Inner Detector Primary Vertex Finder consists of three steps. It starts 
with the selection of good quality tracks originating from the beam crossing area (the detailed selection 
cuts can be found in [1]). 

After this initial track preselection, clusters in z are searched for in the resulting set of tracks. A 
simple sliding window algorithm is used for this purpose. First, the tracks are ordered in ascending order 
of their zo impact parameter. The full range of zq impact parameters is then scanned and clusters of fixed 
length are formed. In the low luminosity scenario, the default maximal cluster length was chosen to 
be 3 mm. The resulting clusters of tracks are considered as independent primary vertex candidates and 
corresponding vertices are reconstructed with the provided vertex fitting algorithm. 

After the reconstruction of the initial vertex candidates, the tracks least compatible with a given 
vertex candidate are rejected and the vertex candidate is refitted (the Billoir Fast Vertex Fitter described 
in 3.1.1 is used by default) using one of the following procedures: 

• All tracks, with a % 2 contribution greater than a predefined value (by default, the threshold of 
X 2 = 5 per track is chosen) are discarded and the vertex candidate is reconstructed again using 
remaining tracks. 

• The least compatible tracks are removed from the vertex candidate iteratively one by one until no 
track with a % 2 contribution greater than a predefined value is left or the number of remaining 
tracks is too little to continue. The vertex candidate is refitted at each iteration. 

At present, the first algorithm is used as the default one. At the last stage of the primary vertex reconstruc- 
tion, the obtained vertex candidates are sorted in descending order according to the sum of transverse 
momenta of their tracks. The vertex with the highest Y*Pt of the tracks is tagged as the signal one. 
It should be noted that the maximum number of reconstructed primary vertices is fully defined by the 
output of the cluster finding in the beginning of the algorithm. The iterative refit of the vertex candidates 
followed by the rejection of incompatible tracks serves for the refinement of the knowledge of the vertex 
positions only. The minimum number of tracks to form a vertex candidate is one in the case when the 
beam position is used as constraint, and two otherwise. 

3.2.2 The InDetAdaptiveMultiPriVxFinder Algorithm 

The InDetAdaptiveMultiPriVxFinder is an example of a finding-through-fitting approach to the primary 
vertex reconstruction. The reconstruction starts with the preselection of tracks originating from the 
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beam crossing region (the detailed selection cuts can be found in [1]). A single vertex seed is created 
out of the preselected set of tracks. A vertex candidate is then reconstructed using the Adaptive Multi 
Vertex Fitter [10], as described in 3.1.2. The tracks which were considered to be outliers during the 
first fit are used to create a new vertex seed. At the next iteration a simultaneous fit of two vertices 
is performed. Each track is down-weighed with respect to the two vertices. The number of vertex 
candidates is growing with iterations and the vertices are competing with each other in order to attain 
more tracks. An annealing procedure is applied to this process: the assignment of tracks to vertices is 
becoming harder with iterations. The result of the fit is a set of reconstructed vertices with an almost 
solid track assignment. It has to be noted that for the adaptive fitters the starting point of the fit is of great 
importance. 

3.2.3 The VKalVrtPrim Algorithm 

In this algorithm, primary vertex reconstruction starts with the selection of a subset of tracks close to the 
beamspot position. A sliding window type algorithm (with window size « 4.5 mm) selects a position 
along the beam which maximizes the vertex candidate quality value. This position becomes a seed for 
the vertex fitter. It accepts all tracks close to the found vertex candidate position and tries to fit a single 
vertex. If the quality of the vertex is not satisfactory, one or several tracks with the biggest % 2 contribution 
are rejected and the fit is repeated. The algorithm works until a vertex with good quality is obtained. Best 
results may be achieved with the combination of a robust fitting functional and the rejection of tracks 
during the fit. After a successful fit, the participating tracks are removed from the set of selected tracks 
and the sliding window algorithm starts a search of the next vertex candidate for the remaining tracks. 
VKalVrtPrim runs until no more vertex candidates can be obtained in the remaining track sample. 

The vertex candidate quality value used in the search is a sum of the track transverse momenta, 
the transverse mass and p T nonuniformity in the transverse plane. The vertex fitter uses the beam spot 
position as constraint in the fit. 

3.3 Performance of Primary Vertex Reconstruction 

In this section, the performance of different primary vertex reconstruction algorithms is evaluated. The 
studies are performed in single collision and low luminosity pile-up modes. 

As mentioned above, the main tasks of primary vertex finding algorithms are: 

• To find all primary vertices in a bunch crossing and reconstruct their positions with highest possible 
accuracy. 

• To identify the hard scatter signal primary vertex among the set of reconstructed vertices with 
highest possible efficiency. 

These two aspects will be investigated in the following. In this section the performance of five strate- 
gies for primary vertex reconstruction are compared: VKalVrtPrim, InDetAdaptiveMultiPriVxFinder and 
InDetPriVxFinder used with SequentialVertexFitter, Billoir Fast and Billoir Full Vertex Fitters. Monte 
Carlo samples of WH{\20) —> \ivbb and WH(\2Q) — »■ \ivuu events with and without low luminosity 
pile-up are used for this study. 

3.3.1 Efficiency of Hard Scatter Signal Primary Vertex Identification 

As mentioned above, an average of 4.6 minimum bias events are expected to overlay the signal event 
during low luminosity running of the LHC. The number of simulated primary vertices per bunch crossing, 
corresponding to the low luminosity mode of the LHC is shown in Figure 1 (left), together with the 
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number of vertices Purity P 



Figure 1: Left: Number of simulated (black line) and reconstructed vertices per event for different 
reconstruction algorithms (dotted lines); right: distribution for the InDetAdaptiveMultiPriVxFinder of 
the primary vertex purity as defined in Eq. (2). The WH{mH = 120GeV) — > \ivbb event sample has been 
used for these studies. 



number of primary vertices reconstructed with different algorithms. All primary vertex finders return 
containers of reconstructed vertices ordered according to the probability of a given vertex to be produced 
in a signal collision. The identification of a signal vertex (first in the list) is performed in different ways 
depending on the algorithm. The InDetPriVxFinder orders the vertices according to the sum of transverse 
momenta of incident tracks, while the output of the InDetAdaptiveMultiPriVxFinder is sorted according 
to the sum of squares of transverse momenta. In all cases, the first vertex in the list is considered to be the 
primary signal vertex and is used by the ^-tagging algorithms. In order to estimate the efficiency to find 
the correct signal vertex from the list of primary vertices two methods are used. In the first method the 
signal vertex is considered as reconstructed correctly if it is closer than 500 jim in the z direction from 
the truth: \z rec —Ztmth\ < 500 jum. The second method is based on the purity of reconstructed vertices. 
The definition of the primary vertex purity is based on the association of tracks fitted to a primary vertex 
to truth particles: 



I> S i gna i +Y,W p Ueup + 'L w noCorres 

where Y, w signal is the sum of weights of all tracks associated with the signal primary vertex, pile-up is 
the sum of weights of all tracks associated with pile-up events and £ w m) c or res is the sum of weights of all 
tracks which were not matched to truth particles. It can be noted that for vertex algorithms which do not 
assign lower weights to tracks with respect to the vertex candidate during the fit, the sum of the weights of 
all tracks fitted to the vertex is equal to the track multiplicity. If the value of the purity for a reconstructed 
primary vertex candidate is above 0.5, it is assumed that the main contribution to the reconstructed vertex 
stems from signal tracks and therefore the signal primary vertex is correctly identified. 

In Figure 1 (right) the distribution of the purity of the first vertex in the output list of the InDetAdap- 
tiveMultiPriVxFinder for WH (120) — > jivbb events is shown. It can be noted that in the majority of 
cases the purity of a tagged signal primary vertex is close to unity. 

In Table 1 the fraction of events with correctly identified vertices for both distance based and pu- 
rity based definitions are shown for different reconstruction algorithms. The WH (120) — > \l\bb and 
WH (120) — > \l\uu Monte Carlo samples with low luminosity pile-up were used for this study. It can be 
noted that the best performance is shown by the InDetAdaptiveMultiPriVxFinder and by VKalVrtPrim. 
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Table 1 : Fractions of correctly identified primary vertices for different primary vertex reconstruction al- 
gorithms. The fractions are calculated according to the purity based (1) and distance based (2) definitions 
of the efficiency. The fraction of events where the signal vertex is in the list of reconstructed vertices 
(column denoted in list) is calculated according to the distance based definition. 





WH -> \ivbb 


WH -> \ivuu 


Algorithm 


(1) [%] 


(2) [%] 




(1) [%] 


(2) [%] 




InDetAdaptiveMultiPriVxFinder 


93.7±0.1 


93.8±0.1 


99.1±0.1 


96.1 ±0.1 


96.2±0.1 


99.3 ±0.1 


VKalVrtPrim 


94.4 ±0.1 


94.5 ±0.1 


99.1 ±0.1 


95.1 ±0.1 


95.2±0.1 


99.4 ±0.1 


InDetPriVxFinder (Billoir Full) 


89.8±0.2 


89.3 ±0.2 


97.6 ±0.1 


94.0 ±0.2 


93.8±0.2 


98.2±0.1 


InDetPriVxFinder (Billoir Fast) 


89.8±0.2 


89.3 ±0.2 


97.6 ±0.1 


94.0 ±0.2 


93.8±0.1 


98.2±0.1 



Also no significant difference is observed between purity based and distance based definitions of the 
efficiency. In addition, the observation is made that in the single-collision mode, the rate of wrongly 
identified vertices is essentially zero for all algorithms. 

The misidentification of the signal vertex leads to a wrong estimate of the longitudinal impact pa- 
rameters of tracks. This wrong estimate may lead to an incorrect identification of b- and light quark jets 
and a decrease of the ^-tagging performance. A dependence of the ^-tagging performance on the type of 
the primary vertex reconstruction algorithm is therefore expected. 

3.3.2 Pointing Lepton 

In the events containing isolated leptons (electrons or muons), or even a lepton inside a jet coming 
possibly from semileptonic decays of heavy hadrons, the properties of these leptons can be used to 
further increase the efficiency to identify correctly the primary vertex. The trajectory of a reconstructed 
lepton can be extrapolated to the interaction region. Each reconstructed vertex, for which the lepton 
has a longitudinal impact parameter z.o < 3 mm, gets an additional weight and can then be tagged as the 
signal vertex after reordering the list of reconstructed vertices. In Table 2 the fractions of events with 
correctly identified primary vertices after reordering according to the pointing lepton information are 
shown for different primary vertex finding algorithms. To estimate these fractions, the distance based 
definition of efficiency is used. Comparing to Table 1, it can be noted that the use of information from 
a reconstructed lepton increases the efficiency of correctly identifying the signal primary vertex in case 
of the Billoir fitters, which use the sum of squares of transverse momenta of tracks as sorting criteria. In 
most cases of misidentified vertices, the direction of the isolated lepton from the W boson decay is not in 
the kinematical acceptance of the inner detector, e.g. for the InDetAdaptiveMultiPriVxFinder (73 ±1)% 
of the outliers have the isolated muon with |t]^| > 2.5. Events with a well reconstructed isolated lepton, 
as it is the case in many analyses, thus show a much lower fraction of misidentified primary vertices. 

Table 2: Fraction of correctly identified primary vertices for different algorithms using the distance based 
definition with reordering due to a lepton matched to the primary vertex. 





WH -► nvbb 


WH — ► jivuu 


InDetAdaptiveMultiPriVxFinder 


94.7 ±0.1 


96.0 ±0.1 


VKalVrtPrim 


94.6 ±0.1 


95.2 ±0.1 


InDetPriVxFinder (Billoir Full) 


93.7±0.1 


95.0 ±0.1 


InDetPriVxFinder (Billoir Fast) 


93.7±0.1 


95.0 ±0.1 
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3.4 Primary Vertex Resolution 

In presence of the pile-up vertices spatial resolutions of the reconstructed primary vertex may degrade 
due to the partial acceptance of tracks from the superposed events. Tables 3 and 4 show the resolutions 
for the x and z coordinates of reconstructed primary vertices for different reconstruction algorithms. 



Table 3: Resolutions on the x coordinate of primary vertices reconstructed with different strategies in 
WH — > jivbb and WH — > jivuu signal only and pile-up scenarios. The numbers shown are the widths of 
a fit with a single Gaussian to the corresponding distributions of residuals. 





o x [jum] in WH — ► \ivbb 


o x [jum] in WH — >• \ivuu 


Algorithm 


signal only 


pile-up 


signal only 


pile-up 


InDetAdaptiveMultiPriVxFinder 


11.46 ±0.05 


11.66 ±0.05 


10. 13 ±0.05 


10.34 ±0.05 


VKalVrtPrim 


11.44 ±0.05 


11.59 ±0.05 


10.04 ±0.05 


10.25 ±0.06 


InDetPriVxFinder (Billoir Full) 


12.22 ±0.05 


12.51 ±0.05 


11.01 ±0.06 


11.17±0.05 


InDetPriVxFinder (Billoir Fast) 


12.23 ±0.05 


12.50 ±0.05 


11.01 ±0.06 


11.17±0.06 



Table 4: Resolutions on the z coordinate of primary vertices reconstructed with different strategies in 
WH — ► fivbb and WH — > jxvuu signal only and pile-up scenarios. The numbers shown are the widths 
of a fit with two Gaussians to the corresponding distribution of residuals (a z ^ for the narrow and o z j 
for the broad component, respectively). Core Fraction is the fraction of events contained in the narrow 
Gaussian. 



Algorithm 




WH -> [lvbb 


WH — > \ivuu 


a z , N , am 


Oz,t, [Mm] 


Core 
Fraction [%] 


°z,N, [Mm] 


o&, [Mm] 


Core 
Fraction [%] 


InDetAdaptiveMulti- 
PriVxFinder 


signal only 


41.3±0.5 


98±2 


76.3 


35.3±0.5 


79±2 


75.8 


40.9 ±0.5 


95±2 


75.1 


34.9 ±0.5 


77±2 


75.5 


VKalVrtPrim 


signal only 


40.2 ±0.5 


92±2 


71.1 


36.0±0.5 


82±2 


78.1 


41.2±0.5 


98±2 


74.1 


34.3 ±0.5 


76±2 


73.2 


InDetPriVxFinder 
(Billoir Full) 


signal only 


49.3 ±0.5 


132±3 


76.3 


40.4±0.6 


99±3 


76.4 


48.6±0.5 


127 ±3 


76.7 


41.0±0.5 


103 ±3 


79.1 


InDetPriVxFinder 
(Billoir Fast) 


signal only 
pile-up 


49.3 ±0.5 


132±3 


76.2 


40.3 ±0.6 


98±3 


76.0 


48.6±0.5 


127 ±3 


76.5 


40.9 ±0.5 


103 ±3 


79.0 



The numbers shown are the widths of a fit with a single Gaussian for the x coordinate and two Gaussians 
for the z coordinate, respectively, to the corresponding distribution of the residuals. In addition the ratio 
of the narrow Gaussian and the sum of the two Gaussians is given for the z coordinate. It can be noted 
that the best results are obtained with the VKalVrtPrim and InDetAdaptiveMultiPriVxFinder and that 
the values of coordinate resolutions for all algorithms are very close to those obtained for the non-pile- 
up case. It can therefore be concluded, that in the low-luminosity conditions the degradation of spatial 
resolution appears to be very small. 

3.5 Impact on ^-Tagging Performance 

The beam interaction region in ATLAS has an approximately gaussian transverse profile with a = 15/im. 
This size is smaller than the typical transverse impact parameter (do) [6] resolution of charged particle 
tracks, ^-tagging algorithms based on transverse impact parameter significances only (IP2D method [4]) 
are thus only slightly sensitive to the details of the primary vertex reconstruction and the presence of 
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Table 5: Light jet rejection for WH(m# = 120 GeV) events (signal only and pile-up scenarios) using 
different primary vertex reconstruction algorithms. Only jets with at least one track associated are con- 
sidered in these performance studies. IP2D,IP3D and IP3D + SV\ ^-tagging methods are described in 
d etails in [4] 



algorithm 




5 ° % 


=== 


IP3 


===== 


° 


° 


InDetAdaptiveMulti- 
PriVxFinder 


signal only 
pile-up 
ratio [%] 


133 ± 6 




226± " 14 




438 ± 36 


119 ±5 


135 ±6 


45 ±1 


160 ±8 


52 ±1 


230 ±14 


82±3 


102 ±6 


98±3 


71 ±8 


78±4 


53 ±10 


69±6 


VKalVrtPrim 


signal only 
pile-up 
ratio [%] 


131 ±6 


46±1 


234 ±14 


69±2 


486 ±42 


129 ±6 


141 ±6 


47 ±1 


136 ±6 


50±1 


220 ±12 


80±3 


108 ±6 


102 ±2 


58±7 


72±4 


45 ±10 


68±6 


InDetPriVxFinder 
(Billoir Fast) 


signal only 
pile-up 
ratio [%] 


123 ±5 


46±1 


225 ±13 


68±2 


449 ± 38 


113 ±5 


124 ±5 


41 ± 1 


140 ±6 


48 ±1 


220 ±12 


71±2 


101 ±6 


89±3 


62±7 


71 ±3 


49 ±10 


62±3 


InDetPri VxFinder 
(Billoir Full) 


signal only 
pile-up 
ratio [%] 


123 ±5 


46±1 


225 ±13 


68±2 


450 ±38 


114 ±5 


124 ±5 


41 ±1 


140 ±6 


48 ±1 


221 ± 12 


71±2 


101 ±6 


89±3 


62±7 


71 ±3 


49 ±10 


62±3 



pile-up vertices. This is demonstrated in the first column of Table 5. Since the vertex reconstruction 
accuracy in the longitudinal direction is also better than the typical longitudinal impact parameter res- 
olution (zo [6]), the ^-tagging performance is not influenced significantly by the choice of the primary 
vertex algorithm in the case where only the signal vertex is present. This can be seen by comparing the 
rows labelled signal only in Table 5 for the different primary vertex reconstruction algorithms. 

The presence of pile-up vertices, however, changes the situation for the longitudinal coordinate con- 
siderably. The z position resolution is not affected significantly by the presence of additional vertices as 
is shown in Table 4, but the longitudinal size of the beam interaction region is much bigger (a z « 5.6 cm) 
than the transverse one. Any misidentification of the primary interaction vertex thus produces artificially 
large zo track impact parameters. Consequently, most of the tracks are rejected by the ^-tagging track 
selection procedure in these cases and the jets to be tagged do not have any tracks associated. Such jets 
can not be tagged as b-quark jets anymore. Since the vertex misidentification probability is not negligible 
in some event topologies, as can be seen from Table 1 , the performance of ^-tagging algorithms using 
information from the longitudinal coordinate is appreciably affected. The degradation of the ^-tagging 
performance in the presence of pile-up vertices for these algorithms is shown in the second and third 
columns of Table 5 for the IP 3D and IP3D+SV1 algorithms which use this information. 

An additional complication for ^-tagging with pile-up originates from the rather large amount of soft 
jets produced in pile-up interactions. These jets do not have reconstructed tracks inside at all in many 
cases and are thus considered as light quark jets by the ^-tagging algorithms. This, however, causes 
an unphysical change of the ^-tagging performance not related to the performance of the algorithms 
themselves, because a different set of jets is used to estimate the performance. To minimize this effect, 
the results in Table 5 are obtained for jets with at least one associated track. This approach is different 
from the one normally used for ^-tagging performance studies. The numbers in Table 5 thus have to be 
taken with care when comparing with other ^-tagging results. The observed degradation of the ^-tagging 
performance confirms the results obtained in a previous study [11], where a more dramatic decrease of 
the ^-tagging performance was demonstrated for the case of the nominal LHC luminosity (10 34 cm~ 2 s _1 ) 
when each beam crossing produces 24 pile-up events on average. 

There are several possibilities to improve the 6-tagging performance in pile-up scenarios. The pres- 
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ence of jets without any tracks associated to them (especially if the requirement of a small longitudinal 
impact parameter is not fulfilled for most of the tracks) is an indication that the correct primary vertex 
has not been reconstructed in this particular event. In such cases, one could either try to use the second 
vertex from the list of primary vertex candidates (according to the ordering of vertices as applied by 
the primary vertex reconstruction algorithm) or drop the information in the longitudinal direction, thus 
falling back to a ^-tagging algorithm working in the transverse plane only. The reconstruction of the 
primary event vertex, as currently implemented, is a universal procedure not depending on the topology 
of the event. Adding information specific to a particular analysis could help in some cases to increase the 
fraction of correctly identified primary vertices, e.g. reconstructing the primary vertex only from prompt 
isolated high pj leptons for channels where these are present. These options have to be studied in the 
future. 

4 Inclusive Secondary Vertex Reconstruction in Jets 

A £>-jet originates from a &-quark, which produces a &-hadron in the fragmentation. The 6-hadron then 
decays due to electroweak interactions, which cause the transition of the b-quark preferably into a c- 
quark (|V c /b| 2 S> |V M fc| 2 ), which then subsequently also undergoes a weak decay. As a result, the typical 
topology of the particles in a b-]et seen in the detector is a decay chain with two vertices, one stemming 
from the &-hadron decay and at least one from c-hadron decays. The eventual intermediate presence 
of excited b- or c-hadron states does not change this picture because their strong or electromagnetic 
decays do not cause measurable lifetimes. A unique feature of jets originating from the fragmentation of 
^-quarks is thus the presence of b- and c-hadron decay vertices. 

An exclusive reconstruction of these decays cannot be done with high efficiency. The large ^-hadron 
masses lead to a huge number of possible decay modes with very small branching ratios, many of them 
involving neutral particles (which cannot be used for vertex reconstruction). The reconstruction of sec- 
ondary b- and c-hadron decay vertices in jets thus has to be done in an inclusive way, where the number 
of charged particle tracks originating from b- and c-hadron decays is not known a-priori. 

Trying to resolve the b- and c-hadron vertices of the decay cascade separately is very difficult for the 
following reasons: 

• The probability to have at least two reconstructed charged particle tracks both from the b- and c- 
hadron decays is much less than 100%. This is both because of the charged particle multiplicities 
involved in these decays as well as the limited track reconstruction efficiency, mainly because of 
material interactions in the detector (as discussed in Section 5). 

• The resolutions of the relevant track parameters, especially at low transverse momenta, are not 
sufficient to separate the two vertices efficiently. 

Two different approaches for the inclusive reconstruction of secondary vertices are presented in Sec- 
tions 4.1 and 4.2. The first one is based on a classical approach of fitting a single geometrical vertex. 
As explained, this is strictly speaking not the correct hypothesis, however, for the reasons given above, 
this is an approximation that works well for a large fraction of cases. The second algorithm is based on 
a kinematical approach, assuming that the primary event vertex and the b- and c-hadron decay vertices 
lie approximately on the same line, the flight path of the ^-hadron. These underlying assumptions are 
discussed in more detail in the corresponding sections. 

The most important issues in inclusive vertex reconstruction are to reconstruct the decay vertices 
with high efficiency and to associate efficiently to the vertices the charged particle tracks coming from 
the corresponding decays of b- and c-hadrons. 

Another important issue is the detection and removal of sources of tracks with large impact parame- 
ters which do not have any relation to the &-quark content of a jet. K° and A 0 decays, /-conversions and 
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hadronic interactions in the detector material must be removed as efficiently as possible before applying 
^-tagging. 

Combining the information from secondary vertices and track impact parameters allows the opti- 
mization of the ^-tagging performance and is decribed in Section 6. 

4.1 The BTagVrtSec Algorithm 

The main purpose of the BTagVrtSec algorithm is efficient b-jet tagging based on the detection of a 
secondary vertex inside a jet. It reconstructs secondary vertices due to b- and/or c-hadron decays inside 
a jet with high efficiency and calculates a jet weight, a discriminating variable which may be combined 
with similar variables from other tagging algorithms (see Section 6). This algorithm is based on the 
VKalVrt [2] vertex reconstruction package. 

4.1.1 BTagVrtSec Vertex Reconstruction 

As stated above, to separate ft-quark jets from jets produced by light quarks and gluons, it is sufficient 
to detect unambiguously b- and c-hadron decay vertices inside the jet. The idea of the BTagVrtSec algo- 
rithm is to maximize the &/c-hadron vertex detection efficiency, keeping at the same time the probability 
to find a fake vertex inside light jets low. The default version of the algorithm constructs a single sec- 
ondary vertex from the b- and c-hadron decay products. As justification of such an approach it should be 
noted, that the ability to reconstruct b- and c-hadron decay vertices separately is quite limited as already 
explained previously. Moreover, the precise reconstruction of the decay chain is less important for b- 
tagging than the detection of the secondary decay itself. The single vertex approximation allows a high 
secondary vertex detection efficiency, necessary for powerful 6-tagging at the price of an imprecise kine- 
matical reconstruction in cases when the distance between the b-and c-hadron vertices is big. Another 
advantage of the single vertex approximation is a more simple algorithm that might be easier to tune and 
calibrate on data. 

The BTagVrtSec algorithm starts with a selection of tracks inside a jet. Tracks are selected with the 
same quality cuts as for the primary vertex reconstruction algorithm VKalVrtPrim, except for a relaxed 
cut on the transverse track impact parameter (do < 3.5 mm) and no requirement of the presence of a hit 
in the first layer (b-layer) of the pixel detector. This is done in order to maximize the efficiency to re- 
construct V° decays and material interactions with subsequent removal of the corresponding tracks from 
the ^-tagging procedure. Tracks are selected in a cone around the jet axis. The size of the cone is a tun- 
able parameter (the default value is AR = 0.4) which currently does not depend on the jet reconstruction 
algorithm. 

The vertex search itself starts with a determination of all track pairs which form good (x 2 < 4.5) two- 
track vertices inside the jet. In addition, each track of the pair must have a three dimensional distance 
from the primary vertex 2 divided by its error higher than 2.0 and the sum of these two significances must 
be higher than 6.0. In order to decrease the fake rate an additional requirement (V 2tr - V primar y,Pj et ) > 0 
is used. 

Some of the reconstructed two-track vertices stem from K® and A 0 decays, y — ► conversions 
and hadronic interactions in the detector material. The corresponding distributions are shown in Figure 2. 
Figures 2 a) and b) show the invariant % + %~ and pn~ mass spectra for accepted two particle vertices 
with peaks due to and A 0 decays. Figure 2 c) shows the distance between the primary and secondary 
vertices in the transverse plane with peaks due to interactions in the material of the beam pipe and 
pixel detector layers. Charged particle tracks coming from such vertices are marked as bad and do not 
participate further in the following ^-tagging procedure for the given jet. 

2 The distance between the primary vertex and the point of closest approach of the track to this vertex. 
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Figure 2: Some distributions for reconstructed two track vertices: 

a) the 7C + n~ invariant mass spectrum with a peak of K° decays; b) the pn invariant mass spectrum with 
a peak of A 0 decays; c) the distance in the transverse plane between the primary and secondary vertices 
with the peaks due to interactions in the beam pipe (two walls at R«30 mm) walls and pixel layers 
(R=50.5 mm and R=88.5 mm). 



In the next step of the algorithm all tracks inside the jet from accepted two-track vertices except for 
marked V° decays and material interactions are combined into one secondary track list and the vertex 
fitting procedure from the VKalVrt package tries to fit a single secondary vertex out of all these tracks. 
If the resulting vertex has an unacceptable % 2 , the track with the highest contribution to the vertex % 2 is 
deleted from the secondary track list and the vertex fit is redone. This procedure iterates until a good x 2 
of the vertex fit is obtained or all tracks from the secondary track list have been removed. 

Secondary vertices in light quark jets mainly stem from tracks coming from the primary vertex, 
typically with a bad measurement of their track parameters and errors. 

Some ^-tagging efficiency and rejection calibration algorithms require negative tail vertices. For the 
use in those algorithms the requirement for two-track vertices (V^r — Vprim,Pjet) > 0 may be dropped. In 
this condition in some cases BTagVrtSec reconstructs a final single secondary vertex behind the primary 
vertex: (V sec — V pr i m ,Pj et ) < 0. Such secondary vertices are mostly fake vertices but give the necessary 
reference for calibration. 

4.1.2 ^-Tagging with BTagVrtSec 

Each ^-tagging algorithm is based on variables which show significantly different behaviour for b-jets 
and light jets. For secondary vertex based algorithms the first variable is the presence of a reconstructed 
vertex in the jet itself, whose probability is big for ft-quark jets and small for light quark jets. The vertex 
reconstruction procedure can, however, provide much more information which also may be used in a 
^-tagging algorithm to increase the efficiency. In order not to have a too complex procedure which is still 
easy to calibrate and to control, only the most sensitive variables should be taken into account. 

Although BTagVrtSec can be used as a standalone 6-tagging algorithm, it was developed primarily for 
working in combination with track impact parameter based algorithms [4]. The distance between primary 
and secondary vertices is thus not used because this kind of lifetime information is already contained in 
the track impact parameters. Three additional variables have been chosen for the BTagVrtSec tagging 
algorithm: 

1. The mass of the reconstructed secondary vertex: M. 
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2. The ratio of the energy of charged particle tracks included in the secondary vertex and the total 
energy of all charged particle tracks in the jet: R; 

3. The number of good (excluding identified V° decays or material interactions) two-track vertices in 
the jet: N. 

The distributions of the first two variables are shown in Figure 9 in Section 5 for light, charm and b-jets. 
The distributions of the number of two-track vertices can be found in [4]. In standalone mode (with- 
out combination with track impact parameter based tagging algorithms) the three dimensional distance 
between the primary and secondary vertices may also be used. 

To be reliable any ^-tagging algorithm must be calibrated on data. To facilitate the calibration and to 
reduce the necessary amount of data the chosen variables have been transformed (for details see [12]): 

M 

• Invariant mass: M = ; 

M+l 

• Energy ratio: R 1 = R° J ; 

• Number of good two-track secondary vertices: N' = logN. 

Due to the efficiency to reconstruct a secondary vertex inside a jet not reaching 100%, the probability 
density functions (PDF) of the vertex based variable have to contain a <5-f unction [12]. 

PDF = (1 - e) • 8(M',F',N') + e-ASH(M',F',N') 

with e being the efficiency to reconstruct a secondary vertex inside a jet. The continuous probability 
density function of the vertex variables is constructed from multidimensional calibration histograms 
using the ASH smoothing method [13]. 

Two slightly different taggers based on the BTagVrtSec algorithms are available in ATLAS, denoted 
SV1 and SV2. They use exactly the same variables but handle them in a different way. SV 1 treats M' 
and R' jointly and adds N' as independent variable (2+1 decomposition), whereas SV2 uses joint three- 
dimensional probability density functions. 

The only criterion for the selection of variables and tuning of the BTagVrtSec algorithm was b- 
tagging performance. Other quantities like the purity of the reconstructed secondary vertices were not 
considered. Although the achieved quality of the ^-hadron decay vertex is quite good (see Section 5), for 
applications other than ^-tagging BTagVrtSec may require a different tuning. 

4.2 The JetFitter Algorithm 

A new inclusive secondary vertexing algorithm which exploits the topological structure of weak b- and 
c-hadron decays inside a jet was recently developed. 

4.2.1 Reconstruction of the Decay Chain Topology 

As already stated, the fragmentation of a &-quark results in a decay chain with two vertices, one stemming 
from the ^-hadron decay and at least one from c-hadron decays. 

As described in Section 4.1, the BTagVrtSec algorithm relies on a vertexing algorithm in which 
displaced tracks are selected and an inclusive single vertex is obtained using a Kalman based x 2 fit (as 
sketched in Figure 3). 

The algorithm described here, called JetFitter, is based on a different hypothesis. It assumes that the 
b- and c-hadron decay vertices lie on the same line defined through the ^-hadron flight path. All charged 
particle tracks stemming from either the b- or c-hadron decay thus intersect this &-hadron flight axis. 
There are several advantages to this method: 
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Figure 3: BTagVrtSec fits all displaced Figure 4: JetFitter performs a multi-vertex fit using the 
tracks to an inclusive vertex. £>-hadron flight direction as constraint. 

• Incomplete topologies can also be reconstructed (in principle even the topology with a single track 
from the ft-hadron decay and a single track from the c-hadron decay is accessible). 

• The fit evaluates the compatibility of the given set of tracks with a ft-c-hadron like cascade topol- 
ogy, increasing the discrimination power against light quark jets. 

• Constraining the tracks to lie on the ft-hadron flight axis reduces the degrees of freedom of the fit, 
increasing the chance to separate the ^/c-hadron vertices. 

From the physics point of view this hypothesis is justified through the kinematics of the particles 
involved as defined through the hard &-quark fragmentation function and the masses of b- and c-hadrons. 
The lateral displacement of the c-hadron decay vertex with respect to the ft-hadron flight path is small 
enough not to violate significantly the basic assumption within the typical resolutions of the tracking 
detector (see Figure 4). 

This hypothesis, extensively used in the JetFitter algorithm, was explored for the first time in the 
ghost track algorithm developed by the SLD Collaboration [14], where the already defined ft-hadron 
flight axis is substituted by a ghost track and where a numerical global % 2 minimisation procedure was 
used to perform the multi-vertex fit. 

4.2.2 The JetFitter Vertex Reconstruction Algorithm 

In JetFitter the vertexing task is mathematically implemented as an extension of the Kalman Filter for- 
malism for vertex reconstruction [7] and the decay chain is described through the determination of the 
following variables: 

d = (x PV ,yp V ,zpv,<t>,0,di,d 2 ,.-.,d N ), (3) 

with: 

• (xpv,ypv,Zpv)' the primary vertex position. 

• (((>,d): the azimuthal and polar directions of the ft-hadron flight axis. 

• (d\,d2, ■■■idx): the distances of the fitted vertices, defined as the intersections of one or more tracks 
and the ^-hadron flight axis, to the primary vertex position along the flight axis (N representing the 
number of vertices). 

Before starting the fit, the variables are initialised with their prior knowledge: 
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• The primary vertex position (with covariance matrix), as provided by the primary vertex rinding 
algorithm. 

• The ^-hadron flight direction, approximated by the direction of the jet axis, the error being provided 
by the convolution of the jet direction resolution with the average displacement of the jet axis 
relative to the ^-hadron flight axis, as determined from Monte Carlo simulations. 

The fit is then performed, resulting in the minimization of the % 2 containing the weighted residuals of 
all tracks with respect to their vertices on the ft-hadron flight axis. The charged particle tracks to be used 
in the determination of the decay chain are selected according to the AR matching criterion explained in 
Section 2 and a further track selection is applied, in order to reduce the amount of fake tracks. 

After the primary vertex and the fe-hadron flight axis have been initialised, a first fit is performed 
under the hypothesis that each track represents a single vertex along the ^-hadron flight axis, until % 2 
convergence is reached, obtaining a first set of fitted variables (<j>,6,di,d2,---,d^). 

A clustering procedure is then performed, where all combinations of two vertices (picked up among 
the vertices lying on the ^-hadron flight axis plus the primary vertex) are taken into consideration, filling 
a table of probabilities. After the table of probabilities is filled, the vertices with the highest compatibility 
are merged, a new complete fit is performed and a new table of probabilities is filled. This procedure is 
iterated until no pairs of vertices with a probability above a certain threshold remain. 

The result of this clustering procedure is a decay topology with a well defined association of tracks 
to vertices along the ^-hadron flight axis, with at least one track for each vertex. 

More details about the JetFitter vertex reconstruction algorithm can be found in [3]. 

4.2.3 The JetFitter based ^-Tagging Algorithm 

The 6-tagging algorithm implemented as a first application of JetFitter is based on separating ft-jets from 
c- and light-quark (u,d,s) jets by means of the definition of a likelihood function. 
The decay topology is described by the following discrete variables: 

1. Number of vertices with at least two tracks. 

2. Total number of tracks at these vertices. 

3. Number of additional single track vertices on the ^-hadron flight axis, 
while the vertex information is condensed in the following variables: 

1. Mass: the invariant mass of all charged particle tracks attached to the decay chain. 

2. Energy Fraction: the energy of these charged particles divided by the sum of the energies of all 
charged particles associated to the jet. 

3. Flight length significance the weighted average vertex position divided by its error. 
The use of these variables allows the definition of a likelihood function of the form: 

Z>'' c (x) = Y^coeff (cat) ■ PDF cat {mass) ■ PDF cat (energyFr action) PDF cat (^y) > ( 4 ) 

which has to be parametrised separately for each of the three different flavours. 

The information about the decay topology of the jet as reconstructed by JetFitter is represented by 
the category (the coefficient coeff(cat) representing how probable it is to find a certain topology for a 
given flavour), while the vertex information is contained in the probability distribution functions (PDFs). 
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Figure 5: 13 different topologies are defined, combining the three discrete variables in such way as to 
reduce their correlations. For the case of one single vertex with at least two tracks (1), the discrete PDFs 
for both the other two variables, total tracks at vertices (2) and single additional tracks (3), are used, but 
they are then considered as uncorrelated, so that their corresponding coefficients are just multiplied. 



The discrete variables describing the decay topology are combined according to the scheme of Fig- 
ure 5 into 13 category coefficients. 

In order to reduce the correlations between the decay topology and vertex related variables and to 
increase the discrimination power, the PDFs are made category dependent. This splitting of PDFs is 
done only when strictly needed. 

The templates for the vertex variables are shown in Table 6 for all three flavours. Each PDF was 
split independently into the categories it was found to be most correlated with, in order to maintain the 
number of split PDFs to be determined on Monte Carlo simulated events as low as possible. They were 
determined on the WH(m« = 120 GeV) Monte Carlo events with H — > bb,H — > cc and H — > uu and on 
it Monte Carlo events. 

The JetFitter based ^-tagging algorithm can be either used as a stand-alone algorithm or in combina- 
tion with pure impact parameter based algorithms (see Section 6). 

5 Secondary Vertex Reconstruction Performance 

In this section, the performance of the inclusive secondary vertex reconstruction algorithms as described 
in Sections 4.1 and 4.2 is discussed. 

Table 6 shows the rate of reconstructed secondary vertices in ft-quark and light quark jets in bins 
of jet transverse momentum and pseudorapidity for both secondary vertex reconstruction algorithms, as 
obtained on the it and ttjj samples. 

Above a certain transverse momentum, the efficiency to identify the secondary vertex in a b-jet 
for both algorithms approaches approximately a constant value around 75 — 80%, while the number of 
vertices reconstructed in light-jets consistently increases with higher jet transverse momentum. 

Both effects have to do mainly with the ability to separate real or fake secondary tracks from the 
primary vertex: tracks from secondaries are selected according to the impact parameter significance of 
a track with respect to the primary vertex or to the three dimensional flight length significance of the 
secondary vertex. Therefore, these quantities are strictly related and, above a certain threshold, they 
are nearly invariant regardless of the boost and thus the flight length of the displaced b- or c-hadron 
produced. For low transverse momenta, the charged particle tracks from ft-hadron decays suffer from 
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Figure 6: The PDFs for the mass, the energy fraction and the flight length significance are shown, 
separately for the three different jet-flavours and split according to the decay chain topology found by 
JetFitter. 

larger multiple scattering, thus showing less significant displacements from the primary event vertex and 
reducing the secondary vertex reconstruction efficiency. 

The rising amount of fake vertices in light quark jets with increasing jet transverse momenta is partly 
due to the nature of the fragmentation process in light quark jets, which produces a larger number of 
tracks coming from the primary interaction vertex the larger the transverse jet momentum is. Assuming 
the probability for a reconstructed track to represent an outlying measurement is approximately constant 
to first approximation, the probability of having tracks which fake secondaries increases roughly linearly 
with the number of primary tracks. The real secondary vertices produced in light quark jets, like photon 
conversions and V° decays represent an additional source of vertices not related to heavy hadron decays. 
Furthermore, the pattern recognition during the track reconstruction phase is more difficult in dense jets 
as it is the case for higher jet transverse momenta (see [4]). 
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Table 6: The fraction of jets with at least one reconstructed secondary vertex passing the selection criteria 
to be used by the ^-tagging algorithms in b-, c- and light quark jets for BTagVrtSec (top) and JetFitter 
(bottom). These numbers, given in percent, have been obtained on the it and ttjj samples, applying the 
purification procedure. 
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The dependence of the vertex reconstruction efficiency on the jet pseudorapidity is mainly a conse- 
quence of the different resolutions which can be achieved in different regions of the Inner Detector: for 
increasing rapidities, the track resolutions close to the Interaction Point get worse, the track reconstruc- 
tion efficiency starts to decrease and the number of fake tracks rises (a detailed quantitative explanation 
of these effects can be found in [6] and [4]). 

It can also be concluded that the BTagVrtSec algorithm is slightly more efficient in reconstructing 
secondary vertices inside b quark jets, at the cost of a slightly higher rate of fake vertices produced in 
light quark jets compared to the JetFitter algorithm, at least in the barrel region of the Inner Detector. 

Table 7 shows the population of some reconstructed topologies for ft-quark, c-quark and light quark 
jets for JetFitter, as a function of the jet transverse momentum and pseudorapidity. As stated in Sec- 
Table 7: Population of the different topologies of the vertices reconstructed by JetFitter in b-,c- and light 
quark jets , shown as a function of the jet transverse momentum pj (top) and the jet pseudorapidity 
r\. These numbers, given in percent, have been obtained on the it sample, applying the purification 
procedure. 
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Figure 7: Residuals of the reconstructed three dimensional (left) and transverse (right) flight length of 
the inclusive secondary vertex with respect to the true ^-hadron position for both vertexing algorithms. 



tion 4.2, the key feature of JetFitter is the ability to distinguish several decay chain topologies, in addition 
to recognizing the presence of a single inclusive decay vertex. The efficiencies and mistagging rates given 
above for the different categories show that most of the achievable gain is due to the 1 Vertex + 1 Track 
and 2 Vertices categories, where a gain in rejection against light quark jets of a factor ~ 4 and ~ 16 can 
be obtained on approximately ~ 10% and ~ 5% of the reconstructed b-jets, respectively. The dependence 
on pr and X] follows the same pattern as already described for the inclusive vertex reconstruction. 

Figure 7 shows the resolution achieved on the inclusively reconstructed ^-hadron decay vertex with 
respect to the true ^-hadron position. A core can be seen, corresponding to the cases where most of 
the reconstructed tracks really stem from the ^-hadron vertex, approaching the intrinsic resolution of 
the vertex reconstruction, while a very large tail to higher flight lengths can be observed, due to tracks 
coming from the decay of the charmed hadron of the b — »■ c-hadron cascade. 

Another criterion to estimate the algorithmic performance of the secondary vertex finders is the 
fraction of charged particles arising from the decays of b- or c-hadrons and reconstructed as tracks in 
the Inner Detector that are correctly associated to a displaced vertex (efficiency) and -at the same time- 
which fraction of the tracks assigned by the secondary vertex finders to displaced vertices really stem 
from real b- or c-hadron decays (purity). 

The average charged particle multiplicities at the secondary and tertiary decay vertices as obtained 
from the Monte Carlo generator PYTHIA was analyzed, where a minimum transverse momentum of 500 
MeV was required for the charged particles. The decay products of a strong or electromagnetic ^-hadron 
decay (e.g. pions from B** decays) are not counted as coming from the secondary vertex, because of 
the lifetime of these states the decay particles emerge from the primary vertex. The decay products 
of a strongly or electromagnetically decaying c-hadron (e.g. D* or D**) are considered as stemming 
from the ^-hadron vertex, while only the decay products of weakly decaying c-hadrons are considered 
as originating from the c-hadron vertex. 

The average charged particle multiplicity of the inclusive b / c-hadron vertex is ~ 4.7, as shown in 
Figure 8, distributed in ~ 2.3 charged particles stemming from the ^-hadron vertex and ~ 2.4 charged 
particles originating from the c-hadron vertex. In the same figure the decay multiplicity at generator level 
(left) is compared with the number of tracks coming from the b- and c-hadron vertices which have been 
reconstructed as tracks in the Inner Detector (right). Around 80% of their charged decay products are 
correctly reconstructed as tracks and pass the standard track quality selection cuts. In order to strongly 
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Figure 8: Average charged particle decay multiplicity of the b- and/or c-hadrons at generator level 
(left), compared with the number of charged particles coming from the b- and/or c-hadron vertices 
reconstructed as tracks in the Inner Detector and passing some standard quality criteria (right). The 
WH(ff — > bb) sample has been used. 



Table 8: Efficiency (fraction of reconstructed tracks from the decays of b- or c-hadrons that are associated 
to the secondary vertices) and purity (fraction of tracks fitted to the secondary vertices that stem from 
real b- or c-hadron decays) for the two secondary vertex finders, separately for different topologies in the 
case of JetFitter. The WH(// — ► bb) sample has been used. 



Algorithm 


Topology 


Track efficiency 


Track purity 


BTagVrtSec 


1 inclusive B/D vertex 


69% 


92% 




1 vertex 


74% 


91 % 


JetFitter 


1 vertex + 1 track 


80% 


85 % 




2 vertices 


85% 


89% 



reduce the contamination of reconstructed secondary vertices by other charged particles originating from 
the primary interaction point, tracks originating from the b- or c-hadron vertices, but not distinguishable 
from primary tracks, are also suppressed to some extent by the track selection cuts. This and other 
selection criteria further reduce the b- and c-hadron decay products reconstruction efficiency, resulting 
in a compromise between the highest possible efficiency and a reasonable purity. In Table 8 the track 
association efficiencies and purities of the displaced tracks stemming from the b- or c- hadrons for the 
BTagVrtSec and JetFitter algorithms are shown, the efficiencies being normalized to the number of tracks 
from &/c-hadron decays which are reconstructed by the tracking detector. The fit of an inclusive b/c- 
hadron decay vertex, as performed by BTagVrtSec, allows to obtain a very high purity, but at the cost of 
starting to loose some tracks when the distance between the b- and c-hadron vertices starts to be relevant. 
JetFitter is able to recover a good part of this inefficiency, thanks to the ability of reconstructing more 
complex decay chain topologies, at the cost of a slightly lower purity. 

Several topological and kinematical variables related to the reconstructed secondary vertices are used 
by the ^-tagging algorithms, as described in sections 4.1 and 4.2. The invariant mass of charged particles 
associated to the vertex and the fraction of charged energy at the vertex relative to that of the jet, are used 
by both algorithms. The distributions of these variables are shown in Figure 9 for different regions of 
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fractional energy at sec. vertex fractional energy at sec. vertex fractional energy at sec. vertex 




flight distance significance flight distance significance flight distance significance 



Figure 9: Variables related to the properties of reconstructed secondary vertices as computed by the 
BTagVrtSec algorithm for different jet transverse momenta and pseudorapidities for ft-quark jets (solid), 
c-quark jets (dotted) and light quark jets (dashed). Left: 0 < |t]| < 0.5, 15 < p T < 30 GeV; middle: 
0 < |tj| < 0.5, 80 < p T < 120 GeV; right: 2 < \r] \ < 2.5, 80 < p T < 120 GeV. The top row shows the 
invariant mass of charged particle tracks associated to the reconstructed secondary vertices, the middle 
row the energy of charged particle tracks associated to the reconstructed secondary vertices divided by 
the energy of all charged particles in the jet, and the bottom row the flight distance significance as defined 
in the text. 



jet transverse momenta and pseudorapidities for the BTagVertSec algorithm. The flight distance signif- 
icance, defined as the distance between the primary and secondary vertices divided by its error, is also 
shown there. The observed dependence of the secondary vertex reconstruction performance on the jet 
kinematics will directly impact the performance of the ^-tagging algorithms. This will be discussed in 
Section 7. 

Essentially all ^-tagging algorithms use the direction of the jet, as reconstructed from calorimeter 
towers, to assign e.g. the lifetime sign to the impact parameters of charged particle tracks. The jet di- 
rection is a reasonably good approximation of the ft-quark direction, however, for ^-tagging it is mainly 
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the direction of the ^-hadron that matters. The direction of the ^-hadron flight path can be estimated 
using information from reconstructed primary and secondary vertices, e.g. for BTagVrtSec the line join- 
ing the primary and secondary vertices can be used as the ft-hadron direction if a secondary vertex is 
present in the jet. The more sophisticated Jet Fitter algorithm delivers by construction an estimate for the 
^-hadron flight direction using information from both the calorimeters and vertices of charged particle 
tracks. Figure 10 shows a comparison of the resolutions achieved for the azimuth angle 0 for the different 
approaches. Using an improved &-hadron direction may decrease the contribution of charged particles 
from the decays of ft-hadrons to the negative tail of the impact parameter distribution. This contribution 
mainly comes from a wrong assignment of the lifetime sign if the ^-hadron direction has not been recon- 
structed sufficiently precisely. This will facilitate the calibration and improve the ^-tagging performance. 
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Figure 10: Angular resolution of the ^-hadron flight direction in the azimuth angle (j) as reconstructed 
with the JetFitter and BTagVrtSec algorithms (for the latter, the line joining the primary and secondary 
vertices has been used in the case where a secondary vertex is present), compared with the corresponding 
resolution as obtained from calorimeter jets. The WH (H — >• bb; m H =\20 GeV) sample was used for this 
study. 



6 Combination with Impact Parameter based ^-tagging Algorithms 

The ^-tagging algorithms presented in the previous sections provide all information needed to be used 
as stand-alone algorithms. Secondary vertex based ^-tagging algorithms are, however, limited by the 
efficiency to reconstruct a secondary vertex inside a jet. To obtain maximum performance, the secondary 
vertex based algorithms can be combined with other algorithms, ^-tagging algorithms purely based on 
the impact parameter significances of charged particle tracks do not have this limitation. They do not 
offer, however, the amount of topological and kinematical information as the secondary vertex based 
algorithms. 

In ATLAS, the impact parameter information can be used either in two (r<f> plane only) or three 
dimensions. The corresponding algorithms and their performance are described in detail in [4]. They 
calculate a likelihood ratio of the probability density functions for observed track impact parameter 
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significances S for tracks coming from &-quark jets and light quark jets: w tra ck = b(S)/u(S). The jet 
likelihood ratio (called jet weight) is the sum of track ratios: w 1 ^ = £^Tf* ^ n w trac kj- Assuming that the 
tracks are independent, the jet likelihood ratio is an optimal variable for making a decision about the jet 
origin. 

Secondary vertex algorithms were designed in a way facilitating combination with track only based 
algorithms. They provide likelihood ratios of various parameters for ft-jets and light jets, which may 
be simply summed with track likelihood ratios. To preserve an optimal performance, secondary vertex 
variables have been chosen to be maximally independent on track impact parameters (i.e. the distance 
between primary and secondary vertex is strongly correlated with track impact parameters and then 
should be used with care). The combined jet weight is then computed as sum of the likelihood ratios of 
the different algorithms: Wj e ° t mbined = W?J t + W%. 

The likelihood ratio approach used in ATLAS makes ^-tagging an easily scalable procedure. Any 
new algorithm or new variable can be added in the same way to already existing tagging information. 
Choice of independent variables (or at least weakly correlated ones) guarantees an optimality of com- 
bined procedure. 

7 ^-Tagging Performance 

In this section, the performance of the algorithms described in this note is discussed. The data samples 
described in Section 2 have been used for these studies. Jets had to be within the acceptance of the 
tracking detectors (|tj | < 2.5) and their reconstructed transverse momenta had to exceed 15 GeV: pj ' > 
15 GeV. The performance is shown in terms of ^-tagging efficiency, £b, and light quark rejection, r M)C as 
defined in Section 2. A certain working point is chosen by placing a cut on the weight as computed by 
the algorithm. 

The performance was studied both using only the information as delivered by the secondary vertex 
based algorithms (denoted BTagVrtSec and JetFitter as in the previous sections) and after combination 
with a tagging algorithm based on a combination of transverse and longitudinal impact parameter signif- 
icances as described in Section 6. The ^-tagging algorithm purely based on the information from impact 

Table 9: The rejection against light quark jets for fixed ^-tagging efficiencies of 50% and 60% for 
different data samples without and with applying the purification procedure, denoted as raw and purified, 
respectively. 



Sample 


£b 


BTagVrtSec 


JetFitter 


BTagVrtSec+IP3D 


JetFitter+IP3D 


WH(120) 


50% raw 


97 ± 1 


156 ±3 


454 ± 13 


545 ± 17 




60% raw 


37 ±0 


52 ±0 


116 ±2 


133 ±2 


WH(120) 


50% purified 


98 ± 1 


156 ±3 


462 ± 13 


554 ± 17 




60% purified 


38 ±0 


52 ± 1 


118 ±2 


134 ±2 


WH(400) 


50% raw 


55 ± 1 


101 ± 1 


285 ±6 


379 ± 10 




60% raw 


25 ±0 


45 ±0 


93 ± 1 


121 ±2 


WH(400) 


50% purified 


55 ± 1 


101 ± 1 


296 ±7 


390 ± 11 




60% purified 


25 ±0 


46 ±0 


95 ± 1 


123 ±2 


tt+ttjj 


50% raw 


110 ± 1 


176 ± 1 


456 ±4 


633 ±7 




60% raw 


48 ±0 


66 ±0 


154 ± 1 


190 ± 1 


tt+ttjj 


50% purified 


130 ± 1 


189 ± 1 


791 ± 11 


924 ± 14 




60% purified 


53 ±0 


68 ±0 


206 ± 1 


226 ±2 
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Table 10: The rejection against charm quark jets for fixed ^-tagging efficiencies of 50% and 60% for 
different data samples. The purification procedure has not been applied here. 



Sample 


Sb 


BTagVrtSec 


JetFitter 


BTagVrtSec+IP3D 


JetFitter+IP3D 


WH(400) 


50% raw 
60% raw 


8.1 ±0.1 
4.8 ±0.0 


9.8 ± 0.2 
6.1 ±0.1 


12.4 ± 0.2 
6.8 ±0.1 


12.6 ± 0.2 
7.3 ±0.1 


tt+ttjj 


50% raw 
60% raw 


9.9 ± 0.0 
5.9 ± 0.0 


10.3 ± 0.0 
6.2 ± 0.0 


12.4 ±0.1 
7.4 ± 0.0 


12.3 ±0.1 
7.4 ± 0.0 



parameter significances of charged particle tracks showing the best performance is called IP3D and is 
used in the following studies. Details about this algorithm can be found in [4]. The results from the 
combined algorithms are denoted BTagVrtSec+IP3D and JetFitter+IP3D, respectively. Tables 9 and 10 
show the rejection rates against light quark and charm quark jets for different tagging algorithms and 
data samples. 

Figure 11 shows the rejection of light quark (u,d,s) and gluon jets, r u , versus the ^-tagging efficiency 
£b for the pure secondary vertex based algorithms and a comparison with the most performant tagging 
algorithm purely based on three dimensional impact parameter information, IP 3D. It can be seen that 
the maximum achievable ^-tagging efficiency of the vertex based algorithms is limited by the efficiency 
to reconstruct a displaced vertex or b-h&Amn decay topology. It is higher in the case of JetFitter which 
imposes only very soft requirements on the topology. The performance of the impact parameter based 
tagging algorithm IP3D is better over almost the full range of 6-tagging efficiencies. There is also a 
significant difference between BTagVrtSec and JetFitter, the latter one showing better performance. It 
has to be noted, however, that JetFitter explicitely uses lifetime information (the flight distance signifi- 
cance), which is not used by BTagVrtSec to decorrelate the algorithm better from IP 3D. The right plot 
of Figure 11 exhibits some steep structures at ^-tagging efficiencies of about 50% and 70%. These can 
be related to sudden drops in the rejection rate of the IP 3D algorithm for these ^-tagging efficiencies, 
as can be seen in the left part of Figure 11. This behaviour can be explained mainly by the presence of 
jets with only a single track associated to them and the finite binning of the reference distributions for 
the probability density functions of the impact parameter significances as used by the IP3D ^-tagging 
algorithm (see [4] for details). 

Figure 12 shows the rejection against charm quark jets. The rejection against charm quark jets is 
significantly lower compared to light quark jets because of the real lifetime of charm hadrons and thus 
similar topology. Here, the vertex based algorithms show a performance that is much closer to the impact 
parameter based algorithm. 

Figures 13 and 14 show the rejections against light quark jets and charm quark jets for the secondary 
vertex based tagging algorithms after combination with the IP3D algorithm, again compared with IP3D. 
This combination results in a significant increase in rejection power, especially for light quark jets. 

The dependence of the performance on the jet kinematics is of particular importance. Figures 15 
and 16 show the rejection against light quark jets for a fixed ^-tagging efficiency of 50% versus the jet 
transverse momentum and jet pseudo rapidity, respectively. Table 1 1 shows the rejection against light 
quark jets for a fixed ^-tagging efficiency of 50% for several regions of jet transverse momenta and 
pseudorapidities. 

The reasons for the observed behaviour are manifold. For larger pseudorapidities, the particles pass 
through more material and thus the detector resolution is worse. Another reason for degradation of 
the longitudinal impact parameter at high pseudorapidity is the dramatic increase of the extrapolation 
distance from b-layer hit to the vertex. The degradation for high jet transverse momenta is caused by 
an increased fragmentation multiplicity, resulting in an increased combinatorics in the secondary vertex 
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Figure 11: Left: Light jet rejection versus ^-tagging efficiency for BTagVrtSec (triangles, green) and 
JetFitter (full circles, red). The pure impact parameter based algorithm IP3D is also shown for compar- 
ison (open circles, blue); right: The ratio with respect to IP3D for BTagVrtSec (dashed line, green) and 
JetFitter (full line, red). These results have been obtained on the it and ttjj samples. No purification of 
light quark jets (see Section 2) has been applied. 
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Figure 12: Left: Charm jet rejection versus ^-tagging efficiency for BTagVrtSec (triangles, green) and 
JetFitter (full circles, red). The pure impact parameter based algorithm IP3D is also shown for compar- 
ison (open circles, blue); right: The ratio with respect to IP3D for BTagVrtSec (dashed line, green) and 
JetFitter (full line, red). These results have been obtained on the it and ttjj samples. 
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Figure 13: Left: Light jet rejection versus ^-tagging efficiency for BTagVrtSec (triangles, green) and Jet- 
Fitter (full circles, red) after combination with IP3D. The pure impact parameter based algorithm IP3D 
is also shown for comparison (open circles, blue); right: The ratio with respect to IP3D for BTagVrtSec 
(dashed line, green) and JetFitter (full line, red) combined. These results have been obtained on the it 
and ttjj samples. No purification has been applied. 




b-jet efficiency b-jet efficiency 



Figure 14: Left: Charm jet rejection versus ^-tagging efficiency for BTagVrtSec (triangles, green) and 
JetFitter (full circles, red) after combination with IP3D. The pure impact parameter based algorithm 
IP 3D is also shown for comparison (open circles, blue); right: The ratio with respect to IP 3D for 
BTagVrtSec (dashed line, green) and JetFitter (full line, red) combined. These results have been ob- 
tained on the it and ttjj samples. 
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Table 11: The rejection against light quark jets (with purification) for a fixed ^-tagging efficiency of 50% 
for the ft and ft jj samples in regions of jet transverse momenta and pseudorapidities for the secondary 
vertex based ^-tagging algorithms (top: BTagVrtSec; bottom: JetFitter ) after combination with the pure 
impact parameter based tagging algorithm, IP 3D. 





0< |tj| <0.5 


0.5< |tj| <1.0 


1.0< |tj| <1.5 


1.5< |t?| <2.0 


2.0< |r/| <2.5 


15 GeV< p T OOGeV 


177 ±4 


180 ±4 


158 ±4 


74 ± 1 


28 ±0 




200 ±5 


183 ±4 


149 ±3 


74 ± 1 


25 ±0 


30 GeV< p T <50GeV 


1170 ±79 


1 140 ± 79 


782 ± 48 


375 ± 18 


92 ±2 




1269 ± 90 


1286 ±93 


957 ± 65 


409 ± 21 


111 ±3 


50 GeV< p T <80GeV 


1534 ± 132 


2613 ± 306 


1380 ± 127 


536 ± 36 


149 ±6 




2354 ±251 


2415 ± 272 


1678 ± 170 


677 ±51 


195 ±9 


80 GeV</? r <120GeV 


1698 ± 203 


2050 ±281 


1293 ± 152 


559 ± 50 


143 ±8 




2286 ±317 


3293 ± 573 


1311 ± 156 


715 ±73 


196 ± 12 


120GeV< p T <200GeV 


1016 ± 128 


1116 ± 152 


736 ± 86 


235 ± 17 


63 ±3 




1231 ± 171 


1370 ±206 


1 194 ± 178 


299 ± 25 


67 ±3 



finding stage when trying to find the tracks stemming from the ft-hadron decay. Furthermore, the pattern 
recognition in the track reconstruction becomes more difficult in the very dense environment of jets 
with very large transverse momenta. The steep fall for low jet transverse momenta is mainly due to the 
strongly enhanced multiple scattering of low momentum charged particle tracks, leading to significantly 
degraded impact parameter resolutions. The observed dependence of the ^-tagging performance on the 
jet kinematics can be related to the peformance of the secondary vertex reconstruction as discussed in 
Section 5. It can be seen there, that the secondary vertex reconstruction efficiency in &-quark jets drops 
in the same kinematical regions as the resulting ^-tagging performance, with an increased rate of (fake) 
vertices in light quark jets. 

Two of the most discriminating variables, for both algorithms, are the invariant mass of charged 
particle tracks associated to the secondary vertex and the fraction of the charged energy at the secondary 
vertex divided by the total charged energy in the jet. Figure 9 in Section 5 shows these variables for b- 
quark, c-quark and light quark jets for various jet transverse momenta and pseusorapidities. The loss of 
discrimination power in the regions where the degradation of the ^-tagging performance is observed, is 
clearly visible. Apart from the kinematic dependence, the ^-tagging performance also depends critically 
on the contamination of the light quark jets by displaced tracks stemming from nearby b- or c-quark 
jets, as can already be concluded comparing the ^-tagging performance before and after application of 
the purification procedure. It is however worth looking at this dependence in more detail, analyzing the 
^-tagging performance as a function of the distance AR of the light quark jets to the nearest b- or c-quark 
or t lepton. 

Table 12 shows, both for the pure secondary vertex based algorithms and after combination with the 
IP3D algorithm, that the more the light quark jets are contaminated by b- or c-hadron decay products 
(smaller values of AR), the stronger the degradation of the ^-tagging performance is. At very small 
angles of AR the light quark-jets can be barely distinguished from the nearby heavy flavour jet. A small 
kinematical bias (slightly different pj and tj distribution of the jets for different AR intervals) should 
also be taken into account when interpreting these results. 

There is a noticeable difference in the behaviour of JetFitter compared to BTagVrtSec, the second 
being less robust against building up vertices which catch up contributions from nearby heavy flavour 
jets. As stated in Section 4.2.2, JetFitter uses the jet direction as reconstructed by the calorimeter as a 
constraint in the fit of the hypothetical ^-hadron flight axis, thus being more efficient in discarding tracks 
coming from nearby jets. 
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Figure 15: Left: Light jet rejection for a fixed ^-tagging efficiency of 50% versus the jet transverse mo- 
mentum for BTagVrtSec (triangles, green) and JetFitter (full circles, red) after combination with IP3D. 
The pure impact parameter based algorithm IP3D is also shown for comparison (open circles, blue); 
right: The ratio with respect to IP3D for BTagVrtSec (triangles, green) and JetFitter (full circles, red) 
combined. The plots in the top (bottom) row show the performance without (with) applying the purifica- 
tion procedure. These results have been obtained on the it and ttjj sample. 



8 Summary and Outlook 

In this note, an overview of vertex reconstruction algorithms used in ATLAS, both for the reconstruction 
of the primary event vertex and the inclusive reconstruction of secondary decay vertices inside jets, has 
been given. The focus has been put on applications to the tagging of Z?-quark jets. 

Several primary vertex reconstruction algorithms are available in ATLAS. The performance of these 
algorithms has been studied both for the case where only the hard signal interaction is present and for 
a luminosity of 2 x 10 33 cm _2 s _1 , when 4.6 additional pile-up vertices are present on average. It was 
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Figure 16: Left: Light jet rejection for a fixed ^-tagging efficiency of 50% versus the jet pseudorapidity 
for BTagVrtSec (triangles, green) and JetFitter (full circles, red) after combination with IP3D. The pure 
impact parameter based algorithm IP 3D is also shown for comparison (open circles, blue); right: The ra- 
tio with respect to IP3D for BTagVrtSec (triangles, green) and JetFitter (full circles, red) combined. The 
plots in the top (bottom) row show the performance without (with) applying the purification procedure. 
These results have been obtained on the it and it jj sample. 



demonstrated that the presence of additional vertices does not degrade the primary vertex reconstruc- 
tion precision significantly. A more severe problem is the misidentification of a pile-up vertex as the 
signal vertex. For the luminosity of 2 x 10 33 cm _2 s _1 and the event topologies studied in this note, this 
misidentification rate can be as high as 5%. This misidentification leads to a significant degradation 
of the performance of the ^-tagging algorithms that use information of charged particle tracks in the 
longitudinal direction. Possible procedures to recover a large part of this performance loss have been 
discussed and will be studied an implemented in the future. 
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Table 12: The rejection against light quark jets for different ^-tagging efficiencies (e^) as function of the 
difference in AR of the light quark jets to the nearest b-, c- or T-jet (top row: BTagVrtSec; bottom row: 
JetFitter), for the secondary vertex based ^-tagging algorithms used stand-alone (upper table) and after 
combination with the impact parameter based tagging algorithm IP 3D (lower table). 



0.3 <AR< 0.35 



0.35 < AR < 0.4 



0.4 < AR < 0.45 



0.45 <AR< 0.5 



0.5 < AR < 0.6 



0.6 < AR < 0.7 



12.4 ± 0.3 
29 ± 1 



177 ± 14 

363 ± 43 



231 ± 11 
345 ± 20 



244 ± 10 
327 ± 16 



7.3 ±0.1 
17.3 ±0.4 



14 ± 0.3 

43 ±2 



91 ±3 
128 ±5 



91 ±3 
119 ±4 



0.3 <Afl< 0.35 



0.35 < AR < 0.4 



0.4 < AR < 0.45 



0.45 < AR < 0.5 



0.5 < AR < 0.6 



0.6<Afl<0.7 



18.4 ± 0.5 
31 ± 1 



51 ±2 
141 ± 12 



179 ± 17 
501 ± 80 



467 ± 62 
793 ± 138 



939 ± 92 
1133 ± 121 



951 ±79 
1077 ± 95 



10.1 ±0.2 
17.0 ± 0.4 



23 ±1 
50 ±2 



66 ±4 
158 ± 14 



143 ± 11 

222 ± 20 



265 ± 14 
309 ± 17 



266 ± 12 
302 ± 14 



Many interesting physics events at the LHC have a lepton with large transverse momentum in the 
final state, so one may try to decrease the misidentification rate by selecting the reconstructed interac- 
tion vertex closest to this lepton. This strategy, however, does not provide a significant improvement 
in comparison with the standard primary vertex selection algorithm since the presence of a well recon- 
structed high p T lepton increases the weight of the corresponding vertex and then is taken into account 
automatically. 

Two algorithms are available in ATLAS for the inclusive reconstruction of secondary decay ver- 
tices in jets, following different approaches. The first algorithm, BTagVrtSec, reconstructs explicitly 
geometrical secondary vertices inside the jet. The second algorithm, JetFitter, is based on the specific 
kinematics of the b- and c-hadrons decay chain. The performance of the algorithms has been investigated 
thoroughly. The ^-tagging performance of both algorithms has been studied using only the information 
from the secondary vertex algorithms as well as after combination with ^-tagging algorithms based on 
charged track impact parameters. A strong dependence of the vertex reconstruction and thus ^-tagging 
performance on the jet pseudorapidity and transverse momentum was demonstrated. As explained in the 
text, this is caused by physics or instrumental effects. 

Several new developments and further improvements are planned for the future. A dedicated recon- 
struction of V° decays and material interactions will also be available in JetFitter soon. Together with 
other improvements, like a modified seeding procedure and the application of multivariate techniques, a 
significant improvement can be expected for the JetFitter based ^-tagging algorithm in the near future. 
An extension of the BTagVrtSec algorithm is able to reconstruct several vertices in a jet. The definition 
of variables related to the additional information to be used for ^-tagging will be done in the near future 
and increase the ^-tagging performance. The implementation of another inclusive vertex reconstruction 
algorithm, the so-called Topological Vertex Finder is in progress. It is based on an algorithm originally 
developed by the SLD collaboration [15]. 

In the inclusive secondary vertex reconstruction algorithms presented in this note, different cuts are 
applied at different stages during vertex finding and fitting. Currently, these cuts do not depend on the jet 
parameters. Due to the strong dependence of the ^-tagging performance on them, however, it seems more 
appropriate to optimize these parameters depending on the kinematics of the jet under consideration. 
One quantity worth investigating is the size of the cone around the jet axis within which charged particle 
tracks are associated to the jet and thus used by the vertex reconstruction and ^-tagging algorithms. 
For large jet transverse momenta, the cone that contains most of the particles from the decays of heavy 
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Figure 17: Ratios of the tuned rejections to the default rejection as a function of the jet transverse momentum for a fe-tagging 
efficiency of 60%. The left hand side plot shows this ratio for light quark jet rejection, while the right hand side plot shows the 
ratio for charm jet rejection. 



b- and c-hadrons as well as from fragmentation, will be significantly smaller than for lower transverse 
momenta. It is thus desirable to choose the size of this track association cone depending on the transverse 
momentum of the jet. Since this is not restricted to secondary vertex based ^-tagging algorithms, this 
point is addressed in detail in [4]. To give an idea of what can be expected from such a parameter tuning, 
one parameter of the BTagVrtSec algorithm has been optimized in bins of the jet transverse momentum. 
This parameter is a cut on the significance of the displacement of vertex candidates built from pairs of 
tracks from the primary event vertex. Table 13 shows the values chosen after the optimization procedure. 
It can be seen, that the optimal value varies strongly with the jet transverse momentum. Figure 17 

Table 13: Optimal values for the significance cut on the displacement of two track vertex candidates from 
the primary event vertex for the different jet pj bins. 



Jet p T [GeV] 


15-30 


30-50 


50-80 


80-120 


120-200 


200-400 


400-1000 


Optimal Cut 


3.5 


4.5 


5.0 


6.0 


6.5 


7.0 


7.0 



shows the gain in ^-tagging performance that is achieved by the parameter tuning. It can be seen that the 
rejection against light quark jets improves by a factor of about two in the region of large jet transverse 
momenta, whereas the gain in charm jet rejection is more moderate. This parameter tuning will be 
continued in the future and will be included in the 6-tagging algorithms. 
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Effects of Misalignment on ^-Tagging 

Abstract 

This note investigates the effects of misalignment on ^-tagging performance 
using Monte Carlo simulations. Four different alignment sets were consid- 
ered, two with known random misalignments, one produced with the ATLAS 
alignment procedures and one with a perfectly aligned detector. Error tun- 
ing was investigated to compensate for the larger effective hit errors caused 
by misalignment. In addition the effects of misalignment on the tracking and 
vertexing performance were evaluated. 

1 Introduction 

The ATLAS detector has been built to provide high precision tracking and vertexing which are essen- 
tial for good ^-tagging performance. Misalignment of the detector will degrade the tracking resolution 
and consequently the performance of the ^-tagging is expected to be sensitive to the alignment of the 
detector. As well as random misalignments of modules which give an effective smearing of each hit, 
systematic distortions introduced by the structure of the detector and by the alignment algorithms can 
have unexpected consequences. 

The effects of misalignment on ^-tagging have been studied with Monte Carlo simulations using a 
number of different alignment sets. These include a set that perfectly aligns the detector, two hand-made 
sets with known levels of random misalignment and an alignment set produced with the actual alignment 
algorithms to be used to align the ATLAS detector. 

The assignment of correct errors for the hits is important for proper track and vertex reconstruction. 
Because module misalignments add to the intrinsic error of the module, the errors assigned to the hits 
need to be adjusted depending on the level of misalignment. Samples were investigated with and without 
this hit error adjustment. 

The note is organized as follows: Section 2 describes the four alignment sets. The error tuning 
procedure and resulting scale factors used to adjust the hit errors are presented in Section 3. Studies were 
made with both it and WH(niH = 120 GeV) samples and a brief description of these samples is given 
in Section 4. Section 5 describes the effects of misalignment on the tracking and vertexing performance. 
The effects of misalignment on the ^-tagging performance, as measured by comparing the b-]et efficiency 
versus light jet rejection for the different alignment sets, are presented and discussed in Section 6. 

2 Residual misalignment sets 

In order to study the effects of misalignment, a number of different alignment scenarios were consid- 
ered. The Monte Carlo simulation used in this investigation includes misalignments introduced at the 
simulation stage. The level of misplacement is representative of the amount of misalignment expected 
before any attempt to align the detector. The misalignments are of the order of 10-100 ;lim at the level 
of individual modules and assembly structures such as layers and disks and misalignments of the order 
of a few mm at the whole subsystem level. The level of these misalignment was based on known fab- 
rication precisions and survey measurements. [1]. The misalignments introduced are too large to allow 
for reasonable reconstruction. What is desired is to reconstruct the resulting data sets with alignment 
corrections that are typical of what is expected in the real detector, after which only small misalignments 
should remain. Four alignment sets were used in this study: 

• Perfect: This is the ideal case where the same set of alignments used in the simulation are used in 
the reconstruction and so one does not see any misalignment. 
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• Aligned: This uses an alignment set produced using the actual track based alignment algorithms 
developed for the ATLAS detector. It is expected to include any systematic deformations that 
the alignment procedure itself causes. While some systematic effects were included in the mis- 
alignments introduced in the simulation, such as clocking effects where each subsequent layer was 
rotated by increasing amounts, it does not contain all the systematic deformations which are ex- 
pected. In particular large scale structures such as layers and discs were treated as rigid objects 
without any internal deformations such as a twist. Also pixel stave bows which are known to occur 
were not introduced. So it is possible that this set is still optimistic. This set is a first attempt at the 
full scale alignment of the inner detector and so should not be considered the final word on what 
will be seen in the real detector. However, it is considered to be the most realistic case studied 
here. 

• RandomlO: This is a hand-made alignment set that takes the misalignment set used in simulation 
and randomly shifts the module positions by small amounts. These residual misalignments were 
introduced at different levels in the hierarchy. Random shifts and rotations were made to individual 
modules, and whole layers and disks. A small shift and rotation was also made to the whole pixel 
structure. Since the degradation of the ^-tagging performance is expected to be dominated by the 
alignment of the pixel system, only pixel residual misalignments were introduced, The SCT and 
TRT were corrected perfectly as in the perfect alignment case. Due to movements of higher level 
structures in this set, some systematic effects may exist. The levels of misalignment are given in 
Table 1 . The axis definitions for the module level uses a local frame where x and y are the r0 and r\ 
measurement directions respectively and z is out of the plane. For higher levels they correspond to 
the global frame with z-axis along the beam direction. RotX, RotY, RotZ are rotations around the 
corresponding axes. The module level shifts in the r0 measurement direction are around 10 jJ,m. 
The set attempts to emulate the level of misalignments expected during the early running period. It 
is not well known what levels of misalignments are expected after certain running periods so this 
is just an indication rather than being a firm prediction of what is expected at start up. Comparison 
with the real alignments ("Aligned" set) shows this to be a rather pessimistic scenario. 

• Random5: As with "RandomlO", but with levels of misalignment better by about a factor of 1.5 to 
2. This is an estimate of what might be expected after several years of running. Like "RandomlO", 
this set introduces misalignments at the three levels of hierarchy with levels of misalignment given 
in Table 2. 



Table 1: Residual misalignment for "RandomlO". Random misalignments were generated with a Gaus- 



sian distribution with o as tabulated. Shifts are i 



;Um and rotations are in mrad. 



Level 


X 


y 




RotX 


RotY 


RotZ 


Module 


10 


30 


30 


0.3 


0.5 


0.2 


Layer 


10 


10 


15 


0.05 


0.05 


0.1 


Disk 


10 


10 


30 


0.2 


0.2 


0.1 


Whole pixel 


10 


10 


15 


0.1 


0.1 


0.1 
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Table 2: Residual misalignment for "Random5". Random misalignments were generated with a Gaussian 
distribution with o as tabulated. Shifts are in jxm and rotations are in mrad. 



Level 


X 


y 




RotX 


RotY 


RotZ 


Module 


5 


15 


15 


0.15 


0.3 


0.1 


Layer 


7 


7 


10 


0.02 


0.02 


0.05 


Disk 


7 


7 


20 


0.1 


0.1 


0.05 


Whole pixel 


7 


7 


10 


0.05 


0.05 


0.05 



3 Error scaling 

3.1 Error scaling procedure 

The intrinsic error of a hit will depend on a number of factors such as the cluster width and track direc- 
tion. These factors are taken into account when calculating the intrinsic error of the hit. In the case of a 
perfectly aligned detector, if these intrinsic errors are properly determined one expects the pull distribu- 
tion (the distribution of the hit residuals divided by the calculated intrinsic error) to have a width close to 
one. 

The differences between the real positions of individual hits and those recorded by a misaligned 
detector lead to an additional error term that must be added in quadrature to the intrinsic error of the hits. 

The errors on the hits directly affect whether a hit is associated to a track, the track propagation and 
track parameter errors and the objects that use tracks as input, such as vertices. Of particular importance 
to ^-tagging is the precision of the impact parameter and the vertexing performance. It is therefore 
necessary to have accurately assigned hit errors. 

In this section hits will refer to clusters in the silicon detectors (pixel and SCT) and drift circles in the 
TRT To correct the hit errors the diagonal elements of the error matrix are modified using two parameters 
a and c: 

o' 2 = a 2 -o 2 + c 2 (1) 

where: 

• o is the original error assigned to the hit which is a function of the cluster size and track angle. 
This should normally be close to the intrinsic resolution if properly determined, 

• a is a multiplicative factor on the error, which is meant to compensate for inaccuracies in the 
intrinsic error determination, 

• c is a constant added in quadrature to the error. This is meant to correct effects attributed purely to 
residual misalignments. 

Since each detector component can have significantly different behaviour, the granularity of each 
detector component has to be taken into account, and therefore different sets of {a, c) have to be computed 
separately for the barrel and endcap regions for each detector technology, as well as for the different r(j) 
and T] measurement directions in the case of the pixel detector. 

For the derivation of the (a, c) pairs, the distributions of hit residuals and their pull distributions are 
analyzed, and in particular the deviations of the pull widths from the ideal value of 1 are investigated. 

Since the scale factor a is intended to correct the intrinsic resolutions, this is most easily obtained 
with a perfectly aligned geometry. Naturally, this is not possible with real data, where more in depth 
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studies will be needed to determine if the assigned intrinsic errors are appropriate. Currently the factor 
a is needed as the errors used in the reconstruction do not match those observed in the simulation. 
It is assumed, however, that the best knowledge from test-beam and simulation will be put into the 
determination of the intrinsic error such that a will be close to 1 and any remaining differences would be 
absorbed into the parameter c. 

The widths of the resulting pull distributions can be used directly as the scaling factors a. This is 
iterated a few times, applying the correction, rerunning reconstruction and then determining new values 
of a. The iterations are necessary due to correlations between detector components. The factor c is set to 
zero when determining the a factor. 

The resulting factors a are then kept constant when used for the misaligned detector. Several itera- 
tions (apply {a, c) factors, reconstruct sample, analyze pulls) are performed using a misaligned detector, 
in order to determine the c factor. It is computed using the formula: 

c! = (p 2 o bs -^ 2 oi + p 2 obs cl 1 (2) 

where c,- and q_i are the values of the c factor obtained in the iteration i and i — 1, respectively, p 0 b s is 
the hit residual pull width observed at step i, and Gq is the average intrinsic detector resolution. 

The determination of c does not rely on any information about the actual detector positions and 
the procedure can be applied to real data. For this study a sample of high energy single muons was 
used, while in practice one would need to study the feasibility of extracting the error tuning with a more 
realistic event sample and track selection. 

3.2 Error scaling parameters 

The resulting parameters after the tuning are shown in Table 3. The values of a are seen to be well below 
one for the SCT and TRT This is due to an overestimate of the intrinsic errors. This is being improved. 
The value for c gives some indication of the level of residual misalignment. For the "Random5" and 
"RandomlO" sets, the values of c are higher than what was input for the module shifts. This is possibly 
due to a larger error being needed to compensate for the effects of the layer and disc movements. It can 
be seen that the real alignment results in small values of c compared to the hand-made sets. In the pixel 
r(j) measurement direction one gets 3 jim and in the r\ measurement direction one gets around 15 \im for 
the "Aligned" set. 

Table 3: Error scaling parameters for the different alignment scenarios. The parameter a was tuned using 
the "Perfect" case and used for all alignment scenarios. 





All 


Perfect 


RandomlO 


Random5 


Aligned 






c(Aim) 


c(jUm) 


c(jUm) 


c(jim) 


Pixel barrel r0 


1.03 


0 


31 


13 


3 


Pixel barrel r\ 


0.97 


0 


71 


34 


13 


Pixel endcap r(j) 


1.05 


0 


30 


14 


3 


Pixel endcap r\ 


1.08 


0 


43 


11 


15 


SCT barrel 


0.78 


0 


0 


2 


7 


SCT endcap 


0.86 


0 


6 


5 


8 


TRT barrel 


0.82 


0 


11 


3 


37 


TRT endcap 


0.77 


0 


11 


10 


19 
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Table 4: Efficiency of track reconstruction. 



Setup 


efficiency (%) 


Perfect 
Random 10 

Random 10 + error scaling 
Aligned 

Aligned + error scaling 


97.09 ±0.02 
95.50 ±0.03 
97.22 ±0.02 
97.07 ±0.02 
97.04 ±0.02 



4 Monte Carlo Samples 

The performance of ^-tagging was investigated with tt and WH(niH = 120 GeV) samples which are 
standard samples used in ^-tagging performance studies in ATLAS [2]. The main difference between 
these two samples is that the WH events have lower jet multiplicities. 

The tt sample includes semi-leptonic and di-lepton channels and this sample is used for measuring 
both b-]et and light jet efficiencies. The WH{\2Q) sample contains two sub samples. WH{\20) — > 
jivbb is used to measure b-]et efficiencies and WH{\20) — »■ jivuu for light jet efficiencies. The samples 
contained no pile-up. 

5 Effects of misalignment on tracking and vertexing 

The tracking and vertexing performance was studied with the W//(120) — > \xvbb sample, although the 
other samples could equally have been chosen. A sample size of 27,000 events was used. This sample 
was reconstructed using three of the alignment sets described in Section 2: "Perfect", "RandomlO" and 
"Aligned". All other settings were kept the same. For the "RandomlO" and "Aligned" sets, samples were 
investigated with and without error scaling. 

5.1 Tracking performance 

The track reconstruction efficiency was computed for each scenario by comparing true tracks to cor- 
responding reconstructed tracks. Tracks with pj > 1 GeV and \r\ \ < 2.5 were selected. A true track 
was considered to match a reconstructed track if the true track was the source of at least 50% of the 
hits associated to the reconstructed track. Next, the efficiency was computed as the ratio of the number 
of matched tracks to the number of all true tracks. The results for the efficiency for each of the three 
scenarios are shown in Table 4. The presence of residual misalignment in the "RandomlO" set causes a 
loss of about 2% in the efficiency, while the introduction of error scaling completely recovers the loss of 
performance. The "Aligned" set shows no significant change with respect to the "Perfect" case, with or 
without error scaling. 

The number of fake tracks was also investigated in a similar manner to the track reconstruction 
efficiency calculation. A track was labeled as "fake" if it had fewer than 50% of its hits from a single true 
track. The percentage of fake tracks from the total accepted tracks is shown in Table 5 for the different 
alignment scenarios. The misalignments result in more fakes, and as with the efficiency, this is recovered 
with the introduction of error scaling. 

5.2 Vertexing performance 

The performance of the primary vertex finding algorithm was also investigated for the same five scenarios 
of alignment and error scaling. The efficiency for the primary vertex finding was computed as the ratio 
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Table 5 : Ratio of fake tracks to the total number of accepted tracks. 



Setup 


fake tracks (%) 


Perfect 
RandomlO 

RandomlO + error scaling 
Aligned 

Aligned + error scaling 


2.33 ±0.02 
2.46 ±0.02 
2.29 ±0.02 

2.34 ±0.02 
2.27 ±0.02 



between the total number of reconstructed vertices to the total number of true vertices. It was found that 
this remains constant, at a value of 99.68 ±0.04%, irrespective of the misalignment scenario considered. 

The primary vertex resolution was evaluated by looking at the difference between the reconstructed 
and the true vertex position. The resulting distributions for x and z directions are displayed in Fig. 1. 
The results for the y direction were similar to that in the x direction. The introduction of residual mis- 
alignment causes the distributions to become wider as would be expected with a degradation of the hit 
resolutions. The shift for the hand-made sets is consistent with the shift of the entire pixel detector that 
was introduced. For the "Aligned" set a shift in z of about 90 jUm is apparent. The alignment procedures 
do not fully constrain the six degrees of freedom of the whole detector and no attempt was made to cor- 
rect to the average primary vertex position in the z direction. Because of this, the alignment procedure 
can easily result in such a shift when comparing with truth information. 

The values for the resolution are computed as the width of a Gaussian fit to the distributions in 
Fig. 1 and are shown in Table 6. The resolution is degraded by residual misalignment, for both x and z 
directions. Error scaling helps to partially recover the loss of performance for the "RandomlO" scenario. 




(a) x-direction (b) z-direction 



Figure 1 : Primary vertex resolution, shown for the x direction (a) and z direction (b) for the various 
misalignment scenarios. ES denotes error scaling. 

The number of primary vertex outliers was also investigated. Since the shift observed when looking 
at the resolution should not affect reconstruction, the true vertex position must be corrected by this shift. 
In the following, a primary vertex was flagged as "outlier" if the distance between the reconstructed 
vertex and the corrected true vertex position was greater than three sigma (30 jltm in the x direction and 
150 jUm in the z direction). 

The percentage of outliers is shown in Table 7, which shows that residual misalignment introduces 
additional outliers, and therefore indicates a degradation in the primary vertex finding. The number of 
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Table 6: Primary vertex resolution. 



Setup 


res. in x (jUm) 


shift in x (jJ.m) 


res. in z (M m ) 


shift in z (jUm) 


Perfect 


11.4±0.1 


-0.13 ±0.07 


51.1±0.4 


-8.2±0.3 


Random 10 


15.1 ±0.1 


4.2 ±0.1 


63.0 ±0.4 


1.4 ±0.4 


Random 10 + error scaling 


13.2±0.1 


2.6 ±0.1 


56.6 ±0.4 


2.3 ±0.4 


Aligned 


13.9±0.1 


-0.18 ±0.09 


53.7 ±0.4 


-91.5±0.4 


Aligned + error scaling 


13.8±0.1 


-0.15 ±0.09 


55.4 ±0.4 


-91.6±0.4 



Table 7: Fraction of primary vertex outliers. 



Setup 


outliers in x (%) 


outliers in z (%) 


Perfect 


1.7±0.1 


4.1±0.1 


Random 10 


5.5±0.2 


8.3±0.2 


Random 10 + error scaling 


2.8±0.1 


6.1 ±0.2 


Aligned 


3.2±0.1 


8.2 ±0.2 


Aligned + error scaling 


3.2±0.1 


8.0 ±0.2 



outliers is however partially diminished by the application of error scaling for "RandomlO". For the 
"Aligned" scenario the corresponding scaling factors are much smaller than for "RandomlO" and the 
effect of error scaling on the primary vertex performance is negligible. 

6 Effects of misalignment on ^-tagging performance 

For the study of the impact of the residual misalignment on the ^-tagging performance, several data sets 
were produced with WH(mn = 120 GeV) and tt events. Eight cases were considered corresponding 
to each specific scenario of residual misalignment ("Perfect", "Aligned", "RandomlO" and "Random5") 
with and without error scaling as described in Sections 2 and 3. The performance of the ^-tagging has 
been assessed by looking at the rejection rate of light quarks at £»-jet efficiencies of 50% and 60% using 
various tagging algorithms: IP2D, IP3D, SV1 and the combined tagger IP3D+SV1. A description of the 
different taggers can be found in Ref. [2]. 

For each of the scenarios using WH(\20) samples, 45,000 WH (120) -»• \ivbb events and 175,000 
WH (120) — >• jXVuu events were used. The tt samples contained 50,000 events each with the exception of 
the "Perfect" scenario without error scaling which had 570,000 events and the "Aligned" scenario with 
error scaling which had 500,000 events. 

6.1 Results 

Figure 2 shows the b-]&t efficiency versus light jet rejection for tt and WH(\20) samples for the four 
misalignment sets with and without error scaling. Rejections for the IP3D and IP3D+SV1 tagger at b-tag 
efficiencies of 50% and 60% are tabulated in Table 8 for WH (120) and in Table 9 for tt. 

The results for the IP3D+SV1 are also summarized in Fig. 3. As expected, the larger the misalign- 
ment, the greater the degradation of the ^-tagging performance. In the case of "RandomlO", which 
represents the highest amount of misalignment (shifts of 10 }Xm in the pixel r0 measurement direction), 
there is almost a factor 2 drop in performance. For "Random5", where the level of misalignment is lower 
(shifts of the order of 5 lira), the decrease of the light jet rejection rates is lower than in the previous 
case at around 30% degradation. For the "Aligned" set the loss in performance is around 10 to 20% 



471 



b- Tagging - Effects of Misalignment on b- Tagging 



Table 8: Light jet rejection rates computed for ft-jet efficiencies of 50% and 60% for the IP3D and 
IP3D+SV1 taggers for the various misalignment scenarios with and without error scaling (ES) for 
WH (1 20) events. 





Rejection rate 


Setup 


IP3D (50%) 


IP3D (60%) 


IP3D+SV1 (50%) 


IP3D+SV1 (60%) 


Perfect 


211 ±4 


67 ±1 


399 ±11 


104 ±2 


Perfect + ES 


215±5 


67 ±1 


372 ±11 


98±2 


Random 10 


51 ±1 


23 ±1 


49±1 


21 ±1 


RandomlO + ES 


80±1 


29±1 


166 ±3 


49±1 


Random5 


144±3 


49±1 


165 ±3 


53 ±1 


Random5 + ES 


182±7 


53 ±1 


311 ± 16 


80±2 


Aligned 


193 ±4 


62±1 


300 ±8 


84±1 


Aligned + ES 


190 ±4 


62±1 


306 ±8 


87±1 



Table 9: Standard light jet rejection rates computed for b-jet efficiencies of 50% and 60% for the IP3D 
and IP3D+SV1 taggers for the various misalignment scenarios with and without error scaling (ES) for it 
events , 





Rejection rate 


Setup 


IP3D (50%) 


IP3D (60%) 


IP3D+SV1 (50%) 


IP3D+SV1 (60%) 


Perfect 


238±11 


68±2 


480 ±30 


166 ±6 


Perfect + ES 


244 ±11 


70±2 


474 ± 30 


161 ±6 


RandomlO 


86±2 


32±1 


95±3 


38 ±1 


RandomlO + ES 


71±2 


25±0 


242 ±11 


77±2 


Random5 


192 ±7 


56±1 


290 ±14 


95±3 


Random5 + ES 


133±4 


46±1 


360 ±20 


116±4 


Aligned 


234 ±10 


67±2 


442 ±27 


143 ±5 


Aligned + ES 


206 ±8 


62±1 


428 ±24 


138±5 



Table 10: Purified light jet rejection rates computed for &-jet efficiencies of 50% and 60% for the IP3D 
and IP3D+SV1 taggers for the various misalignment scenarios with and without error scaling (ES) for it 
events . 





Rejection rate 


Setup 


IP3D (50%) 


IP3D (60%) 


IP3D+SV1 (50%) 


IP3D+SV1 (60%) 


Perfect 


331 ±19 


80±2 


914 ±86 


243 ± 12 


Perfect + ES 


332 ±19 


80±2 


872 ±79 


234 ±11 


RandomlO 


97±3 


34±0 


106 ±3 


41 ±1 


RandomlO + ES 


76±2 


26±0 


316±17 


89±3 


Random5 


250 ±12 


62±2 


387 ±23 


113±4 


Random5 + ES 


154 ±6 


50±1 


558±41 


148 ±6 


Aligned 


321 ±18 


77±2 


714±59 


190 ±8 


Aligned + ES 


273 ± 14 


70±2 


706 ±56 


180±7 
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Figure 2: Light jet rejection versus ^-tagging efficiency for the four different alignment sets for 
IP3D+SV1 for tt (left) and W//(120) (right). ES denotes error scaling. 



and lies somewhere between the "Perfect" alignment and the "Random5" set. This is consistent with the 
level of misalignment suggested by the parameter c in the error tuning which is 3 jltm in the pixel r0 
measurement direction. 




Figure 3: Light jet rejections using IP3D+SV1 tagger for the four misalignment scenarios at ^-tagging 
efficiency working points of 50% (left) and 60% (right). Results are shown before and after error scaling 
(ES). 

The rejections for WH (120) are systematically lower than that for tt as observed in other studies [2], 
however, in general they both show similar trends with misalignment. Some differences are observed 
with the effects of error scaling which are discussed below. Results are shown mainly for tt, although 
similar conclusions are reached for both samples. 

Figure 4 shows the ft-tag weight for IP3D+SV1 tagger for the different alignment scenarios. The 
differences between the "Aligned" and "Perfect" sets are difficult to see in such plots but for the larger 
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misalignments ("Random 10" and "Random5") it is seen that the light jets have slightly larger weights 
while the Z?-jets have lower weights resulting in the loss of discrimination. 
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Figure 4: Jet weight distributions for the IP3D+SV1 tagger for the different alignment scenarios with 
error scaling for tt (a) and WH (120) (b). 



6.2 Effects of error scaling 

It is observed in Fig. 3 that for the larger misalignment scenarios ("Random 10" and "Random5") the 
error scaling gives a significant improvement, while for the "Aligned" and "Perfect" case the impact of 
error scaling is small. 

Figure 5 compares the b-tag weights with and without error scaling. Only the "RandomlO" results 
are shown. For the other scenarios the differences were less pronounced. The tt events show some 
differences in behaviour for the error scaling as compared with the WH events. For tt, the error scaling 
results in only a small difference for the light jets while for the fe-jets the differences are more evident. 
This is in contrast with the WH events where the light jets show more differences and the &-jets are less 
affected. 




(a) "RandomlO", ft 



(b) "RandomlO", WH{\2Q) 



Figure 5: Jet weight distributions for the IP3D+SV1 tagger comparing with and without error scaling 
(ES) for "RandomlO" scenario for tt (a) and WH (120) (b). 
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The differences are thought to be associated with the pj spectra of light jets and ft-jets which are 
different from each other and different for the two event types. The effectiveness of the error scaling 
is expected to have some pj dependence since for lower pj, the multiple scattering will dominate and 
differences in hit errors will be less important. There was insufficient statistics in the samples however 
to verify if this was indeed the case. 

In some cases the use of error scaling results in worse performance. This is discussed further in 
Section 6.4. 

6.3 Purified jets 

For comparison with other studies [2] purified jets were also studied. Purification excludes labelling jets 
as light jets when there is a &-quark within a cone of 0.8. This gives a more physics independent measure 
of the performance (although differences will still be seen between samples because of the different pj 
and T\ distributions of the jets contained in the samples). Table 10 shows the results for purified jets for 
tt. For WH events only standard jets were considered as other studies [2] show similar results with and 
without purification. Figure 6 compares standard and purified jets for tt events and it can be seen that 
the rejections are higher for purified jets but the trends are similar for the different alignment scenarios. 
The degradation of the "Aligned" case with respect to the "Perfect" alignment is more pronounced after 
purification (19 - 23% degradation) than for the standard jets (10 - 14% degradation). The effects of 
error scaling showed similar behaviour for both standard and purified jets. 




\andom10 Randoms Perfect Aligned RandomlO Random5 Perfect Aligned 



Alignment set Alignment set 



Figure 6: Comparison of light jet rejections using IP3D+SV1 tagger for standard and purified jets in tt 
events. Left plot: 50% b-tag efficiency. Right plot: 60% ft-tag efficiency. 



6.4 Comparison of the different taggers 

Figure 7 shows the rejections for different taggers for standard jets. The impact parameter based taggers 
are the most affected by misalignment with the "Aligned" set showing 10 - 20% lower rejections than 
the "Perfect" case and up to a factor 3 degradation for the largest misalignment. After error scaling, the 
SV1 tagger shows rather uniform performance for all scenarios considered, with the "Aligned" set giving 
10% degraded performance with respect to the "Perfect" case. 
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Without error scaling (see Fig. 8) the SV1 tagger shows significant differences for the different align- 
ment scenarios. The ratio between rejections with error scaling to those without error scaling is shown 
in Fig. 9. It can be seen that the error scaling has the most beneficial effect with the larger misalignments 
("RandomlO" and "Random5") for the SV1 performance. For the "Aligned" scenario the error scaling 
has little effect while for the "Perfect" case it actually degrades the performance slightly. For the impact 
parameter based taggers the error scaling has a smaller effect on the ^-tagging performance and even 
degrades the performance in it events. 




Figure 7: Comparison of the light jet rejections for the different taggers, IP2D, IP3D, SV1 and the 
combined tagger IP3D+SV1. Left plot: 50% &-tag efficiency. Right plot: 60% ft-tag efficiency. Results 
are with error scaling using it events. 

Due to the beneficial effect of the error scaling on the secondary vertexing, the overall performance 
of the combined tagger (IP3D+SV1) is also improved with error scaling in the case of the "RandomlO" 
and "Random5" sets. The error scaling parameter c is zero for the "Perfect" alignment and small for the 
"Aligned" set and consequently for both cases the effect of error scaling on the performance of ^-tagging 
for the combined tagger is also small. For the "Aligned" case, while the relative difference is smaller 
than the hand-made sets, the error scaling degrades the performance slightly. 

The reason for loss of performance with error scaling for some cases can be explained by the fol- 
lowing: since the error scaling will generally increase the errors, it will reduce the impact parameter 
significance. This is desirable for light jets as it will make them more compatible with zero impact pa- 
rameter. For Z?-jets, however, it also reduces the significance and so will reduce the ^-tagging efficiency 
for a given weight cut, or in other words, one needs a lower weight cut to obtain the same efficiency and 
hence results in lower rejection for light jets for a given b-]et efficiency. The overall effect depends on 
these two competing effects and so can potentially lead to a loss in performance. While a decrease in 
performance with error scaling was observed in it events for the IP2D and IP3D taggers, the opposite 
was observed for WH events. As was already discussed above for the b-tag weight in Fig. 5, the error 
scaling affects the light jets and ft-jets in the two physics samples differently and this is thought to lead 
to the differences in the error scaling behaviour seen here. 

The effect of error scaling on the secondary vertex tagger will be to recover some secondary vertices 
which would have otherwise failed quality cuts due to an underestimated error. One will lose some true 
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Figure 8: As in Fig. 7 but without error scaling. 




Figure 9: Ratio of rejections with error scaling to rejections without error scaling. Left plot: 50% b-tag 
efficiency. Right plot: 60% ft-tag efficiency. 
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secondary vertices that are close to the primary vertex as the larger error will make them compatible with 
the primary vertex, however for similar reasons it will result in fewer fake secondary vertices close to the 
primary vertex. 



6.5 Effects of recalibration 

The taggers require probability distribution functions for light jets and ft-jets as described in Ref. [2] 
and the process of obtaining the set of these reference distributions is known as calibration. The results 
presented here use the same set of calibrations as used in Ref. [2]. Since misalignments will alter these 
distributions, it is possible that one can obtain some more discriminating power by recalibrating using 
the misaligned sample. Methods for obtaining these calibrations from real data are explored in Ref. [3]. 

To investigate whether recalibrating results in any gain in performance, a new set of reference dis- 
tributions was obtained for each sample with and without error scaling. In practice one should use 
independent samples to calibrate and to test the performance, however, here the reference distributions 
were obtained with the same or a subset of the sample used to measure the performance. 

As seen in Fig. 10 recalibration gives better performance, although after error scaling the difference 
between the fixed calibration and recalibration is only marginal. 

One might expect that recalibration may compensate any miscalculation of the errors in a similar 
way to the error scaling. This is only partially the case as can be seen in Fig. 11, where the relative 
improvement with error scaling is reduced when recalibrating. 




Random! 0 Randoms Perfect Aligned RandomlO Random5 Perfect Aligned 



Alignment set Alignment set 



Figure 10: Comparison of the light jet rejection obtained for the IP3D+SV1 tagger using a fixed calibra- 
tion or recalibrating for each separate sample. Left plot: 50% ft-tag efficiency. Right plot: 60% ft-tag 
efficiency. 



7 Conclusion 

The effect of misalignment on the tracking performance, as measured by tracking efficiency and fake 
rates, was small, and error scaling recovered the performance almost to the level that was seen with 
the perfect alignment. The degradation of the primary vertex was more significant, with resolutions 
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Figure 11: Ratio of rejections with error scaling to rejections without error scaling. Results are shown 
for the IP3D+SV1 tagger. Left plot: 50% b-tag efficiency. Right plot: 60% b-tag efficiency. Compares 
using a fixed calibrations and recalibrating for each separate sample. 

about 2.5 /im degraded and an increase in the number of outliers. The "Aligned" set showed similar 
performance to the "RandomlO" despite better ^-tagging performance. 

The performance of ^-tagging was clearly degraded with misalignment and the amount of degrada- 
tion was found to be roughly proportional to the amount of random displacement of modules. Systematic 
effects are expected to also play an important role, however, the random displacement was the only as- 
pect that was quantified in this study by the parameter c obtained in the error scaling procedure. In 
order to disentangle the contributions from random and systematic effects it would be necessary to create 
dedicated residual misalignment sets with known systematic distortions. 

The impact parameter based taggers were observed to be the most affected by misalignment and 
the introduction of error scaling brought little or even negative benefit to the ^-tagging performance in 
the case of tt events. Error scaling was important for the performance of the secondary vertex find- 
ing, and without it, for larger misalignments the degradation for the secondary vertex based tagger was 
significant. With error scaling most of the degradation was recovered and the secondary vertex tagger 
showed uniform performance for all alignment scenarios considered. The behaviour of the combined 
tagger, IP3D+SV1, follows what one might conclude from the behaviour of the separate taggers, that is, 
it benefits from error scaling but shows a degradation with misalignment even after error scaling. 

The "Aligned" set was the most realistic misalignment scenario studied here and the results were 
encouraging with rather moderate degradation in the ^-tagging performance. In purified jets from tt, 
the degradation was more evident with 19% loss of rejection at 50% ^-tagging efficiency. However, in a 
more realistic environment, as seen by looking at standard jets, the amount of degradation was only 10%. 
At 60% ^-tagging efficiency, the loss of rejection was slightly larger with 23% degradation for purified 
jets and 14% degradation for standard jets. For WH events the loss of rejection was similar with around 
18% degradation at 50% ^-tagging efficiency and 11% degradation at 60% ^-tagging efficiency. 

The amount of residual misalignment remaining after applying the actual alignment procedures re- 
sulted in only a small loss of performance and so misalignments are not expected to cause a major 
problem for doing ^-tagging in ATLAS. 
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Abstract 

£>-jets can be identified by taking advantage of the presence of a muon coming 
from the semi-leptonic decay of the b hadrons. This note describes a soft muon 
tagging algorithm and its performance when applied to different Monte Carlo 
physics samples in ATLAS. A b-]et tagging efficiency of about 10% (includ- 
ing the inclusive b — >• }ivX branching ratio of « 20%) is achieved for a light 
jet rejection of better than 300. The effect of pile-up events and cavern back- 
ground on the performance is also considered and is found to be significant but 
manageable. 

1 Introduction 

Soft-lepton tagging relies on the semi-leptonic decays of b and c hadrons. Indeed the presence of a 
muon is enhanced in Z?-jets thanks to the significant semi-leptonic decay branching ratio of b hadrons 
(BR(b — > flvX) « 11%), and of c hadrons produced by the b hadron decay (sequential semi-leptonic 
decay, BR(b — > c — > jXvX) 10%). A soft muon tagger, while intrinsically limited by the small semi- 
leptonic branching ratio, offers a good alternative or complement to the more performant lifetime taggers. 
Thanks to its good purity and low correlation with lifetime taggers, it can be used to do cross-calibrations 
of the two types of tagger [1], or to enhance the lifetime tagger performance by combining the two. 
Finally, it permits derivation of specific jet energy corrections [2]. 

The properties of the soft muons and of the various backgrounds are shown in Section 2. The al- 
gorithm is then described in detail in Section 3 while its performance applied to different Monte Carlo 
samples is discussed in Section 4. 

2 Properties of muons from semi-leptonic decays and their backgrounds 

There are three sources of background from particles within light jets: muons coming from the decay 
of light hadrons (mostly pions), hadrons managing to go through the calorimeter and reaching the muon 
spectrometer ("punch-through"), and hits caused by the neutron gas that will be surrounding the detector 
during data taking, producing fake tracks in the muon spectrometer ("cavern background"), c-jets are 
also a source of background as far as ^-tagging is concerned. Figures 1, 2, 3, and 4 show, respectively, 
the distance to the closest jet AR = y / T] 2 + ^) 2 , the muon transverse momentum, the impact parameter, 
and the transverse momentum relative to the closest jet axis p r f , for muons from (direct) b decays, 
sequential b — >• c — >• }lvX decays, c decay, light hadron decays, and fake muons, in jets of E T > 15GeV 
and |t]| < 2.5 in tt events. p r j l is defined as the muon momentum in the plane orthogonal to the jet axis, 
where the jet axis is corrected for the presence of a muon by adding the muon momentum to the jet 
momentum. Contributions from true muons are estimated at the generator level while fake muons are 
estimated with the full simulation and defined as a reconstructed muon associated with a jet while no 
muon of momentum larger than 2 GeV was found at the generator level within AR = 0.6 of that same jet. 
Note that the distinction between fake muons and muons from light hadron decays is somewhat arbitrary 
since the latter can also be produced in the calorimeter as part of the hadronic shower; such muons are 
present in the full simulation but are not distinguished from fake muons because long-lived particles and 
showers are taken care of by GEANT and the full GEANT information was not available in the present 
study. Muons produced by direct b hadron decays have a significantly larger transverse momentum than 
the various backgrounds, especially muons from light hadron decays; additionally, since light hadrons 
such as charged pions and charged kaons have a long lifetime, those muons tend to have a large impact 
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parameter. Those two properties will be used to reject this important background efficiently. Thanks to 
the b hadrons high mass, muons from direct b decays tend to be more boosted in a plane transverse to the 
jet axis, i.e. have a larger p r j than the backgrounds. This property is used in the tagger to further separate 
ft-jets from light jets through a likelihood technique. It is important to note that muons from sequential b 
decay are much more difficult to separate from background since they have a much softer spectrum both 
in pj and pj 1 than those from direct b decay. Finally, Fig. 5 shows that the p™ 1 distribution in tt and WH 
events is very similar. Indeed the p™ 1 variable is not very correlated with the jet Ej or pseudo-rapidity. 
Thus it is not very dependent on the process producing the &-jets. 



<20.14 




Figure 1 : Distribution of AR = a/Atj 2 + A(j) 2 between closest jet and muons from b hadron decays, c 
hadron decays, light hadron decays, and fakes, in jets of Ej > 15GeV and |tj | < 2.5 in it events. The red 
line shows the cut applied for the basic selection described in Section 3. All histograms in Fig. 1 to 5 are 
normalized to unity. 



3 Description of the algorithm 

The algorithm works in three steps, described in the following sections: 

• The standard muon reconstruction algorithms are used to identify muons. 

• Muons satisfying some basic requirements are then associated to jets. 

• Finally, a 1 -dimensional likelihood ratio using the p r j l variable discriminates further signal from 
background. 

3.1 Muon reconstruction 

Two complementary muon reconstruction algorithms are used by the soft-muon tagger: so-called "com- 
bined" muons, which correspond to a track fully reconstructed in the muon spectrometer and matched 
with a track in the inner detector; and "tagged" muons, which cannot reach the muon middle and outer 
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Figure 2: Transverse momentum distribution of muons from b hadron decays, c hadron decays, light 
hadron decays, and fakes, in jets of Ej > 15GeV and \r\\ < 2.5 in tt events. The last bin includes 
overflows. The red line shows the cut applied for the basic selection described in Section 3. 
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Figure 3: Impact parameter distribution of muons from b hadron decays, c hadron decays, light hadron 
decays, and fakes, in jets of Ej > 15GeV and |tj| < 2.5 in it events. The last bin includes overflows. The 
red line shows the cut applied for the basic selection described in Section 3. 
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Figure 4: Distribution of the transverse momentum relative to the jet axis of muons from b hadron decays, 
c hadron decays, light hadron decays, and fakes, in jets of Ej > 15GeV and \r\ \ < 2.5 in it events. The 
last bin includes overflows. 
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Figure 5: Distribution of the transverse momentum relative to the jet axis of muons from b hadron decays 
and from background sources, in jets of Ej > 15GeV and \r]\ < 2.5 for tt events and WH events. The 
last bin includes overflows. 
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stations and are identified by matching an inner detector track with a segment in the muon inner sta- 
tions only. The ATLAS muon spectrometer is able to fully reconstruct muons of momentum larger than 
about 5 GeV, providing a precise measurement of its momentum, allowing - in combination with the 
inner detector track - a very good rejection of the combinatoric background in the dense environment of 
jets. Muon spectrometer tracks that fail to be combined with an inner detector track are not used by the 
soft-muon tagger because of the contamination from fakes and muons from light hadron decays. Tagged 
muons are less pure than combined muons but allow one to recover some efficiency in the crucial low 
momentum range. Thus both combined muons and tagged muons are used by the soft-muon tagger. For 
more details of muon reconstruction and identification, see Ref. [3]. 

3.2 Jet-muon association and basic selection 

Muons are associated with the closest jet (in T7 x 0) in the event (only jets with E T > 15 GeV and 
I r\ I < 2.5 are considered) and the jet-muon pair is required to satisfy A/?(jet-muon)< 0.5. Additionally, 
muons are required to satisfy the following requirements: 

• Impact parameter with regard to the primary vertex \do\ < 4 mm 

• p T > 4 GeV 

• Matching between the muon spectrometer and inner detector tracks of % 2 jdof < 10 

3.3 Likelihood ratio 

Muons from semi-leptonic ^-decays are further separable because of their particular kinematics. As 
shown in Section 1, the muon transverse momentum relative to the jet (p r j l ) is a good discriminant vari- 
able, and it is used in a likelihood ratio to discriminate between the b and light jet hypotheses. No attempt 
was made to specifically discriminate between b- and c-jets. The p r j l probability density functions were 
estimated on fully simulated tt and WH — > jivbblWH —>■ jivuu samples of several hundred thousand 
events and are shown in Fig. 6 and 7 for tagged and for combined muons separately. Here pile-up and 
cavern background are omitted. The normalized variable Pj l /{pj l + 0.5GeV) is used instead of p r j l in 
order to facilitate the smoothing of the probability density functions. The likelihood is found to be less 
discriminating for tagged muons than for combined muons because muons at low momentum tend to 
be produced by b —>■ c — > /J.X sequential decays, which also tend to yield a smaller p r j l as shown in the 
previous section. This is visible in the probability density function as a second peak at a value of « 0.5 
(compared to « 0.7 for direct b — > /xX decays). Each likelihood is normalized with the probability 
(estimated on the same fully simulated samples) that a reconstructed muon be associated with a jet of a 
given flavor. For each type of reconstructed muon (tagged or combined), the likelihood ratio Q can be 
written as follows: 

e°xL(pf\b) 
e?xL(p?\l) 

where (£*') is the fraction of b-jets (light jets) that contain a muon satisfying the basic selection and 
L(p r j l \b) (L(p r f\l)) is the probability density function for a b-jet (light jet). It was found that e^/ef rs 6.6 
for tagged muons and e^/ef ~ 44.4 for combined muons, reflecting the fact that combined muons are 
purer and contribute to a large fraction of the efficiency. 

Several muons can be associated with a jet. In such a case, the muon with highest transverse momen- 
tum is considered. At this point, no attempt has been made to use the information given by the presence 
of an additional muon in the jet. 
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Figure 6: Probability density function used in the algorithm likelihood for the p r j variable, for the tagged 
muons. 
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Figure 7: Probability density function used in the algorithm likelihood for the p r j l variable, for the 
combined muons. 
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4 Performance 

The algorithm performance was estimated on tt and WH — > iivbblWH — > /ivmm Monte Carlo samples 
of several hundred thousand events. The jet flavor is determined at the generator level by matching 
reconstructed jets with quarks considered after final state radiation (for details, see Ref. [4]). Figures 8 
and 9 show the ^-tagging efficiency and the light jet tagging rate, respectively, with and without a cut 
on the likelihood ratio, as a function of jet Ej and pseudorapidity in tt events. An average ^-tagging 
efficiency of 10% is reached for a cut on the likelihood ratio of InQ > 3.05, which corresponds to 
rejecting all tagged muons and selecting combined muons with p™ 1 > 360 MeV. No requirement on 
the b decay is made, so that the ^-tagging efficiency includes the semi-leptonic branching ratios as well 
as the jet-muon association efficiency, the detector acceptance, and the muon reconstruction efficiency 
and selection. Likewise, the light jet tagging rate includes both light hadron decay muons and fake 
muons. The Z?-tagging efficiency tends to decrease as the jet Ej increases, mostly because of a drop 
in the tracking efficiency of the inner detector in highly collimated jets. Not surprisingly, the light jet 
tagging rate increases significantly with the jet Ej. After the basic selection stage, light hadron decays 
account for 8% (1.6%) of combined muons (tagged muons) present in tagged light jets in tt events. 
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Figure 8: Probability for a Z?-jet to be tagged by the soft-muon tagger as a function of jet pseudorapidity 
(left) and transverse energy (right: the last bin includes overflows) without and with a requirement on the 
likelihood ratio corresponding to an average ^-tagging efficiency of 10%. 
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Figure 9: Probability for a light jet to be tagged by the soft-muon tagger as a function of jet pseudora- 
pidity (left) and transverse energy (right, the last bin includes overflows) without and with a requirement 
on the likelihood ratio corresponding to an average ^-tagging efficiency of 10%. 
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Figures 10 and 11 show the ^-tagging efficiency vs light jet rejection (the inverse of the light jet 
tagging rate) for different cuts on the likelihood ratio. Figure 10 compares the performance in tt and WH 
events. The rejection for a given efficiency is slightly better in tt events than in WH events, due to the 
fact that jets in tt events are more central, while the tagger performance is degraded in the forward region 
of the detector. The soft-muon tagger reaches a rejection of about 300 in WH events and 380 in tt events 
for an efficiency of 10%. 

Figure 1 1 shows the effect of pile-up (the superposition of several events occuring within the same 
bunch crossing) and cavern background on the tagger performance. Here, only jets matched to the hard- 
scatter process quarks are considered in order to make a fair comparison of the algorithm performance 
with and without cavern background and pile-up. Indeed, soft jets from pile-up events tend to have a 
lower light jet tagging rate and would bias the comparison. For a luminosity of 2 x 10 33 cm _2 -s _1 , pile- 
up and cavern background decrease the rejection by about 15% (for a given efficiency of 10%), which is 
significant but not dramatic. 
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Figure 10: ^-tagging efficiency vs light jet rejection estimated in tt and WH (without pile-up/cavern 
background). 

Performance will have to be evaluated in data, using techniques similar to those developed at the 
Tevatron [5]. The muon reconstruction efficiency can be measured using a tag-and-probe method ap- 
plied to J/yr and Z samples. The light jet tagging rate may be measured using jet events, although 
disentangling background from heavy-flavor muons is a complex issue. Jet samples may also be used to 
test the likelihood probability distribution functions against data. Those issues have not been studied in 
the context of soft muon tagging in ATLAS yet. 

5 Conclusion 

The soft-muon tagger developped for the ATLAS detector shows excellent performance across a large jet 
Ej spectrum, with a light jet rejection of more than 300 for a ^-tagging efficiency of 10%. The event pile- 
up and the cavern background appear to have a significant but not dramatic effect on the performance, 
with a decrease of about 15% on the light jet rejection at a luminosity of 2 x 10 33 cm _2 -s _1 . 
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b-jet efficiency 

Figure 1 1 : ^-tagging efficiency vs light jet rejection estimated on a tt sample with and without pile- 
up/cavern background. 
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Abstract 

The presence of an electron coming from the semi-leptonic decay of a B hadron 
can be used to tag b-jets. This note describes a soft electron tagging algorithm 
and its performance when applied to ATLAS Monte Carlo simulation data. Jets 
are built from the energy reconstructed in the electromagnetic and hadronic 
calorimeters with a cone algorithm. Electrons, reconstructed by a track-seeded 
algorithm, are sought among charged tracks associated to the jets. The tagging 
of jets originating from b quarks is based on kinematics and on the identi- 
fication capabilities of the inner tracker and the electromagnetic calorimeter. 
The performance of the electron identification and 6-tagging procedure are 
presented. A b-jet tagging efficiency of about 7% (including the inclusive 
b — ► evX branching ratio of ~ 20%) is achieved for a light jet rejection of 
better than 100. 

1 Introduction 

A variety of interesting physics processes at LHC, such as the H — > bb decay for an intermediate Higgs 
boson mass range, top physics or searches for new physics require efficient identification of ^-quarks. 
The performance of ^-tagging algorithms in ATLAS are studied in Ref. [1]. The semi-leptonic decays 
of heavy quarks provide a clean signature used to identify the flavour composition of jets. The semi- 
leptonic decay modes are b —>■ £, c —>■ t and the cascade decay b —>■ c — > I. Electrons produced in b 
decays (through direct and cascade decays) can be detected using the electromagnetic calorimeter and 
the inner detector. Since these electrons are non-isolated and with low transverse momentum, excellent 
electron/hadron separation capability is required. As a ^-flavour tag, the lepton tag is not competitive 
with the lifetime tag because the branching ratio is low for such decays (about 20% for B-meson decays 
to leptons, including cascade decays, per lepton family) and is limited by the efficiency to reconstruct 
and identify electrons within jets. Still this method can be used with the vertex algorithms to provide a 
complement to the overall 6-tagging performance and for cross-checks and calibration. 

The note is organised as follows. Monte Carlo data samples used in this analysis are described in 
Section 2. The jet and track selections are described in Section 3. The electron identification algorithm 
is described in Section 4. The ^-tagging algorithm and its performance are described in Section 5. 

2 Monte Carlo samples 

The samples used in this study contain events with electrons in jets from Higgs boson associated pro- 
duction WH, with W —>■ jiv. We take the mass of the Higgs boson to be m# = 120 GeV. The signal 
sample consists of H — > bb events and the background sample of H —>■ uu events. Another sample of 
H —> bb events is enriched in true electrons with a filter at the generation level. This filter is defined as 
follows: both b quarks (before final state radiation, FSR) are required to have a transverse momentum 
Pj > 15 GeV and a pseudorapidity \r\b\ < 2.5; at least one true electron with p e T > 1 GeV is required to 
be found in a cone AR < 0.4 around each b-quark direction (before FSR). This sample will be used only 
in Section 4 to model signal electrons with enough statistics. 

All samples have been generated using the PYTHIA 6.403 [2] Monte Carlo event generator. More 
details on the Monte Carlo generators used can be found in [3]. Data have been passed through a full 
detector simulation based on GEANT4. No pile-up has been included but its effect on performance, 
based on previous studies, is mentioned below where relevant. 
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The signal electrons come from direct (b — > e) and cascade (b — > c — > e) semi-leptonic decays of 
5-hadrons. Apart from the previous sources, there are also some other sources 1 , mainly b — > T — > e and 
b — > tyO — *■ e+<? ~- The background electrons arise from 7T° Dalitz decays, /-conversions occurring 
in the inner detector and decays of light hadrons. The fraction of electrons from photon conversions 
becomes substantial at large pseudo-rapidities and large transverse momenta. 

Distributions of true transverse momentum, pj, and pseudo-rapidity, r\, for electrons and pions are 
shown in Fig. 1 for the samples considered. Table 1 shows the mean values of the true transverse mo- 




P T (GeV) 



Figure 1 : Normalized distributions of true transverse momentum pj (left) and pseudo-rapidity r\ (right) 
are shown for signal electrons (hatched histograms), electrons from conversions (dotted line histograms) 
and pions (plain histograms). 

mentum distributions for signal and background electrons and pions in the two samples. 



Table 1 : The mean true pj (in GeV) for electrons and pions in various data samples. 



sample 


electrons 


pions 


B hadrons 


D hadrons 


/-conversions and 71° Dalitz 


other sources 


H^bb 


10.8 


11.9 


4.9 


7.3 


5.9 


H -^uu 






5.2 


4.2 


7.1 



3 Jet and track selection 

The reconstruction of the various objects needed for ^-tagging and the estimation of its performance are 
summarized in this section. The reconstruction of soft electrons will be detailed in the next section. 

We follow the default jet reconstruction and labelling procedure described in Ref. [1]. Jets are re- 
constructed in the calorimeters using a standard cone algorithm with a size of AR = 0.4. Jets with 
pj > 15 GeV and |t]| < 2.5 are considered for ^-tagging. To assess quantitatively the ^-tagging perfor- 
mance, Monte-Carlo information is used to determine the type of parton (a quark-based labelling) from 
which a jet originates. Jets are labelled as Z?-jets if a b quark with pj > 5 GeV (after FSR) is found in a 
cone AT? = 0.3 around the jet direction. The labelling for c-jets (and T-jets) is done in the same way. By 
light jets we denote all other reconstructed jets. 

'The corresponding branching ratios [4] are Br(b -> E~) = (10.71 ±0.22)%, Br(b -^c^£+) = (8.01 ±0.18)%, Br(b 
c^r) = (1.62+°;^)%,Br(6-> T ^ e) = (0.419±0.055)% and Br(b ^ (J/y, i/) -> e + e~) = (0.072 ±0.006)%. 
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number of tracks in jets ~ ~" p T ofjet(GeV) 



Figure 2: Track multiplicity in jets (left) and jet transverse momentum (right) for b jets (hatched his- 
tograms) and light jets (solid line). Only jets having at least one good quality track with pj > 2 GeV are 
considered. 

For the soft electron ^-tagging algorithm, a track-based electron reconstruction algorithm is used and 
will be described in the next section. In order to be able to identify track candidates as electrons, strict 
track selection criteria are applied and called hereafter good track quality cuts. To reduce the number of 
fake candidates in jets, only tracks with pj > 2 GeV are considered. The inner detector coverage extends 
to pseudo-rapidity values of ±2.5, except for the transition radiation tracker (TRT) which extends up to 
±2. This sub-detector is crucial in the identification procedure, so the track selection requires |tj | < 2. 
Tracks are required to have at least nine precision hits (pixels and SCT), at least two hits in the pixel 
detector, at least one hit in the vertexing layer (the so-called Mayer) and an unsigned transverse impact 
parameter at the perigee smaller than 1 mm. The TRT may record either of two types of hits, 'low- 
energy' hits which are used for track reconstruction and 'high-energy' hits which are used for electron 
identification. Selection criteria thus require at least 20 low-energy hits and at least one high-energy hit 
in the TRT detector along the track. After this selection, only about 50% of initial tracks remain. About 
50% of actual £»-jets and 60% of light jets, in the WH sample, have no good quality track associated. 
Figure 2 shows the multiplicity of good quality track inside jets and the jet transverse momenta. For jets 
having at least one good quality track the mean multiplicity is 3.8 (3.3) for &-jets (light jets). Light jets, 
which include ISR, FSR and underlying events, have a softer transverse momentum than b jets, which 
are originated from the H decay, with a mean pj of 5 1 GeV against 57 GeV, but with a large spread, an 
RMS of ~35 GeV. 

The fraction of jets containing electrons with a good quality track is given in Table 2 for signal and 
background samples. 

Table 2: Fraction of jets (in %) with an electron at the generator level that also have a good quality track, 
for the signal and background samples. 



Sample 


/-conversions and 7r° Dalitz 


B hadrons 


D hadrons 


Other sources 


H^bb 


1.6 


3.4 


2.2 


0.3 


H -^uu 


1.3 


< 10~ 4 


< 10~ 4 


0.1 
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4 Electron reconstruction and identification 

The standard electron reconstruction procedure [5] is based on calorimeter clusters, with a subsequent as- 
sociation to tracks. While this method is efficient for high-energy isolated electrons, such as those arising 
from W or Z decays, it is not effective for electrons inside hadronic jets, such as those from semi-leptonic 
decays. Indeed hadron and electron showers tend to overlap in collimated jets so that electron cluster 
characteristics are obscured. An alternative procedure takes full advantage of the tracking capabilities 
of the inner detector as well as the granularity of the electromagnetic calorimeter. The method relies 
on the extrapolation of reconstructed charged particle trajectories into the electromagnetic calorimeter. 
The most common background processes for producing electron-like showers in the calorimeters were 
described in Section 2. Because the development of showers is different for electrons and hadrons, the 
electron identification algorithm incorporates variables that describe the shower shapes, quality of the 
match between the track and its corresponding cluster, and the fraction of high-energy hits in the tran- 
sition radiation tracker. This algorithm is used also for the reconstruction of low pj electrons in J/y/ 
events [6]. 

4.1 Electron reconstruction 

All the tracks that pass the good track quality cuts, described in the previous section, are extrapolated to 
the second (also known as middle) sampling layer of the electromagnetic calorimeter. Around this posi- 
tion a cluster of size Atj x A(j> = 0.125 x 0.125 (5x5 cells in this sampling layer) is built. The cell with 
the maximum energy is sought within a small r\ and 0 window (0.075 x 0.075) around the extrapolation 
point. Shower shapes are estimated with respect to this cell. The contribution of neighbouring hadronic 
showers is therefore reduced. 

A set of preselection criteria are applied to decrease the number of fake candidates per jet: 

- The ratio of energy reconstructed in the core of the shower in the first sampling layer to the to- 
tal shower energy reconstructed in the core of the cluster, must fulfil £'i(core)/£'(core) >0.03. 
£"i(core) is the energy reconstructed in a window of size At] x A(j) = 0.009375 x 0.1 in the first 
sampling layer (3x1 cells). £(core), the core energy in the cluster, is computed in the follow- 
ing windows: 0.075 x 0.3 (3 x 3 cells) in the presampler, 0.046875 x 0.2 (15 x 2 cells) in the 
strips, 0.125 x 0.125 (5x5 cells) in the middle, 0.15 x 0.125 (3x5 cells) in the back. This ratio 
tends to be larger for electrons than for hadrons due to the different development of hadronic and 
electromagnetic showers. This quantity is discussed in detail in the following section. 

- Similarly, the fraction of energy reconstructed in the core of the shower in the third sampling layer 
to the energy reconstructed in the core of the cluster, £3 (core) /£ (core), must be smaller than 0.5. 
£3 (core) is the energy reconstructed in a window of size At] x A0 = 0.15 x 0.075 in the third 
sampling layer (3x3 cells). This ratio is larger for hadrons than it is for low-energy electrons 
which almost never reach this layer and is discussed in the following section. 

- The ratio of the energy E reconstructed in the electromagnetic calorimeter over the momentum p 
of the track reconstructed in the inner detector is required to fulfil E/p >0.7. 

These preselection criteria cut out less than 5% of signal electrons. Finally, candidates which are also 
reconstructed as originating from a conversion are vetoed, corresponding to a loss of 9% of all signal and 
background tracks. 

Position and energy corrections are applied in the precise reconstruction of the electromagnetic clus- 
ter and are described in [7]. These corrections have been tuned for high-energy clusters and are not 
optimal for low-energy electrons. In Fig. 3 the ratio between the reconstructed and the true electron 
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Figure 3: Ratio between reconstructed and true energy as a function of the electron pseudo-rapidity |tj| 
(left) and ratio of the reconstructed to true momentum for electrons (right). 

energies is shown as a function of | r\ | for signal electrons from the H —> bb sample. It can be seen that 
the corrections over-estimate the electron energy. Moreover, as electrons are embedded in jets, there is 
a contamination from other hadrons in the energy determination which is not visible with the isolated 
electrons from a J/y sample [6]. Work is on going to improve the energy reconstruction at low energy. 

By default the four-momentum of an electron is defined as the energy reconstructed in the calorime- 
ter, whereas the direction is taken from the associated track. Since in this note the main physics processes 
lead to electron transverse momenta of less than 15 GeV, the momentum measured in the tracker is used 
instead of the energy unless stated otherwise. The reconstructed momentum of the electron is shown in 
Fig. 3. The track reconstructs the momentum of the electron close to its true value for most candidates. 
The small downward shift of the peak from unity and the tail towards lower values are due to photon 
bremsstrahlung. Future developments in ATLAS will ensure an optimal combination of calorimeter and 
tracker measurements in the energy definition. 

4.2 Electron identification variables 

The hadronic calorimeter has a granularity of At] x Atj) = 0.1 x 0.1 which is too large to disentangle the 
energy deposit of an electron inside a jet. In the electromagnetic calorimeter, electrons are narrow objects 
while jets tend to have a broader profile, allowing a discrimination. For this purpose, we use the finer 
granularity (At? x A0 = 0.05 x 0.025) of the third sampling layer of the electromagnetic calorimeter. The 
ratio ^(core) /E{com), defined above, is larger for pions than for electrons and is clearly discriminating 
as can be seen in Fig. 4. 

Electromagnetic showers deposit most of their energy in the second sampling layer of the electro- 
magnetic calorimeter. The following variables are used: 

- The lateral shower shape (cf. Fig. 5 on left) is given by the ratio of the energy reconstructed in a 
window of size 0.075 x 0. 175 (3x7 cells of the middle sampling layer) to the energy reconstructed 
in 0.175 x 0.175 (7 x 7 cells). 

- The lateral width G) 7) 2 = ^/ ^Eg^ ~ ( ^^e^ ^ ^ on r i§ nt ) * s calculated in a window of 
size 0.075 x 0.125 (3 x 5 cells of the middle sampling layer) using the energy weighted sum over 
all cells, which depends on the particle impact point inside the cell. 

The first layer, with its very fine granularity in pseudo-rapidity, can be used to detect sub-structures 
within a shower and thus isolated 71° 's and y's can be discriminated against efficiently. For all variables 
computed in the first sampling layer, two cells in (j) are summed. 
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Figure 4: Ratio £'3(core)/£'(core) (see text) for electrons in the H — > bb sample (hatched histogram) and 
for charged pions in the H — ► uu sample (solid line). The distributions are normalized to unit area. 




Figure 5: Lateral shower shape R-q (left) and lateral width (0^2 (right) in the second layer of the elec- 
tromagnetic calorimeter (see text for details). The distributions are shown for electrons in the H — ► bb 
sample (hatched histograms) and for charged pions in the H — >• uu sample (solid lines). The distributions 
are normalized to unit area. 



The ratio £i(core) /is(core), defined above, is larger for electrons than for hadrons as can be seen in 
Fig 6. 

The lateral shower shape in the strips is now exploited. 

- The total shower width in strips co stot is determined in a window AT7 = 0.0625 (corresponding to 20 
strips in the barrel for instance). It is calculated from: co stot = \J x (/ — i max ) 2 /L-Ei, where i 
is the strip number and i max the strip number of the first local maximum. This width is shown for 
electrons and pions in Fig. 7. 

- The shower width using three strips around the one with the maximal energy deposit is shown in 
Fig. 7 on the right. It is given by the following formula: 

ft) 4 3 = yYJZi x (i — i max ) 2 fLEj, where i is the number of the strip and i max the strip number of the 
most energetic one. 

The pion and jet rejection can be significantly improved by ensuring consistency between the elec- 
tromagnetic calorimeter and the inner detector information. 
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Figure 6: Ratio £'i(core)/£'(core) (see text) for electrons in the H — > bb sample (hatched histogram) and 
for pions in the H — ► uu sample (solid line). The distributions are normalized to unit area. 




Figure 7: Total shower width co stot (left) and shower width in three strips a s j (right) in the first layer 
of the electromagnetic calorimeter. The distributions are shown for electrons in the H —> bb sample 
(hatched histograms) and pions in the H — »■ uu sample (solid lines). The distributions are normalized to 
unit area. 



First, the angular matching between the track and the electromagnetic cluster is checked (cf. Fig. 8): 

i=; m +7 i=i m +7 

|At]| = £ £,-x (i — iffi) I £ Ei, which gives the difference between the track and the shower 

!=/ m -7 !=i'm-7 

positions measured in units of distance between the strips, where i m is the impact cell for the track 
reconstructed in the inner detector and E\ is the energy reconstructed in the i-th cell in the X] direction, at 
constant 0 given by the track parameters. 

Subsequently, the energy E measured in the electromagnetic calorimeter is compared to the momen- 
tum p measured in the inner detector (cf. Fig. 9 on the left). In the case of an electron, the momentum 
should match the energy. The large tails at high values of the ratio in the signal distribution are due to 
soft bremsstrahlung. 

A further reduction of the charged hadron contamination is obtained by rejecting tracks having a low 
fraction of high-energy hits in the TRT Figure 9 (right) shows the ratio A^HTR/A^straw between the number 
of high threshold hits Ahtr and the total number of TRT hits Af stra w 
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|#Sum Ar|| (1 st sampling) 

Figure 8: Angular matching between charged tracks extrapolated to the electromagnetic calorimeter 
and electromagnetic clusters in pseudo-rapidity (|At]|). The distributions are shown for electrons in the 
H —* bb sample (hatched histogram) and for pions in the H — » uu sample (solid line). The distributions 
are normalized to unit area. 




Figure 9: Ratio E/p between the energy of the electromagnetic clusters and the momentum of recon- 
structed charged tracks (left) and fraction A^HTR/A^straw of high-energy hits in the TRT (right). The distri- 
butions are shown for electrons in the H — > bb sample (hatched histograms) and for pions in the H — ► uu 
sample (solid lines). The distributions are normalized to unit area. 



4.3 Rejection of electrons from y-conversions and Dalitz decays 

A significant source of low-pr electron tracks in jets are photon conversions and Dalitz decays. Such 
tracks might be identified by the algorithm as signal electron tracks. The good quality track cuts help in 
part to suppress that type of background. In case of y-conversions and Dalitz decays, e + e~ pairs with 
small invariant mass are observed in the detector. To find them the conversion finding algorithms can 
be used. Ref. [8] details the last developments on the reconstruction of such electrons. Unfortunately, 
these were not available for the electron reconstruction algorithms at the time of this study. As detailed in 
Section 4.1, about 9% of signal electrons are mis-identified as conversions. Based on previous studies [9], 
for an electron efficiency e e = 80%, the rejection factor of electrons from conversions and Dalitz decays 
is ~ 3 with the conversion package and ~ 2 without. 
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5 ^-tagging algorithm 

The variables described in the previous section have been shown to be efficient in distinguishing electron 
tracks from non-electron tracks or electron tracks from /-conversions and Dalitz decays. 

In order to construct a ^-tagging algorithm we combine these variables with additional variables to 
exploit the specific features of b-jets. First we take advantage of the fact that the electron is coming 
from a b quark and thus can have a significant transverse impact parameter do- Figure 10 shows the 
corresponding distributions. In addition, because of the B hadron's high mass, electrons from direct 




d 0 (mm) 



Figure 10: Transverse impact parameter for electrons in the H — > bb sample (hatched histogram) and for 
pions in the H — > uu sample (solid line). The distributions are normalized to unit area. 

bottom decays tend to be more boosted in a plane transverse to the jet axis, i.e. have a larger p r j l than 
the backgrounds, as shown in Fig. 11. p™ 1 is defined as the electron momentum in the plane orthogonal 
to the jet axis. 




Figure 1 1 : Distribution of the track transverse momentum p r j l relative to the jet axis for signal electron 
tracks in jets (hatched histogram) and for pion tracks in light jets (solid line). 

The combination of all variables is performed in two steps; first for variables independent of the jet 
and second including a jet dependent variable as explained in Section 5.1. Finally the performance of the 
algorithm will be detailed in Section 5.2. 
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5.1 b-tagging procedure 

A likelihood-ratio method is used to combine the variables described in the previous section along with 
the electron track impact parameter do- This likelihood ratio uses information only on the characteristics 
of the electron and is independent of the parameters of the jet. The discriminating variables are compared 
to pre-defined normalized distributions (probability density functions) for both the electron and the pion 
hypotheses. We use the signal sample of H — > bb filtered for electrons and part of the H uu background 
samples. The variables show a significant dependence on pseudo-rapidity and a less pronounced one 
on transverse momentum. In rj the changes correspond to varying granularities, lead thickness and 
material in front of the electromagnetic calorimeter. The separation between the distributions obtained 
for electrons and pions can vary also with r\ . Therefore probability density functions are defined in five 
|t]| bins: (0-0.8), (0.8-1.37), (1.37-1.52), (1.52-1.8), (1.8-2). 
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Figure 12: Discriminating function D trac k for electrons in the H — > bb sample (hatched histogram) and 
for pions in the H —>■ uu sample (solid line). The distributions are normalized to unit area. 



For each good quality track the discriminating function Dtrack is calculated as: 

where X; denotes the value of the i'-th variable for a given track, P e (xj) is the probability obtained from 
the single variable x, that the track originates from a signal electron, Pf,(xi) is the probability that the 
track originates from a hadron, and i runs through all the variables used by the algorithm. Distributions 
of Dtrack obtained for signal electrons and background pions are shown in Fig. 12. D trac k tends to 1 for 
electrons and tends to 0 for pions. The identification of a candidate track as originating from a signal 
electron track is based on the value of D trac k. Those tracks for which D trac k is below a given threshold are 
rejected. Typically, for an electron identification efficiency of 80%, a pion rejection factor of about 200 
can be achieved in H —> uu events. 

For each good quality track in the jet the value of the discriminating function for electron identifi- 
cation, D trac k, defined previously in Eq. 1, has been calculated. For each jet, the track with the highest 
value max(D trac k) is chosen and this single track will now be used to estimate the discriminating function 
Djet for the jet. In addition to the variables used for electron identification based on the inner detector 
and electromagnetic calorimeter information, the track transverse momentum p™ 1 relative to the jet axis, 
shown in Fig. 1 1 is included. For each jet the discriminating function Dj et is calculated using the selected 
track as: 

P iv rd \ 

D jet = max(D track ) x — — ' — =- , (2) 
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where P e {p r f) is the probability obtained from the single variable p'j 1 that the track originates from a 
ft-jet and Ph{p r j) is the probability that the track originates from a light jet. Distributions of Dj et obtained 
for b— and light jets are shown in Fig. 13. For a given threshold D* t r , a jet with Dj et > Djj£ is tagged as a 
6-jet. 



ATLAS 
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Figure 13: Discriminating function Dj et for Z?-jets in the H — > bb sample (hatched histogram) and light 
jets in the H — > uu sample (solid line). The distributions are normalized to unit area. 



5.2 Performance of the ^-tagging algorithm 

The ^-tagging efficiency is defined as Eb = N[/Nb where N[ is number of the tagged ^-labelled jets and 
Nb is the total number of jets labelled as &-jets. The definition includes the semi-leptonic branching 
ratios as well as the detector acceptance and the electron reconstruction and identification efficiencies. 
The jet rejection factor is calculated as /?ughtjet = where Nj is the number of light jets and M is 

the number of light jets tagged by mistake as b-jets. 

Figure 14 shows the rejection of light jets as a function of the ^-tagging efficiency £},. The rejection 
factors achieved against light jets are presented in Table 3. For a ^-tagging efficiency £j, = 7% on 



io 2 r 




b-jet efficiency 



Figure 14: Rejection factor of light jets /?ughtjet versus ^-tagging efficiency Sj,. 

inclusive b-jets, the light jet rejection factor is about 110. This operating point corresponds to a 61% 
^-tagging efficiency on semi-electronic b-jets. 
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Table 3: Light jet rejection factors /?u g htjet for various 6-tagging efficiencies on inclusive jets. Errors 
are statistical only. 



s h (%) 


^lightjet 


8 


80±1 


7 


110±2 


6 


160 ±3 



This study has been performed on Monte Carlo simulated data which do not include pile-up effects. 
Based on previous study [10], a further degradation of the light jets rejection factor by 10% (30%) is 
expected when on average 4.6 (23) minimum-bias events per beam-crossing are added. 

Figure 15 shows the light jet rejection as a function of the jet pj and \r\ | for a total ^-tagging effi- 
ciency Eb = l%. Performance are estimated in five bins in |tj | and in seven bins in pj. The rejection of 
light jets does not depend significantly on jet pj over the range 30-80 GeV. For lower pj, jets are wider 
and an electron can escape outside the cone of AR = 0.4 and this leads to lower rejections for the same 
^-tagging efficiency. For higher jet energies, the jet is narrower and showers coming from the particles in 
the jet overlap more and thus it is more difficult to separate the electrons, leading to a drop in rejection. A 
loss in the jet rejection factor can also be observed in the region 1.37 < \r} \ < 1.52 corresponding to the 
crack between the barrel and the end-caps of the electromagnetic calorimeter. Even though soft electrons 
are reconstructed only within |tj | < 2, some sensitivity is retained for jet axes just beyond the cut-off as 
shown in Fig. 15. 




Figure 15: Light jet rejection factor as a function of jet pj (left) and jet |t]| (right) for a ^-tagging 
efficiency Sb = 7%. 

Table 4 shows the fraction of jets tagged of a given type of track for £/, = 7%. Most Z^-jets are tagged 
by true electron tracks; in particular by signal electrons in ~65% of cases. Light jets are tagged in ~25% 
of cases by electrons from y-conversions and Dalitz decays. Light jets are also tagged in ~60% of cases 
by pions. 

6 Conclusion 

Soft electron tagging relies on the semi-leptonic decays of bottom and charm hadrons. It is therefore 
intrinsically limited by the branching ratios to electrons: about 20% of ft-jets will contain a soft electron, 
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Table 4: Fraction of jets (%) in the WH sample tagged by a specified type of track for e& = 7%. The "e 
from * " column corresponds to electrons from conversions and Dalitz decays. Statistical errors on these 
numbers are negligible. 



Jet type 


Fraction of jets tagged by a specified type of track [%] 


alle 


e from b 


e from c 


e from * 


other e 


n 


others 


b 


67.6 


42.0 


19.4 


3.6 


2.6 


23.9 


8.5 


light 


27.2 


0.2 


0.4 


25.5 


1.1 


58.7 


14.1 



including cascade decays of bottom to charm hadrons. However, when a signal electron is present the 
algorithm is quite efficient. In addition, tagging algorithms based on soft leptons have low correlations 
with the track-based ^-tagging algorithms, which is very important for checking and cross-calibrating 
performance with data. 

The study presented here was based on a sample of WH events which have two high pj and well 
separated jets. A &-jet tagging efficiency of about 7% (including the inclusive b —> evX branching 
ratio of ~ 20%) is achieved for a light jet rejection better than 100. The efficiency of the soft electron 
identification is high, since two-thirds of the Z^-jets are tagged by the true electron. Work is ongoing to 
prepare similar analyses with other Monte Carlo simulated samples with different topologies as well as 
to prepare the measurement with real ATLAS data. 
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Abstract 

This note describes various studies of b-]et reconstruction and ^-tagging in 
simulated ATLAS tt events. The performance of several jet algorithms on 
tt event &-jets is studied. Techniques for measuring the ^-tagging efficiency 
directly from data tt events, using both tag counting and explicit b-]et selec- 
tion, are described and compared. Finally, the optimisation of ^-tagging for tt 
events, using both Monte Carlo and data samples, is discussed. Both semilep- 
tonic and dilepton tt decay channels are considered, for an initial integrated 
luminosity of lOOpb -1 . 

1 Introduction 

The identification of &-jets is of crucial importance to many physics analyses at the LHC, including 
top physics and the search for new particles including Higgs bosons. The performance of ^-tagging 
algorithms will have to be understood using real data, as Monte Carlo will only give an approximate 
description of the tracking and vertexing performance of the detector, at least initially. Light jet rejection 
factors can be measured in inclusive jet samples (where the heavy flavour content is small), but pure 
samples of ft-jets are required to measure the ft-jet tagging efficiency. One solution, already widely used 
at the Tevatron, exploits dijet events in which one of the jets is ^-tagged using a soft lepton tag [1]. 

At the LHC, the large tt production cross-section offers an alternative source of £>-jets, in a distinctive 
topology which is relatively easy to trigger on and isolate, providing at least one of the W -bosons from 
the top decays leptonically. Providing the top quark is assumed to have Standard Model properties (in 
particular that Br(f — > Wb) = 1), each tt event has two &-jets, together with at least one high-£y lepton, 
missing energy and additional high-£y jets from hadronically decaying W -bosons. The environment 
(high jet multiplicity, high-£y b-jets) is also much closer to that which ^-tagging efficiency measurements 
are typically needed than in dijet events. 

The ^-tagging efficiency can be extracted from tt events in two ways — either by counting events 
with different numbers of tagged jets, or by reconstructing the tt decay topology in order to identify a 
pure sample of &-jets. Both techniques are explored in this note, which is organised as follows. Sec- 
tion 2 briefly describes the Monte Carlo simulation samples and common event selection which is used 
throughout, and Section 3 discusses reconstruction of &-jets in tt events, as an essential preliminary to 
studies of b-]et tagging. The counting and b-]et selection methods are discussed in Sections 4 and 5. 
Finally, Section 6 discusses the possibilities for optimising ^-tagging performance specifically for tt 
events, using both Monte Carlo and data. 

2 Datasets and event selection 

All studies were performed using ATLAS Monte Carlo production samples as discussed in more detail 
in [2] and [3]. Semileptonic and dileptonic tt decays were generated with MC@NLO with Herwig 
hadronisation. AcerMC with Pythia hadronisation was used as an alternative. The background in the 
semileptonic channel is dominated by W+multijet production, which was simulated with ALPGEN (in- 
cluding Wbb and Wcc), and single top production, simulated with AcerMC. In the dilepton channel (used 
only for the counting method ^-tagging efficiency determination), additional backgrounds from Z+jets 
(simulated with ALPGEN) and diboson production (simulated with Herwig) are also important. Back- 
ground coming from QCD multijet events has not been studied, as this background is difficult to simulate 
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Electron channel 


Muon channel 




Dilepton channel 


tt (except all hadronic) 


45232 


45232 




45232 


Trigger 


10430 


12116 




21417 


Lepton ID 


8350 (80.1 %) 


11268(93.0%) 






Isolation 


8063 (96.6 %) 


9909 (87.9 %) 


Lepton pair selection 


1549 (7.23 %) 


Missing E T > 20 GeV 


6362 (78.9 %) 


7834 (79.1%) 


Missing Ej cut 


1355 (87.5 %) 


>4jets£y >20GeV 


3651 (57.4%) 


4555 (58.1%) 


> 2 jets E T > 20 GeV 


1160 (85.6%) 


>4jets£ r > 30 GeV 


2329 (63.8 %) 


2927 (64.3 %) 


> 2 jets E T > 30 GeV 


1000 (86.2 %) 


W mass window cut 


1958 (84.1 %) 


2487 (85.0%) 






top mass window cut 


1378 (70.4%) 


1773 (71.3%) 







Table 1: Numbers of events from tt production expected with 100pb _1 in the lepton+jets and dilepton 
channels. The cut efficiency w.r.t. the previous line is shown in parenthesis. The fully hadronic decay tt 
contribution is not included but is expected to be small. Contributions from non-ff background are not 
included. 

and samples were not available. The normalisation of such background (if significant) will eventually 
have to be extracted from data. 

Systematic errors have been evaluated following the prescriptions discussed in detail in [2] and [4] 
wherever possible. The sensitivity to incorrect ^-tagging of non-b jets was assessed by doubling and 
setting to zero the corresponding jet tagging efficiencies, separately for light quark and charm jets. The 
jet energy scale was varied by ±5 %. The Monte Carlo event generation sensitivity was assessed by using 
AcerMC instead of MC@NLO, including samples with modified ISR, FSR and partem shower cutoff Q 2 . 
The backgrounds from W+multijet events and single top production were independently doubled and set 
to zero. In all cases, the effect of these variations was computed without adjusting parameters of the 
analysis {e.g. acceptance factors and background estimates) to compensate for changes made to the input 
Monte Carlo samples. 

The analyses make use of isolated high Ej electrons and muons (with Ej > 20 GeV) and jets (with 
Ej > 20 GeV or higher depending on the particular analysis), selected according to standard ATLAS 
object definitions and fiducial cuts [2]. In the semileptonic channel, events are required to have exactly 
one high E T lepton, at least four jets and missing transverse energy of at least 20 GeV. In the dilepton 
channel, two leptons of opposite charge and a missing transverse energy of at least 20 GeV are required; 
in the e + e~ and jJ. + jJ.^ channels, the Z+jets background was further reduced by vetoing events with 
dilepton mass between 81 and 101 GeV and by requiring a missing transverse energy of at least 35 GeV. 
In both cases, at least one lepton in the event was required to pass the ATLAS level- 1, level-2 and event 
filter triggers, using trigger signatures appropriate for the 20 GeV offline lepton Ej cuts. For electrons, 
the signatures e25i and e60 were used (the e60 trigger improves the efficiency at high lepton Ej), and for 
muons, the signature mu20i was used. Further studies of the ATLAS trigger performance on tt events 
can be found in [5]. 

The resulting estimated event yields at various stages of the event selection are shown in Table 1 for 
100pb _1 . Non-« backgrounds are not included. The individual analyses apply further specific selec- 
tion cuts as discussed below. The effects of the tighter selections used only by the tag counting analysis 
described in Section 4 are also shown in Table 1. The hadronic W mass window cut was applied by se- 
lecting the pair of jets with invariant mass closest to the W mass and requiring this to be within ±20 GeV 
of the nominal value; a top mass cut was applied similarly by adding the jet giving a reconstructed top 
mass closest to the nominal, and requiring it to be within ±30 GeV of the top mass. 

All studies were performed with the default ATLAS ^-tagging algorithm (IP3D+SV1), which uses 



505 



b- Tagging - 6-Tagging Calibration with tt Events 



a likelihood weight w constructed from the results of the IP3D impact parameter and SV1 secondary 
vertex-based taggers [6]. This likelihood weight is large for &-jets and small for light and oquark jets. 
For jets from top decay with Ej > 20 GeV, a cut w > 4 gives a b-jet tagging efficiency of around 70 % 
per jet and a light jet rejection of 50 while a w > 7 cut gives an efficiency of 60 % and a rejection of 200, 
though these values are somewhat dependent on jet Ej and r\. Detailed studies of the expected ^-tagging 
performance can be found in [6] (e.g. in Table 1 therein). For the purposes of comparison, ^-tagging 
efficiency measurements have been done at reference efficiencies of 50, 60 and 70 %. Various other cut 
values are used internally by the selections in the individual analysis methods. When considering Monte 
Carlo truth information, jets have been labelled b, c and light using the standard ATLAS jet matching 
and labelling procedures [6]. 

3 b jet reconstruction in tt events 

The choice of jet reconstruction algorithm is an important issue for top physics analysis in ATLAS, where 
good resolution for the reconstruction of the original quark energy and direction is mandatory. These 
questions are explored below, using various jet algorithms and parameter choices. The analysis uses a 
sample of 250k semileptonic tt events generated with MC@NLO and passed through the standard top 
selection as discussed in Section 2. The choice of jet algorithm also affects the ^-tagging performance, 
as studied in detail elsewhere [6]. 

The standard reconstruction outputs jet collections made with the Cone (with AR = 0.4 and AR = 0.7) 
and k T (with D = 0.4 and D = 0.6) jet algorithms, each based on either Precluster Projective Towers 
('Tower') or Topological Cell Clusters ('Topo') (see [7] for more information on jet reconstruction in 
ATLAS). For this study, these standard jet reconstruction algorithms and parameter choices have been 
complemented by additional Cone and kj jets with a larger variety of AR and D parameters. To compare 
all these jet algorithm and parameter choices in a controlled environment, re-reconstruction of jets was 
performed at the AOD level, using 'Topological Cell Clusters' calibrated by 'Local Hadron Calibration' 
(see [8] and [9] for details about the different strategies used to exploit and calibrate calorimeter infor- 
mation). All parameters except for the D parameter (kj) or AR (Cone) were left unchanged with respect 
to the standard jet algorithms. 

It should be noted that these re-reconstructed jets do not contain the calorimeter energy corrections 
applied in the standard calibration procedure described in [7] and [8]. Therefore a comparison of the 
overall performance between the different jet algorithms with different parameters is valid, but no direct 
comparison between corrected and non-corrected jets should be made. 

3.1 Jet energy and angle resolution 

Reconstructed jets are matched to quarks from the tt decay following the standard procedure [6]. The 
results are shown in Fig. 1. 

The top plots of Fig. 1 show the width of the distributions (E quar k — Ej et ) /E quar k (i.e. the energy 
resolution) as a function of E quar k, for both kj and Cone. To calculate the width, a Gaussian is fitted 
to the individual distributions. The distributions show the expected behaviour. The smallest width is 
obtained for the jet sizes 0.4, 0.5 and 0.6, for both Cone and kj- However, the differences between the 
different jet algorithms are small, so no strong conclusions can be drawn from them. 

The two lower plots of Fig. 1 show an investigation of the angular resolution. The mean distance 
between the initial quark and the resulting jet in AR is plotted as function of E quar i for Cone and kj. The 
expected behaviour is that for bigger jet sizes the deviations between the jet axis and the flight direction 
of the initial quark should increase; this behaviour is observed in Fig. 1. It can also be seen that for larger 
jets and higher quark energies, the k T jet algorithm performs better than Cone. 
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The choice of the Cone algorithm with AR = 0.4 that was driven by the need to reconstruct properly 
W hadronic decays [10] has been found to be an appropriate choice for the tt signal. Background studies 
that will be performed in the near future will complete the picture. 

4 ^-tagging efficiency measurement via tag counting 

Conceptually, in the absence of background, the simplest way to determine the ^-tagging efficiency in 
tt events is to count the number of events with different numbers of ^-tagged jets; this allows both the 
^-tagging efficiency and the tt production cross-section to be measured simultaneously. This method is 
discussed in detail below, with an emphasis on the b- (and c-) tagging efficiency measurements. More 
information on tt cross-section measurement can be found elsewhere [4]. 

In the following analysis, the event selection has been slightly tightened from that used elsewhere, in 
order to reduce the background. Jets are required to have Ej > 30 GeV. In the lepton+jets channel, the 
single top background is significantly reduced by applying a cut on the W and top reconstructed masses, 
as described in Section 2. 

4.1 Method 

Top pair-production events give rise to two b-jets in the final state. Assuming for pedagogical purposes 
that every selected event contains two fe-jets in the detector acceptance and that only ft-jets can be tagged, 



507 



b- Tagging - 6-Tagging Calibration with tt Events 



then it is clear that the expected number of events with one ^-tagged jet is proportional to N2£t,{ \ — Eb) 
while the expected number of events with two ^-tagged jets is proportional to Ne^, where: 

• £b is the ^-tagging efficiency, i.e. the probability to tag a b-]et 

• N is the number of selected tt events prior to any ^-tagging requirement, which is proportional to 
the tt production cross-section. 

It can be seen that it is possible to estimate both the ^-tagging efficiency and the tt production cross- 
section from the observed number of events with one and two ^-tagged jets. 

In reality, c-jets and light jets (either from the hadronic W decay or from ISR/FSR) are present 
and contribute to the number of tagged jets in the event. Moreover not all £>-jets coming from the top 
decays end up being selected, whilst a small number of b-jets are produced through gluon radiation. To 
take these effects into account, the event flavour content is estimated from Monte Carlo with a large 
simulation tt sample. The factors F^k are defined as the fractions of selected events (prior to any b- 
tagging requirement) with i b-jets, j c-jets, and k light jets (i,j,k = 0, 1,2,3, ...). The expected number 
of events with n tagged jets <N n > can be written as the sum over all possible combinations of i b-jets, 
j c-jets, and k light jets, as a function of b-, c-, and light jet tagging efficiency: 

<N n >= (L- o tr A pre _ tag ) • £ F ijk £ A\ -el- (I- £b)'~'' -A f j ■ 4 • (1 - e c y- f -At ■ ef ■ (1 - eif- k ' 

i,j,k i'+j'+k'=n 

(1) 

where A • is the number of arrangements i !/(/' !•(/ — /') !), the prime subscript corresponding to the number 
of tagged jets of a given flavour; o t j is the total production cross-section; A pre - tag is the acceptance for 
tt events prior to any ^-tagging requirement (including trigger efficiency, lepton reconstruction and ID 
efficiency, etc.) and L is the integrated luminosity. The assumption that there is no correlation between 
tags in a given event is discussed later and is treated as a systematic uncertainty. 
Finally, the following likelihood can be written: 

L = U{Poisson (N n ,<N n >))) (2) 

where N n is the observed number of events with n tags. In practice only events with one, two, or three 
tags in the lepton+jets channel and one or two tags in the dilepton channel are taken into account in 
the likelihood. Events with no tag suffer from significant background, whilst there are few events with 
more than three (two in the dilepton channel) tags. In the lepton+jets channel, both b- and c-tagging 
efficiencies are allowed to fluctuate in the fit together with the tt cross-section (hence three variables and 
three constraints); the light jet tagging efficiency is fixed in the fit and must be measured elsewhere. In 
the dilepton channel, the c-jet tagging efficiency is also fixed (hence two variables and two constraints). 

Tables 2 and 3 show the F ijk values for the lepton+jets and dilepton channels, estimated from a sample 
of 265k tt events. It is interesting to note that the 'nominal' combination (two b-jets and two light jets 
for the lepton+jets channel; two b-jets for the dilepton channel) represents only about one third of the 
events, due to the presence of ISR/FSR light jets and c-jets from W decays. The significant presence of 
c-jets in the lepton+jets channels allows the c-jet tagging efficiency to also be measured. 

Figures 2 and 3 show the expected yield of events in the lepton+jets and dilepton channels for an 
integrated luminosity of lOOpb -1 , as a function of the number of tagged jets. A b-jet is considered 
tagged if its ^-tagging weight w is greater than 7. 

The method was applied to a sample of tt events corresponding to about 400 pb _1 of data and 
independent of those used to estimate the acceptance factors. Table 4 compares the measured efficiencies 
and cross-section to their true value for three different ^-tagging purity levels. No bias is visible within 
the available statistics. The small deviation for the cross-section measurement in the dilepton channel is 
explained by the lack of statistics in the sample used to estimate the F^ factors; this issue is discussed 
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Number of light jets 



any 



2 b-jets, 1 c-jet 
2 b-jets, 0 c-jet 
1 b-jet, 1 c-jet 
1 b-jet, 0 c-jet 
0 b-jet, 1 c-jet 
0 b-jet, 0 c-jet 



15.9% 10.6% 
24.4 % 
6.46 % 



3.00% 0.85% 
13.8% 3.74% 



2.38 % 
7.60% 
0.38 % 



30.5 % 
42.8 % 
9.75 % 
11.6% 



0.74 % 
3.21 % 
0.11% 0.49 
0.46 % 0.65 



Table 2: Fractions of selected events with a certain number of b, c and light jets in the lepton+jets channel 
for the counting method. Only the dominant contributions are shown. 



Number of light jets 



2 b-jets, 1 c-jet 
2 b-jets, 0 c-jet 
1 b-jet, 1 c-jet 
1 b-jet, 0 c-jet 
0 b-jet, 1 c-jet 
0 b-jet, 0 c-jet 



1 



1.02% 
35.2 % 



0.73 % 
20.9 % 
1.08 % 
18.9% 
0.29 % 



0.27 % 
6.64 % 
0.46 % 
6.26 % 
0.11% 
1.11% 



<0.1% 
1.39% 
0.22 % 
1.66% 
<0.1% 
0.31 % 



any 



2.08 % 
64.3 % 
3.54 % 
27.2 % 
0.40 % 
1.50% 



Table 3: Fractions of selected events with a certain number of b, c and light jets in the dilepton channel 
for the counting method. Only the dominant contributions are shown. 

later together with other systematic uncertainties. With 100pb~\ a statistical precision of 2.7% (4.2%) 
can be achieved on the ^-tagging efficiency, and 2.4% (4.8%) on the tt cross-section, respectively in 
the lepton+jets channel and dilepton channel. In the lepton+jets channel, the statistical uncertainty on 
the c-tagging efficiency is rather large, because this measurement is mostly determined by the number of 
events with three tagged jets. 

4.2 Backgrounds 

In the lepton+jets channel, the main backgrounds are W+jets and single top. Other sources of background 
are Z+jets (where one lepton fails to be identified), WW/WZ/ZZ+jets, and QCD processes (which has 









e b (%) 




S c (%) 


On (pb) 






true 


meas. 


true 


meas. 






w > 4 


72.1 


71.7±0.7 


22.3 


21.9±1.5 


841 ±9 


Lepton+jets 


w > 1 


60.4 


59.8±0.8 


12.8 


13.8±1.3 


844±10 




w> 10 


48.1 


47.4±0.9 


6.7 


8.2±1.4 


832±13 




w>4 


72.9 


72.9±1.0 






882±17 


Dilepton 


w>7 


61.1 


60.5±1.2 






883±19 




w> 10 


48.4 


47.9±1.3 






883±25 



Table 4: Counting method: tagging effciencies and cross-section measured on a control sample and 
compared to their true value for three levels of ^-tagging purity, for the lepton+jets and dilepton channels. 
The tt production cross-section was assumed to be 833 pb. The uncertainties are statistical only and 
correspond to about 400 pb -1 of data. 
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■- s-channel 
t-channel 
Diboson+jet: 



Figure 2: Yield expected for an integrated luminosity of 100 pb in the lepton+jets channel as a func- 
tion of the number of tagged jets. The expected background from IV/Z+jets, single top, and diboson 
production is also shown. 



Figure 3: Yield expected for an integrated luminosity of 100 pb in the dilepton+jets channels (left: 
eel\i\l\ right: eji) as a function of the number of tagged jets. The expected background from Z+jets and 
single top is also shown. 



not been studied, as discussed in Section 2). Figure 2 shows the total background due to W7Z+jets, single 
top, and diboson+jets production and Fig. 4 (left) shows the expected signal over background ratio as a 
function of the number of tagged jets. Events with one, or more than one, tagged jets have good purity, 
with signal over background ratios of 14.4 and 26.8 respectively, but the background does need to be 
taken into account. The estimated background is subtracted from each sub-sample. 

In the dilepton+jets channel, the main source of background is Z+jets production (but in the ejx 
channel, only Z — ► tt — > ejJ,vv is significant). WW fW Z/ZZ+jets and single top also contribute. Figure 3 
shows the total background due to Z+jets and diboson+jets production. The purity of the dilepton+jets 
sample (especially of the e\l channel) is good enough for the background estimate not to be an issue. 
Figure 4 shows that the expected signal over background ratio for events with exactly one tagged jet 
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Channel 




Lepton+Jets 




Dilepton 


Source 


£b 


£ c 


Off 




Off 


Light & T jets 


<0.1 


18 


< 0.1 


0.7 


0.3 


c-jets 








0.8 


0.8 


ft-jet labelling 


1.4 


12 


0.1 


1.4 


0.1 


tag correlation 


<0.2 


<0.2 


<0.2 


<0.2 


<0.2 


Jet energy scale 


0.9 


2.9 




0.5 


+1.3 

-3 


b-jet energy scale 


<0.1 


1.5 


0 9 8 9 


0.2 


<0.1 


(MC statistics 


0.5 


7 


0.5 


3 


3) 


Background 


1.2 


3.5 


4.8 


0.3 


0.4 


AcerMC vs MC@NLO 


<0.1 


9 


(4.9) 


2 


6 


ISR/FSR 


2.7 


12.5 


8.9 


2 


(4) 


top quark mass 


0.3 




2.2 


0.5 


2 


Luminosity 






5 




5 


Lepton ID/trigger/pdf 's 






2.8 




2.8 


Total 


3.4 


27 


™±2.8±5 


3.5 


^±2.8 ±5 


Statistical (lOOpbr 1 ) 


2.7 


18 


2.4 


4.2 


4.8 



Table 5: Relative systematic and statistical uncertainties (in percent) for the lepton+jets channel for a 
^-tagging efficiency of £ true = 0.6 and jet Ej > 30GeV. 



is 80 in the ee and channels and 175 in the ejx channel; the background is completely negligible for 
events with two or more tagged jets. It is remarkable that the e\l 0-tagged jet sub-sample is also quite 
pure and could be used in the fit to improve its statistical power, provided a reliable background estimate 
is available. 



S : ATLAS 



Figure 4: Signal over background ratio vs the number of tagged jets in the lepton+jets channel (left) and 
in the dilepton eeljXjX and e}X dilepton+jets channels (right). 



4.3 Systematic uncertainties 

Table 5 summarizes the resulting systematic uncertainties for the counting method in the lepton+jets and 
the dilepton channels. Some effects specific to this method are discussed in detail below. 

The acceptance factors depend upon the definition of jet flavour, which is arbitrary to some ex- 
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tent. By default, reconstructed jets are matched to the closest heavy quark (after FSR) in AR = y 'r\ 2 + (j) 2 ; 
the jet is attributed the quark flavour if AR < 0.3 (in case of ambiguity, b quarks have priority over c 
quarks). Jets that are not associated to any b or c quark are considered light jets. Another possibility is 
to match jets with hadrons, following the same procedure. It was checked that the choice of matching 
hadrons or quarks has a negligible effect on the definition of jet flavour. To assess the systematic uncer- 
tainty, the cut was shifted from 0.2 to 0.5 (nominal cut: 0.3) and the F^i factors were re-estimated for 
each value. The shift observed on the same pseudo-experiments was taken as systematic effect. 

The counting method assumes that there is no correlation between &-tags within an event, while 
some detector effects or the reconstructed primary vertex might induce such a correlation. To check 
this assumption, events with two reconstructed Z^-jets in the detector acceptance were selected. The 
covariance between the tagging of the two b-jets is: 

N 2b-tags 

cov(tag u tag 2 ) = — e h 

(where N 2b ~ tags is the number of events with two tagged b-jets, N is the total number of events, and St is 
the true ^-tagging efficiency). It was found that cov(tagi,tag2) = (8.8 ±27) • 10~ 4 , which is consistent 
with 0; the statistical uncertainty corresponds to a bias on the measured efficiency of 0.2 %, which is 
conservatively taken as systematic uncertainty. 

Backgrounds are subtracted in each sub-sample and a 100% uncertainty is assumed. In the lep- 
ton+jets sample, for 100pb _1 , 106 events from W/Z+jets are expected to populate the lepton+jets 1-tag 
sub-sample and 34.6 the 2-tag sub-sample. In the dilepton channel, 4.4 events with one tagged jet are 
expected for all three channels (mostly in the ee and }X}X channels), resulting in a small uncertainty. 

A 5% uncertainty on the jet energy scale, a 1% additional uncertainty on the 6-jet energy scale, and 
a 2 GeV uncertainty on the top mass, are also taken into account. 

The statistical uncertainty on the F^ factors was estimated by comparing the results given by two 
different training samples. It is not negligible in the dilepton+jets channel (3 %). However it could easily 
be reduced by employing a larger tt sample (1M events would suffice); thus it was not included in the 
total uncertainty. 

The cross-section measurement is very sensitive to the jet multiplicity, which is driven mostly by 
ISR and FSR. This uncertainty appears also in the comparison of the two MC generators since the 
hadronisation models (Herwig in one case, Pythia in the other) need to be tuned in data and display 
important differences. In order to avoid to count twice the uncertainty on jet multiplicity, only the largest 
of the two estimates (ISR/FSR and MC generators) was considered in the total systematic uncertainty. 

Systematic uncertainties on o t j that are not specific to this analysis are discussed elsewhere [4]. 
Uncertainties relative to integrated luminosity and event selection prior to ^-tagging (namely, lepton 
identification efficiency, trigger efficiency and pdf's) amount to 5 % and 2.8 %, respectively. 

4.4 Tag counting results 

With 100pb _1 of data, the counting method allows the ^-tagging efficiency at a working point of £tme = 
0.6 to be measured with a relative precision of ±2.7(stat.)±3.4(syst.) % in the lepton+jets channel, and 
±4.2(stat.)±3.5(syst.) % in the dilepton channel. A better understanding of ISR/FSR could significantly 
reduce the systematic uncertainty. 

The uncertainty on the tt production cross-section is somewhat larger in both channels, with 
±2.4(stat.)l}^(syst.)±5(lum.) % in the lepton+jets channel, and ±4.8(stat.)+7;7(syst.)±5(lum.) % in the 
dilepton channel, mostly because of the uncertainties on the jet multiplicity (ISR/FSR), jet energy scale, 
and background estimate, uncertainties that can be improved in the long term and have been assessed 
conservatively here. 
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The lepton+jets channel also allows the c-tagging efficiency to be measured. Unfortunately it is very 
sensitive to light jet contamination, and the uncertainty on light jet rejection and jet multiplicity makes 
any precision measurement impossible. Isolating a purer c-jet sample through some kinematic selection 
might help but has not been attempted here. 

5 Selecting b jet samples in data 

Going beyond the ^-tagging efficiency measurement from the counting method discussed above, the 
kinematics and topology of semileptonic tt events also allow pure samples of £>-jets to be identified 
directly. Providing this selection is done without biasing any properties of the selected Z^-jets, they can 
then be used to measure the ^-tagging efficiency, simply by studying the distribution of the tagging 
discriminant variable(s) in the selected jets. With enough statistics, this technique should also allow the 
measurement of the tagging efficiency as a function of other variables (e.g. jet E T and 77). 

Obtaining a pure sample of identified b-jets requires full reconstruction of the tt decay chain, as- 
signing four jets in the event to the two ft-jets coming directly from the t — > Wb decays and the jets 
from hadronic W —>■ qq decay. For any given event, many different assignments are possible, especially 
when spectator jets (not from the tt decay) are also present. The correct combination can be determined 
by making use of kinematic information, including the reconstruction of the hadronic and leptonic top 
masses for each considered combination, and the classification can be further improved by requiring a 
&-tag for one of the jets assigned to the t — > Wb decay, providing the other presumed b-]et is left unbiased. 

Three similar techniques have been explored for making this jet identification, all of which consider 
each combinatorial jet assignment and try to determine which one is correct: 

• A 'topological' selection based on the reconstructed masses for each combination. 

• A 'likelihood' selection, exploiting reconstructed top mass, jet momentum and angular informa- 
tion. 

• A 'kinematic' selection based on applying a kinematic fit to each combination and choosing the 
one with the best % 2 value. 

All of these methods start from the selection of semileptonic tt events discussed in Section 2, i.e. events 
with a high-£V lepton, significant missing energy and at least four jets. None of the methods result 
in a sample which is 100 % pure in ^-flavoured jets, so background subtraction techniques are needed 
to derive pure samples on a statistical basis. The three methods, their associated background subtrac- 
tion techniques, and their application to the measurement of the ^-tagging efficiency for the standard 
IP3D+SV1 tagger, are discussed in Sections 5.1, 5.2 and 5.3 below. Comparisons of their performance 
in terms of ft-jet sample selection and ^-tagging efficiency measurement are then given in Sections 5.4 
and 5.5. 

5.1 Topological selection 

The topological selection of &-jets from the basic sample has two stages: reconstruction of tt pairs, 
followed by fitting the top mass distributions to extract the b-]et sample and estimate the remaining back- 
ground. To improve the purity, an explicit b-t&g is required on the jet from the hadronic top decay, whilst 
the £>-jet from the leptonic top is left unbiased, and is used to form the sample of &-jets for measuring the 
tagging efficiency. 

An attempt is made to reconstruct both hadronic and leptonic top decays in each event. First, the 
invariant mass of every pair of jets is calculated to look for combinations consistent with coming from 
the decay W — »■ jj. Combinations satisfying 60 < nijj < 100 GeV are retained, and combined with a third 
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jet to form a candidate t —> bW — »■ bjj decay. One jet from the W decay and the candidate b-jet must 
have jet Ej > 40GeV, and the second W decay jet must have Ej > 20GeV. To reduce background, this 
candidate &-jet is required to have a ^-tagging weight of w > 3, corresponding to an efficiency of about 
74 % for genuine b-jets from top decay Additionally, the two W jets are both required to have ^-tagging 
weights of w < 3, to reduce the probability that the b-jet from the top decay is included in the W. No 
mass requirements are placed on the hadronic top candidate at this stage. 

A lep tonic top is then reconstructed from one of the other jets and a reconstructed leptonic W decay. 
The leptonic W is formed from the identified lepton and a neutrino, whose transverse momentum (p x , p y ) 
is taken to be equal to the missing transverse momentum vector of the event. The longitudinal momentum 
of the neutrino is inferred using the constraint of the known W mass, which leads to a quadratic equation 
for p z which has either two or no solutions. In the case of two solutions, the one with the smaller p z is 
chosen. If there is no solution, the measured missing E T is scaled down, preserving its direction, until 
a solution is possible. The reconstructed W is then combined with another jet which was not used in 
making the hadronic W, which is assumed to be the b-jet from the leptonic top decay. This jet is required 
to have Ej > 20 GeV. No requirements are placed on the ^-tagging weight for this jet, which is instead 
enhanced in ^-flavour due to the topological reconstruction of the it event. 

This procedure is performed for all possible combinations of jets in each event. In an event with four 
jets, several assignments of jets to the hadronic top b-]et, leptonic top b-]et and W decay jets are possible, 
constrained by the requirements made on the ^-tagging weights for the jets assigned to the hadronic 
top decay. In events with more than four jets, some jets will be left unassigned, and are assumed to 
be spectator jets from initial or final state radiation or the underlying event. As the number of jets in 
the event increases, the number of combinations satisfying all the requirements increases very rapidly, 
from typically one or two combinations in events with four jets to an average of nine combinations for 
events with six or more jets. The combination with the largest scalar sum of the pj of the two tops is 
retained for further analysis. In 100pb _1 of fully-simulated it events plus W+jets background, 3028 
events are selected, of which 80 % are semileptonic it events with t — >• bev or b\xv, with the background 
being dominated by it with b —>■ bzv (11 %) and W+jets (6 %), together with smaller contributions from 
dilepton and single top events. 

5.1.1 Mass fitting 

After all these selections, only one assignment of jets and leptons to the hadronic and leptonic top de- 
cays remains. The resulting distributions of reconstructed hadronic and leptonic top masses are shown 
in Fig. 5. Clear top mass peaks are seen in both distributions, though with significant combinatorial 
background and a small contribution from W+jet and single top background events. Contributions from 
non-f? events (denoted class 1), events where both tops are incorrectly reconstructed (class 2), events 
where one top (hadronic or leptonic) is correctly reconstructed and the other is not (classes 3 and 4), and 
events where both tops are correctly reconstructed (class 5) are shown separately. For these purposes, a 
top decay is considered correctly reconstructed if the associated b-jet is actually labelled as a b-jet by the 
Monte Carlo truth analysis, and the direction of the reconstructed top decay is within AR of 0.5 (2.0) for 
the hadronically (leptonically) decaying top. The hadronic top mass distribution in Fig. 5 shows peaks 
for classes 3 and 5, and the leptonic top mass shows peaks for classes 4 and 5, though with a consider- 
ably worse mass resolution. Both distributions also show some 'peaking' structure in the combinatorial 
backgrounds. 

The analysis aims to extract a pure sample of b-jets from the reconstructed leptonic top decay. The 
leptonic top is chosen since it does not suffer from combinatorial background within the top decay 
itself — by contrast, in the hadronic top decay with three jets combined to give the top mass, a mis- 
assignment of jets to W decay and b will give the same top mass, with only the reconstructed hadronic 
W mass changing. Since the resolution on the latter quantity is relatively poor, many reconstructed b j 
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Hadronic top mass (GeV) Leptonic top mass (GeV) 



Figure 5: Reconstructed hadronic (left) and leptonic (right) top masses for the selected jet combi- 
nation, showing the contributions from correctly reconstructed tt events, combinatorial and non-tt 
background, normalised to lOOpb -1 . The numbers refer to the classes discussed in the text. 



combinations are compatible with the W mass, and the W mass requirement gives little discrimination. 
In the leptonic top decay, the W has no decay jets, and this problem does not arise. 

The resolution on the reconstructed leptonic top mass is poor, and there is a large combinato- 
rial background. This is reduced by first requiring that the hadronic top mass is in the signal region 
(140 < mjjj < 190 GeV), and then dividing the sample into five sub-samples according to the recon- 
structed Ej of the leptonic top &-jet, as the level of background depends strongly on this jet energy. The 
resulting leptonic top mass distributions can be seen in Fig. 6, where it can be seen that the top signal 
is significantly enhanced, especially for high leptonic top jet Ej. Events from classes 4 and 5 have cor- 
rectly reconstructed leptonic tops, where the leptonic top jet is pure b flavour, whilst the other events are 
combinatorial background, with a mixture of jet flavours (they can include ft-jets from the hadronic top, 
and jets which are not unambiguously from a single partem, containing some particles from the decay of 
B hadrons). 

In order to correct for this background of event classes 1-3 under the leptonic top mass peak, both its 
size and flavour composition must be determined. This is done on a statistical basis using the sideband 
region with high mu\ to normalise the background contribution, and a control sample to determine the 
shape of its mass distribution. The control sample is generated from events where the reconstructed 
hadronic top mass satisfies 200 < /wyj < 400 GeV, and where the leptonic top &-jet satisfies a cut on the 
^-tagging weight of w < 4. These samples are dominated by events in classes 2 and 3, and provide a 
reasonable model of the background shapes in the signal sample. 

The amount of background under the signal is then extracted using a simultaneous fit to both signal 
and control sample leptonic top mass distributions. The control sample is described using a fit function 
Fb{m Mv ) given by: 

F b (m Mv ) = E((m Mv - c 0 )/400) , (3) 
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Figure 6: Reconstructed leptonic top mass distributions for different regions of the leptonic top 
6-jet E T , with hadronic top mass in the range 140 < mjjj < 190 GeV. The points with error bars 
show all events, with background contributions indicated by the hatched histograms. The error 
bars show the full Monte Carlo statistics (948 pb -1 ) whilst the event counts indicate the number 
of events expected for 100pb _1 . The lines show fits to the signal and estimated background, and 
are discussed further in the text. 
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where E(x) is given by: 

FM _ / c i xCl exp(-c 3 x) if x > 0 

0 ifx<0 • (4) 

The signal is described by a fit function F s (mt>£ V ) consisting of a scaled background function plus a 
Gaussian representing the signal, given by: 

F s {m Mv ) = c 4 F b (m Mv ) + c 5 G((m Mv -c 6 )/c 7 ) . (5) 

The parameter ca, represents the ratio of class 1-3 background contributions in signal to control samples, 
and c 5 the amount of top signal with fitted mean c 6 and RMS width c 7 . All eight parameters are extracted 
in a simultaneous % 2 fit to the signal and control sample mass distributions, thus determining the shape of 
the background under the signal peak from the control sample, and the normalisation of the background 
from the sideband of the signal distribution at high muv- Separate fits are performed in each leptonic 
top jet Ej bin, and the results are shown in Fig. 6. The fits are performed over the mass range b\ and 
b 2 , where b x is set to 90, 90, 110, 120, and 130 GeV for the 20^10, 40-80, 80-120, 120-160 and 160- 
200 GeV jet Ej bins, reflecting the differing kinematic cut-off in each jet mass bin. 

5.1.2 Background subtraction 

The sample of leptonic top &-jets is divided using the reconstructed leptonic top mass into a signal region 
defined by s\ < mu\ < $2 where s\ = c& — 2cj and s 2 = C6 + 2c 7 (i.e. muy within ±2a of the fitted top 
mass peak position) and a sideband region with mu\ outside this range. The distribution of the ^-tagging 
weight in the signal region contains contributions both from pure b-jets (events of classes 4 and 5), and 
from the mixture of flavours forming the combinatorial background under the signal peak. This latter 
mixture is well described by the jets in the sideband region, so its effect is corrected for by subtracting 
the ^-tagging weight distribution in the sideband region, scaled according to the expected normalisation 
derived from the mass fit discussed in Section 5.1.1. The scale factor S is given by: 

s = J^F b (m Mv )dm Mv ^ 

Ibl Fb{muv) dmuv + !^ F b (m Mv )dm Mv 

and varies between around 4 for the 40-80 GeV bin to 0.3 for the 160-200 GeV bin. Once this scaled 
background has been subtracted, the resulting ^-tagging weight distribution in the signal region is sta- 
tistically compatible with that expected from pure ft-jets. The uncertainty on the amount of background 
to subtract in each bin of the ^-tagging weight distribution has two components: the uncertainty on the 
number of background events in each bin in the sideband distribution, scaled by S, and the uncertainty 
on S itself, which is correlated across all bins. The latter is calculated using equation 6 together with the 
full correlation matrix for the parameters cq to c& which are used in the background function F b (mu\)- 

In order to check that this selection does not introduce a bias, the ^-tagging weight distribution of 
true b-jets in the selected signal region was checked, and found to be compatible with that of all b-jets 
within the 77 and Ej acceptance. Hence this selected sample of ft-jets can be safely used to measure the 
^-tagging efficiency. 

5.1.3 Topological selection results 

The results of applying this analysis procedure to full simulation events (including background) are 
shown as the 'Topological' entries in Table 6 in Section 5.4. For each leptonic top jet E T bin, the number 
of jets selected in the window s\ < m Mv < s 2 , the number remaining after subtracting the scaled sideband 
background, and the purity of the selected sample are given, together with the background-subtracted 
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sample ^-flavour purity, i.e. the fraction of the background-subtracted sample which is made up of b- 
flavour jets according to the Monte Carlo truth information. It can be seen that in all cases the flavour 
purity is compatible with unity within errors, showing that the background subtraction procedure is 
working well and producing a statistically pure sample of ft-jets. Table 6 also shows the results integrated 
from 40-200 GeV. The sample size and purity in the 20-40 GeV jet Ej bin is not sufficient to make a 
sensible measurement of the ^-tagging efficiency, and this bin is not considered in what follows. 

Using this b-]et selection, the ^-tagging efficiency corresponding to any cut on the ^-tagging weight 
can be obtained by integration of the ^-tagging weight distribution, i.e. calculating the fraction of the 
background-subtracted weight distribution above the cut value. This efficiency is shown as a function of 
^-tagging weight cut for each b-]et Ej bin in Fig. 7 as the points with error bars. The solid histogram 
shows the same distribution for an unbiased sample of pure Z^-jets in it events without any event selection, 
derived using Monte Carlo truth information. If the method is correct, the two distributions should agree 
within errors. The statistical errors on the measured ^-tagging efficiency (which are highly correlated 
between bins) have two components: the binomial errors due to the fluctuations in numbers of events 
passing the cut for signal and background, and the uncertainty due to the background subtraction scale 
factor S. Figure 7 also shows the difference 8 = £ meas - £ t me between the estimated and true efficiency, 
which should be compatible with zero if the method is working well. This is seen to be the case except 
for very high ^-tagging efficiencies in the two highest jet Ej bins, where the statistics are limited. The 
combined efficiency curve for &-jet Ej > 40 GeV is shown in Fig. 8(a). These results are derived by 
weighting the individual 6-tagging efficiencies measured in each energy bin by the &-jets E T distribution 
intt events. 

The method has been validated using ensemble tests based on sets of Monte Carlo subsamples of 
various integrated luminosities, derived from a sample of 1793 pb _1 . Studies of the distributions of fit 
results, estimated uncertainties and pulls show that the uncertainty on the ^-tagging efficiency returned 
from the fit underestimates the true uncertainty by about 20 %. They also show that the fit shows sig- 
nificant biases and does not always converge correctly for samples of lOOpb -1 . With 200 pb -1 of data, 
the fit results are unbiased and the fit converges in 98 % of the test samples; hence this is considered the 
minimum amount of data for which this technique can be used. 

Figure 8(b) shows the absolute uncertainty returned by the tagging measurement as a function of the 
efficiency itself, scaled to an integrated luminosity of 200 pb -1 . At £tme = 0.6, the statistical error is 
about 0.038, corresponding to a relative error of 0£ tme /e true =6.4 %. 

Systematic uncertainties have been evaluated as discussed in Section 4.3, with the exception that no 
uncertainties were evaluated for b-jet labelling, Monte Carlo statistics and the top quark mass, which are 
not relevant for this analysis. The systematic errors are summarised in Table 7. 

5.2 Likelihood-based selection 

The likelihood selection uses templates based on event and jet kinematic variables to determine the best 
assignment of jets to tt decay products within an event. These same likelihoods are also used to select 
events for which the jet assignments are most likely to be correct. 

Templates are constructed using several kinematic variables to provide discrimination between 'cor- 
rect' and 'incorrect' permutations. To maximize discrimination, templates are constructed separately for 
each jet multiplicity. All possible permutations of assignments of jets to final-state quarks are considered, 
and, for each permutation within each event, the leptonic and hadronic top quark and hadronic W boson 
are reconstructed, using the reconstructed momenta. The permutation is labelled 'correct' if the jets used 
to reconstruct the particle (t, W, b-quark, or light quark from W decay) are correctly matched to the 
quarks arising from the hard-scatter vertex. To limit the number of jet-parton assignment combinations, 
events with more than six jets are rejected. 
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Figure 7: ^-tagging efficiency vs. weight cut for different Z^-jet Ej ranges, as measured by the 
topological b-jet analysis (upper points with error bars, statistics of full simulation sample), and 
compared to the true efficiency from unbiased 6-jets (histogram). The difference between the two 
is shown as the lower points with error bars. 
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b-tag weight cut b-tag efficiency 



Figure 8: (a) 6-tagging efficiency vs. cut on ^-tagging weight w, as measured from the topological 
b-jet selection (points with error bars), and derived from Monte Carlo truth information in all b- 
jets in tt events (histogram), for 948 pb -1 of simulated tt plus background events; (b) Estimated 
statistical uncertainty on the measured ^-tagging efficiency as a function of tagging efficiency, for 
200 pb '. 



The 18 variables used in the likelihood are described below. The two templates based on the hadronic 
t-quark are shown in Fig. 9. 

1-4. hadronic W boson and top quark p T and mass. The two light quark four-momenta are added to- 
gether to obtain the hadronic W boson four-momentum. The ft-quark four-momentum is then 
added to obtain the hadronic top quark momentum. 

5-6. leptonic top quark pj and mass. The transverse momentum of the neutrino is taken to be the neg- 
ative of the total transverse momentum of the event. The z-component of the neutrino momentum 
is unknown, so is determined by assuming the mass of the W boson (80.403 GeV) and solving 
the resulting quadratic equation. Since there are two solutions, the smaller solution is chosen. If 
the root is imaginary, the real part is used. The energy of the neutrino is taken from the neutrino 
momentum assuming the neutrino is massless. The leptonic top quark four-momentum is then 
determined from the lepton, neutrino and associated &-quark jet. 

7-18. hadronic and leptonic ft-jets and 2 jets from W-decay: rank, E T , and A^(b had ,lep). For each of 
the four jets assigned to the hadronic W quarks and the two b-quark, kinematic variables are used 
to construct additional templates. The jets are ranked by £>, with rank one denoting the jet with 
the highest Ej. This jet rank, along with jet Ej and the separation in 0 between the jet and the 
lepton (A0(jet,lep)) are used as additional templates. 
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Figure 9: Templates for the likelihood-based selection, using (a) hadronic ?-quark mass and (b) hadronic 
?-quark pj. 

5.2.1 Event selection 

The b-purity of the basic sample selected as discussed in Section 2 is enhanced by requiring one jet to 
be ^-tagged, with an IP3D+SV1 weight w > 6. Then a discriminant value Q is calculated for each jet 
combination in the event using the templates, which are taken to be likelihood distribution functions for 
each variable. The 'correct combination' templates are used to get the V values for the i th variable, and 
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Figure 10: Discrimination between correct and incorrect permutations, best permutation chosen for each 
event, for entire range of £F, and for Si > 0.985. Events are chosen which have at least 1 ^-tagged jet. 



the 'incorrect combination' templates are used to get the 'b' values. They are combined according to: 



9 = 



Y\M/bj) 



(7) 



In each event, the combination with the largest value of £F is retained. The resulting distributions are 
shown in Fig. 10, normalised to 100pb _1 . The contributions from the correct combination, incorrect 
combinations and aon-tt events are shown separately. As can be seen from the right-hand plot, very 
high discriminant values are required to select a sample with high purity. 

The ft-jet purity is shown vs. the number of selected events in Fig. 11, for several ranges of jet 
Ej. Higher purity samples can be obtained by tightening the cut on @, but at the expense of selection 
efficiency. A cut of £F > 0.985 was chosen for these studies assuming 100pb _1 samples, which could be 
raised to improve the purity once the data sample size allows. 



5.2.2 Background subtraction 

Assuming the cuts used to discriminate between correct and incorrect jet-parton assignments reduce the 
background from non-?f events to a negligible level, the primary background to be considered arises 
from jets incorrectly assigned to ^-quarks. These jets can be from light, c, or s-quarks from W decay, and 
from ISR/FSR jets. To estimate the effect of the background on the ^-tagging efficiencies, it is necessary 
to determine the tagging rate of jets in background events, as well as the fraction of signal (and hence 
background) events in the data sample. Using Monte Carlo truth information to determine the true flavour 
of the jet assigned to the leptonic top decay (and assumed to be a b-jei), signal (S) and background (B) 
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Figure 11: Likelihood selection: Z?-jet purity vs. number of selected events (discriminant cut value 
ranging from 0.975 to 1) for each Z?-jet Ej bin. 

templates are formed, binned by discriminant value. Using these templates, a likelihood is constructed 
to find the fraction f sgn of signal events. The likelihood formed from data (D) and templates is: 

#bins 

lnJ?(f sgn ) = £ —Dj In (f sgn S t + (1 - f sg n)Bi) , (8) 

where D ( is the number of data events in the i th bin, and S, and fi ( are the values of the i th bins of signal and 
background templates. The associated background ^-tagging efficiencies are determined using events 
passing a looser discriminant cut (@ > 0.9), since the tag rate for jets in the background do not depend 
strongly on the value of this cut. These are shown for several ranges of jet Ej in Fig. 12. The average of 
the ^-tagging efficiencies of non-^-jets over all jet Ej bins is used, giving = 0.021 ±0.004. Once 
the ^-tagging efficiencies for background events and f sgn are obtained, the following equation is used to 
solve for £*"jg ts : 

^r = ^(<af S -( 1 -^) £ b b 5 d ) (9) 



5.2.3 Results for ^-tagging efficiency measurement 

The ^-tagging efficiencies are determined for events binned in Z?-jet Ej, with bin sizes chosen to match 
the kinematic and topological methods. To simulate the 100pb _1 sample, histograms have been scaled 
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Figure 12: ^-tagging efficiencies for signal (b-jets), and tag rate for light and c-jets (background) for 
events passing a discriminant cut of 0.9. Each plot was done for a different range of jet Ej. 

to the number of events expected in the sample. The sample purities are shown for the Ej -binned sample 
in Table 6 for comparison with the other methods, and the ^-tagging efficiencies are shown in Fig. 13. 
The average of ^-tagging efficiencies for jet Ej > 40 GeV is 0.674 ± 0.086, consistent with the actual 
value of 0.658. 

Ensemble testing was done using 100 pseudo-experiments, verifying the same central value and error. 
The pull distribution of the 100 pseudo-experiments has a width of 0.60 standard deviations, so the error 
(0.086) is scaled by 0.60 to get an expected statistical error 0.05 for the 100pb _1 sample. The low pull 
width arises because, for some pseudo-experiments in the ensemble, likelihood fits of signal fractions 
are close to 0 or 1 . Failing to determine asymmetric errors properly for these pseudo-experiments results 
in an overestimation of the error. 

Systematic uncertainties are determined for the effects discussed in Section 4.3, using the full avail- 
able MC statistics for all except the effect of background events in the sample. Ensemble testing was 
used to evaluate this error, using double the expected number of events and no background events. In 
general, the systematic errors are large for the cuts chosen for the 100pb _1 sample. These are expected to 
improve once data samples are available and the discriminants are optimised, especially once the sample 
size allows tighter jet Ej and discriminant cuts to be used in order to reduce the background from non-f? 
and wrong combinations. 

5.3 Kinematic selection 

This section describes the selection of a high purity b-jet sample through the use of a kinematic fit and 
cuts based on the resulting fit x 2 - Other selection criteria, including the use of a b-tag on the hadronic top 



524 



6- Tagging - 6-Tagging Calibration with tt Events 



CD 0. 
O 

S 0. 

CD 

B °" 

-Q 0. 

0. 
0. 
0. 
0. 



ATLAS 



100 pb" 1 "data" 
MC truth 



40 



60 



80 



100 120 140 160 180 200 
E T (GeV) 



Figure 13: Likelihood method: simulation of ^-tagging efficiencies vs. Ej for a 100 pb sample 
using the best permutation for events selected with a discriminant cut. The solid line shows the 
true ^-tagging efficiencies from the full MC sample using truth information. (b-tag weight=6.0, 
discriminant cut 0.985) 



ft-jet are also used to enhance the purity. As with the other methods described above, it is difficult to get 
high purity whilst maintaining sufficient statistics, and a data-based background subtraction procedure is 
employed to derive an effective pure ft-jet sample. 

5.3.1 Kinematic fit and performance 

The analysis uses the kinematic fit HITFIT [11], originally developed at D0. A W mass constraint is 
included and the two reconstructed tops are constrained to have the same mass. The top mass can also be 
constrained. For this study the mean top mass obtained with the unconstrained fit (172 GeV) was used 
as a constraint in subsequent fits. The mean top mass obtained from the fit was 172 GeV. Both neutrino 
solutions are considered and the one with the lower x 2 is used. The fit requires calibrations of the jet 
energy to the partem energy and resolutions of the jets and leptons, which were determined by comparing 
reconstructed quantities with the Monte Carlo truth information. 

Only the four largest Ej jets in the event were used in the kinematic fit, giving rise to 12 permutations 
which were each fitted. The correct combination is not always in the four leading jets. In 49 % of 
the events passing the basic semileptonic event selection discussed in Section 2, the four jets from the 
tt system are reconstructed and pass the jet selection cuts. Out of the events where all four jets are 
reconstructed 44 % contain the ttbar jets in the four leading jets, rising to 92 % of events if the first six 
jets are used. However, this leads to a large increase in combinatorial background and was not considered 
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Figure 14: Kinematic selection: Distribution of the x 2 for the permutation with the minimum x 2 showing 
contributions of the signal and backgrounds with standard selection cuts (left plot), and with additional 
requirements according to selection S 3 described in the text (right plot). 

From the four leading jets, the permutation with the lowest x 2 was chosen, and events where this 
X 2 was below a particular cut value were retained. If the four leading jets are four jets from the tt 
system, then the correct permutation is the one with the lowest x 2 in 58 % of the cases. Figure 14 (left) 
shows the x 2 distribution for the selected combination, including the contributions from correct and 
wrong combinations in tt events, and from W+jets and single top backgrounds. It can be seen that the 
fraction of correct combinations is not high, even for low x 2 values. Several additional selections beyond 
the 'standard cuts' So defined above were tried in order to enhance the purity as follows. Selection Si 
requires a b-tag with weight w > 5 on the jet assigned as the hadronic b-jet; the effect of this is mainly 
to reduce the W+multijet background. Selection S2 adds a b-tag veto (w < 5) on both jets assigned to 
the hadronically decaying W. Selection S3 also requires there are six or less jets with Ej > 20 GeV, and 
selection S4 requires that the hadronic top pj satisfies pj > 150 GeV. The relative reduction of wrong 
combinations and background can be seen in the x 2 distribution after selection S3 as shown in Fig. 14 
(right). 

Figure 15 shows the results of these additional selections, in terms of the fraction of jets assigned as 
the leptonic top 6-jet which are really ft-jets, and the number of selected events for lOOpb -1 . Purities of 
up to 90 % are reachable, but at the cost of low selection efficiency. To retain a reasonable number of 
events, a signal sample is defined using selection S3 with a cut x 2 < 10, which leaves around 600 events 
for 100 pb" 1 . 

5.3.2 Background subtraction 

As for the topological method, a background control sample is used to estimate the composition of the 
remaining background in the sample of selected jets from data. The use of the kinematic fit excludes 
most events with masses outside the top mass window, so it is not possible to use top mass sidebands 
to estimate the background composition. However, events with high x 2 are predominantly wrong com- 
binations (see Fig. 14). The wrong-combination fraction can be enhanced by selecting events that pass 
the standard selection So and requiring that one of the jets assigned to the W is tagged as a b-jet (w > 5). 
The x 2 distribution of this control sample follows closely that of the wrong combination background in 
the selected jet sample. The shape of the x 2 distribution is used to predict the amount of combinatorial 
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Figure 15: Left: The purity (fraction of true b jets) of the jet assigned as the b-jet on the leptonic side as 
a function of the % 2 cut. Right: The corresponding number of events. The different selection criteria are 
described in the text. 

background in the signal sample, normalising the two samples in the region 30 < % 2 < 100. Any test 
distribution (e.g. the ^-tagging weight) extracted from the signal sample can then be corrected for this 
contribution of background events, taking the shape of the test distribution in background from the region 
of the control sample with x 2 > 30. This background subtraction method is very similar to that discussed 
in Section 5.1, with the high x 2 region playing the role of the top mass sideband, and the control sample 
with a ^-tagged jet from the W decay corresponding to the sample with high hadronic top mass in the 
topological selection. 

5.3.3 Results for ^-tagging efficiency measurement 

The results of applying this selection to simulated data including background and scaling the event count 
to lOOpb 1 are shown in Table 6. The purity in the 20-40 GeV jet Ej bin is rather low, and the back- 
ground from the significant contribution of single top is not well-estimated. As for the topological 
analysis, this bin is not used for calculating the ^-tagging efficiency. The combined ^-tagging weight 
distribution for the other bins (Ej > 40 GeV) is shown in Fig. 16. The ^-tagging efficiency correspond- 
ing to any given cut can be calculated by integration, and the corresponding statistical errors for various 
tag working points can be seen in Table 7. 

5.4 Comparison of selection methods 

The performance of the different Z?-jet selection methods are compared in Table 6, which shows the 
number of jets selected as a function of jet Ej, the effective number of jets after background subtraction 
procedures have been performed, the corresponding purity of the original sample, and the ft-jet fraction 
of the final background-subtracted sample, which should be compatible with one. 

The different selections have roughly similar overall performance, with the topological selection 
giving relatively more jets at high Ej, and the kinematic selection more at low Ej. All selections allow 
a sample of several hundred Z?-jets to be selected in 100pb _1 of data, although the selections of low Ej 
&-jets (20-40 GeV) suffer from large backgrounds. 
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Figure 16: Kinematic selection: (Left): The b-tag weight distribution for the uncorrected sample (un- 
filled histogram), for the estimated background sample (filled histogram) and the corrected distribution 
calculated from the difference (data points). Right: The b-tag weight distribution for the corrected sam- 
ple (data points) compared with the distribution for true ft-jets (histogram). Both plots are normalised to 
lOOpb -1 , but use 967 pb _1 of simulated data. 

5.5 Comparison of efficiency measurements 

The performance of the different ^-tagging efficiency measurements is compared in Table 7, which shows 
the statistical and significant systematic errors for a true tagging efficiency of 0.6 for all methods. The 
systematic errors have been evaluated as discussed in [2] and Section 2. The differences in systematic 
uncertainties reflect the different uses made of Monte Carlo information by the various methods. The 
expected statistical errors for other tagging efficiencies, and the simpler IP2D tagging algorithm, are 
shown (where available) in Table 8. 

With lOOpb -1 of data, the counting method can measure the overall ^-tagging efficiency to a preci- 
sion of better than 5 %, and, particularly as the luminosity increases through 200 pb -1 several alternative 
approaches are possible that also enable the efficiency dependence on other variables (e.g. jet Ej) to be 
studied. 



6 Optimisation of Magging for tt events 

The ATLAS multivariate ^-tagging algorithms rely heavily on likelihood reference distributions for light- 
and Z?-quark jets derived from Monte Carlo simulation. The default reference distributions are produced 
using a mixture of physics processes. In principle, using reference histograms from Monte Carlo tt 
events alone could lead to an improvement in tagging performance on tt events. The selection of a pure 
sample of Z?-jets from tt data offers an opportunity to tune the ^-tagging algorithms directly on data, 
reducing the dependence on Monte Carlo simulation. Both these topics are discussed below. 
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Table 6: Summary of b-jet selection method performance, showing the number of selected Z^-jets, the 
number of jets after background subtraction, the selected sample purity, and estimated b-jet purity of the 
final sample after background subtraction, for each selection method. The results correspond to 100 pb _1 
of MC@NLO tt Monte Carlo plus backgrounds from W+jets, Wbb, Wcc and single top production. 
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Table 7: Summary of systematic and statistical uncertainties on the measurement of the ^-tagging effi- 
ciency at a true efficiency of £tme = 0.6, for the counting method in lepton+jets and dilepton channels, 
and the topological, likelihood and kinematic jet selection methods. The uncertainties are expressed as 
relative errors (in %). 
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Table 8: Expected statistical errors (relative errors in %) on the ^-tagging efficiency for 200 pb 1 mea- 
sured using each technique, for IP3D+SV1 and IP2D taggers at various working points. 
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Figure 17: Comparison of b tagging weights and b tag performance using default and special top sample 
reference histograms. 

6.1 Using tt specific reference histograms 

For this study, reference histograms based only on tt events (without the light jet purification procedure 
which discards light jets biased by nearby b-jets [6]) were created using a total of 250k events. By 
contrast, the default reference histograms were produced using a variety of physics processes, light jet 
purification, as well as different versions of the detector geometry, simulation, and reconstruction. The 
following comparison does not attempt to disentangle the various effects of those differences; for more 
details, see [6]. The resulting tt event ^-tagging weights are compared to those calculated using the 
default reference histograms, for the IP3D+SV1 ^-tagging algorithm with the default tower-based Cone 
jet algorithm with AR = 0.4. The results are shown in Fig. 17. The left plots shows the ^-tagging weights 
for &-jets and light jets calculated with both sets of reference histograms. 

The weights for both Z?-jets and light quark jets calculated with the top sample reference histograms 
are on average smaller than those from the default reference histograms. However, because the ^-tagging 
weights are smaller for both b-jets and light jets, the performance should not change significantly. This 
is confirmed by the right plot in Fig. 17, showing the b-tag efficiency vs the light jet rejection. Within 
the statistical uncertainty there is no difference in the performance. So there appears to be no significant 
advantage in using special dedicated tt reference histograms when applying ^-tagging to tt events. 
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6.2 Calibrating ^-tagging using data 

In principle, the b-jet selection methods discussed in Section 5 should allow the reference distributions 
for b-jets in tt events to be determined directly from data, removing the need for Monte Carlo Z^-jets 
completely. However, none of the methods produce a pure ft-jet sample without the need for back- 
ground subtraction, and hence do not identify a pure sample of ft-jets from which multiple variables 
(and their correlations) can be extracted simultaneously on a jet-by-jet basis. Instead, background- 
subtracted distributions are produced. These can be used directly as reference distributions only for 
one-dimensional likelihoods with a single input variable. For ra-dimensional likelihoods (for example 
the standard tagging weight which is a two-dimensional combination of IP3D and SV1 taggers), an n- 
dimensional background-subtracted distribution would be needed in order to properly model both the two 
input distributions and their correlation. For higher-dimensional distributions, the required data statistics 
would quickly become prohibitive. 

At least for initial data, a more feasible approach would be to use the background-subtracted b-]et 
samples to check each of the likelihood input variables individually, to determine how well the Monte 
Carlo simulation models the data. Monte Carlo would then be used to form the likelihood reference 
distributions as before (including correlations between input variables), with the comparison with data 
being used to assess systematic uncertainties, e.g. by reweighting Monte Carlo events to more precisely 
follow the data distributions if discrepancies are seen. 

This technique is illustrated in Fig. 18, which shows background-subtracted distributions of five 
variables related to the ^-tagging weight computation, derived from the b-]et sample selected using the 
topological technique of Section 5.1, compared to the true distributions from Monte Carlo. The inte- 
grated luminosity is 948 pb _1 . The first two variables are IP3D, the weight derived from the 3D-impact 
parameters of all tracks in the jet; and SV1, the weight derived from the SV1 secondary vertex recon- 
struction algorithm. The others are all input variables used to form the SV1 secondary vertex weight: 
N2Track, the number of two-track vertices found; SVMass, the mass of the reconstructed secondary 
vertex, and SVEFrac, the energy fraction of the jets associated to the vertex. The spikes around zero for 
these last three variables correspond to jets where no secondary vertex was found. The variables have 
been transformed linearly onto the range [0,1] to simplify the subtraction procedure. 

It can be seen that the background-subtracted distributions are consistent with the Monte-Carlo truth, 
demonstrating the validity of the method. It would be straightforward to construct reweighting factors 
from these distributions, to explore possible systematic variations. However, it should also be noted that 
the techniques discussed in Section 5.5 already allow the ^-tagging efficiency to be measured with min- 
imal dependency on the Monte Carlo — the main use of this technique would then be to understand how 
to improve the Monte Carlo simulation of &-jets, which could help to increase the ^-tagging performance 
in tt events by giving each variable its optimal weight in the likelihood. 

7 Conclusions 

The studies of jet reconstruction presented above show that the b-]et energy resolution in tt events is not 
very sensitive to the jet algorithm and parameters chosen. The jet angular resolution is more sensitive, 
degrading in particular with large jet cone sizes. 

With 100 pb" 1 of data, several methods are available to make useful measurements of the ^-tagging 
efficiency in tt events. The tag counting method can reach an overall relative precision of around 5 %, in 
both lepton+jets and dilepton channels. However, this method only gives an 'integrated' measurement, 
valid for the Ej and tj distribution of b-jets selected by the tt event selection. Monte Carlo techniques 
would have to be used to extrapolate this measurement for other jet E T and tj distributions, perhaps also 
incorporating information derived from studies of dijet samples [1]. 
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Figure 18: Background-subtracted ^-tagging variable distributions derived from the b-jet sample 
selected by the topological method, with 948 pb" 1 of simulated tt plus background data. The 
derived distributions are shown by the points with error bars, and the Monte Carlo truth for an 
unbiased sample of ft-jets is shown by the solid histograms. 
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Other techniques, based on the topology and kinematics of tt events, can be used to select a sample 
of jets enriched in ^-flavour, which can then be used to measure the ^-tagging efficiency as a function 
of other variables. This requires that the remaining non-b background in the selected jet samples be 
estimated and removed, and a variety of techniques are available for doing that, based either on data or 
Monte Carlo. These techniques are less statistically powerful than the tag counting method, achieving 
overall ^-tagging uncertainties of around 10 % with 100pb _1 , but will become increasingly powerful as 
more data becomes available. 

Calibrating the ^-tagging likelihood reference distributions from Monte Carlo tt events rather than 
the generic event mixture used by default does not significantly affect the performance of the standard 
algorithms, which demonstrates the robustness of the ^-tagger. On the other hand, the b-]et selection 
methods discussed above also allow reference distributions from tt data to be obtained; with moderate 
integrated luminosity these are likely to be more useful to cross-check the Monte Carlo description of 
likelihood input variables rather than as a direct input to 6-tag calibration. 
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Abstract 

This note describes two strategies for data-based measurement of the semi- 
leptonic ^-tagging efficiency of ATLAS lifetime ^-tagging algorithms using jet 
data. The pr, re i method uses templates of the muon pj relative to the jet+muon 
axis for bottom, charm, and light jets to estimate the ft-quark content of a 
jet sample. The ^-tagging efficiency is obtained by measuring the &-quark 
content before and after tagging a sample. The second method, System 8, uses 
two samples of jets of differing ft-quark content and two uncorrected tagging 
algorithms to form a system of 8 equations and 8 unknowns, one of which 
is the ^-tagging efficiency. Both methods use a Monte Carlo scale factor to 
convert the measured semi-leptonic efficiency to an inclusive efficiency. Both 
methods give results binned in pj and tj . Good agreement was found between 
the ^-tagging efficiencies determined by both techniques and true Monte Carlo 
for 15 < pj < 80 GeV, however above 80 GeV both methods have difficulty. 
Initial systematics studies have been performed which indicate that it should 
be possible to control the absolute error on ^-tagging efficiency to 6% for both 
methods. This error does not include the error associated with converting the 
semi-leptonic efficiency to the inclusive efficiency. 

1 Introduction 

Many analyses at the LHC will rely on the presence of ^-quarks. Top production, Standard Model 
Higgs boson searches, and searches for physics beyond the Standard Model, for example, all have final 
states involving ^-quarks. To estimate the backgrounds from well known Standard Model processes 
after applying ^-tagging algorithms it is necessary to know the tagging efficiency for b-jets, for light-jets 
(fake rate or mistagging rate), and for c-jets (charm-tagging efficiency). All three efficiencies should 
be measured as accurately as possible: systematic errors in the ^-tagging calibration can translate to 
large errors in an analysis. Although the fake rate and c-tagging rate are important, only the ^-tagging 
efficiency is considered in this note. 

In order to measure the ^-tagging efficiency we must know the bottom quark content ('^-content') 
of a calibration sample well. Unfortunately, it is difficult to select a pure b-jet sample in the data. Two 
independent methods to calibrate the ^-tagging algorithms are currently under study in ATLAS. The first 
method uses a high purity sample of top quark events. Full kinematic reconstruction enables the proper 
identification of the jet resulting from the fragmentation of the &-quark from the top quark decay. A 
tagging algorithm can be run on that sample of jets and the efficiency measured [1]. Similarly, one can 
count tags in a high purity sample of top events and estimate both the tag rate and the cross section [1]. 

This note describes the determination of the ^-tagging efficiency in QCD jet data. Unfiltered QCD 
jet data have a very small fraction of b-jets at small jet energy and small effective y/s, however. In order 
to increase the fraction of b-jets we require that jets in the sample contain a muon. Though muons come 
from other sources, a major source is the semi-leptonic decay of ^-quarks or c-quarks resulting from the 
^-quark decays. A consequence is that only the lifetime tagging efficiency in semi-leptonic decays of 
b-jets is measured by the techniques described here. The semi-leptonic efficiency must be corrected to 
obtain an inclusive ^-tagging efficiency. The scaling is determined from Monte Carlo; this note only 
briefly touches on the determination of this scale factor. 

Two standard lifetime-based ^-tagging algorithms, IP2D and IP3D+SV1 [2], are used to demon- 
strate the calibration methods and estimate systematic errors. Briefly, IP2D uses reconstructed tracks 
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and their transverse impact parameters to look for jets with displaced tracks inconsistent with light-jets. 
IP3D+SV1 is a combined tagger: IP3D is similar to IP2D, but also takes the longitudinal impact pa- 
rameter into account; SV1 reconstructs a secondary vertex from tracks near a jet. SV1 uses a likelihood 
made up of the invariant mass of the tracks associated with the secondary vertex, the ratio of the energy 
of tracks in the secondary vertex to the energy of tracks associated to the jet, and the total number of 
two-track vertices reconstructed in a jet. Methods that measure the ^-tagging efficiency that work for 
these particular taggers are expected to work for other lifetime taggers as well. 
Two separate calibration techniques are described in this note: 

• The pr,rel method, described in Section 3, uses Monte Carlo-derived templates, for b-, c-, and 
light-jets, of the relative p T of a muon with respect to the jet+muon axis. The ^-content of a jet 
data sample can then be determined by fitting the pr, re i distribution of the data with these templates 
before and after the lifetime tagging algorithms are applied. The ^-tagging efficiency is derived 
from the changing ^-fraction. 

• The System 8 method, described in Section 4, employs two samples with different ^-content and 
two uncorrected tagging algorithms to construct a system of 8 nonlinear equations and 8 un- 
knowns. One of the unknowns is the ^-lifetime tagging efficiency. 

Both of these techniques require a large sample of jets with muons. Section 2 describes a dedicated 
trigger that will be used to collect this sample along with the Monte Carlo samples and selection cuts 
used in this study. 

The performance of the tagging algorithms varies with both jet pj and r\ [2]. This is caused by 
both geometrical acceptance effects and variations in the track reconstruction efficiency. Both methods 
described in this note measure the efficiency as a function of pj and r\. The dependence on other 
variables could also be measured if required. 

The efficiency measured from jet data is largely uncorrected with that measured from the tt data, 
since the data sets and the techniques are uncorrected. Thus the results obtained with the two methods 
can be combined to further improve the understanding of ^-tagging and reduce associated systematic 
errors. At low jet energies the tt method is rather sensitive to background contamination, a region where 
the jet method will perform best. At higher jet energies the tt method will have a better accuracy. 

2 Samples and selection 

Several sets of Monte Carlo QCD jet events were generated specially for this study. The jet samples were 
generated with the standard dijet process using the PYTHIA Monte Carlo generator and full ATLAS 
detector simulation and detector reconstruction software. ATLAS splits generation of its QCD samples 
by requiring the hard scatter parton p T to be within a certain range: between 17 and 35 GeV, between 35 
and 70 GeV, between 70 and 140 GeV, and between 140 and 280 GeV. Approximately 100,000 events 
were generated in each range. After generation the samples are combined for the analysis. 

The b-jet statistics in the QCD sample are not sufficient to test the methods discussed here. To 
increase statistics we also generated muon+jet samples, which further required in each event a muon 
from any source with a true pj > 3 GeV and \r\ \ < 2. The pr cut was chosen as a representative lower 
limit for muons that can accurately be reconstructed and identified (we make a reconstructed muon 
pj > 4 GeV cut). The requirement of a muon has a dramatic effect on the flavor composition of the 
sample, increasing the &-jet content by about a factor of 10, and increasing the c-jet content by about a 
factor of 5. 

Identification of electrons in jets is more challenging than that of muons, and so in the studies reported 
here we restrict ourselves to the muon channel. 



535 



b- Tagging - 6-Tagging Calibration with Jet Events 




Figure 1 : The left plot shows the fraction of all muons with reconstructed pj > 4 GeV that are secondary 
muons vs muon pj in the QCD samples (filled circles) and in the muon+jet samples (open circles). The 
difference is caused by the generator-level filtering of the ji. samples before ATLAS simulation has a 
chance to create the secondary muons. The right plot shows the fraction of muon-tagged jets that are 
due to a ^-quark as identified by the default Monte Carlo labeling algorithm. In both plots, the black 
open circles represent muon-tagged jets found in the muon+jet sample and the red filled circles represent 
muon-tagged jets found in the QCD jet sample. 

2.1 Selection cuts 

This analysis depends, primarily, on two reconstructed objects: jets and muons. Jets are found as AR < 
0.4 cone jets with a pj > 15 GeV [3]. Jets are calibrated with a standard jet energy scale calibration (and 
the jet pr includes the muon pj) [4]. Muons are reconstructed from tracks in the outer muon detectors 
and must match an inner detector track with a fit % 2 < 10. The muons must have a reconstructed pj > 4 
GeV [5]. A muon-tagged jet is a jet with a muon contained within a cone of AR < 0.4 of the jet axis. 

The flavor of each jet must be determined (b-, c-, or light-jet) in Monte Carlo. The ATLAS flavor 
labeling algorithm [2] is used. A jet is labeled as a b-jet if a ft-quark with pr > 5 GeV is found in a cone 
of size AR = 0.3 around the jet direction. If no &-quark is present, but instead a c-quark is found then the 
jet is labeled as a charm jet. Remaining jets are labeled as light-jets. Light-jets include jets that originate 
from a gluon as well as a u-, d-, or s-quark A b- or c-quark that originates from gluon splitting will be 
classified as a b- or c-jet as long as the b- or c-quark is within AR < 0.3 of the jet axis. 

2.2 Monte Carlo biases 

Requiring that a true muon from b or c decay is present as imposed on the muon+jet Monte Carlo sample 
means that almost all muons from decays of 7T/K, or from material interactions, are missing. These latter 
sources are here denoted secondary muons. They are present in the inclusive jet sample, of course. 

The QCD jet sample predicts the secondary muon rate shown by the filled circles in Fig. 1 (left). The 
muon+jet samples have a very different secondary muon average fraction, especially at low muon pj, 
as seen by the open circles in Fig. 1 (left). Fig. 1 (right) shows the fraction of jets with selected muons 
which are b-jets in the two samples. 

The adequacy of the modeling of the secondary muons was checked, firstly by checking that the 
secondary muon rates were consistent separately for Z?-jets, c-jets, and light-jets between the inclusive jet 
and muon+jet samples. Secondly, we compared the lifetime tagging rates of ft-jets and light-jets in the 



536 



b- Tagging - 6-Tagging Calibration with Jet Events 




Figure 2: Tag rates (IP3D+SV1 weight > 4) for jets containing a reconstructed muon. The left plot 
are jets labeled as b-jets and the right plot are jets labeled as light-jets. In both plots, the open circles 
represent muon-tagged jets found in the muon+jet sample and the filled circles represent muon-tagged 
jets found in the QCD jet sample. 

two samples as a function of pj after requiring a reconstructed muon. The results are shown in Fig. 2. 
The ft-jet tag rates (a tag is defined as a jet having an IP3D+SV1 weight greater than 4) in the QCD jet 
and muon+jet samples are consistent within statistics. The light-jet lifetime-tag rate is a factor of about 
three higher in the muon+jet sample than in the muon-tagged inclusive jet sample. This arises from semi- 
leptonic heavy flavor decays in nearby jets which contaminate the light-jets with a muon and a displaced 
vertex. Muon-tagged jets in the QCD jet sample are less likely to suffer from this contamination because 
the muon is more likely to be a secondary, and thus not be associated with an event containing a &-quark. 
Real data is expected to look more like the QCD jet sample with the lower lifetime-tagger tag rate. We 
have checked that neither tagging calibration technique is affected by changes in the relative mix of 
light-jet muon sources as far as current Monte Carlo statistics allow. 

2.3 Trigger 

Dedicated trigger signatures are necessary to obtain the required number of muon+jet events. We con- 
sider the statistics obtainable as a function of jet pj. We also discuss a more sophisticated trigger to 
increase statistics at high jet pj compared to a single jet trigger. A full discussion of the operation of the 
ATLAS jet and muon trigger systems can be found elsewhere [6,7]. 

The most obvious choice for a muon+jet trigger is a simple coincidence of lepton and jet triggers 
requiring no geometrical correlation. The rate for such a trigger is high, so that it must be prescaled, 
especially at low pj. The flexible multi-level trigger system of ATLAS allows geometrical correlations 
to be used, explicitly requiring that the muon is close to the triggering jet direction. 

In the current trigger menu foreseen for running at luminosities around 10 31 cm _2 s _1 [8], various 
muon and jet trigger thresholds are available at Level-1 (LI). Example combinations of these thresholds 
which are promising for the studies reported here are the two triggers L1_MU4J10 and L1JVIU6J10, 
which are LI signatures with a muon with pj > 4 or 6 GeV and a jet with pj > 10 GeV. 

At Level-2 (L2) the muon selection is refined [7]: the selection sequence is started by the mitFast 
algorithm which confirms LI muon candidates and makes a more precise muon momentum measurement 
using muon spectrometer hit information. A fast track combination algorithm, muComb, matches tracks 
found by muFast with tracks found in the inner detector. For the studies reported here, no further selection 
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is made at the third trigger level, the event filter (EF). 

Since the trigger rate is likely to be limited by bandwidth considerations at low pj rather than the 
physics rate, the key parameter in considering the trigger performance is the purity of the sample in 
terms of the final selected events which are desired: the offline muon-tagged jets. We define the purity 
of the triggered sample as the fraction of events passing the trigger that contain such an offline muon- 
tagged jet. For the purpose of this study, the offline muon-tagged jet candidate is as defined in Section 2 
(pr > 4 GeV matching a reconstructed jet with pj > 15 GeV). The purities of jet samples selected by 
the different trigger requirements discussed above are shown in Table 1. 

The second and subsequent rows of Table 1 show that a further refinement is possible by ask- 
ing for angular matching between the trigger muon and jet directions (taken here to be within AR = 
V / A0 2 +AT7 2 < 0.4). The choice between L1JVIU4J10 and L1JV1U6J10 will be driven essentially by 
the required muon p T acceptance. Overall, purities relative to the offline selected sample of around 80% 
are attainable. This refined algorithm will be implemented for data-taking. 

Table 1 : Purities of muon-tagged jet events in the triggered sample for different trigger selections. The 
purity is shown for four triggers with two LI muon trigger thresholds (X). Errors are purely Monte Carlo 
statistics. 



Signature 


X = 4 GeV 


X = 6 GeV 


L1_MUXJ10 


20 ±1% 


40 ±3% 


L1JVIUXJ10 (jU-jet matching) 


51 ±3% 


79 ±7% 


L2_muX(muFast) _J10 (jU-jet matching) 


70 ±5% 


82 ±8% 


L2_muX(muComb: jX+lD) J10 (jU-jet matching) 


78 ±6% 


84 ±8% 



A further concern is to provide a good coverage in jet p T extending from low p T as discussed above to 
much higher jet pj, where the highest jet threshold will run unprescaled. The strategy adopted, similar to 
the one used for the inclusive jet trigger, consists of building up a set of muon-jet triggers with different jet 
trigger thresholds with prescale factors that diminish as pj rises. Figure 3 (left) shows the pj distribution 
of the jet belonging to the muon-jet candidate selected using the signature L2_mu4 _J10. Using the set 
of signatures L2_mu4J10, L2_mu4J18, L2_mu4J23, L2_mu4J35, L2_mu4J42 with prescale factors 
50/15/12/12/1 a more uniform jet pj distribution is obtained as shown in Fig. 3 (right). 

Under the assumption that the rate budget for the muon+jet trigger is 1 Hz, 100 000 muon+jet events 
are expected for around 30 hours of running time, corresponding to lpb -1 of data at a luminosity of 
10 31 cm _2 s _1 . The combined muon-jet sample used in this study is therefore equivalent to roughly 5 
pb -1 of data, assuming that the pj acceptance of the trigger is comparable to that generated in the Monte 
Carlo samples. 

3 The pr,rei method 

The pr,rei method is based on the different relative transverse momentum distributions of muons in b- 
jets, c-jets, and light-jets. This arises because the muon typically originates from the semi-leptonic decay 
of a heavy hadron in heavy-flavor jets. The variable p Tje i is defined as the p T of the muon with respect 
to the jet+muon axis. As can be seen Fig. 4, pr, re i has good discrimination between b-jets and c-jets and 
light-jets. 

The fraction of b-jets in the muon-tagged jet sample is estimated by performing a fit to the pr,rei 
distribution of muons using templates describing the p Tje i shape from ft-jets, c-jets and light-jets. The 
first two templates are determined from Monte Carlo using muons originating either from b- or c-hadron 
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Figure 3: Jet p T distribution for trigger L2_mu4_J10 (left plot) and the jet p T distribution for the sum 
of triggers L2_mu4_J10, L2_mu4_J18, L2_mu4J23, L2_mu4J35, L2_mu4J42 (right plot). In the case 
of the sum of triggers, the triggers were relatively prescaled by factors 50/15/12/12/1. In both plots the 
muon is confirmed at L2 by the muComb algorithm. 



decays. The template for light-jets is obtained by picking random tracks from jets with no heavy flavor 
in close proximity, assuming a uniform probability that such tracks fake a muon. We plan on using the 
same procedure to determine the light-jet templates from data once data is available, although we will 
still use Monte Carlo for the b- and c-templates. A systematic uncertainty will arise from the presence 
of non b- and c-jets in the real data sample, as discussed in Section 3.3. 

The pr.rei templates are determined in bins of jet pj and Y\ to allow the /^-tagging efficiency to be 
measured as a function of these variables. The pr.rd distribution is fitted by allowing the normalization of 
the three templates to vary and minimizing a likelihood that also accounts for the template statistics [9]. 
The fit results are expressed in terms of fit fractions of the b-, c-, and light-jet p Tje i templates, F b , F c and 
flight- The ^-tagging efficiency is obtained from the fit results by: 

N tag,i f tag,i 

„data,i _ "n-jet'b , n 

where N'^_j et and N^ 1 ^ are, respectively, the number of /x-jets in the i'th jet pj and r\ bin before and 
after tagging. 

As described in Section 5, a Monte Carlo-based correction is needed to correct the efficiency for 
Z?-jets with a muon to inclusive &-jets. In this section we discuss only the lifetime-tagging efficiency 
measurement for Z>-jets with muons. 

3.1 Measuring the ^-tagging efficiency using p T , re i 

The full muon+jet sample was split into two equal parts. The first half was used to obtain the pr,rel 
templates for light-, c- and Z?-jets, while the second half was used to measure the tagging performance. 

As mentioned above, the light-jet pr, re i templates were built from the pr, re i of all tracks within a cone 
of AR < 0.4 and pj > 4 GeV of any reconstructed jet labeled as 'light'. The candidate jet was further 
required to be at least AR away from any other non-light-labeled jet. 

Representative templates are shown in Fig. 4. The pr, re i distributions depend on jet pj, especially for 
Z>-jets, and the templates are therefore derived in several bins of jet pj. For high-^7- jets, the separation 
power of the pr.rei variable significantly decreases: both the shape and peak value of the pr, re i distribution 
for b-jets approaches that of c- and light-jets. This can be seen in the right plot of Fig. 4. 
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The ROOT TFractionFitter [9] algorithm is used to fit the templates to the data (the second half of the 
Monte Carlo sample for this study). TFractionFitter uses a standard likelihood fit that takes into account 
both the template and data statistics. It includes the constraint that the templates must sum to the data 
distribution. The histograms in Fig. 5 show the results of the fit applied to muon-tagged jet samples 
before applying the IP3D+SV1 tagger (left) and after (right). Since the shape of the templates for c- and 
light-jets do not differ greatly, the relative rate of c- and light-jets is not so well determined. The shape of 
the template for b-jets does differ significantly, and so the fraction of b-jets can be reliably determined. 
The ^-tagging efficiency is then determined directly using Equation (1). 

As a first step, an inclusive ^-tagging efficiency was measured, averaged over the pj and rj spectra 
of the jets. The results for the inclusive ^-tagging efficiency, for various cuts on the tagger weight w, are 
presented in Table 2. Good agreement is observed between the true Monte Carlo ^-tagging efficiency 
and that measured with the pj.rei method, within 1-2% statistical precision. 

Table 2: True Monte Carlo efficiencies and efficiencies obtained with the pr, r el method for 2 taggers 
using the combined muon+jet samples. The errors are statistical only. 



Tagger 


Weight cut 


Btrue 


£meas 




w > 4 


0.748 


0.758 ±0.018 


IP3D+SV1 


w > 7 


0.627 


0.630 ± 0.013 




w > 10 


0.489 


0.475 ± 0.010 




w > 2 


0.731 


0.715 ± 0.017 


IP2D 


w>3 


0.640 


0.631 ±0.013 




w > 4 


0.550 


0.541 ± 0.009 




Figure 4: p T ^ rd templates obtained at low rff (15 < $ < 28 GeV) (left) and high p 1 ? (163 < p^f < 300 
GeV) region. Intermediate p 1 ^ ranges have distributions lying between these extremes. 

The stability and sensitivity of the algorithm was tested by varying the fraction of b-jets in the Monte 
Carlo data-like test sample. This can be done in two ways: by decreasing the number of Z?-jets or by 
adding more light-jets to the initial sample. We have chosen the second approach. 

We varied the input fraction of light-jets in the Monte Carlo test sample (and thus the fraction of 
ft-jets) and re-measured the efficiency. The true input fractions and those obtained from fits to templates 
are shown in Table 3. We find agreement within statistical errors. 
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Figure 5: A fit of the pr,rel templates to the test sample. The test sample (black error bars) was fit with 
the pr,rel templates obtained from QCD jet Monte Carlo samples (green triangle: light-jet, blue square: 
c-jet, and red dot: ft-jet). The red histogram is the result of the fit. The left plot shows the fit results for 
all muon-tagged jets, and the right shows that obtained after tagging. 

Table 3: The stability of the pr, re i method as a function of changing b-, c-, and light-jet fractions. The 
fraction of b-jets was varied by altering the number of light-jets as described in the text. The first two 
columns show the true and fitted fractions of b-jets in the muon-tagged jet sample (/# - true and fs - 
result of fit). The remaining columns show the efficiency for two taggers (IP3D+SV1 with w > 7 and 
IP2D with w > 3) as measured directly in Monte Carlo (e true , and as determined by the p tre \ method 

{Emeus). 



f B - true 


fs - result of fit 


eZT +svl> ' 




eZ7 D>i 




0.259 


0.260 ± 0.005 


0.627 


0.630 ± 0.024 


0.640 


0.632 ± 0.014 


0.298 


0.298 ± 0.005 


0.627 


0.632 ± 0.013 


0.640 


0.633 ± 0.013 


0.347 


0.349 ± 0.006 


0.627 


0.630 ± 0.015 


0.640 


0.631 ±0.014 


0.398 


0.397 ± 0.006 


0.627 


0.634 ± 0.015 


0.640 


0.635 ± 0.012 


0.457 


0.463 ± 0.010 


0.627 


0.625 ± 0.016 


0.640 


0.625 ± 0.016 



3.2 Magging efficiency as a function of jet p T and r\ 

The ^-tagging efficiency is strongly dependent on jet pj and r\ [2]. The above Monte Carlo test is an av- 
erage and is only applicable for physics analysis that have the same kinematic properties as the muon+jet 
sample. In order to properly account for such effects the ^-tagging efficiency should be parameterized 
and measured as a function of at least these two variables. Since the Monte Carlo statistics are limited, 
the method is tested by binning separately in pj and tj. With more statistics a two-dimensional (pr, T])- 
binning will be used. For the first case, the pr,rel templates are derived in several jet pj bins. Event 
samples in each pj bin are split in half as before: the first half was used to determine the b, c, and light 
jet pT,rel templates, while the second half was used as a data-like test sample to measure the ^-tagging 
efficiency. The efficiency obtained in this way is integrated over the whole jet tj spectrum. The procedure 
was repeated to measure the efficiency dependence on jet | tj \ . In this case, the efficiency was integrated 
over the px spectrum of jets. 

The measured pj dependence of the ^-tagging efficiency is shown in Fig. 6 (left) for the tagger 
IP3D+SV1. We observe a good agreement between measured and true efficiencies in this low jet pj 
region. For jets with pj > 80 GeV, not shown, the statistical error from the fit becomes large and 
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the results become unreliable. At present, therefore, we conclude that the efficiency can be reliably 
determined only in the low jet pj region (pj < 80 GeV). This is related to the poor separation of b-jet 
templates from c- and light-jet templates in the high jet pj region (compare the plots in Fig. 4 for an 
example). In view of this, the 1 77 1 -dependence of the ^-tagging efficiency was studied only for jets with 
Pt < 80 GeV (Fig. 6 right). Good agreement over the whole \t] | range is observed. 



f * i 



Figure 6: Efficiency as a function of jet pj (left) and jet |tj | (right) for the tagger IP3D+SV1 as measured 
using the ptm method. The dots are the true value as measured in the Monte Carlo, and the squares 
(with error bars) are determined from the ptm method. The lines are parameterizations to the measured 
pr,rel points. At high jet pj the pj,rei measurement technique fails and so we do not attempt to measure 
^-tagging performance above 80 GeV. The \r\ \ plot (right) includes only jets with pj < 80 GeV. 



3.3 Expected Errors 

The high yield of muon-tagged jets from the muon-jet trigger means that the statistical error for determin- 
ing efficiencies with the pT, re i method should rapidly become small: for 100 pb _1 of data, for example, 
the overall statistical error would already be well below 1%. 

The study of the systematic errors in the ptm method is in progress, and will require real data for 
detailed evaluation. Sources of systematic error being considered include: 

• The major uncertainty of this method is the use of Monte Carlo to model the ptm templates for 
b- and c-jets. A change in the fragmentation function, for example, will likely change the muon 
Pr,rei spectrum. We estimated the contribution of these effects by scaling the pr, re i shape of the b 
and c templates, resulting in an change in e meas of 5%. 

• The light-jet templates will be drawn from data and thus will have some heavy flavor contamina- 
tion. QCD Monte Carlo predicts a jet sample will contain 2.5% ft-jets and 5% c-jets. We expect 
little error to be introduced by c-jets as their ptm distribution is so similar to light-jets. We tested 
the effect of 2.8% ft-jets in the pool of jets that was used to determine the light-jet template and 
re-ran the pj.rei fit- We observed a systematic shift to smaller ft-jet efficiencies by 3%. We add 
±3% as a systematic error due to heavy flavor contamination of the light jet templates. There is 
evidence we can mitigate the size of this error by rejecting jets from the light-jet template that 
satisfy a loose lifetime tagging requirement. 

• Detector modeling has also been examined as a source of further systematic error. Jet pj resolu- 
tion, the muon pj resolution, and the jet and muon direction resolution are all potential contribu- 
tors. Studies indicate the error due to this are of order a few percent. Detector modeling of muon 



542 



b- Tagging - 6-Tagging Calibration with Jet Events 



systematics have not been studied as it is assumed that the Monte Carlo modeling errors will be 
larger. 

• We have an implicit assumption that all tracks in jets correctly model the pr, re i distribution for 
light jets. A further systematic will exist because pr, re i distribution for fake muons in light jets is 
not the same as the tracks associated with light jets. 

These preliminary studies to date indicate that the systematic error sources should be controllable at the 
level of 6% or better on the lifetime tagging efficiency. 

4 The System 8 method 

The System 8 technique is designed to measure the ^-tagging efficiency with reduced dependence on 
Monte Carlo [10], [11]. This method uses two data samples with different ^-fractions and two uncorre- 
lated tagging algorithms. A system of 8 nonlinear equations involving known quantities - like the total 
number of jets tagged in each sample - and 8 unknown quantities - like the ^-tagging efficiency - can be 
written down and solved. 

Before writing down the system of 8 equations we define the taggers and two samples more fully. 
The two taggers used must be as uncorrected as possible. For this study we use the muon-in-jet tagger 
algorithm (SMT) [5] as one tagger. The SMT algorithm forms a 1 -dimensional likelihood using the pr.rd 
of the muon. In the present analysis an event is said to be tagged by the SMT tagger if the likelihood 
value is greater than 1.4. The second tagger, the one whose efficiency we want to measure is either 
the IP3D+SV1 or the IP2D algorithm. We abbreviate this second algorithm as "LT", short for "lifetime 
tagger". 

The two samples are denoted the "n-sample" and the "p-sample." The n-sample is the full sample of 
jets containing a muon described in Section 2. The p-sample is a subset of the n-sample selected to have 
an enhanced ^-content. A muon-tagged jet in the p-sample is required to have at least one back-to-back 
(A(j> > 2.5) lifetime-tagged (IP3D+SV1 weight > 3) jet. This selection criterion increases the ^-fraction 
of the p-sample relative to the n-sample by about a factor of two. The cuts for the p-sample were chosen 
to keep the statistics of the p-sample as large as possible and also to ensure the stability of the method. 

4.1 The 8 equations 

Four numbers are measured in each sample: the number of jets before tagging (n in the n-sample, p in 
the p-sample), the number of jets tagged by the LT algorithm (n LT , p LT ), the number of jets tagged by 
the SMT algorithm (n SMT , p SMT ), and the number of jets tagged by both algorithms (n both , p both ). 

While the total number of muon-tagged jets in each sample is an experimental observable, their flavor 
composition is not. We denote the number of &-jets in each sample as (rij,, Pb) and c- and light-jets as 
{n c i, Pel)- The tagging efficiencies of the algorithms on the selected &-jets are (efc T , £^ MT ) and non-&-jets 
tagging efficiencies are (e^ r , £™ T ) — which are also unknown. Unless otherwise stated, each efficiency 
is that which would apply to tagging of the n-sample. 

These 8 quantities can be related by 8 equations. These equations also contain parameters encoding 
the extent to which the following assumptions are valid: that the efficiency of each tagger is the same on 
the n- and p-sample, and that the two tagging algorithms are uncorrelated. The parameters introduced 
are a, as follows: 



£™ T (on p-sample) 



= a 3 £^ T (on n-sample) 




(2) 
(3) 
(4) 
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Eel (on p-sample) = 0:4 s d T (on n-sample) (5) 
e b MT (on p-sample) = a 5 e b MT (on n-sample) (6) 
e b T (on p-sample) = a 6 e b T (on n-sample) (7) 

The (X\ and CC2 coefficients measure how correlated the two taggers are on b-jets and non-ft-jets. The 
CC3, CC4, (X5, and CC(, are sensitive to any tag rate differences caused by the selection of the p-sample. The 
method is constructed in such a way that the a,- should each be approximately unity. The values of the 
a, must be determined from Monte Carlo and possible differences of each a, from that in data must be 
included in the systematic error. The current method used to evaluate this systematic is described in 
Section 4.3. 

The eight jet counts enumerated above can be related by the following eight equations, including the 

a,-: 



n 


= n h + n d 


(8) 


P 


= Pb + Pel 


(9) 


n LT 


= ej; T n b + e^n c i 


(10) 


P LT 


= a 6 sj; T p h + a 4 s d p d 


(11) 


n SMT 


— t b nb-\- t d n d 


(12) 


p SMT 


— 0C5 e h pb + a^e d p c i 


(13) 


n both 


_ „ p LT p SMT , p LT p SMT 

— ui£ b e b nb + «2fc c / t cl n c i 


(14) 


p both 


= oti a 5 a 6 et T e s b MT Pb + a 2 a 3 a 4 e c 7 e s f T Pd 


(15) 



This system of eight equations (giving the method its name) is well specified if the value of each 
a,- is known. The e b T is the unknown that is of most interest — most of the others are not directly 
usable for other analysis. The System 8 method is based on a technique used by the DO experiment 
at the Tevatron [10]. The CDF experiment has also used jet events to calibrate their lifetime 6-tagging 
efficiency [12] using a different technique. 

4.2 Solving System 8, statistical error, and stability 

Solving the system of 8 equations is in general straightforward with tools like Mathematica (analytical 
solution) or MINUIT (numerical solution). However, the nonlinearity of the 8 equations makes eval- 
uating the statistical errors nontrivial. As a result Monte Carlo methods are employed. The System 8 
observables, the jet counts {n,p,n LT ,p LT ,n SMT ,p SMT ,n hoth ,p both } are correlated; from them 8 uncor- 
rected jet counts {jci,..,jc8} are defined by dividing the n-sample into non-overlapping subsets. These 
parameters are varied according to Gaussian distributions with standard deviation y/x}. The variation of 
all Xi was performed simultaneously assuming no correlations between them. The counts were varied in 
this way 100,000 times and the 8 equations were solved each time. The resulting distribution of e b T was 
used to determine the one-sigma statistical error. Statistical errors are shown for current Monte Carlo 
statistics in Table 5. 

The System 8 solution is not well constrained when the equations are close to being linearly-dependent. 
As the equations near linear dependence, the System 8 solution becomes increasingly sensitive to statis- 
tical fluctuations, resulting in unreliable values of e b T . The term 'stability' will be used to describe the 
sensitivity of System 8 to statistical variations in the data. 

The 8 equations are linearly independent as long as the ^-content of the n- and p-samples differ, and 
as long as e b / e c i for both the SMT and LT tagging algorithms. In practice the first condition is easily 
met by construction of the p-sample (which requires a back-to-back ^-tagged jet), but in the case of the 
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SMT the second stability requirement (e^ MT / £^ Mr ) becomes increasingly difficult to satisfy at high jet 

PT- 

The SMT algorithm is highly correlated with the ptm distribution of muons to distinguish between 
b-, c- and light-jets. These distributions become very similar at high jet pj (see Fig. 4), leading to the 
poor ability of the SMT to separate b and non-ft-jets at high momentum. It is for this reason we restrict 
the analysis to pj < 80 GeV. 

4.3 Correlation systematic error 

It is important to determine how well the Monte Carlo describes the a, in the data. Possible deviations 
should be reflected in the systematic error. We evaluate this contribution as the change in e^ T obtained 
by shifting each a, from its value in Monte Carlo to unity. The total contribution to the systematic error, 
formed by adding the individual shifts in quadrature, is abbreviated as the correlation error. 

Each CCi is measured from Monte Carlo and the results are shown for the IP3D+SV1 tagger (w > 4) 
in Table 4. The errors on each a, come only from Monte Carlo statistics. We have checked other tagging 
algorithms and weight cuts and the results are similar. It can be seen that only a 3 differs by much more 
than one sigma from unity, and the others lie within 3% of unity. 

Table 4: The a coefficients measured on Monte Carlo for jet 15 < p T < 80 GeV, for the IP3D+SV1 tagger 
weight w > 4 and SMT cut of 1.4 (a™ eas ). The last two columns show how the ^-tagging efficiency {Eb) 
is affected by different variations in a,-: a 1% variation (a,- = 1 ±0.01) and the measured offset of each 
a, from unity (a,- = 1 ± (a™ eas — 1)). The change in Eb is labeled A Eb . The latter column is added in 
quadrature to determine the total systematic error for the correlation error (see text). 



Assumption 


Value {a™™) 


A Eb for a ; = 1 ±0.01 


A efc fora, = 1 ± {af eas - 1) 


ai 


1.016±0.012 


0.027 


0.043 


a 2 


1.018 ±0.024 


0.002 


0.004 


a 3 


1.052 ±0.019 


0.006 


0.031 


a 4 


1.028 ±0.020 


< 0.001 


< 0.001 


a 5 


1.011 ±0.011 


< 0.001 


< 0.001 


a 6 


1.005 ±0.010 


0.016 


0.008 



We vary each a,- by ±0.01 to give an idea of how each a, contributes to the error in ej^ T . Table 4 shows 
that variations in a 2 , «4 and a 5 should have rather less impact on the ^-tagging efficiency measurement 
than «i, 0:3 and OC(,. The measured values of the coefficients oti and Ofe are statistically consistent with 
unity and show little statistically-significant dependence on jet pj or SMT cut. The coefficient a 3 , on the 
other hand, is not consistent with unity and shows a pj dependence, and has a pronounced effect on the 
measured efficiency at high jet pj. The non-unity a 3 is understood to be primarily due to the different 
relative fractions of charm and light in the n- and p-samples. 

In order to work out the correlation error we evaluate the shift in A e lt for a variation in a, of ±(a, - 
1), as shown in Table 4. Adding the errors in quadrature gives an estimate of the correlation error of 6%. 
Larger Monte Carlo samples will help improve our understanding of this error and help better understand 
which a,- really aren't unity. 

4.4 Tests on Monte Carlo 

We evaluated the System 8 method on the muon+jet Monte Carlo sample, for the following standard 
ATLAS taggers: IP2D, and the default combination of IP3D+SV1 for jets with p T < 80 GeV. 
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To derive pj and 77 dependent efficiency curves, the System 8 method was applied in bins of pj and 
rj . Due to lack of Monte Carlo statistics it is not currently possible to bin in both pj and rj simultane- 
ously; instead, System 8 was tested in p T bins for |tj| < 2.5, and separately in 77 bins for 15 < p T < 80 
GeV. With larger Monte Carlo statistics and much larger data statistics (such as with 50 pb _1 ) we should 
have enough for four bins in pj and four in 1 77 1 simultaneously for jet pj < 80 GeV. The exact binning 
configuration will be optimized to give the best possible resolution at low jet pj where the ^-tagging ef- 
ficiency is changing most rapidly. For this study, three bins in pj (15-30 GeV, 30-50 GeV, 50-80 GeV), 
and four bins in |tj | (0.0-0.5, 0.5-1.0, 1.0-1.5, 1.5-2.5) were considered. 

Table 5 shows the results of calibrating the IP3D+SV1 tagger (w >4) and IP2D tagger (w >3) in bins 
of px and r\ . The statistical error is shown. Recall the correlation error is an additional 6% which should 
be added in quadrature. 

Table 5: Measured efficiencies, the statistical error, and their deviation from the true efficiencies in bins 
of p T and r\, as described in the text. The statistical error is for the muon+jet Monte Carlo statistics. The 
correlation error, estimated to be ±0.06, must be added as well. 



Tagger 


Bin 


Measured e b 


\A meas ., true \ 




Pj 


15-30 


0.672 ±0.036 


0.040 




PT 


30-50 


0.723 ±0.026 


0.004 


IP3D+SV1 


PT 


50-80 


0.755 ±0.029 


0.017 




\n\ 


0.0-0.5 


0.748 ±0.041 


0.008 




\n\ 


0.5-1.0 


0.734 ±0.045 


0.029 




\n\ 


1.0-1.5 


0.736 ±0.038 


0.017 




\n\ 


1.5-2.5 


0.707 ±0.043 


0.001 




PT 


15-30 


0.520 ±0.030 


0.002 




PT 


30-50 


0.619 ±0.027 


0.003 


IP2D 


PT 


50-80 


0.671 ±0.033 


0.011 




\n\ 


0.0-0.5 


0.663 ±0.043 


0.003 




\n\ 


0.5-1.0 


0.656 ±0.052 


0.016 




\n\ 


1.0-1.5 


0.606 ±0.036 


0.034 




\n\ 


1.5-2.5 


0.587 ±0.046 


0.018 



To conclude, System 8 works well for different types of lifetime tagging algorithms for jet pj < 80 
GeV. It demonstrates stable results for different cuts on ^-tagging weights. 

5 The inclusive ^-tagging efficiency 

This note describes two methods which measure the lifetime-tagging efficiency for b-jets containing 
muons. In this section we briefly discuss how to convert from the measured tag efficiency to a lifetime 
tagging efficiency relevant to inclusive jet samples. 

Fig. 7 shows the ratio of the muon-jet lifetime tag rate to the inclusive-jet tag rate for the default 
IP3D+SV1 tagger (w > 4) as a function of pj. The difference between the tag rates in the two samples 
is primarily due to about a 50% difference in the p T of the highest p T track and in the average number 
of tracks per jet. 

We define a scale factor, Sb, to describe the difference in tag rates. The scale factor is defined in 
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Figure 7: The right hand plot shows the ratio of efficiencies for semi-leptonic jet tagging and hadronic 
jet tagging in the QCD jet sample. 

Monte Carlo: 

e*5had(Pr,i7) =Sb(PT,V)e^ vX (PT,n) (16) 

and thus, given a hadronic Monte Carlo b-jet, one calculates its calibrated lifetime tagging efficiency as 
follows: 

efc-had(pr,»?) = £b(PT,ri)S h ^ had (pT,ri) (17) 

where £/, is the tagging efficiency measured with System 8 or pr,rel in muon-tagged jets. 

To understand the systematic error on the scale factor we need a better understanding of the causes 
of the differences in tag rate. This work is in progress. 

6 Conclusions 

We have presented two methods to calibrate lifetime ^-tagging algorithms using dijet data. Trigger stud- 
ies were performed in order to assure that there will be enough data for the ^-tagging calibration. Detailed 
^-tagging performance can be obtained with data corresponding to 50 pb _1 of integrated luminosity. 

The suggested methods, pr, re i and System 8, were studied using Monte Carlo jet events, at least one 
of which has an associated muon. Good agreement is observed between true semi-leptonic ^-tagging 
efficiency and that measured by pr.rei and System 8. Both methods were verified in different jet T7 and 
p T regions. We found that the current version of the p T , re l method can be used for jets below 80 GeV in 
entire r\ region. System 8 was also proved to work well up to 80 GeV in the entire T7 region. 

The total error of both methods is expected to be rapidly dominated by systematic uncertainties. The 
systematic errors studied so far indicate that both methods should be able to control the absolute error on 
£b to 6%. The additional systematic uncertainties associated with converting the semi-leptonic efficiency 
to the inclusive efficiency have not yet been determined The two methods are complementary to those 
discussed in [1], in that they are most useful in measuring the turn-on of the ^-tagging efficiency at low 
jet p T . 
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The Trigger for Early Running 

Abstract 

The ATLAS trigger and data acquisition system is based on three levels of 
event selection designed to capture the physics of interest with high efficiency 
from an initial bunch crossing rate of 40 MHz. The selections in the three 
trigger levels must provide sufficient rejection to reduce the rate to 200 Hz, 
compatible with offline computing power and storage capacity. The LHC is 
expected to begin its operation with a peak luminosity of 10 31 cm~ 2 s -1 with a 
relatively small number of bunches, but quickly ramp up to higher luminosities 
by increasing the number of bunches, and thus the overall interaction rate. 
Decisions must be taken every 25 ns during normal LHC operations at the 
design luminosity of 10 34 cm~ 2 s _1 , where the average bunch crossing will 
contain more than 20 interactions. Hence, trigger selections must be deployed 
that can adapt to the changing beam conditions while preserving the interesting 
physics and satisfying varying detector requirements. In this paper, we provide 
a menu of trigger selections that can be deployed during the startup phase at 
each trigger level and show its evolution to higher luminosities. The studies 
presented in this paper are based on simulated data. 

1 Introduction 

The ATLAS trigger [1,2] is composed of three levels of event selection: Level 1 (LI) [3] which is 
hardware-based using ASICs and FPGAs, the Level 2 (L2) and Event Filter (EF) (collectively referred 
to as the High Level Trigger or HLT [4]) based on software algorithms analyzing the data on large 
computing farms. The three levels of the ATLAS trigger system must reduce the output event storage 
rate to ~200 Hz (about 300 MB/s) from an initial LHC bunch crossing rate of 40 MHz. It is evident 
from Fig. 1 that large rejection against QCD processes is needed while maintaining high efficiency for 
low cross section physics processes that include searches for new physics. 

During the ATLAS startup phase, where low luminosity conditions (10 31 cm~ 2 s _1 ) are expected to 
prevail, the focus of the trigger selection strategy will be to commission the trigger and the detector and to 
ensure that established Standard Model processes are observed. It is therefore important to deploy loose 
selection criteria at each stage. In early operations many triggers will operate in pass-through mode, 
which entails executing the trigger algorithms but accepting the event independent of the algorithmic 
decision. This allows the trigger selections and algorithms to be validated to ensure that they are robust 
against the varying beam and detector conditions that are hard to predict before data-taking. As the 
luminosity increases, the use of higher thresholds, isolation criteria and tighter selections at HLT become 
necessary to reduce the background rates while achieving selection of interesting physics with high 
efficiency. 

This note describes the possible triggers that can be deployed during the initial low luminosity 
running and discusses the strategy for triggering as the LHC ramps up to its design luminosity of 
10 34 cm~ 2 s _1 . The performance of the various trigger algorithms at each of the three trigger levels 
is described in a set of additional accompanying notes. It should be emphasized that the rate estimates 
discussed in this note are based on simulations and are subject to several sources of uncertainty which in- 
clude lack of knowledge of the exact cross-sections, detector performance, and beam related background 
conditions. Observations with early data will allow validation of these estimated rates and extrapolations 
to higher luminosities. 
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Figure 1 : Expected event rates for several physics processes at the LHC design luminosity. 

2 Level 1 trigger 

The Level 1 trigger system receives data at the full LHC bunch crossing rate of 40 MHz and must make 
its decision within 2.5 jits to reduce the output rate to 75 kHz (~40 kHz during ATLAS start-up). The 
LI trigger has dedicated access to data from the calorimeter and muon detectors. The LI calorimeter 
trigger [5] decision is based on the multiplicities and energy thresholds of the following objects observed 
in the ATLAS Liquid Argon [6] and Tile [7] calorimeter sub-system: Electromagnetic (EM) clusters, 
taus, jets, missing transverse energy ($t), scalar sum Ej (X-EV) in calorimeter, and total transverse 
energy of observed LI jets (££y(jets)). These objects are computed by the LI algorithms using the 
measured Ej values in trigger towers of 0.1 x 0.1 granularity in At] x A(j). The LI muon trigger [8] uses 
measurement of trajectories in the different stations of the muon trigger detectors: the Resistive Plate 
Chambers [9] (RPC) in the barrel region and the Thin Gap Chambers [8] (TGC) in the endcap region. 
The input to the trigger decision is the multiplicity for various muon pj thresholds. 

There are a limited number of configuration choices that are available at LI. The most common 
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difference between configuration choices is the amount of transverse energy or momentum required, so 
we refer to these configurations as "thresholds," but note that in addition to the Ej threshold condition, 
three different isolation criteria can be applied for LI EM and tau objects, and three different window 
sizes can be specified for LI jet objects. Table 1 gives the number of these so-called thresholds that can 
be set for each object type. The total number of thresholds allowed for EM and tau objects is 16, where 
8 are dedicated to be EM objects and 8 can be configured to be either EM or tau objects. The forward 
jets have four thresholds that can be set independently in each of the detector arms. 



Object 


EM 


Taus 


Jets 


For. Jets 


E T 




ZE T {jets) 


M<10 GeV 


M>10 GeV 


# of thresholds 


8-16 


0-8 


8 


4+4 


8 


4 


4 


3 


3 



Table 1: Number of LI thresholds that can be set for each LI object type at any given time (see text for 
details). 

The total number of allowed LI configurations (also called LI items) that can be deployed at any 
time is 256. Each of these LI items, programmed in the Central Trigger Processor (CTP) [10], is a 
logical combination of the specified multiplicities of one or more of the configured LI thresholds. As 
an example Ll_EM25i and L1_EM25 (A single LI EM object with E T > 25 GeV with and without 
isolation respectively) uses two LI EM thresholds while Ll_2EM25i (Two LI isolated EM object with 
Ej > 25 GeV) uses the same LI threshold as the Ll_EM25i item. Furthermore, for each of the 256 
LI items, a prescale factor N can be specified (where only 1 in N events is selected and passed to the 
HLT for further consideration). As the peak luminosity drops during a fill, the LI prescale value can 
be adjusted to keep the output bandwidth saturated without stopping and restarting a data-taking run, if 
desired. A given data-taking run is sub-divided into time intervals of the order of one minute. These 
sub-divisions, called luminosity blocks [11], provide the smallest granularity at which various data will 
be monitored and available for physics analysis. The trigger configuration, including the LI prescale 
settings, remains unchanged within this luminosity block and adjustments to LI prescale factors will be 
made on luminosity block boundaries. 

3 Level 2 

The L2 trigger is software -based, with the selection algorithms running on a farm of commodity PCs. 
The selection is largely based on regions-of-interest (Rol) identified at LI and uses fine-grained data 
from the detector for a local analysis of the LI candidate. A seed is constructed for each trigger accepted 
by LI that consists of a p T threshold and an r\-§ position. The L2 algorithms use this seed to construct 
an Rol window around the seed position. The size of the Rol window is determined by the L2 algorithms 
depending on the type of triggered object (for example, a smaller Rol is used for electron triggers than for 
jet triggers). The L2 algorithms then use the Rol to selectively access, unpack and analyze the associated 
detector data for that r\ -0 position. The ability to move, unpack, and analyse the local data only around 
the seed position greatly reduces both the processing times and the required data bandwidth. 

The L2 algorithms provide a refined analysis of the LI features based on fine-grained detector data 
and more optimal calibrations to provide results with improved resolution. They provide the ability to use 
detector information that is not available at LI , most notably reconstructed tracks from the Inner Detector. 
The information from individual sub-systems can then be matched to provide additional rejection and 
higher purity at L2. For each LI Rol, a sequence of L2 algorithms is executed which compute event 
feature quantities associated with the Rol. Subsequently, a coherent set of selection criteria is applied on 
the derived features to determine if the candidate object should be retained. 
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The L2 farm will consist of around 500 quad-core CPUs. On average, the L2 can initiate the process- 
ing of a new event every 10 /xs. The average processing time available for L2 algorithms is 40 ms, which 
includes the time for data transfers. The L2 system must provide an additional rejection compared to LI 
of about 40 to reduce the output rate down from ~75 (40) kHz to ~2 (1) kHz during nominal (startup) 
operations. 

4 Event Filter 

The final online selection is performed by software algorithms running on the Event Filter (EF), a farm 
of processors that will consist of 1800 dual quad-core CPUs. The EF receives events accepted by L2 at 
a rate of 2 kHz (1 kHz) during nominal (startup) operations and must provide the additional rejection to 
reduce the output rate to ~200 Hz, corresponding to about 300 MB/s. An average processing time of 4 
s per event is available to achieve this rejection. The output rate from the Event Filter is limited by the 
offline computing budget and storage capacity. 

As in L2, the EF works in a seeded mode, although it has direct access to the complete data for a 
given event as the EF selection is performed after the event building step. Each L2 trigger that has been 
accepted can be used to seed a sequence of EF algorithms that provide a more refined and complete anal- 
ysis. Unlike L2, which uses specialized algorithms optimized for timing performance, the EF typically 
uses the same algorithms as the offline reconstruction. The use of the more complex pattern recognition 
algorithms and calibration developed for offline helps in providing the additional rejection needed at the 
EF. 

5 Trigger rate estimation 

Trigger rates have been estimated using a sample of simulated events. The design of a specific trigger 
menu often requires several iterations of selection optimization to ensure that the output rate is within 
allowed bandwidths and that interesting physics is triggered with high efficiency. 

The first step in approximating trigger rates is to choose an appropriate input simulation sample. Most 
trigger selections are dominated by backgrounds from common processes, so samples with large physics 
cross-sections are generally used. However, these typically contain very few events that satisfy the trigger 
criteria and hence a very large number of events are required to obtain adequate statistical uncertainties 
on the estimated rates. In order to design the trigger menu for a luminosity of 10 31 cm~ 2 s~ 1 , a minimum- 
bias dataset containing seven million non-diffractive events with a cross-section of approximately 70 mb 
was used. To estimate the trigger rates with comparable statistical uncertainties for higher luminosities 
would require prohibitively large generated samples, hence other approaches are being pursued. These 
include using a combination of QCD and minimum bias event samples or alternatively using the so-called 
enhanced bias sample. The enhanced bias sample is a loosely filtered minimum bias sample requiring the 
lowest LI pr thresholds for muon, EM, or jet to have been fulfilled. Only events that pass the filtering 
process are reconstructed, resulting in a much more effective use of the computing resources. In addition 
to QCD processes, other high cross-section physics processes, such as W and Z boson production, need 
to be considered for estimating trigger rates at very high luminosities. Although such simulated samples 
provide a reasonable starting point to establish a data taking menu, these trigger menus will evolve as 
our understanding of the detector and trigger evolve, and as the physics requirements mature. Once data 
taking operations begin, dedicated data samples for further menu optimization and rate estimations will 
be collected. 

In order to compute the initial trigger rates, the full trigger simulation (LI, L2, and EF) is executed on 
a minimum bias data sample generated with PYTHIA [12] and simulated with realistic detector effects 



553 



Trigger - Trigger for Early Running 



in Geant4 [13]. The rate estimates presented in this paper are based on an analysis of seven million 
minimum-bias events. At a luminosity of 10 31 cm~ 2 s _1 each unprescaled event corresponds to a rate of 
about 0.1 Hz. These raw rates are subsequently weighted for any applied prescale factors, which allows 
more accurate rate measurements as it makes use of all the events in the data sample. For each level, the 
individual trigger rates are computed using the equation 

R^x^ x curr J ievel xa (1) 

ntotal newest level P l 

where R is the rate at the current level, Jz? is the instantaneous luminosity, P[ is the prescale factor applied 
at a specific level, n accepted is the number of events accepted after the specific trigger level studied (and 
hence all the previous lower levels as well), and n tota i is the total number of events in the dataset and a 
is the cross section. 

Trigger rates are estimated by assigning a probability for an event to have been accepted. The com- 
putation of rates based on probabilities makes maximal use of the available set of simulated events from 
a statistical point of view. The probability Pri of an event being accepted by a trigger item i is given by: 



Pri (event) = Dj (event) /P h (2) 

where D, is the decision probability accepting an event when no prescale factor is applied, and Pj is the 
overall prescale associated with the trigger item. 

In order to compute the overall acceptance rate of a specific menu, the overlap in acceptance from 
different trigger items needs to be correctly taken into account. The probability that two triggers accept 
an event simultaneously is then given by: 

Pr 12 (evt) = Pri (evt) x Pr 2 (evt). (3) 

The overall probability of accepting an event in a menu of two triggers, including their correlations, 
is thus given by 



Pr ra enu(evt) = Pri (evt) +Pr 2 (evt) - Pri (evt) x Pr 2 (evt) (4) 

This computation, although simple in the case of two triggers, becomes increasingly complex as the 
number of items in the menu increases. Fortunately, this problem can be solved recursively. Dedicated 
tools employing these methods have been developed to compute trigger acceptance rates and overlaps 
for various trigger menus. 



6 Trigger menu for a luminosity of 10 31 cm 2 s 1 

The initial LHC startup luminosity is expected to be approximately 10 31 cm~ 2 s _1 with a low number 
of bunches in the machine. These conditions will be ideal for commissioning the trigger and detector 
systems, as well as for the initial data taking, which will be dedicated to high cross section Standard 
Model signatures. Hence, the trigger selections deployed during this early running phase will primarily 
be a combination of low pj thresholds and loose selection criteria. Triggers at higher selection stages 
will be operated in pass-through mode wherever possible. 

Trigger menus are tables of triggers that incorporate the signatures for various physics objects at 
each of the three trigger levels. These signatures fully specify the thresholds and selection criteria at 
each level, providing a recipe for triggering on various physics processes. Each signature is considered 
carefully addressing its physics goals, the efficiency and background rejection it provides for meeting 
these goals and the trigger bandwidth it consumes. Trigger menus also must include additional triggers 
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for trigger validation, monitoring, calibration and measuring the performance of the physics triggers. 
With these considerations, a trigger menu for the startup phase has been developed. The rates have been 
estimated using the previously described simulated minimum-bias events. 

The following notation is used to label the different trigger items: e (electron), g (photon), EM 
(electromagnetic), J (jets), FJ (forward jets), XE (Missing Ej), TE (Total scalar sum Ej), JE (Scalar sum 
of jet Ej), MU (muons), and tau (tau leptons). A typical example of a trigger item is 2el5i (two isolated 
electrons with a pj greater than 15 GeV) or tau20i_XE30 (an isolated tau decaying hadronically with 
visible p T above 20 GeV and tf T above 30 GeV). A prefix to the item name is used to specify the trigger 
level at which the item is deployed. If the presence of several trigger object types are required, the AND 
of these multiple object types is shown using an for example, tau25i JXE30 requires an isolated tau 
lepton with a pj above 25 GeV AND a Missing ET above 30 GeV (when objects are combined with no 
qualifier a simpler notation can be used, for example, e + }l for a trigger with an electron and a muon). 

6.1 LI items foreseen for a luminosity of 10 31 cm -2 s _1 

All rates given in this section are measured using simulated events with a luminosity of 10 31 cm~ 2 s _1 . 
Table 2 shows a potential set of EM LI signatures, their prescale factors and estimated rates that could 
be deployed during start-up for an assumed luminosity of 10 31 cm~ 2 s _1 . It shows the eight LI EM 
thresholds and the multi-EM object thresholds that are direct combinations of the single EM thresholds. 
At low luminosities, a single non-isolated trigger of 7 GeV can be used without the application of prescale 
factors at the first trigger level with a rate of about 5 kHz. All multi-object EM triggers at LI have 
sufficiently low trigger rates and can be deployed without the application of any prescale factors. The 
total LI output rate out for the EM objects defined in Table 2 is about 10 kHz after correctly accounting 
for events that have passed multiple LI EM object signatures. The LI EM object triggers are then used 
to seed both electron and photon signatures at the HLT 



Trigger Item 


EM3 


EM7 


EM 13 


EM 131 


EM18 


EM 181 


EM23I 


EM 100 


Prescale 


60 


1 


1 


1 


1 


1 


1 


1 


Rate (Hz) 


674 


4900 


950 


480 


369 


143 


53 


1.5 


Trigger Item 


2EM3 


2EM7 


2EM13 


2EM13I 


2EM18 


2EM18I 


2EM23I 


3EM7 


Prescale 


1 


1 


1 


1 


1 


1 


1 


1 


Rate (Hz) 


6500 


534 


108 


8 


47 


2 


0.6 


53 



Table 2: LI trigger items and estimated rates at 10 31 cm 2 s 1 for electromagnetic objects. 

Several higher threshold signatures are deployed with and without isolation, even though lower 
thresholds can be deployed without the application of prescale factors at low luminosities. This allows 
the validation of the higher thresholds and trigger items that will be needed at high luminosities when 
the lower threshold triggers can only be deployed with prescale factors due to rate considerations. 

The inclusive LI jet trigger items and corresponding prescale factors, as shown in Table 3, are chosen 
to given an approximately flat trigger rate across the steeply falling jet Ej spectrum. At a luminosity of 
10 31 cm~ 2 s -1 , a single jet trigger with a threshold Ej of 120 GeV can be deployed without prescale 
factors and has a LI output rate of about 8 Hz. Figure 2 shows that triggered jet Ej spectrum is fairly 
flat up to the threshold value of 120 GeV and then falls with the jet cross section. This strategy provides 
sufficient statistics across the E T spectrum for the measurement of the differential cross sections and for 
measuring the efficiencies and performance of different algorithms. 

At a luminosity of 10 31 cm~ 2 s _1 , the HLT algorithms for the inclusive jet triggers are run in pass- 
through mode with the rate controlled only by using LI prescale factors. This allows validation of the 
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Trigger Item 


J10 


J18 


J23 


J35 


J42 


J70 


J120 


3J10 


3J18 


4J10 


4J18 


4J23 


Prescale 


42000 


6000 


2000 


500 


100 


15 


1 


150 


1 


30 


1 


1 


Rate (Hz) 


4 


1 


1 


1 


4 


4 


9 


40 


140 


40 


20 


7 



Table 3: LI trigger items and estimated rates at 10 31 cm 2 s 1 for jet objects. 




Figure 2: Jet E T spectrum at 10 31 cm 2 s 1 before (dashed) and after (solid) pre- scaling at LI. 
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HLT algorithms for jet triggers that can then be deployed at higher luminosities. 

Unlike single jet triggers, where the output rates are controlled by applying prescale factors at LI, 
jet triggers with higher multiplicities (multi-jets) have a larger allocated LI output bandwidth to allow 
the use of additional selection algorithms only applicable at the HLT (e.g. b-tagging). Their total output 
rate to disk storage is subsequently controlled at the HLT with either a combination of jet algorithms 
and additional prescale factors (leading to normal jet signatures) or additional highly rejective selections, 
(e.g. b-tagging algorithms leading to b-jet signatures). 

The prescale factors and rates for forward jet triggers are shown in Table 4. The HLT algorithms at 
low luminosity are executed in pass-through mode with rates controlled using LI thresholds and prescale 
factors. 



Trigger Item 


FJ18 


FJ35 


FJ70 


FJ120 


2FJ18 


2FJ35 


Prescale 


7000 


700 


20 


1 


100 


1 


Rate (Hz) 


1 


1 


1 


1 


1 


2 



Table 4: LI trigger items and estimated rates at 10 31 cm 2 s 1 for forward jet objects. 

Table 5 shows the rates for the suite of single muon and dimuon signatures; these triggers can all 
be deployed at 10 31 cm~ 2 s _1 without LI prescale factors with further selection at HLT to control the 
output rates. Three of the six available muon thresholds must be established below pj of 10 GeV and 
are based on a coincidence of only two of the inner three stations. The lowest possible muon threshold 
of 4 GeV is set by opening the coincidence window in the two stations to the maximum allowed size. 
The remaining three high pj threshold LI muon triggers (pj > 10 GeV) require a coincidence in all 
three muon trigger stations. The largest contribution to the rates shown in Figure 3 come from b, c quark 
decays and in-flight decays of pions and kaons. The LI muon trigger is highly efficient (99%) for pj 
above the threshold values within the fiducial acceptance of the detector. 



Trigger Item 


MU4 


MU6 


MU10 


MU15 


MU20 


MU40 


Rate (Hz) 


1730 


640 


360 


30 


20 


10 


Trigger Item 


2MU4 


MU4.MU6 


2MU6 


2MU10 


2MU20 


3MU6 


Rate (Hz) 


70 


45 


14 


7 


0.2 


0.7 



Table 5: LI trigger items and estimated rates at 10 31 cm 2 s 1 for muon objects. 

Some of the signatures and estimated rates for single and double tau triggers at 10 31 cm~ 2 s _1 are 
shown in Table 6. The triggers at low luminosity are chosen to collect large statistics of W and Z boson 
decays to tau leptons. For W boson decays, additional cuts on tfj help reduce the rates. However, 
reliance on Iflj is limited during startup as it is very sensitive to several detector and acceptance effects, 
and will take time to validate. Alternative approaches that use lf.j only in the Event Filter seeded with a 
single tau signature at LI are being studied. In addition to the 2x triggers at LI, x + e and x + fl triggers 
have been implemented at LI to trigger on Z —> xx decays with one of the taus decaying leptonically. 

The eight Jf.j thresholds likely to be deployed at the LI stage are shown in Table 7. The strategy here 
is similar in nature to that of the jet triggers, with LI prescale factors tuned to provide a flat rate across 
the $t spectrum. The reliance on inclusive Ifij trigger will be small especially during the early running 
period as it is sensitive to various detector effects that will require time to understand. Most of the 
thresholds are expected to be used in combination with other signatures. 

In addition to the Jfij trigger thresholds, there are four thresholds available for the scalar sum Ej (TE) 
and the scalar sum Ej of observed LI jets (JE). As for the #j triggers, reliance on such triggers will be 
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3 4 5 6 7 8 910 20 30 40 

Muon p T threshold (GeV) 



Figure 3: Estimated muon trigger rate for a luminosity of 10 31 cm 2 s 1 . Shown are the total rates and 
various contributions. 



Signature 


tau6 


tau9I 


taulll 


taul6I 


tau25 


tau25I 


tau40 


Prescale 


750 


300 


1500 


10000 


20 


10 


1 


Rate (Hz) 


19 


16 


2 


<0.1 


16.1 


25 


83 


Signature 


2tau6 


2tau9I 


2taul6I 


tau6_taul6I 


tau9I_EM13I 


tau9LMU6 


tau9IJCE30 


Prescale 


100 


1 


1 


10 


1 


1 


1 


Rate (Hz) 


19 


413 


65 


46 


100 


25 


160 



Table 6: LI trigger items and estimated rates at 10 31 cm 2 s 1 for tau objects. 



Trigger Item 


XE15 


XE20 


XE25 


XE30 


XE40 


XE50 


XE70 


XE80 


Prescale 


30000 


7000 


1500 


200 


20 


2 


1 


1 


Rate (Hz) 


2.5 


3 


4 


7.5 


7.5 


14 


2 


1 



Table 7: LI trigger items and estimated rates at 10 31 cm 2 s 1 for Missing E T objects. 



558 



Trigger - Trigger for Early Running 



limited in the early running period, but could prove to be valuable to ensure that very high Ej events 
are recorded. Table 8 shows the selections, the pre-scale factors necessary to achieve the desired rate 
reduction, and estimated LI output rates for the TE and JE triggers that could be deployed at startup. 



Trigger Item 


TE150 


TE250 


TE360 


TE650 


JE120 


JE220 


JE280 


JE340 


Prescale 


100k 


1100 


40 


1 


150 


10 


2 


1 


Rate (Hz) 


2 


3 


1 


0.5 


0.5 


0.5 


0.5 


0.1 



Table 8: LI trigger items and estimated rates at 10 31 cm 2 s 1 for the scalar ££j- (TE) and J^E^' (JE) 
triggers. 

In addition to LI triggers that rely on thresholds and multiplicities of a single object type (EM, Jet, 
muon etc.), several triggers are formed by combining multiple LI object types (e.g: EM + muon, Jet 
+ missing Ej etc.). Table 9 shows some of the typical combined signatures and their estimated rates. 
The rates for these triggers are generally small because they combine objects of different types at LI 
and can thus be executed without prescale factors. Combining object types at LI provides a mechanism 
to control the rates while maintaining low enough thresholds to meet the physics goals of the trigger. 
Even though single object triggers may suffice for low luminosity running, it is necessary to deploy the 
multi-object triggers at low luminosity to validate them and ensure their reliability as the LHC moves to 
high luminosity operations. 



Trigger Item 


EM13_XE20 


EM7.MU6 


MU11JXE15 


MU10J18 


Rate (Hz) 


225 


10 


13 


33 


Trigger Item 


2J42_XE30 


4J23_EM13I 


4J23JVIU11 


EM13IJ42JCE30 


Rate (Hz) 


13 


6.5 


1 


6.5 



Table 9: A representative list of LI trigger items and estimated rates at 10 31 cm 2 s 1 for triggers 
combining several object types. The "_" notation is used to show the AND between two object types. 

6.2 HLT signatures foreseen for a luminosity of 10 31 cm 2 s 1 

During low luminosity running, most of the triggers at Level 2 and Event Filter are either executed in 
pass-through mode or with loose selections. Table 10 shows some of the lowest threshold trigger items 
that can be executed without applying prescale factors and their estimated rates for 10 31 cm~ 2 s -1 . Higher 
threshold triggers, which will become important during high luminosity running are also deployed so that 
they can be validated with early data. 



Trigger Item 


el2 2e5 g20 


tau60 


tau25iJ£E30 


MU10 


2MU4 


elOJVIU6 


J120 


4J23 2b23 


Rate (Hz) 


19 7 7 


10 


3.5 


18 


2.3 


0.5 


9 


7 3 



Table 10: Examples of low threshold trigger terms executed without prescale factors and estimated rates 
that can be deployed at 10 31 cm~ 2 s -1 . 

Figure 4 shows a graphical summary of the EF output rates for each trigger group and the cumulative 
rates, which provide a running total of the rates. The sum of the rates for all the trigger groups is more 
than the cumulative rates due to overlaps between the groups. The grouping is done as follows: single 
and multi-object triggers of the same object type are grouped together, hence "Electrons" refers to the 
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total rates estimated for all single and multi-electron triggers including triggers executed with prescale 
factors and in pass-through mode. "fi-Physics and Topological" refers primarily to B-physics triggers 
and other triggers where invariant mass cuts have been applied during the selection process, such as in 
selection of J/y , T , and Z decays. The "Other Topological" triggers require two or more object types, 
such as e + jets, t + $t etc. 

The trigger grouping and associated rates are shown in finer detail in Table 1 1 for each of the trigger 
levels. The rates for each trigger grouping accounts for overlaps between signatures in that group, but not 
across groups. The "Total" row gives the cumulative rates for this trigger menu, accounting for overlaps 
between the trigger groups as well. The total output rates for each trigger level, for the proposed trigger 
menu at a luminosity of 10 31 cm~ 2 s -1 , is estimated to be within the available bandwidth, although 
there are large uncertainties inherent in the simulation. The estimated rate out of L1/L2 is 12 kHz/620 
Hz well below their respective targets of 40 kHz and 1 kHz available during the LHC startup phase. 
The selections have been tuned to yield the targeted EF output rate of 200 Hz, but it is evident that this 
preliminary trigger list will need to be optimized based on early experience with real data. 

7 Data streams 

ATLAS has adopted an inclusive streaming model whereby raw data events can be streamed to one or 
more files based on the trigger decision. A proposed initial streaming configuration consists of four raw 
data streams called egamma, jetTauEtmiss, muons, and minbias. Each stream consists of events that pass 
one or more trigger signatures. The stream names indicate the type of trigger signatures they will contain, 



Cumulative Rates 



Electrons . 

Photons _ 

Muons _ 

Missing Et ! 

Total E I 

Total Jet E E 
B-physics & Topological 



Other Topological 



0 20 40 60 



100 120 140 160 180 200 
Rate (Hz) 



Figure 4: HLT unique (black) and cumulative (gray) estimated rates at 10 31 cm 
groups as described in the text. 



1 for different trigger 
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Object | LI (Hz) | L2 (Hz) | EF (Hz) 



Single-electrons 


5580 


176 


27.3 




6490 


41.1 


6.9 


Multi -photons 


common 


2.9 


< 0.1 


Single-photons 


common 


33.4 


9.1 


Multi-Jets 


221 


7.9 


7.9 




24.4 


24.4 


24.4 


Multi-Fjets 


2.7 


2.7 


2.7 


Single-Fjets 


3.7 


3.7 


3.7 


Multi-bjets 


common 


12.9 


2.6 


Single-bjets 


common 


11.6 


11.6 


Multi-taus 


465 


14.5 


12.4 


Single-taus 


148 


32.9 


22.3 


Multi-muons 


68.6 


5.8 


2.3 


Single-muons 


1730 


204 


21.8 


Missing Ej 


37.9 


31. 


3.8 


Total E T 


6.3 


6.3 


1 


Total Jet E T 


1.6 


1.6 


1.6 


BPhysics 


common 


25 


13 


Muti-Object 


5890 


134 


48 


Minimum Bias 


1000 


10 


10 


Total 


12000 


620 


197 



Table 11: LI, L2, and EF estimated rates for several groups of trigger items at a luminosity of 
10 31 cm~ 2 s _1 . The total rate accounts for overlaps between the groups. The LI objects labeled "com- 
mon" have the same LI triggers as other object types and hence do not require any additional LI band- 
width. 



for example, events passing electron or photon triggers will be written to the egamma stream. Events 
passing certain topological triggers could be written to more than one stream. Two examples, e + #r and 
e + /J,, can be used to demonstrate the two possible modes for streaming events that pass a topological 
trigger. For the e + $t signature, events are only written to the egamma stream unless they also pass an 
inclusive tfj signature after pre-scaling, in which case they are also written to the jetTauEtmiss stream. 
For the e + }l signature, events are written to both egamma and muon stream regardless of whether they 
pass the inclusive single electron or single muon trigger. 

Streams are chosen to have approximately the same proportion of events and to keep the total overlap 
(event duplication across streams) to less than 10%. The final optimization of these streams can only be 
achieved with an understanding of the overlaps observed with real data. Furthermore, the number and 
type of raw data streams may be optimized for use at different luminosity settings. 

Table 12 shows the total and the unique rates of the proposed early stream configuration. The unique 
rate reflects the number of events written solely to the specified stream, hence the difference between the 
total rate and the unique rate is the rate of replicated events in each stream. 

As shown in the Table, in addition to the raw data streams, an express stream and a calibration stream 
have also been defined. An express stream containing a subset of triggers can be used to provide rapid 
feedback on the quality of the data. It is therefore reconstructed first and any relevant knowledge of 
the quality of data is incorporated into the reconstruction of the remaining streams. Events in express 
stream are primarily intended for monitoring and not for physics analysis. Hence, by definition, events 
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appearing in express stream would also appear in one of the primary raw data streams. 

Additional triggers required for detector calibration can be run in a parasitic mode during data taking 
operations. These include triggers that provide data needed for detector alignment and energy scale 
determination. Such data can be written to their own raw data stream with the advantage of being 
processed early and the extracted calibration constants used as part of the bulk reconstruction of the 
primary raw data streams. 

8 Evolution to higher luminosities 

Experience with early running will allow further optimization of the trigger algorithms and menus and 
improve the ability to estimate rates in the high luminosity regime. As the LHC ramps up to its de- 
sign luminosity, complex trigger signatures with multiple observables, higher pj thresholds and tighter 
selections will be deployed to maintain the output Event Filter rates at about 200 Hz. 

As noted in Section 2, there are a limited number of LI thresholds available for each object type. The 
intent is to keep the LI thresholds as stable as possible. With increasing luminosity, higher LI thresholds 
need to be introduced at the expense of some of the lower thresholds. However, many of the thresholds 
will be retained providing common points of comparison across luminosity regimes. At high luminosity, 
the luxury of running in pass-through mode or with loose selections will not be possible, and tighter 
HLT selections will be implemented to achieve the required rejection. For example, jet triggers that used 
only the LI prescale to control the rates at low luminosity, will enable the HLT at high luminosity to 
obtain the additional rejection needed to control the EF output rates. In addition, topological triggers that 
include jets at lower thresholds than the inclusive jet triggers but in combination with other objects or 
requirements to achieve reduction in output rate, are deployed to increase the physics acceptance. 

At high luminosities, the trigger software and selection must be robust against high detector oc- 
cupancies, pile-up effects and cavern backgrounds. Pile-up effects becomes significant with increas- 
ing luminosities with an average of more than 20 interactions per crossing expected at a luminosity of 
10 34 cm~ 2 s _1 . The trigger should ensure coverage of the full physics programme, including searches 
for new physics and precision measurements of Standard Model parameters. The signatures include lep- 
ton, photon, and jet triggers, but with higher thresholds and tighter selection criteria employed to control 
the rates. Additional requirements that operate in pass-through mode at low luminosities, such as isola- 
tion, large # T and other complex criteria such as flavour tagging, must also be deployed to achieve the 
necessary rate reduction. 

Table 13 shows a representative sample of LI and HLT trigger items that can be expected to be 
deployed without prescale factors at a luminosity of 2xl0 33 cm~ 2 s _1 . A comparison of this menu to 
the one at a luminosity of 10 31 cm~ 2 s _1 illustrates the evolution of the rates and thresholds as a function 
of luminosity. The evolution is not linear as some of the triggers also have employed tighter selection 
conditions at higher luminosity. While Table 13 gives a flavour of some of the primary triggers and 



Stream 


Total Rate (Hz) 


Unique Rate (Hz) 


egamma 


55 


48 


muon 


35 


29 


jetTauEtmiss 


104 


89 


minBias 


10 


10 


express 


18 


0 


calibration 


15 


13 



Table 12: Total and unique rates at 10 31 cm 2 s 1 for a selected raw data stream configuration. 
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LI item 


Rate (kHz) 


EM 181 


12.0 


2EM11I 


4.0 


MU20 


0.8 


2MU6 


0.2 


J140 


0.2 


3J60 


0.2 


4J40 


0.2 


J36JXE60 


0.4 


taul6I_XE30 


2.0 


MU10_EM11I 


0.1 


Others 


5.0 



HLT item 


Rate (Hz) 


e22i 


40 


2el2i 


< 1 


g55i 


25 


2gl7i 


2 


MU20i 


40 


2MU10 


10 


J370 


10 


4J90 


10 


J65JXE70 


20 


tau35iJXE45 


5 


2MU6 for B-physics 


10 



Table 13: Subset of trigger items from two illustrative trigger menus at LI (left) and at the HLT (right) 
for a luminosity of 2x 10 33 cm~ 2 s _1 . 



expected rates, the full physics trigger menu will consist of many additional signatures for precision 
measurements and the discovery program, as well as triggers required for calibration and background 
studies. 

Preliminary rate studies suggest that about 30% of the 200 Hz bandwidth will be available for electron 
and photon triggers, 25% for muon triggers, 15% for jet triggers, and 15% for triggers involving taus and 
$t- About 5% of the bandwidth is allocated to B-physics related triggers which will involve low pj 
di-muon signatures with additional mass cuts to select J/\\r and other rate B-meson decays, with the 
balance of the bandwidth used for calibration and background triggers. This proposed distribution, will 
of course evolve with experience, and will be tuned to ensure full coverage for discovery and precision 
physics at all luminosities. 

9 Summary 

The LHC is expected to begin its operation at a low luminosity of about 10 31 cm~ 2 s _1 and ramp up 
to the design luminosity of 10 34 cm~ 2 s _1 over the first few years of operation. The three levels of the 
ATLAS trigger have been designed to handle the high rates and occupancies at high luminosity. The 
trigger items and their performance have been studied in detail in both the low and high luminosity 
regimes and a comprehensive trigger menu has been developed for the LHC startup phase. Details of 
the triggers comprising this menu have been discussed in this note and primarily consist of low pj 
thresholds and loose selections that would allow for rapid commissioning and preparation for the high 
luminosity regime. The rates estimated for a trigger menu that will likely be deployed during the initial 
phase of the LHC run are within the limits of the TDAQ bandwidths, but are subject to fairly large 
uncertainties due to the use of simulations that extrapolate from the 2 TeV center-of-mass energy of 
the Tevatron to the 14 TeV expected for the LHC. The trigger menu in the high luminosity regime will 
use high pj thresholds and complex triggers involving multiple objects to provide efficient selection of 
physics processes and high background rejection. A flavour of such signatures has been discussed in this 
note. The initial running will allow further optimization of the trigger menus, which will evolve as the 
luminosity increases over three orders of magnitude to the design luminosity. 
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Abstract 

This note reviews the tracking algorithms used at the L2 and Event Filter stages 
of the High Level Trigger of ATLAS. The tracking performance (efficiency, 
resolution) is studied for different topologies (single tracks, high and low pj 
jets) using simulated data. Detailed information on the execution time of the 
algorithms is also given. 

1 Introduction 

The aim of this note is to describe the tracking algorithms used at the L2 and Event Filter stages of the 
High Level Trigger of ATLAS and to study their performance. 

The definitions of the relevant quantities (efficiency, fake rate, and resolution) given in this note can 
differ with the ones used in the different selection algorithms of specific trigger objects (e/y, muon, taus, 
&-jet and B-physics): here the purpose is to define a common language to study and compare the different 
tracking algorithms. 

A detailed description of the complete ATLAS detector and its performance can be found in [1]. 
In addition to a description of the trigger system, it also contains relevant information on the ATLAS 
tracking detectors, the Inner Detector (ID) and its subsystems (Pixel, SCT and TRT). 

2 Track reconstruction at L2 

2.1 Data preparation 

Detector data should be converted before it can be used by tracking algorithms. The conversion process 
includes the bytestream decoding, the cluster formation in the Pixel and SCT detectors [1], and their 
conversion in spatial coordinates (space points). 

2.2 IDScan 

IDScan is a set of algorithms for fast pattern recognition and track reconstruction at the second level trig- 
ger, using space points provided by the tools described in Section 2.1. These algorithms first determine 
the z-position of the interaction point along the beam axis and then perform combinatorial tracking only 
inside groups of space points that point back to that determined position. 

The first algorithm, aiming to determine the z-position of the primary vertex, divides the region- 
of-interest (Rol) into many equally-sized 0 slices, whose width is tuned according to the individual 
Rol type, based on the lowest track momenta desired and the level of background hits in the detector. 
While tracks of high momenta produce most of their hits in a small number of neighbouring slices, hits 
from lower-momentum, curved tracks populate several different slices. Every space point is paired (or 
optionally every space point from the innermost three silicon layers) in each slice to the other space 
points in that slice and in a few neighboring slices, and each pair is used to calculate a z-position by 
linear extrapolation to the beam line. (This exploits the fact that helical trajectories of charged particles 
in a solenoidal magnetic field are straight lines in the p-z projection.) A one-dimensional histogram 
accumulates all the calculated z values, and the peak(s) in this histogram provide the rest of the IDScan 
algorithms with the z-coordinate of the pp interaction point. The correct position is identified in more 
than 98% of the Rols, with a resolution between 150 and 200 ;Um (depending on the type of Rol) for the 
central Rols. 
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Using the z-position previously reconstructed, the second algorithm computes the pseudorapidity for 
all the space points in the Rol and fills a two-dimensional histogram in (77 , 0). Since all hits from a given 
(sufficiently) high-Pr track tend to be contained in a small solid angle (with its apex at the origin for the 
track), the space points from each track that originates from the computed z-position on the beam axis 
form a cluster of neighbouring bins in this histogram. When the bin size is small enough, the occupancy 
for each bin is low and each cluster, called a group, often contains the space points of a single track. 
Fake candidates are reduced by keeping track of which detector layers contribute space points to each 
bin and requiring that at least four out of an expected seven layers to have contributed to a given bin or 
its immediate neighbours, before that bin is included in a group. 

After the groups have been identified random space points and/or space points from multiple tracks 
in each group are separated. This is achieved by considering all possible triplets of space points within 
a group and making use of the fact that any three hits from a track can be used to extract the same 
track parameters 0o and l/Pj in the transverse plane. The algorithm fills a two-dimensional histogram 
with extracted (0o, I/Pt) values and considering combinations containing space points from at least four 
different silicon layers. 

Finally the cleaned groups are subjected to the clone removal algorithm, which identifies groups 
sharing at least a certain number of space points (currently 2 or 3 depending on the Rol type) and 
removes all but the one with the highest number of space points. Furthermore, a group is removed if it 
shares more than 45% of its space points with others. This step significantly reduces the number of fake 
groups that contain a few random space points in addition to a small number of space points from an 
actual track. 

After all these steps, the remaining groups are passed on to a track fitter. The default fitter used by 
IDScan is described in Section 2.5. 

2.3 SiTrack 

The SiTrack L2 algorithm adopts a combinatorial pattern recognition approach to reconstruct tracks 
starting from space points formed in the ID silicon detectors. 

In order to perform space point combinations, these are first of all grouped into sets from which the 
entries of each combination will then be extracted; the grouping is implemented in SiTrack, using the 
idea of "logical layers". These correspond to a list of physical detector layers, i.e. barrel layers and 
end-cap disks, and are labeled with increasing numbers moving away from the beam line. The same 
physical layer can be included in more logical layers, to increase the robustness of the track finding 
process. To provide a tangible example, the first logical layer adopted for the reconstruction of high-pr 
isolated leptons includes the innermost two pixel barrel layers and the innermost pixel end-cap disk. 

Once the space points have been associated to the logical layers they belong to, the track reconstruc- 
tion algorithm proceeds through the following five steps: 

• formation of track seeds; 

• optional primary vertex reconstruction along the beam line; 

• extension of track seeds; 

• merging of extended seeds; 

• clone removal. 

The formation of track seeds corresponds to a combinatorial pairing of space points coming from the 
innermost two logical layers. For each seed, the extrapolation to the beam line is evaluated, using a 
straight line approximation; this process is depicted in Fig. 1. At this point a cut on the transverse 
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Figure 1 : Pictorial scheme of the SiTrack combinatorial strategy for track seeds formation (left) and track 
seeds extension (right). 

impact parameter is applied. This cut, meant to reduce the number of seeds to be further processed, is 
particularly important, as it fixes the lowest reconstructible track pr value. 

The subsequent step is the reconstruction of the position of the primary interaction vertex along the 
beam line, used to reject tracks not coming from the primary interaction. The vertex reconstruction 
is performed filling a histogram with the longitudinal impact parameter of the seeds and searching for 
histogram maxima; more vertex candidates can be retained and seeds not pointing to any of the recon- 
structed vertexes are discarded. This optional step is useful for high track multiplicity topologies like 
jets, but is typically skipped in the case of low multiplicity event topologies, e.g. for the reconstruction 
of single isolated leptons. Each retained seed is extended, as depicted in Fig. 1, extrapolating it to the 
outer logical layers and forming one or more space point triplets for each seed; extensions are selected 
applying a cut on the distance between the outer space point and the extrapolated seed. Each extended 
seed is then fitted by a straight line in the longitudinal plane and parametrized as a circle in the transverse 
plane. 

All the extensions found for each seed must then be merged into a single full track, grouping the 
triplets having similar track parameters after the fit. The full track is thus formed by the union of the 
space points from all the merged extensions. All the triplets not involved in the merging process are 
discarded, while track parameters are re-evaluated for the full track, fitting it with a straight line in the 
longitudinal plane and a circle in the transverse one. 

Two full tracks obtained from different track seeds may still share most of their space point; these 
tracks are defined as clones. To eliminate these ambiguous cases, only the clone track containing the 
largest number of space points is retained; in case more clone tracks contain the same number of space 
points, the one with the lowest % 2 value prevails. The retained full tracks are finally refit using one of the 
available common fit tools. 

2.4 TRT tracking 

Information from the TRT part of Inner Detector can be used as the basis of a L2 tracking algorithm. 
The core of the algorithm is a set of utilities from the offline reconstruction package xKalman [2] for 
the reconstruction of tracks in the TRT detector. It is based on the Hough-transform (histogramming) 
method. At the initialization step of the algorithm, a set of trajectories in the (j) — R(Z) space is calculated 
for the barrel and endcap parts of the TRT. The value of the local magnetic field is taken into account 
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at each straw position and coordinate along it when calculating the trajectories. After initialization, a 
histogram (with a size of 500 bins in 0 and 70 bins in curvature) is filled for each event with the TRT hit 
positions. The track candidates can be identified from peaks in the histogram. Bins with at least eight 
hits are considered as track candidates. These track candidates should satisfy some quality parameters 
like the number of unique hits and the ratio of hits to number of straws crossed by the trajectory. For each 
track candidate, the parameters are tuned so that the track lies on the maximum number of drift circle 
positions. It is at this stage that drift information is taken into account to further improve the resolution 
of the track parameters. 

2.5 Track fitting tools 

The track fitting procedure used by the L2 ID algorithms in Pixel and SCT detectors is based on a Kalman 
filtering technique. 

At first, space points are dissolved into clusters and a filtering node is created for each cluster. The 
filtering nodes encapsulate implementations of the Kalman filter algorithm for various measurement 
models. The fitter object uses the filtering nodes to update a track state described as a 5-dim vector of 
track parameters (local x, local y, angles 0 and 6 given in the global coordinate system, and track inverse 
momentum Q) and corresponding covariance matrix. 

The track state update consists of three steps. First, the track state is extrapolated using a sim- 
ple parabolic approximation of a trajectory in uniform magnetic field. The material-related corrections 
(multiple scattering, energy losses) are added to the covariance matrix during this step. The extrapolated 
track state is used to validate the next hit: if the % 2 distance between the hit and state is less then a 
predefined cut for this filtering node the track state is updated. These "extrapolate-validate-update" steps 
are repeated for every node. After that, a standard backward smoother is applied. 

There exists another tool which performs track fit and simultaneous pattern recognition in the TRT. 
The implementation of this tool is based on a distributed approach. More details on the L2 TRT track 
extension tool can be found in Ref . [3] . 

2.6 Vertex Fitting tools 

An essential part of the L2 event selection (e.g. B-physics event selection) is vertex finding and fitting us- 
ing tracks reconstructed by the L2 tracking algorithms as input. Due to the L2 timing constraints a vertex 
fitting algorithm for the L2 application has to be fast. An additional requirement stems from the L2 track 
reconstruction which provides input track parameter errors in form of a covariance matrix. In contrast, 
vertex fitting algorithms proposed in literature assume uncertainties of the input track parameters to be 
described by weight (inverse covariance) matrices. However, if only track covariance matrices are avail- 
able, these algorithms requires them to be inverted beforehand thus resulting in substantial computing 
time overhead. 

To alleviate this drawback a fast vertex fitting algorithm capable of using track covariance matri- 
ces directly (i.e. without time-consuming inversion) has been developed. The specific feature of the 
algorithm is that track momenta at perigee points rather than "at-vertex" momenta are selected as the 
fit parameters. Such a choice of fit parameters makes it possible to apply a decorrelating measurement 
transformation so that the transformed measurement can be partitioned into two uncorrected vectors 
- measured momenta and its linear combination with measured track coordinates at the perigee. This 
linear combination comprises a new 2-dim measurement model while the measured momenta and the 
corresponding blocks of the input track covariance matrices are used to initialise a vertex fit parameter 
vector and covariance matrix of the Kalman filter. This approach provides a mathematically correct and 
numerically stable initialisation of the vertex fit. A reduced size (2-dim instead of the usual 5-dim) of 
the measurement model makes the proposed Kalman filter very fast and therefore suitable for an online 
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application in the ATLAS Level 2 Trigger. A detailed description of the L2 vertex fitting algorithm can 
be found in Ref. [4]. 

3 Track reconstruction at EF 

ID reconstruction at the EF is performed using ATLAS "New Tracking" software [5]. A common ap- 
proach between offline and online is possible thanks to the modular New Tracking design which allows 
the replacement of time-critical components and full-featured offline modules by trigger-specific im- 
plementations. New Tracking currently covers two sequences, the main inside-out track reconstruction 
(track finding starts from the Silicon and then is extended to the TRT) and outside-in tracking (from 
the TRT to the Silicon). The primary pattern search concepts for both sequences have been to a large 
extent adopted from the already existing ATLAS ID reconstruction program xKalman [2] , but integrated 
and incorporating additional components in the common New Tracking approach. In the following note, 
only the inside-out reconstruction of tracks is described since is it the only one used in the EF ID online 
reconstruction. In the future, an outside-in approach is intended to be used in the trigger in cases where 
photon conversions are present. 

The EF ID reconstruction runs for many different signatures, such as electrons, muons, taus, and b- 
jets. Each of these triggers contains a very similar algorithm sequence as the ID inside-out tracking and 
is followed, depending on a given object, by dedicated event reconstruction algorithms including vertex 
finding, ^-tagging or electron processing. In the EF realization of New Tracking dedicated algorithms 
steer the underlying tools with Rol-seeded input collections x . The tools used are directly taken from 
the offline reconstruction chain but operated in a Rol-seeded mode, where the trigger signature defines a 
width of the Rol. 

The EF ID algorithm sequence is divided into three stages defined as pre-processing, inside-out track 
finding and post-processing. The pre-processing stage is responsible for building clusters and drift circles 
in the Silicon and TRT detectors, respectively, and the creation of space points as three-dimensional 
representations of the Silicon detector measurements. 

The inside-out track finding starts from the Pixel and SCT to find track seeds and creates track 
candidates based on the seeds primarily found. The seeded track finding results in a very high number of 
track candidates, that have to be resolved before an extension into the TRT detector can be done. Many of 
these tracks share hits, are incomplete, or describe fake tracks, hence ambiguity resolution is necessary. 
The track extension from the Silicon to the TRT is divided into two modules. First, tracks found in the 
Silicon detector are used as an input to find a compatible set of TRT measurements. Then each extended 
track is evaluated with respect to the original Silicon track. A track scoring mechanism is then used to 
compare the original track with the one after refitting, and the best track is chosen. 

The last stage, post-processing, starts from a primary-vertex search, which is based on the Billoir 
fitting method [6]. A track object is created which is a representation of the track reconstruction results 
aimed for analysis applications. 

Currently several different fitting techniques are implemented in New Tracking and can be chosen at 
a configuration level: 

• Kalman Fitter as a straightforward implementation of the Kalman filter technique [7] that has been 
adopted for the track fitting in high energy physics experiments. For the ATLAS Silicon detector, 
the Kalman Fitter has a dedicated extension for fitting of tracks from electrons, that lose stochas- 
tically a significant part of their energy due to bremsstrahlung effects. In that case, an assumption 
about purely Gaussian noise is far from being optimal. A special Dynamic Noise Adjustment tech- 

'The FullScan operation is an exception which assumes track reconstruction in the entire detector. 
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nique has been developed [8]. It still uses a Gaussian error assumption but modifies the applied 
variance based on the amount of traversed material. 

• Deterministic Annealing Filter is a deterministic annealing technique [9] which combines the stan- 
dard Kalman filter formalism with a probabilistic description of the measurement assignment to a 
track. 

• Gaussian Sum Filter is a special multi-Gaussian extension of the standard Kalman fitter [10], 
dedicated to reconstruction of electron tracks. In the GSF approach, the highly non-gaussian 
probability density function of electron energy loss is modeled by a mixture of several Gaussians. 

• Alignment Kalman Filter is an extended version of the Kalman filter [11] that integrates an update 
of the detector surface orientation and position into an intrinsic measurement update of a Kalman 
Fitter step. 

• Global Chi2 Fitter [12] is a track fit through a minimization of % 2 value. Given purely Gaussian 
process noise, the minimization of the x 2 value that is built from hit residuals at every measurement 
surface gives the best set of estimators of the track trajectory. Material effects enter the % 2 function 
as additional fitting parameters. 

The Kalman Fitter approach without the Deterministic Annealing Filter extension is a default fitter 
at the EF ID. Other fitters described above can be chosen during the configuration step. 

4 Performance 

4.1 Timing measurement 

This section presents a summary of CPU timing measurements for various steps of data preparation and 
track reconstruction in L2 ID and EF ID algorithms. 

4.1.1 L2 ID 

The L2 ID timing has been measured on a quad-core 3GHz Woodcrest CPU machine. Timers provided 
by the standard trigger monitoring framework have been used for these measurements. 

The data preparation timing measurements obtained on it data for the Pixel and SCT are shown in 
Table 1 for e/y, muon, and tau triggers. The average cluster collection and space point multiplicities per 
Rol are presented in Table 2. 

The track reconstruction timing measurements are summarized in Table 3 (note that SiTrack fit timing 
is included in the pattern recognition time). The average track multiplicity is presented in Table 4. 

Tables 5,6, 7, and 8 present a comparison between Rol-based and FullScan running for B— physics 
triggers on bb — »■ jiX data. These show, respectively, the time of data formation, average space point 
multiplicity, time of L2 ID track reconstruction, and average track multiplicity for the two cases. 

4.1.2 EF ID 

The EF ID timing measurements are shown in Table 9. They were done on a 3 GHz Intel Xeon 5160 
CPU with 8GB of memory. The EF ID software was run in the emulator of the event filter processing and 
execution times for the algorithms were obtained as mean values from the timing histograms provided 
by the HLT framework. The results are representative in terms of algorithm execution times but do not 
account for data collection times, which are platform dependent. Two event samples were used in the 
tests, one corresponding to a single electrons with lOOGeV, the other was a simulation of tt production. 
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Data preparation step 


Mean time [ms] 




H 


e/y 


T 


RegionSelector 


0.23 


0.26 


0.32 


RobDataProvSvc 


0.02 


0.02 


0.02 


Cluster IDCs retrieval 


0.03 


0.03 


0.02 


BS-to-clusters, Pixel 


0.63 


0.73 


1.02 


BS-to-clusters, SCT 


0.60 


0.69 


0.95 


Pixel space point formation 


0.24 


0.27 


0.35 


SCT space point formation 


0.31 


0.37 


0.40 



Total time | 2.11 | 2.45 | 3.23 



Table 1 : The timing measurements of the data formation per Rol. 



Data preparation step 


Multiplic 


ity 






e/y 


T 


Pixel cluster coll. 


46.2 


45.6 


104.4 


SCT cluster coll. 


82.8 


83.5 


245.8 


Pixel space points 


66.4 


65.9 


131.6 


SCT space points 


36.5 


38.8 


87.1 



Table 2: Average cluster/space point multiplicities per Rol. 

The most CPU demanding operation is the track finding in the silicon detectors with typical exe- 
cution times of 30 ms per Rol ( 300 ms in FullScan mode). Next is the processing of TRT extensions 
and resolution of ambiguities each with a CPU cost of about 20 ms per Rol (approximately 170 ms in 
FullScan). Another important contribution to the processing time is the data preparation step which takes 
about 20 ms per Rol for all detectors (and about 300 ms in FullScan mode). The total time to process it 
events in full-scan mode is about Is. 

4.2 Efficiency and resolution definition 

In order to evaluate the performance of any tracking algorithm, three kinds of information have to be 
provided: track reconstruction efficiency, the percentage of fake reconstructed track candidates, and the 



Track reconstruction step 


Mean time [ms] 






e/y 




IDScan : Pattern recognition 


0.60 


0.70 


1.45 


IDScan : Track Fit 


0.28 


0.29 


0.49 


IDScan : TRT data preparation 


1.14 


1.32 




IDScan : TRT tracking 


2.96 


2.87 




SiTrack : Pattern recognition 


0.62 


0.66 




SiTrack : TRT data preparation 


1.15 


1.39 




SiTrack : TRT tracking 


2.85 


3.66 





Table 3: The timing measurements of the L2 ID track reconstruction with IDScan and SiTrack. 
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L2 ID algorithm 


Tracks/Rol 

H | e/y | T 


IDScan 


1.92 


1.63 


4.50 


SiTrack 


2.05 


2.66 





Table 4: The average track multiplicity per Rol for L2 ID track reconstruction with IDScan and SiTrack. 



Data preparation step 


Mean time [ms] 




Rol 0.75 x 0.75 


FullScan 


RegionSelector 


0.67 


4.76 


RobDataProvSvc 


0.02 


0.41 


Cluster IDCs retrieval 


0.03 


0.06 


BS-to-clusters, Pixel 


2.13 


11.7 


BS-to-clusters, SCT 


1.89 


13.1 


Pixel space point formation 


0.78 


5.27 


SCT space point formation 


0.68 


5.65 


Total time 


6.67 


44.5 



Table 5: The timing measurements of the data formation for Rol-based and FullScan running of 
B— physics triggers. 

resolution of the track parameters. 

All the results shown in this section refer to reconstructed Monte Carlo (MC) simulated data samples, 
where the track parameters for both the reconstructed and the simulated tracks are available. In addition, 
each space point used to build a given track can be traced back to the MC particle that generated the cor- 
responding charge deposit. This information is used by the ID trigger analysis packages to evaluate two 
additional quantities for each reconstructed track, which prove fundamental for the efficiency definition: 

• the number of space points which trace back to the same MC track; 

• the link between a reconstructed track and the MC track that generated most of its space points. 
In this context we define: 

• reconstructive MC particle: a particle passing a set of geometrical selection cuts (being contained 
in one of the processed Rols, pointing to the primary vertex) and a pj cut; 



Data preparation step 


Multiplicity 




Rol 0.75 x 0.75 


FullScan 


Pixel cluster coll. 


266.2 


1744 


SCT cluster coll. 


824.7 


8176 


Pixel space points 


246.2 


1591.5 


SCT space points 


119.6 


910.3 



Table 6: The average cluster/space point multiplicities for Rol-based and FullScan running of B-physics 
triggers. 
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Track reconstruction step 


Mean time 


[ms] 




Rol 0.75 x 0.75 


FullScan 


IDScan : Pattern recognition 


3.00 


34.6 


IDScan : Track Fit 


0.55 


2.19 


IDScan : TRT data preparation 


2.05 


9.50 


IDScan : TRT tracking 


5.43 


19.5 



Table 7: The timing measurements of the L2 ID track reconstruction for Rol-based and FullScan running 
of B— physics triggers. 



L2 ID algorithm 


Tracks/Rol 
Rol 0.75 x 0.75 | FullScan 


IDScan 


4.26 16.9 



Table 8: The average track multiplicity for Rol-based and FullScan running of the B— physics triggers. 

• good track: a reconstructed track linked to a reconstructible particle by the majority of its space 
points; the geometrical and p T cuts used for the MC particles are applied to the reconstructed 
tracks too; 

• best track: for each reconstructible particle, more than one good track can be available; the best 
track is defined as the one sharing the largest percentage of space points with the linked particle; 

• fake track: a reconstructed track (passing geometrical and pj cuts) which is not a good track; 
The most natural choice for the definitions of efficiency, fake fraction and resolution is then: 

• efficiency: ratio between the best tracks and reconstructible particles; 

• fake fraction: ratio between fake tracks and all the reconstructed tracks passing the applied cuts; 

• resolution: difference between the track parameter of a good track and that of the linked recon- 
structible MC particle; 

These definitions are used to produce the set of plots used in the following subsections to summarize the 
performance of a given tracking algorithm. 

The track reconstruction efficiency is shown as a function of the absolute value of tj and pj of the 
reconstructible MC particle while the fake fraction is shown as a function of the absolute value of T7 and 
Pt of the reconstructed track. The track parameter resolutions are shown as a function of the absolute 
value of T] and pj of the reconstructible MC particle: the resolutions on 0, l/pr, transverse impact 
parameter (<f 0 ) and longitudinal impact parameter (zq) are shown as a function of the absolute value of 
T] while resolutions on d 0 and l/p T are shown as a function of p T . The track parameter resolutions are 
evaluated with a gaussian fit to the resolution distribution for each parameter. 

4.3 Results with isolated electrons 

The tracking performance for electrons was evaluated with a data sample of single electrons uniformly 
distributed over a transverse momentum range of 7 to 80 GeV. Figure 2 shows the reconstruction effi- 
ciency and the fake fraction as a function of r\ and pj for SiTrack, IDScan and EF tracking algorithms. 
Figure 3 summarizes, for the same algorithms, the track parameters resolutions. As shown in the figures, 
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Mean time [ms] 


Algorithm 




single e lOOGeV 


ft 






Electron 


Electron 


Muon 


Tau 


FullScan 




Pix 


2 


2 


2 


3 


61 


Data preparation 


SCT 


8 


8 


10 


10 


85 


TRT 


8 


6 


8 


9 


140 


Space point finder 


1 


1 


1 


2 


30 


Track rinding in Si 


6 


29 


6 


31 


310 


Ambiguity solving 


4 


15 


5 


11 


135 


TRT track extensions 


1 


3 


1 


1 


31 


TRT extension processing 


5 


19 


8 


13 


170 


Vertex finding 


0.1 


1 


1 




23 


Particle creation 


0.4 


2 


1 


1 


24 


Total 


35 


86 


43 


81 


1009 



Table 9: Timing of EF reconstruction steps per Rol in two samples of events, single e~ of 100 GeV and 
tt events. Electron, muon, and tau times were measured in the corresponding triggers, FullScan mode 
comes from the execution of B-physics triggers. The Rol sizes were 0.1 x 0.1 (A(p x At]) for the electron 
and muon triggers, 0.2 x 0.2 for tau triggers. 

the efficiency is typically 95% or greater, except for low p T or high r\ where it drops to about 90%. The 
fake fractions tend to be below 2% except at low p T or high T7 where they can exceed 6%. Resolutions 
are observed to be quite good and are also somewhat degraded at low p T or high r\. 

4.4 Results with isolated muons 

The tracking performance for muons was evaluated with a data sample of single muons with transverse 
momenta of 6, 9, 21 and 30 GeV. Figure 4 shows the reconstruction efficiency and the fake fraction as a 
of function 77 and pj for SiTrack, IDScan and EF tracking algorithms. Figure 5 summarizes, for the same 
algorithms, the track parameter resolutions. As shown in the figures, the muon performance is even better 
than for electrons, with close to 100% efficiency throughout the kinematic range and an extremely low 
fake rate. Resolutions are slightly better than for electrons, and show similar p T and angular dependence. 
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Figure 2: Electron track reconstruction efficiency (top) for single electrons as functions of pj and r\ for 
SiTrack (full triangles), IDScan (empty triangles) and EF tracking (empty circles). Bottom plots show 
fake fraction as a function of the reconstructed p T and r\. 
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Figure 3: Track parameter resolutions for single electrons as a function of T7 and p T for SiTrack (full 
triangles), IDScan (empty triangles) and EF tracking (empty circles). 
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Figure 4: Muon track reconstruction efficiency (top) for single muons as functions of pr and r\ for 
SiTrack (full triangles), IDScan (empty triangles) and EF tracking (empty circles). Bottom plots show 
fake fraction as a function of the reconstructed p T and r\. 
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p T [MeV] P T [MeV] 



Figure 5: Track parameter resolutions for single muons as a function of T7 and pj for SiTrack (full 
triangles), IDScan (empty triangles) and EF tracking (empty circles). 
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4.5 Results with jets 

The tracking performance has also been evaluated using a sample of b-jets produced in the decay of a 
Higgs boson (mass 120 GeV) produced in association with a leptonically-decaying W boson. This is 
the benchmark sample for ^-tagging selection. Only tracks formed with at least four space points are 
considered. 

Figure 6 shows the reconstruction efficiency and the fake fraction as a function of tj and p T for 
SiTrack, IDScan and EF tracking algorithms. Figure 7 summarizes, for the same algorithms, the track 
parameter resolutions. Not surprisingly, the efficiency for this sample is a bit lower (80 to 90 %) than the 
single electron and single muon samples, and has a bit higher fake rates. Resolutions are comparable or 
slightly worse. 




Figure 6: Reconstruction efficiency (top) for tracks in &-jets yielded by Higgs decay as functions of 
Pt and tj for SiTrack (full triangles), IDScan (empty triangles) and EF tracking (empty circles). The 
efficiency as a function of tj is computed for tracks with pj > 3 GeV . Bottom plots show fake fraction 
as a function of the reconstructed p T and tj. 



4.6 Results with n and K in B-physics events 

Using B s decay to (j) n —>■ KKn the reconstruction efficiency for both kaons and pions have been evaluated. 

Figure 8 shows the reconstruction efficiency as a function of tj and pj for SiTrack, IDScan and EF 
tracking algorithms. The efficiency is typically close to 95% dropping somewhat at low pj or high tj. 
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Figure 7: Track parameter resolutions for tracks in b-jets yielded by Higgs decay as a function of r\ and 
p T for SiTrack (full triangles), IDScan (empty triangles) and EF tracking (empty circles). 
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Figure 8: Reconstruction efficiency for kaons (top plot) and pions (bottom plot) yielded from a decay 
of the B s meson to —> KKn (sample 16701 with misaligned geometry), as functions of pj and tj for 
SiTrack (full triangles), IDScan (empty triangles) and EF tracking (empty circles). 
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4.7 Comparison between the EF and the offline tracking performance 

Since the track reconstruction in the EF is based on the same software as used for offline reconstruction, 
the EF tracking performance was studied under conditions equivalent to those used offline. This study 
was done using release 13.0.30.4 with perfectly aligned detector geometry. 

To factorize out the bare EF performance, fake LI Rols were produced and passed through the L2 to 
seed the EF tracking, based on the MC truth information from particles with a p T > 1 GeV and a |?] | < 3. 

The same selection as normally used for offline studies was applied, where only particles within 
\t] \ < 2.5, \d0\ < 2 mm and \zo~ z v \ x sin$ < 10 mm were taken into account. The accepted tracks had 
to pass the requirement that at least 80% of the their hits were caused by the matched MC particle as well 
as the quality requirement of having at least 7 hits in the Pixel or SCT detectors together. 

Figure 9 (left) shows the reconstruction efficiency as a function of tj , for single electrons and muons 
with a pt = 5 or 100 GeV. The efficiency is defined as the fraction of particles that produce an accepted 
track and the average efficiencies based on all electrons (muons) were found to be 83.9 ± 0.2% (99.50 ± 
0.03) for a p T = 5 GeV and 92.9 ±0.1% (99.52 ±0.03) for a p T = 100 GeV. Figure 9 (right) together 
with Fig. 10 shows the muon resolution of l/pr, 0 and do for accepted tracks as functions of r\, where 
the inverse pj resolution is scaled by the pj value for easier comparison. In accordance with the offline 
studies, the resolution is determined from the root-mean-squared of the tracks within a region containing 
99.7% of the distributions, i.e. within 3 standard deviations from the mean of a Gaussian distribution. 
All results are shown together with the offline results presented in [1], which is represented in the plots 
by the superimposed lines. The results agrees well, however, small deviations are seen due to slightly 
different software setup with respect to Ref. [1]. 




°% 0.5 1 1.5 2 2.5 0 0.5 1 1.5 2 2.5 

hi H 



Figure 9 : Reconstruction efficiency (left) and scaled l/p T resolution (right) as functions of 1 77 1 . The superimposed 
lines represent results from the offline track reconstruction. 



5 Summary and conclusions 

We have reviewed the track reconstruction algorithms used at the L2 and Event Filter stages of the High 
Level Trigger of ATLAS. 

The algorithms performance and timing have been studied running on simulated data different chains 
of trigger selection. 
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Figure 10: 0 (left) and do (right) resolution as functions of |tj |. The superimposed lines represent results from the 
offline track reconstruction. 
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Data Preparation for the High-Level Trigger Calorimeter 
Algorithms 

Abstract 

This note describes the data preparation necessary to enable the ATLAS High 
Level Trigger Calorimeter Algorithms. An overview of the infrastructure, 
which provides the transition from the calorimeter electronics to data recon- 
struction and trigger algorithm implementation, is given. This infrastructure 
is detailed as a separate note since it is relevant to all trigger algorithms re- 
quiring calorimeter information (electrons, photons, taus, jets, missing Ej and 
muons). 

1 Introduction 

The calorimeters and part of the muon system were designed to participate in the ATLAS first level 
hardware-based trigger (LI) [1-4], while all sub-detectors participate in the software-based high level 
trigger (HLT) [5], comprised of Level 2 (L2) and the Event Filter (EF). One important phase for any 
trigger software algorithm is the data preparation step which provides the conversion of the bytes of data 
produced by the detector electronics into a convenient form for the trigger algorithms. In the case of the 
calorimeters, the digital information provided by the detector must be converted into calorimeter cells as 
input to the reconstruction algorithms. A good data preparation step will provide the input to the trigger 
software in an organized manner, so that access to the prepared data is optimized. This note describes 
this step for the calorimeter trigger software in the HLT. The same software data preparation layer is used 
in algorithms that are used to identify electrons, photons, taus, jets and muons [6]. 

1.1 Calorimeter readout 

The fundamental LAr calorimeter readout unit is the calorimeter cell. The cell electrodes receive the 
current due to the drift electrons in the liquid argon and form a triangular shaped signal [1]. The shaping 
and readout of this signal is performed by the Front-End Electronics. To preserve the dynamic range and 
the energy resolution, the signal is shaped with three possible gains. The Front-End Boards (FEBs) save 
analog samples of the signals coming from the detector at the bunch crossing rate (25 ns). Each FEB can 
process up to 128 LAr calorimeter cells. 

The signals are converted by the FEBs to a digital format if the event is accepted by the LI Trigger. 
The digital information is sent to the ReadOut-Drivers (RODs). These are Digital Signal Processor (DSP) 
based machines, fast enough to deal with a number of input channels (2 FEBs feed one ROD DSP). From 
the pulse shape digitized at the FEB, the energy deposited in any cell can be calculated. 

Data from each ROD are sent to a ReadOut Buffer (ROB). The ROBs keep this data fragment until 
it is requested by L2 or the Event Builder (EB). While L2 only requests a limited amount of data frag- 
ments, the EB will request fragments from the whole detector for events approved by L2 for subsequent 
processing in the EF computer nodes. 

In the Tile Calorimeter [2], the photons produced in the scintillators are measured by photomultipli- 
ers, which produce a negative shaped pulse. The digitized electronic signal is saved into an on-detector 
memory waiting for the accept signal from the LI trigger. For each of the 256 Tile Calorimeter modules, 
a so-called drawer (inserted in the back of the calorimeter structure) contains up to 48 photomultipliers 
and all the readout electronics. 

The analog signals from the detector cells are also summed up by dedicated hardware by detector 
regions in depth. Trigger Towers (TT) are coarse granularity combinations of the detector cells and can be 
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Figure 1 : Different parts of the data preparation processing and their relation to the calorimeter algorithm 
at the L2. For details, see text. 



provided in analog mode to the hardware LI processing. Except for the very forward regions, the TT size 
is 0.1 x 0.1 in 77 x 0. The LI hardware algorithm uses some minimal TT energy and isolation quantities 
to define a possible LI calorimeter candidate. A pointing to the found candidate 77 x 0 position is sent 
as a seed for software trigger processing. This seed is used to open a region (usually defined in terms of 
TT coordinates) called Region of Interest (Rol), where the full detector granularity ca be accessed by the 
reconstruction algorithms. 

2 Data preparation 

From a general point of view the preparation of the LAr and Tile Calorimeters data is similar. Figure 1 
depicts the global scope of the data preparation for the L2 calorimeter algorithm as described below. The 
EF structure is similar and is discussed briefly later in the note (see Ref. [7] for additional details of the 
full HLT data preparation). 

The L2 software component called Steering receives the LI information on the acceptance of an 
event along with the tj and 0 coordinates corresponding to the LI triggered object. The reconstruction 
algorithm gathers a list of ROB identifiers which contain data for a given Rol. Each ROB may partially 
contain data from TTs not pertaining to the Rol (ROB data access is not usually defined by the Rol, but 
rather by the hardware cabling). An optimal way to map cells to the towers and to the addresses of the 
ROB must be provided. 

The mapping of the ROBs and TT are part of the geometry description and are also used in the offline 
reconstruction software framework. The access to the offline detector description databases which maps 
any physical position into a set of identifiers is typically very slow, as the full detector description is 
comprised of a great amount of data. In order to provide faster access, compatible with the L2 speed 
requirements, a look-up table called the Region Selector is prepared in the initialization phase of the 
trigger software. 

The ROB identifiers are translated to network addresses of the ROB machines and the data are sub- 
sequently requested. The processing of the trigger selection algorithms is inhibited while the network 
acquires the data. Operating in a multiprocessing environment, as forseen for the ATLAS trigger, reduces 
this dead time [5, 8]. The ROB data provider receives the list of ROB identifiers and returns the detector 
data to the algorithms. 

When data are received, pointers to the beginning of the different fragments are passed to the 
detector-specific bytestream conversion code. Bytestream conversion is the decoding of the data format 
produced by the detector RODs, and packaging of the data into an accessible format for the algorithms, 
in this case calorimeter cells. The last part of the data preparation is to provide the cells in a manner 
organized for the reconstruction algorithms. For instance, cells are provided by detector layer. 
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2.1 Data processing in the Read-Out Drivers 

The LAr Digital Signal Processors (DSPs) are able to prepare data in different formats, the most im- 
portant one is termed "physics mode." In this mode the DSPs process the nominal 5 samples per cell 
provided by the front-end electronics. These samples are used to compute the energy deposited in the 
cell by the particles using an optimal filtering (OF) [9]. This processing is a simple weighted sum of the 
samples. The weights include the noise autocorrelation, electronics calibration constants, and a normal- 
ization factor that converts ADC counts to MeV. For cells with energy above a programmable threshold 
the timing of the signal and the quality of the pulse shape compared to the expectation are calculated. 
Finally, for each cell, the choice of electronic gain applied to the analog signal in the FEB is recorded. 

Beyond the cell-based data, the DSP can also extract global information at the FEB or TT level, 
which can be used to improve the L2 and EF processing speed. The DSP can sum up the energy in a 
given region in space providing E x , E y , and E z sums for these regions. The cell energies are added within 
these regions using cell-position-based projection coefficients loaded in the DSP from a database. Zero 
suppression is applied for cells below a given threshold. FEB's or TT's can be used to reconstruct jets 
or missing Ej at L2 and EF if unpacking the full detector is too time consuming. Currently FEB's are 
being used to provide the energy sums, however, using the TT information instead is under evaluation to 
improve jet and missing Ej resolutions. 

The pulses from the Tile Calorimeter photomultipliers are also sampled and digitized by 10-bit 
ADCs. During a physics data taking, 7 samples (175ns) of the signal pulses are acquired and trans- 
mitted to the RODs. The information is processed using DSPs which also apply optimal filtering for cell 
energy reconstruction [10]. 

2.2 Region selector 

As mentioned earlier, part of the information is stored in lookup tables for fast access to the detector 
description. In the LAr calorimeter case, the information unit to be correlated to the LI position is the 
Trigger Tower. The r\ x 0 minimum and maximum and the ROD identifier for each TT is arranged 
in a large matrix. Multiple tables corresponding to the different calorimeter layers are available. For 
the transition region from barrel to endcap calorimeter the data of the fiducial volume covered by a TT 
may be provided by more than one ROD. In the Tile Calorimeter case, the geometry information is 
associated with the calorimeter module identifier and, again, the ROB identifiers. The Look-Up tables 
with geometry information for LAr and Tile are prepared by accessing the relevant conditions database. 

2.3 Data containers 

The data structure of a calorimeter cell includes a part common to LAr and Tile, and parts specific to 
both subdetectors. In the software these cells are organized in vectors, called collections. For the Liquid 
Argon Calorimeter each cell collection holds data for a LAr ROD, corresponding to two FEBs or, at most 
256 cells. In the case of the Tile cell collection, there are either 23 cells (in the Barrel) or 13 cells (in 
the Extended Barrel) per collection. Data for four cell collections are associated with a single ROD. A 
Tile Calorimeter ROD has data for at most 92 Tile cells. Finally, the collections are organized in a vector 
which is called a container. 

The containers for LAr and Tile are stored permanently in memory and the cells and collections 
are never deleted. This way, on-the-fly memory allocation, which is typically a slow operation in a 
computing system, is avoided. One problem with reusing collections is that the container must keep 
track of which collections have already been decoded in a given event. This information is provided by 
the tools that access the container. If requested subsequently in the same event, the collection will not be 
decoded again. 
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2.4 Bytestream conversion 

The ROD fragments, containing the energy encoded information are provided to the appropriate HLT 
bytestream conversion code. Based on the ROD fragment identifier, the corresponding cell collection is 
located by the proper container (LAr or Tile containers). Subsequently, subdetector specific code is used 
to perform the data unpacking. 

The LAr bytestream conversion code automatically identifies the fragment type using the ROD ver- 
sion encoded in the bytestream itself. Depending on the detected format, the corresponding internal 
infrastructure is selected. 

The bytestream conversion software unpacks the energy information using the DSP physics output 
format as described in Section 2.1. The conversion provides the cell energy, hardware gain, pulse peak 
time, and pulse-fit quality information (if available) for each of the ROD fragment channels. The channel 
number is used as an index to the cell position in the cell collection, so that each LAr channel is associated 
to a single predefined cell object in the collection. Each cell is updated with the current values of energy, 
time, quality and hardware gain. In the unpacking step, typically more cells are requested than those 
contained in the Rol as data from one FEB may extend over several TTs. Furthermore, the trigger 
reconstruction algorithms require data access on a layer-by-layer basis. This results in a very complex 
operation with many checks of cell layer and position. Maps between TT identifiers and the associated 
groups of cells are prepared prior to algorithm execution, to speed up the process. Using the TT identifier 
list obtained from the Region Selector, a chain of cells for those TTs can be obtained, simplifying the 
algorithm code. 

The bytestream conversion software for the Tile calorimeter data also checks the ROD format, ensur- 
ing that the correct method of unpacking the data is chosen. The data is decoded and the energy values 
are stored in a pre-allocated raw data structure. This is again used to avoid online memory allocation. The 
energy, time, and quality are stored together with the ADC identifier for each cell in a Tile Calorimeter 
drawer. This raw data is copied into the cell structure. The mapping of raw data to cells is the same for 
every drawer in a given calorimeter sector. To speed up the processing a mapping is built to the indices 
of the cells that correspond to each raw data. 

Each Tile drawer is unpacked into a cell collection. The data providing in this case is much simpler 
than in the LAr case. Algorithms are able to iterate through the whole collection after the unpacking is 
done. 

2.5 Data preparation in the EF 

The Data Preparation tools in the EF make use of the same data unpacking approach that is used by L2, 
dumping this information into an offline cell container. As the EF has a larger time budget, however, 
more sophisticated algorithms and tools developed for offline reconstruction are used to process the cells 
stored in this container. 

For each of the subdetectors (EM, HEC, FCal and Tile), the EF cell container is filled. This provides 
the possibility of unpacking only selected calorimeter sections, as needed. Once the container has been 
filled with the corresponding calorimeter cells, a set of software tools are executed to organize and check 
the container, and to perform cell-based calibrations. 

3 Algorithm performance 

In this section, the performance of representative HLT algorithms (e/y for L2 and missing Ej for EF) 
and data preparation is studied. The results are based on bytestream files prepared with a format similar 
to the ATLAS raw output data. 
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Figure 2: Number of cells used in the L2 e/y selection algorithm as a function of tj for the LAr (EM, EM 
endcaps and HEC) Calorimeters (left) and for the Tile Calorimeter (right). The Rol size was 0.4 x 0.4 in 
tj xf 




Figure 3: Cumulative time spent in the different phases (Region Selector, bytestream conversion, and 
algorithm) of the L2 e/y selection as a function of tj for the electromagnetic part (left) and for the 
hadronic part (right) for an Rol size of 0.4 x 0.4 in tj x 0 (a 2.3 GHz machine was used to perform these 
measurements). 



The primary performance issue is processing time. The processing time depends on the number of 
cells required, which is a function of tj. Figure 2 shows the number of cells separately for LAr (left) 
and Tile (right) calorimeters; the overlapping bins in the figure are due to variable 0 segmentation in 
transition regions between different calorimeter modules. The distribution on the left shows that the 
number of active cells in the barrel is quite uniform; since the endcap granularity is smaller, the number 
of unpacked cells decreases with increasing tj. The distribution of the number of cells unpacked for the 
Tile Calorimeter depends on the number of drawers to be unpacked. In the very central region (| tj | < 0.4), 
data from negative and positive rapidities must be accessed to complete the Rol, doubling the amount of 
data to be unpacked. A similar effect is observed in the region between the TileCal Barrel and Extended 
Barrel. 

For the standard L2 e/y selection based on a Rol size of 0.4 x 0.4 in tj x 0, the execution time for each 
of the processing steps was measured. The results, based on a sample of about 15,000 single electron 
events are shown in Fig. 3 separately for the EM (left: LAr) and for the hadronic (right: Tile and HEC) 
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Reco. step 


Region Selector 


bytestream conversion 


Algorithm 


Total 


EM 2 nd layer 


29jUs 


169jus 


146^s 


347ms 


EM I st layer 


13jus 


171ms 


113ms 


301ms 


EM other layers 


21jus 


158jUs 


56ms 


243ms 


Hadronic 


46;Us 


334ms 


43ms 


438ms 


Total 


109 Ais (8%) 


833ms (63%) 


358ms (27%) 


1.33 ms 



Table 1: Processing time for different algorithm steps and for different actions. Improvements for the 
Tile calorimeter data preparation are envisaged. Time measurement excludes ROB data retrieval time (a 
2.3 GHz machine was used). 

sections. It is important to stress that the processing time per Rol does not depend on event type, since 
the cluster sizes are constant for a given Rol. It can be seen that the Region Selector comprises a small 
portion of the total time, and that the bytestream conversion is the dominant source of time, with the 
algorithm itself taking only about 35% (10%) of the total time for the EM (Hadronic) calorimeters. Even 
though fewer cells are used in the EM calorimeter crack region (around 17 = 1.5 as shown in Fig. 2), 
these cells are distributed in two ROBs (one from the Barrel and another from the EM endcap), resulting 
in an overall increase in the processing time. Finally, we note that the conversion times are especially 
large in the regions covered by the Tile calorimeter and in proportion to the number of Tile calorimeter 
modules accessed. Work is ongoing to reduce these large processing times. The timing results averaged 
over r\ are also summarized in Table 1 . ROB data retrieval times are not included, since they can only be 
evaluated during real data taking. 

These time measurements indicate that the preparation of the Tile calorimeter data needs to be im- 
proved. Even though about six times fewer cells are accessed for the barrel region, the data preparation 
time to run the hadronic part is comparable to the EM part. 

Other trigger algorithms such as tau or jet identification need larger Rol sizes and consequently 
require more time. As an example, a jet algorithm which uses a 1.0 x 1.0 Rol size takes 10 to 12 ms. 

3.1 EF missing Ej performance 

The EF missing Ej reconstruction algorithm accesses data from all the calorimeters and computes the 
missing Ej with its E x , E y components as well as the total scalar energy sum. In addition, corrections 
due to energy deposits from muons can be taken into account by including the results from the EF muon 
reconstruction. 

To access the calorimeter data the algorithm uses the same data preparation layer used by L2. Since 
the ATLAS calorimeters contains about 200,000 cells, the access to every single cell can become too 
time consuming at the trigger level. A faster option is to use energy sums at the FEB level (discussed 
earlier in Section 2.1). FEB unpacking has only been implemented for LAr data, where the impact on 
the unpacking time is most significant. 

In Fig. 4 the processing time of the EF missing Ej algorithm is shown for full unpacking and FEB- 
based unpacking. On average, the time to process the whole calorimeter is dramatically reduced from 
57 ms for the cell method to 2.4 ms for the FEB method. 

The total scalar sum and the Missing E T calculations were performed using the two unpacking meth- 
ods. The missing Ej calculation does not depend on the method, while the scalar sum is systematically 
reduced in the FEB calculation due to the effect of the zero suppression. However, due to the drastic 
improvement in speed, the FEB algorithm is a valid option for the missing Ej reconstruction. 

In addition to the timing studies detailed here, a thorough study of the memory usage and initial- 



589 



Trigger - Data Preparation for the High-Level Trigger Calorimeter . 



I" 3 












T 


-- FEB-based method 












1 r 




i 



Figure 4: Processing time of the EF missing Ej algorithm. The timing distributions for the cell and FEB 
method are shown. A 2GHz machine was used for this test. 



ization time was performed. A substantial fraction of the initialization time is taken by the detector 
geometry preparation, including the filling of the cell coordinates and the Region Selector tables. This 
initialization step requires access to databases containing information on detector conditions, and pos- 
sibly files with complementary information. It was determined that the initialization time is acceptable 
and does not inhibit the running of any of the desired algorithms. Furthermore, the memory usage of the 
algorithms was measured to be stable and within acceptable operating limits. 

4 Data preparation summary 

This note describes the implementation of the whole data preparation step for the HLT calorimeter trigger 
from the detector electronics up to the reconstruction level. It is fundamental that a data preparation layer 
is efficient and fast, leaving time for the real physics algorithms. The High-Level Trigger Calorimeter 
tools described here have been used extensively with simulated data to commission the ATLAS trigger. A 
unique interface provides access to detector physics quantities (calorimeter cells) obtained with complex 
computations from the readout data. Knowledge of the detector details is, of course, a fundamental input 
into optimizing the strategy to be followed in this unpacking procedure. The critical performance issue 
for calorimeter data preparation is that it be accomplished within the online time budget, and this goal has 
been achieved with the current system. Even for special algorithms, like the missing Ej which process 
cells from the whole detector, the data preparation performance is still within the required processing 
interval restrictions. Whenever FEB summary information can be used, significant timing reductions can 
be achieved. Further optimization studies are still in progress. 

In addition to studies with simulated data, the tools and algorithms discussed here have been applied 
to commissioning runs of the ATLAS detector using cosmic rays. Until the LHC begins taking data, this 
is the only exercise that can approximate the real trigger usage in LHC conditions. Many trigger objects, 
such as taus, jets, and missing Ej are being successfully debugged in this manner, providing important 
feedback to the algorithm developers. 
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Abstract 

The selection of events with handronically decaying tau leptons is challenging 
due to high background rates at the LHC. On the other hand, efficient selection 
of events with tau leptons increases the discovery potential of ATLAS in many 
physics channels, notably Standard Model or Supersymmetric (SUSY) Higgs 
boson production. In this note we describe the ATLAS tau trigger system, 
focusing on the early data taking period, and present results from studies based 
on simulated events, including trigger rates and the acceptance of tau leptons 
from W and Z boson decays, Higgs Boson decays, and SUSY processes. In 
order to cope with the rate and optimize the efficiency of important physics 
channels, the results of the current simulation studies indicate that ATLAS tau 
triggers should include either relatively high transverse momentum single tau 
signatures, or low transverse momentum tau signatures in combination with 
other signatures, such as missing transverse energy, leptons, or jets. 

1 Introduction 

Tau triggers are designed to select hadronic decays of tau leptons, which mainly consist of one or three 
charged pions accompanied with a neutrino and possibly neutral pions. Leptonic tau decays are typically 
selected by electron [1] or muon [2] triggers. Tau triggers are an important part of the ATLAS trigger 
system, a fundamental component of the ATLAS detector [3]. 

ATLAS will collect data at different luminosities, starting from 10 31 cm~ 2 s _1 . At the lowest lumi- 
nosity the focus of tau triggers is to collect samples that are useful for understanding the detector and 
the tau reconstruction software. Tau signatures combined with missing transverse energy signatures are 
essential to provide data samples enriched in TV events, which provide an important sample of real 
taus needed to refine tau identification algorithms. Additionally, single tau triggers with large prescale 
factors will provide samples for tau fake rates studies. 

At higher luminosities, tau triggers will cope with the event rate increase by using higher Ej thresh- 
old requirements, more restrictive identification requirements, or demanding a combination of different 
signatures, such as missing transverse energy, jets, or leptons. At high luminosity, tau triggers will be 
essential to enable the collection of data samples for searches based on single tau lepton final states, like 
Minimal Supersymmetric Standard Model (MSSM) — > TV [4] decays. They will also be used for 
final states with more than one tau lepton, like SM Higgs boson [5], MSSM neutral Higgs boson [6], or 
Z' boson [7] decays. 

The results presented in this paper are based on events fully simulated with GEANT4. The text is 
organized as follows. An overview of the three-level trigger selection for single tau triggers is presented 
in Section 2, while the performance in terms of resolution, rates, and efficiencies is described in Section 3. 
Timing studies of tau trigger algorithms are presented in Section 4. The current tau trigger menu, which 
includes various single and combined tau triggers, is described in Section 5, with a focus on the early 
data taking period corresponding to a luminosity of 10 31 cm~ 2 s _1 . Finally, a summary is presented in 
Section 6. 
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2 Overview of single tau trigger selection 

2.1 Level 1 selection 

The LI tau trigger selection is closely related to the LI electron/photon trigger (e/y), and is fully doc- 
umented in [8]. It is a hardware trigger based on electromagnetic (e.m.) and hadronic calorimeter 
information, and uses trigger towers of approximate size A 17 x A0 = 0.1 x 0.1, with a coverage up to 1 77 1 
< 2.5 (given by the inner-detector coverage and the high-granularity e.m. calorimetry). 

The algorithm considers a rectangular Region of Interest (Rol) of 4 x 4 towers (0.4 x 0.4 in r\ x 0) in 
both the e.m. and hadronic calorimeters, and makes use of different elements, each formed by summing 
Ej over a group of towers. The algorithm uses the following quantities: 

• the central 2x2 core cluster is the energy measured in the central 2x2 e.m. and hadronic towers 

• the TauCluster is the energy defined by the two most energetic neighboring central towers in the 
e.m. calorimeter plus the central 2x2 towers of the hadronic calorimeter 

• Emlsol is the energy in the e.m. isolation ring (the region between 2x2 and 4x4 towers in the e.m. 
calorimeter). 

• Hadlsol is the energy in the hadronic isolation ring (region between 2x2 and 4x4 towers in the 
hadronic calorimeter). 

The LI tau trigger candidate is accepted if the core cluster is a local Ej maximum and also satisfies 
additional conditions on TauCluster, Emlsol and Hadlsol [8]. Its position is taken as the center of the 
4x4 tower Rol, and its energy is calibrated using a procedure derived for jets (see Section 2.1.1). A 
maximum of eight trigger thresholds are available at LI for taus. Each threshold is a combination of 
requirements of Ej thresholds for TauCluster, Emlsol and Hadlsol. A LI tau trigger candidate passing 
the requirements is then passed to L2 for further examination. 

2.1.1 Level 1 Calorimeter Calibration 

Trigger towers are formed by the analogue summation of calorimeter cells. Calibration of the trigger 
consists of adjusting the overall gains of the towers. The e.m. towers are calibrated to optimize the e.m. 
trigger response. The gains of the hadronic towers are adjusted to provide a uniform jet response in T7 
using a jet sample with an Ej range 50-100GeV and making ^-dependent adjustments to the trigger 
thresholds. 

2.2 Level 2 selection 

The L2 tau trigger selection uses the full calorimeter granularity and the inclusion of tracking information 
from the Inner Detector to refine the LI selection. The selection is designed to further reject QCD jet 
backgrounds by exploiting more of the characteristics of a hadronic tau decay, such as collimation and 
low track multiplicity. 

2.2.1 Level 2 calorimeter selection 

At this stage, the selection of the single tau triggers is based on the calorimeter shower shape variables 
and the energy of a reconstructed cluster within the Rol to enrich the sample of tau candidates. Shape 
variables are calculated using only the second e.m. sampling layer (out of four). The second sampling 
is where most of the e.m. energy is deposited, therefore it provides the most information about the 
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e.m. shower shape. The cluster energy is calculated with all available e.m. and hadronic layers. Three 
different rectangular windows are defined, centered on a seed cluster, with areas of r\ x <p = 0. 1 x 0. 1 
(narrow, Nar), 0.2 x 0.2 (wide, Wid), and 0.3 x 0.3 (normal, Nor) l . The Nor window is equivalent to the 
Rol used at L2. 

The algorithm consists of several steps. First, a seed cluster is found, using the second e.m. sampling. 
The algorithm unpacks cells in the Nor window centered around the LI Rol position, and finds the cell 
with the highest energy deposition. In a Wid window around the most energetic cell, the cluster position 
is defined as the energy weighted mean position of the cells. Then, shape variables are reconstructed 
using the different windows around the seed, as described in the following. At the same time, the total 
energy in all calorimeter samplings is computed. Finally, the total energy is corrected with a simple 
sampling calibration. 

The variables used by the L2 Calorimeter selection are the following: 

• EMRadius is the energy weighted squared radius of the seed, which is obtained from the sum of the 
individual energy weighted squared cell distances from the seed. It is calculated in a Nor window 
around the seed, in the second sampling of the e.m. calorimeter, i.e. 



Y,E cel rR 2 ce 

_ Nor 

£ E cell 



> IsoFrac is the difference in energy between the Nar and Wid window, normalized to the Wid 
window. It is calculated in the second sampling of the e.m. calorimeter. The definition is 

$2 E cell - EceU 



< StripWidth is the width of the energy deposition, defined as the energy weighted standard devia- 
tion in r\. It is calculated in a Nor window around the seed, in the second sampling of the e.m. 
calorimeter. The formula is 



stripWidth = 



12 ^cell ' E ce, 



J2 E cell 



y. Tjcell ' E cei 



J2 E cell 



• EtCalib is the calibrated total transverse energy, calculated in the e.m. and hadronic calorimeters, 
in a Nor region around the seed. 

The distributions of EMRadius, IsoFrac and stripWidth are shown in Fig. 1 for tau trigger candidates 
from generated tau leptons decaying hadronically and for QCD jets for two different Ej regions. In 
the top row distributions for low Ej tau leptons from W — ► TV decays are compared to background 
QCD jets distributions, while in the bottom row high Ej tau leptons from Supersymmetric neutral Higgs 
decays (with a mass of 800 GeV) are compared to QCD background. The top row shows the difficulty 
in triggering on low Ej taus as the separation between signal and background is not so dramatic; the 
separation is much better for high Ej taus. 

'The window sizes are currently subject of optimization studies. 
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Figure 1 : L2 Calorimeter variables to distinguish QCD jets from low Ej taus from W — > TV decays (top) and from high Ej 
taus from Supersymmetric Higgs A — > tt decays (bottom). 



2.2.2 Level 2 calorimeter calibration 

A simple Sampling calibration is used in the L2 calorimeter algorithm for reconstruction of the tau en- 
ergy. This method is fast (« 7^s) because it applies a global weight to the energy of each sampling 
to compute a total calibrated energy, as opposed to other slower algorithms, such as the offline calibra- 
tion [9], which use a cell-by-cell correction. In the current implementation only two layers are considered 
(e.m. and hadronic compartments) and the weights are energy and r\ dependent. 

2.2.3 Level 2 tracking selection 

The standard L2 track reconstruction [10] is used in the L2 tau triggers.To keep the L2 execution time 
within budget, data from the Transition Radiation Tracker is not used. 

Tracks with pj > 1.5GeV are reconstructed in a rectangular Rol of size tj x 0 = 0.6 x 0.6 centered 
on the L2 Calorimeter seed. The output of the L2 Tracking algorithm is a list of tracks found in the Rol. 
Within this region, two selection cones are defined, corresponding to a distance AR = \J (At]) 2 + (A0) 2 
of 0.15 (Core) and 0.3 (Nor) with respect to the direction of the highest pj track found in the Rol. An 
isolation ring (Iso) with AR between 0.15 and 0.3 is also defined. 

The following selection variables are then calculated from the track list: 

• P t leading is the pj of the track with the highest pj. By requiring a minimum pj this criterion 
also effectively requires that at least one track is found in the Rol. 

• P t Iso/Core is the ratio of the scalar sum of pj$ of all tracks in the Core and Iso region £ pf ve 

• N Slow tracks is the number of slow tracks found in the Core region. A slow track is defined as a 
track with pj below a certain threshold, typically 7.5GeV/c. The rejection power of this variable 
might depend on the pile-up conditions. 



595 



Trigger - Tau Trigger: Performance and Menus for Early Running 



• Charge is defined as the absolute value of the sum of charges of all tracks found in the Nor region. 

• N Tracks is the total number of tracks found in the Nor region. 

The L2 tracking selection places requirements on these five variables. The set of requirements can be 
different for different single tau signatures. The three most important criteria are requiring that at least 
one track be found with a minimum Pt leading, an isolation cut on the maximum amount of energy de- 
posited in the Iso region, and a cut on the maximum number of slow tracks. The requirements on charge 
and total tracks are very loose, due to the higher number of fake tracks found by the L2 tracking algo- 
rithm compared to the more sophisticated offline reconstruction. In addition, the offline reconstruction 
of single tau lepton final states (e.g. from W — > TV) relies on an unbiased track distribution to estimate 
backgrounds and extract the number of signal events. The distributions of Pt leading, N Slow tracks and 
Pt Iso/Core are shown in Fig. 2 for tau trigger candidates from generated tau leptons decaying hadron- 
ically and for QCD jets for two different Ej regions. QCD jet background is compared to low Ej tau 
leptons from W — > TV decays (top row), and high Ej tau leptons from Supersymmetric neutral Higgs de- 
cays (bottom row). Tau trigger candidates are required to pass different LI and L2 Calorimeter selection 
criteria based on their Ej. A L2 tau trigger candidate passing the calorimeter and tracking requirements 
is passed to the EF for further consideration. 




Figure 2: L2 Tracking variables to distinguish QCD background from low Ej taus from W — > TV decays (top) and from high 
Ej taus from Supersymmetric Higgs A — > TT decays (bottom). 



2.3 Event Filter selection 

At the EF level, the selection follows the offline reconstruction procedure as closely as possible. The EF 
calorimeter algorithm collects cells in a rectangular Rol of size r\ x (j) = 0.6 x 0.6 centered around the 
L2 tau trigger candidate. The EF tracking reconstruction is described in [10]. Tracking is performed in a 
rectangular Rol of size r\ x (j) = 0.4 x 0.4 centered around the L2 tau trigger candidate. 
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To create an EF tau trigger candidate, the EF single tau triggers execute the calorimeter based iden- 
tification algorithm of the offline reconstruction [11] in the following way. The cells collected in the 
Rol are used to reconstruct the direction of the EF tau trigger candidate. Additionally, some very loose 
criteria are applied to tracks reconstructed in the Rol, and if more than one track is found a secondary 
vertex reconstruction is attempted. The transverse energy and the calorimeter shower shape variables 
are built from the cells collected in the Rol using the newly reconstructed direction. Unlike the offline 
reconstruction algorithm, no noise suppression is applied at the EF. The addition of electronic noise in 
the calorimeter cells gives a generally small change in the calorimeter shower shape variables. Finally, 
an overall hadronic calibration [9] is applied to all cells, and a tau specific jet calibration is applied to 
the tau trigger candidate. This final calibration is derived from simulated samples of high and low pj 
tau lepton sources. A procedure using taus from Z bosons decays and exploiting the correlation between 
the tau absolute energy scale and the visible reconstructed Z boson mass will be applied to determine 
calibration constants from real data [11]. 

The following variables are used for the EF selection: 

• EMRadius is an energy weighted radius, which exploits the small transverse shower profile of tau 
leptons. The mathematical expression used is 

^E TjCe u ■ AR ce u 

EMRadius = -!— = . (4) 

Nor 

where the sum includes not only the second sampling layer of the e.m. calorimeter (as in L2) but 
also all other samplings except the last one. 

• IsoFrac is the energy deposited in an annular region between 0. 1 < AR < 0.2 divided by the energy 
deposited in AR < 0.3. A similar expression is given in Eq. 2, where the square regions should be 
replaced by the annuli. 

• ./V Tracks is the number of associated tracks with p r > 2GeV found in a AR < 0.2 region around 
the trigger candidate from L2. 

• Pt leading track is the highest p T track among the tracks found in a AR < 0.2 region around the 
trigger candidate from L2. 

• EtCalib is the energy calculated in all e.m. and hadronic cells found in the AR < 0.3 region around 
the trigger candidate from L2, and calibrated with the procedure described in [9] and an additional 
tau specific jet calibration. 

Some of the variables used in the EF selection, namely EtCalib, IsoFrac and EMRadius, are shown 
in Fig. 3 for tau trigger candidates from generated tau leptons decaying hadronically and for QCD jets for 
two different E T regions. As for previous figures, QCD background is compared to standard tau signal 
samples. Events with a tau candidate that pass the EF thus pass the trigger and are recorded for offline 
analysis. 

3 Performance of single tau triggers 
3.1 Performance overview 

This section describes in detail the expected trigger efficiencies and background event rates obtained from 
the various single tau triggers available in the ATLAS trigger menu, while combined triggers including 
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Figure 3: EF variables to distinguish QCD background from low Ej taus from W — > TV decays (top) and from high Ej taus 
from Supersymmetric Higgs A — > tt decays (bottom). 
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other types of signatures is given in Section 5. In addition, the optimization procedure applied to the 
selection at each level is introduced. The goal is to obtain at least 90% efficiency at each trigger level 
with sufficient rejection of background. 

The various types of samples of GEANT4 simulated events used in this note together with the total 
number of events and the cross section are summarized in Table 1 . It is appropriate to note at this point 
that the rates reported in this note are clearly affected by large uncertainties, because of the limited 
statistics available and because the simulation has yet to be validated with real data. 



Sample 


Events 


a(pb) 


Min.Bias 


2900k 


7.0E+10 


QCDdijet 8< p T < 17GeV 


742 k 


1.74E+10 


QCDdijet 17 < p T < 35GeV 


395 k 


1.38E+9 


QCD dijet 35 < p T < 70GeV 


824 k 


9.33E+7 


QCD dijet 70 < p T < 140GeV 


272 k 


5.88E+6 


W T ^ hx (filtered p T > 12GeV) 


24000 


5.54nb 




23000 


0.246 nb 


H^ ehx (\20) 


22924 


0.145 


H^ hhx (\20) 


40950 


0.073 


A„(800) 


26250 


10 


SU3 


22250 


18.59 


tt 


41700 


800 



Table 1 : GEANT4 simulated data samples used in the note. 



3.1.1 Naming convention 

In the ATLAS trigger menu, different single tau triggers are implemented, corresponding to different Ej 
threshold requirements 2 : taulOi, taul5i, tau20i, tau25i, tau35i, tau45i, tau60. For taulOi and 
taul5i the isolation criteria are only applied at the L2 and EF level. For some E T thresholds, additional 
triggers are defined using looser isolation criteria (e.g. taulO). Hadronic isolation at LI is not applied 
in any selection. Due to the Ej resolution at the three trigger levels, the actual cut applied on Ej at each 
stage is generally lower than the nominal Ej to maintain high efficiency. 

3.1.2 Trigger efficiency definition 

The tau trigger efficiency is optimized with respect to generated tau leptons decaying hadronically, where 
the visible tau momentum (p® k = p® — p% ) is in the sensitive region of the detector (|tj| < 2.5) and is 
greater than the nominal Ej threshold requirement for a given signature. Furthermore, the efficiency is 
optimized to select those tau leptons which are likely to be selected by the tau identification algorithms of 
the offline reconstruction software. This constitutes what we subsequently call the tau trigger reference. 

More specifically, a geometrical match of a trigger candidate to a generated tau lepton within a cone 
region AR < 0.2 is requested. Generated tau leptons are considered for calculating signal efficiencies 
only if the tau lepton is also selected by either of the two tau offline reconstruction algorithms [11]: 
the calorimeter based or the track based algorithm. It should be noted here, that detailed optimization 
with respect to 1 -prong (1 charged pion) and 3 -prong (3 charged pions) tau lepton decays has not been 

2 The first symbol of the signature represents the particle type, the following number is the Ej threshold and the "i" indicates 
that an isolation requirement is applied. 
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performed for this study. This results in different efficiencies for these classes of events, as pointed out 
in the following sections. Total trigger efficiencies shown in this section are estimated on various physics 
processes which cover different kinematic ranges. The samples used are therefore specified in the text. 

3.2 Energy and angular resolution 

The relative resolutions (in %) on Ej, J], and 0 as a function of visible Ej and tj for tau trigger candidates 
at the different trigger levels are shown in Fig. 4 for tau20i. The visible four-momentum of the tau 
lepton is reconstructed from all decay products except neutrinos. The resolutions are rather flat as a 
function of the generated visible Ej, however, some dependence on the distributions as a function of the 
generated visible tj is observed. In particular a degraded resolution in Ej and tj in the transition region 
between the barrel and endcap calorimeters is clearly visible. Due to the inclusion of track information, 
the angular resolution at L2 is better than that of LI. The tj and 0 of the tau trigger candidate at the 
end of the EF algorithm execution is set according to the calorimeter based tau reconstruction algorithm, 
hence the EF angular resolution is slightly worse than at L2 where tracking information is used. 




Figure 4: Relative angular and energy resolution for T candidates as a function of generated visible Ej and r/. Resolution (in 
%) is calculated using a variety of simulated tau lepton samples passing the tau20i trigger. 



3.3 LI performance studies 

The standard reconstruction of the visible energy of LI tau candidates {TauCluster) reveals a serious 
limitation when trying to obtain a high efficiency above a given Ej threshold at LI. A systematic shift 
of about 30% and a relative resolution of 33% are observed for the reconstructed energy with respect to 
the generated values. Therefore other possible reconstruction methods have been explored. The energy 
reconstructed with the standard algorithm (TauCluster) has been compared with other reconstruction 
methods: (1) E4x4, defined as the sum over all towers (e.m. and hadronic part) in the 4 x 4 Rol region, 
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(2) Jet4 x 4, obtained as the total energy in the 4 x 4 Rol region calculated with the LI Jet algorithm 
(which is described in Ref. [8]), and (3) Jet6x6. An improvement of 10% in resolution can be achieved 
with E4 x 4 and Jet4 x 4 algorithms with respect to TauCluster. However, a comparison of the efficiencies 
of the different methods, shown in Fig. 5, indicates that the TauCluster algorithm efficiency is system- 
atically the highest. While these other algorithms that more fully contain the Ej of the tau lepton give 
better resolution than TauCluster, this gain in resolution is accompanied by a loss in efficiency and dis- 
crimination, motivating the continued use of TauCluster as the default tau algorithm. 




Offline Et (GeV) 

Figure 5: Efficiency curves as a function of the offline Ej for different LI reconstruction methods; threshold requirements 
are chosen to give the same rate among the different cases. 

Using TauCluster, the rates for a luminosity of 10 31 cm _2 s _1 are evaluated by using simulations 
of the relevant QCD backgrounds (see Table 2). The expected rate including all physical processes 
(minimum bias rate) is shown as a function of the cut value applied to TauCluster in Fig. 6. The effect 
of the isolation cut can also be seen in Fig. 6. 

The LI selection used in tau trigger implementation is presented in Table 2; the cuts have been tuned 
to maintain a high efficiency, and the signatures are all observed to have an efficiency > 95% 3 . The 
efficiency has been obtained using simulated samples of W — ► TV and 800 GeV Supersymmetric Higgs 
A —>■ xx events, while background is evaluated using QCD events with two jets with hard parton 8 < 
Pt < 140 GeV. In Fig. 7, the efficiency is plotted as a function of the generated visible Et for different 
tau signatures. The rather slow increase of the efficiency curves limits the overall performance of tau 
triggers. 

3.4 L2 performance studies of the calorimeter selection 

At L2, the full granularity calorimeter and tracking information is available, allowing a more sophisti- 
cated selection of tau leptons. As shown in Section 3.2, the angular and energy resolutions are improved 
with respect to LI. 

The L2 calorimeter based selection of tau trigger candidates is optimized with respect to generated 
tau leptons that pass the LI selection and are successfully reconstructed by the offline reconstruction 
software (offline details are given in Section 3.1.2). The optimization is obtained using simulated samples 

3 The efficiency definition is described in Section 3.1.2 
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10 20 30 40 50 60 70 
L1 reconstructed tau energy (GeV) 



Figure 6: Expected rate including all physical processes at a luminosity of jSf = 10 31 cm 2 s ', as a function of the cut 
applied to TauCluster and with and without the isolation cut Emlsol. 



Signature 


E T threshold (GeV) 


Emlsol (GeV) 


Eff.(%) 


Rate (Hz) 


taulOi 


6 




97 


13945 


taul5i 


6 




98 


13945 


tau20i 


9 


6 


96 


4823 


tau25i 


11 


6 


96 


2822 


tau35i 


16 


6 


96 


990 


tau45i 


25 


6 


96 


263 


tau60 


40 




97 


97 



Table 2: Single tau signatures, associated LI thresholds, and efficiency and rates determined from signal and background 
samples as described in the text («^=10 31 cirr 2 s" 1 ). 



of W — > TV and 800 GeV Supersymmetric Higgs A — > TT events, and simulated background samples of 
QCD events with two jets with hard partem 8 < pj < 140 GeV 

The L2 variables based on calorimetry introduced in Section 2.2 are optimized in the following way: 

• Since some correlation among the calorimeter shower shape variables (EMRadius, isoFrac and 
stripWidth) is expected, the three variables are simultaneously optimized. The thresholds are de- 
termined by scanning the possible cut values for each variable and choosing the set of cuts giving 
the lowest background rate for a requested minimum signal efficiency. 

• The threshold for the variable EtCalib is determined by requiring that the efficiency becomes flat 
starting at the nominal E T of the trigger (see Fig. 8). 

The optimized L2 calorimeter requirements for tau signatures and the corresponding efficiencies 
and rates are presented in Table 3. The cut value on EtCalib increases for higher Ej signatures, as we 
aim to select higher Ej taus. The shower shape variables, a measure of the narrowness of the shower, 
are tighter for higher E T signatures, since higher E T taus are more boosted and their decay products 
are more collimated. The three shower shape variables requirements have been optimized to give an 
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Figure 7: LI efficiency curves for different tau signatures. 



efficiency of ps 95%, although the observed efficiency values are a little lower due to the additional 
EtCalib requirement. 

The efficiency is detailed further in Table 4 for two typical tau signatures, one with low and one with 
high Ej. For each signature, efficiency is recalculated using as a reference different tau lepton samples 
reconstructed offline. Either the sample from each tau algorithm is considered separately, or the sample 
reconstructed by either of them ("OR"), or by both simultaneously ("AND"), is considered. The errors 
quoted are statistical only. In order to achieve the needed rejection of QCD backgrounds, the calorimeter- 
based algorithm of the offline reconstruction biases the distribution of decays towards 1 -prong decays 
using calorimeter information. A similar rejection and efficiency is achieved in a different manner using 
the track-based algorithm, which requires a large Uansverse momentum for the leading track in the tau 
lepton decay. Therefore, the L2 calorimeter selection has a high and uniform efficiency for the sample 
reconstructed offline by the calorimeter-based algorithm, while it is lower on the sample reconstructed 
using the track-based algorithm in the 3-prong category. 



Signature 


EtCalib (GeV) 


EMRadius 


IsoFrac 


StripWidth 


Eff.(%) 


Rate(Hz) 


L1/L2 rate 


taulOi 


8.0 


0.023 


0.74 


0.058 


95 


9251 


1.51 


taul5i 


9.7 


0.022 


0.71 


0.057 


94 


5933 


2.35 


tau20i 


12.2 


0.019 


0.66 


0.055 


94 


3070 


1.57 


tau25i 


17.0 


0.016 


0.65 


0.051 


92 


1224 


2.31 


tau35i 


26.5 


0.015 


0.6 


0.048 


91 


351 


2.82 


tau45i 


33.5 


0.00807 


0.43 


0.0465 


93 


119 


2.21 


tau60 


44.9 


0.00345 


0.35 


0.04 


93 


47 


2.06 



Table 3: Single tau signatures, associated L2 calorimeter thresholds and efficiency on W — > TV sample and rates on QCD 
background samples for «5f=10 31 cm~ 2 s _1 . 
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tau20i 


tau60 


Offline Identification method 


all taus 


1 -prong 


3-prong 


all taus 


1 -prong 


3-prong 


Calo-based 


97.6±0.1 


97.5±0.1 


98.3±0.4 


97.8±0.1 


97.8±0.1 


96.8±0.5 


Track-based 


92.5±0.2 


95.2±0.2 


88.4±0.4 


94.9±0.2 


95.7±0.2 


91.7±0.5 


Calo- OR Track-based 


94.2±0.2 


96.3±0.2 


88.9±0.4 


96.6±0.1 


97.3±0.1 


92.8±0.4 


Calo- AND Track-based 


97.2±0.2 


97.0±0.2 


98.2±0.4 


96.6±0.2 


96.6±0.2 


96.1±0.9 



Table 4: Tau signatures and corresponding efficiencies (in %) for different reference samples of tau leptons identified by 
the offline reconstruction. Efficiencies have been estimated on low and high Ej generated samples of tau leptons. Errors are 
statistical only. Efficiencies are for L2 calorimeter selection only. 
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Figure 8: Efficiency curves of L2 selection relative to LI for different tau signatures. 



3.5 L2 performance studies of the tracking selection 

The addition of tracking information at L2 allows an improvement in the discrimination between hadronic 
decays of tau leptons and QCD jet backgrounds, although at the cost of a non-negligible loss of trigger 
reconstruction efficiency. This loss is most noticeable for low Ej tau signatures, mainly due to the re- 
quirement of one leading track above a fixed pj threshold. Overall, the L2 tracking algorithm is about 
95% efficient for isolated tracks with Ej above 5 GeV (for example from W —>■ tv). In W — ► TV events, 
however, only 90% of generated tau leptons that are identified by the offline reconstruction and pass the 
L2 calorimeter selection have at least one track with p^ > 5 GeV reconstructed at L2. 

The L2 tracking selection of tau trigger candidates is optimized with respect to generated tau leptons 
that pass the LI and L2 calorimeter selection and are found by the offline reconstruction software. Details 
of the offline reference and efficiency definition are described in Section 3.1.2. The optimization is 
obtained with the standard samples described previously. 

The efficiency and rejection power of some of the L2 tracking variables can be seen in Fig. 9, where 
the integrated efficiency for tau samples and background are shown as a function of the applied cut value. 
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For low Ej signatures, the minimum requirement on Pt leading along with the upper limit on Pt Iso/Core 
are the two most important criteria. For higher Ej signatures, the upper limit on N Slow tracks becomes 
more important, as background QCD jets tend to have a higher multiplicity of soft tracks compared to 
multi-track hadronic decays of tau leptons. The multiplicity requirement, N Track, is potentially a useful 
discriminant for higher Ej signatures, although a tight cut tends to bias the multiplicity distribution of 
the tau leptons found by offline reconstruction. To avoid such bias a very soft cut is applied. 

The optimized L2 tracking requirements for tau signatures and the corresponding efficiencies and 
rates are presented in Table 5. Aside from the Pt leading requirement, which is tightened with increasing 
Ej threshold, the requirements on Pt Iso/Core<0.\, N Slow tracks<2, Charge<2 and 1<N Track<l are 
kept constant at values which work reasonably well for all signatures, and therefore are not shown in the 
Table. 

The efficiency is detailed in Table 6 for two typical tau signatures, one with low and one with high 
Ej. For each signature, efficiency is recalculated using different reference samples based on offline 
reconstruction as in the previous section. Again, the errors quoted are statistical only. For the high Ej 
signatures, there is not a strong efficiency dependence on the particular reference sample chosen. For 
the low Ej signatures, however, the different selections used in the offline algorithms, particularly in 
selecting 1 -prong tau lepton decays, are apparent. Since the track-based algorithm also requires a track 
with some minimum pj when applied to these events the L2 tracking selection is quite efficient. The 
L2 tracking selection is considerably less efficient when applied to events selected with the calorimeter 
based algorithm, where no such requirement on the tracks is made. 

The overall L2 trigger efficiency for various single tau signatures, using the optimized selection 
described in Sect. 3.4- 3.5, is shown in Fig. 8. 



Signature 


Pt lead (GeV) 


Eff. (%) 


Rate (Hz) 


L2Calo/L2Track 


taulOi 


1.5 


86 


3980 


2.32 


taul5i 


2.5 


85 


2900 


2.04 


tau20i 


5.0 


81 


947 


3.24 


tau25i 


5.0 


74 


351 


3.48 


tau35i 


5.0 


75 


107 


3.28 


tau45i 


5.0 


80 


39 


2.01 


tau60 


5.0 


79 


15 


3.13 



Table 5: Single tau signatures, associated L2 tracking thresholds and efficiencies for W — > TV sample and rates for QCD 
background samples for Jzf=10 31 cirT 2 s -1 . 





tau20i 


tau60 


Offline Identification method 


all taus 


1 -prong 


3-prong 


all taus 


1 -prong 


3-prong 


Calo-based 


87.1±0.3 


86.6±0.3 


91.8±0.8 


97.5±0.1 


97.9±0.1 


90.3±0.9 


Track-based 


95.5±0.2 


96.5±0.2 


93.9±0.3 


95.8±0.2 


97.3±0.2 


89.3±0.6 


Calo- OR Track-based 


88.9±0.2 


87.5±0.3 


92.8±0.4 


96.5±0.1 


97.5±0.1 


89.3±0.5 


Calo- AND Track-based 


97.3±0.2 


97.4±0.2 


96.6±0.6 


97.9±0.1 


98.2±0.1 


92.4±1.3 



Table 6: Tau signature efficiencies (in %) for different reference samples of tau leptons identified by the offline reconstruction. 
Efficiencies have been estimated on low and high Ej generated samples of tau leptons. Errors are statistical only. Efficiencies 
are for L2 tracking only. 
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3.6 EF performance studies 

As described in Section 2.3, the tau identification algorithm developed for offline reconstruction is 
adapted for EF use. The resolution in Ej and direction achieved at the EF are shown in Section 3.2, 
and provide a clear improvement in Ej determination with respect to L2. 

The EF selection of tau trigger candidates is optimized with respect to generated tau leptons that 
pass the LI and L2 selection and are found by the offline reconstruction software. Details of the offline 
reference and efficiency definition are described in Section 3.1.2. The optimization is performed using 
the standard samples described previously. 

The cuts on calorimeter shape variables and Ej are optimized separately, in analogy to the L2 pro- 
cedure previously described. First, the correlation between the Ej of the tau lepton and the pj of the 
leading track associated with the tau lepton is studied in simulated samples, and then a combined re- 
quirement on these two variables is applied to each tau signature. Finally, the calorimeter shower shape 
variables are studied, and a further requirement is applied on these variables. 

In Table 7 the optimized threshold requirements and corresponding efficiencies and background 
rates are presented. As for L2, the shower shape variables are tightened and the cut value on EtCalib 
and pr lead are raised as a function of the Ej of the tau signature. The two shower shape variables 
requirements have been optimized to give an overall efficiency of « 90%. The requirement on nTracks 
is not stringent, so as not to bias the distribution for this variable, which is a key offline variable for the 
evaluation of tau lepton purity. Fig. 10 shows the efficiency for various single tau signatures, using the 
optimized selection summarized in Table 7. 

The efficiency is detailed in Table 8 as in previous sections. At EF clearly the efficiency is highest 
for the tau leptons identified by the calorimeter based algorithm of the offline reconstruction. 



g 1 

% 0.8 

CM 
_l 
LL 

W 0.6 

0.4 
0.2 
0 




■ ■ ■ 
20 40 



60 



ATLAS 



• tau10i 
a tau15i 
■ tau20i 
° tau25i 

tau35i 
□ tau45i 

tau60 



80 100 120 
True visible E T (GeV) 



Figure 10: Efficiency curves of EF selection relative to L2 for different tau signatures. 
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Signature 


EtCalib (GeV) 


p T lead. (GeV) 


EMRadius 


IsoFrac 


nTracks 


Eff.(%) 


Rate (Hz) 


taulOi 


10. 


5. 


0.13 


0.38 


1-8 


90 


1611 


taul5i 


14. 


6. 


0.13 


0.32 


1-8 


85 


897 


tau20i 


19. 


6. 


0.11 


0.33 


1-8 


86 


349 


tau25i 


22. 


7. 


0.1 


0.3 


1-8 


86 


175 


tau35i 


31. 


8. 


0.09 


0.24 


1-8 


87 


64 


tau45i 


36. 


8. 


0.08 


0.19 


1-8 


92 


24 


tau60 


51. 


8. 


0.09 


0.24 


1-8 


93 


8 



Table 7: Single tau signatures, associated EF thresholds and efficiencies for the W — > TV sample and rates for the QCD 
background sample for ,if=10 31 cm~ 2 s _1 . 





tau20i 


tau60 


Offline Identification method 


all taus 


1 -prong 


3-prong 


all taus 


1 -prong 


3-prong 


Calo-based 


90.9±0.3 


90.3±0.3 


95.5±0.6 


98.5±0.1 


98.5±0.1 


98.7±0.4 


Track-based 


84.7±0.3 


88.9±0.4 


77.4±0.6 


97.3±0.1 


97.7±0.1 


95.8±0.4 


Calo- OR Track-based 


85.0±0.3 


87.9±0.3 


77.9±0.6 


98.0±0.1 


98.2±0.1 


96.4±0.3 


Calo- AND Track-based 


93.7±0.3 


93.2±0.3 


97.1 ±0.5 


98.2±0.1 


98.2±0.1 


98.6±0.6 



Table 8 : Tau signature efficiencies (in %) for different reference samples of tau leptons identified by the offline reconstruction. 
Efficiencies have been estimated on low and high E T generated samples of tau leptons. Errors are statistical only. Efficiencies 
are for EF only. 



3.7 Combined performance of single tau triggers 

After evaluating each trigger level separately, we now present the overall combined performance (LI + 
L2 + EF) of the single tau triggers. Figures 11 and 12 show the dependence of the efficiency on E T . 
The shape of the efficiency curves is determined by the Ej resolution, which is different for the different 
trigger levels. The Ej threshold requirement of the signature is selected such that the efficiency curve 
reaches a plateau at the visible Ej of the generated tau lepton greater than this threshold. Since the 
efficiency turn-on is slowest at LI, the cut on Ej at LI is set significantly lower than for L2 and EF. 

The efficiency is detailed in Table 9 for two typical tau triggers as in the previous sections. In addition 
to the observations of the individual trigger level sections, one can see that the overall performance for 
the low Ej tau trigger is about 70% with respect to the reference, while at high Ej it is about 90%. This 
is due to relatively tighter cuts needed to control the large backgrounds present for the low Ej signature. 
One can also see that the efficiency at low E T is low in particular for 3-prong decays when the reference 
is the sample of tau leptons identified by the track-based algorithm. This reflects the previous observation 
of Section 3.4, and points to the need for the execution of the track-based algorithm at the EF as well as 
atL2. 

4 Trigger timing studies 
4.1 Online setup 

Prior to first LHC collisions, it is desirable to test the trigger software and data acquisition system in 
conditions resembling real data taking. There are two type of tests which can be performed: cosmic runs, 
where the detector signals left by cosmic rays activate the LI trigger, and technical runs where simulated 
LI trigger signals are used. These runs allow the monitoring system to be assessed, the algorithm timing 
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tau20i 


tau60 


Offline Identification package 


all taus 1 -prong 3 -prong 


all taus 1 -prong 3-prong 


Calo-based 

Track-based 

Calo- OR Track-based 

Calo- AND Track-based 


76.5±0.4 75.5±0.4 85.2±1.0 
70.9±0.4 79.5±0.4 58.3±0.7 
68.3±0.3 72.5±0.4 58.6±0.6 
87.9±0.4 87.4±0.4 91.1±0.9 


93.8±0.2 94.3±0.2 86.1±1.0 
88.4±0.3 90.9±0.3 78.1±0.8 
91.3±0.2 93.1±0.2 79.6±0.7 
92.8±0.3 93.1±0.3 87.1±1.6 



Table 9: Tan nigger efficiencies (in %) for different reference samples of tau leptons identified by the offline reconstruction. 
Efficiencies have been estimated on low and high Ej generated samples of tau leptons. Errors are statistical only. Efficiencies 
is for LI + L2 + EF selection. 
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Figure 1 1 : Overall trigger efficiency (LI + L2 + EF) for different tau triggers. 
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Figure 12: Efficiency for tau20i trigger. 



to be measured (although the composition of the input events is not representative of the expected LI 
trigger output when real data are taken) and the interplay between triggers to be studied. All studies are 
performed on CPUs dedicated to timing studies in order to minimize the possible influence of other users 
on timing results. Each of these machines is a dual-core Intel(R) XEON(TM) CPU 2.20 GHz machine. 

4.2 Timing results 

The CPU time performance of the trigger is a crucial ingredient in the optimization of the trigger. Current 
design constrains the total execution time per event to 40 ms at L2 and 1 s at EF for the whole trigger. 
Recent timing studies performed on tau triggers include: 

• tests of individual triggers to gauge the impact of increasing threshold levels on the total execution 
time of the trigger. 

During this test, each of the tau triggers is run individually (and no other triggers are run at the 
same time). Although the average execution time of each algorithm remains roughly equal between 
triggers (see Table 10), the average total time per event decreases as a result of the lower number 
of Rols per event for high energy triggers. The results shown in Table 10 indicate that the time 
performance of the tau trigger should be well within the constraint for total execution time at L2 
and EF. 

• a verification of caching of data from the liquid argon calorimeter in L2. This test requires running 
the tau algorithms and other calorimeter-based algorithms (electron algorithm, for example) for 
the same event. In case data in a given Rol is needed by more than one signature, then the caching 
procedure ensures that data are requested only once. Recent tests verify that caching is correctly 
implemented and provides roughly a factor of four time savings in the L2 calorimeter part. 
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Threshold Signature 


Algorithm 


taulOi 


taul5i 


tau20i 


tau25i 


tau35i 


L2 Calo 


8.1 


8.0 


8.0 


8.1 


8.1 


L2 Tracking 


15.4 


15.5 


15.0 


14.9 


14.7 


L2 Combined 


1.9 


1.9 


2.0 


2.0 


2.2 


L2 TotalTime 


41.6 


35.9 


19.7 


14.1 


7.9 


EF Calo 


12.3 


12.5 


13.4 


13.0 


14.0 


EF Tracking 


289.7 


297.7 


269.5 


268.4 


247.8 


EF Combined 


77.0 


76.9 


80.7 


77.8 


78.9 


EF TotalTime 


149.1 


133.6 


67.5 


51.2 


24.6 



Table 10: Mean algorithm execution time for each of the tau triggers. All times are given in ms and per Rol, except the L2 
and EF total times are given per event, in ms. The measurements are performed on a simulated sample of limited statistics, 950 
QCD background events with hard parton 35 <p T < 70GeV. 



• a timing comparison between the calorimeter-approach and tracking-approach in L2. Currently, 
the default order of L2 algorithms is that calorimeter data are analysed first and tracking data 
afterward. The alternative approach should also be evaluated. For the taul5 signature alone, 
using a simulated sample of limited statistics (950 QCD background events with hard parton 35 
< pt < 70GeV), the calorimeter and tracking approaches have been evaluated and show roughly 
equal total execution times, 34.1 and 34.0ms respectively. On a simulated sample of 400 QCD 
background events with hard parton pj > 2 TeV, for the same tau trigger, the tracking approach is 
slightly faster than the calorimeter approach, 299 versus 330ms respectively. The longer execution 
time on high energy jet samples with respect to low energy ones is mainly due to the longer 
execution time of the L2 tracking algorithm. This result shows that the L2 tracking selection 
provides greater background rejection power than the L2 calorimeter one against high energy jets, 
and therefore suggests that the tracking-approach is advisable for high Ej tau triggers. 

5 Tau trigger menu 

The single tau signatures are the basic element in the ATLAS trigger menu for collecting hadronic decays 
of tau leptons. Therefore, Section 2 and 3 have been devoted to the selection and performance of the 
single tau signatures. In this section we describe the full tau trigger menu, with an emphasis on triggers 
that combine tau signatures with other signatures and the physics goals they address. Such combined 
triggers are very important for collecting samples of tau leptons of moderate E T from several SM or 
even beyond the SM processes. As shown in Sect. 3, the minimal requirement on Ej for a single tau 
trigger must be high, even at low luminosity, to be allowed to run without prescale factors. A combined 
requirement of a moderate Ej tau signature with other signatures provides a way to achieve the necessary 
rejection against backgrounds and avoid prescale factors. 

5.1 Combined tau triggers 

The following triggers aiming at collecting hadronic decays of tau leptons are currently implemented in 
the ATLAS trigger menu, in addition to the single tau triggers already introduced: 

• tau+missing E T {t&u+Ef lss ). This type of trigger covers a wide spectrum of physics channels. At 
low luminosity, when the trigger rejection can be relaxed, the selection of events with W — > TV is 
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the priority, tt events with tau leptons in the final state are also selected by this trigger. Such events 
are characterized by relatively soft Ej range of tau leptons as well as low Ef lss . The tm+Ef lss 
triggers at design luminosity are intended for SM or SUSY Higgs (neutral or charged) searches as 
well as for searches of new exotic particles like Z'. The Ef lss trigger [14] uses the same threshold 
requirement at all trigger levels. The turn-on of the Ef lss efficiency curve is mainly limited by 
the LI missing Ej resolution. Since the average Ef lss inW — > TV events is low, and the EF Ef lss 
resolution is better, additional triggers where the Ef lss requirement is applied only at EF are under 
study. 

• tau+^(+jets), £ = e,mu. This type of trigger aims at selecting events with two relatively soft tau 
leptons in the final state. Two tau leptons are found in events with Z boson, neutral SM or SUSY 
Higgs. In addition, the tm+£ combination selects events with multiple leptons like tt or lepton 
flavor violating processes. The combination of two trigger signatures allows the use of lower 
threshold requirements than for the case of the single tau trigger. In case of excessive rates for this 
type of trigger at design luminosity, the additional requirement of jets or E^ lss can be introduced. 

• tau+tau(+jets). This type of trigger records events where both tau leptons decay hadronically. 
While the rejection rate is less favorable than the tau+^ case, the sample collected is complemen- 
tary to the above and both increases statistics and allows the reduction of systematics uncertainties 
due to lepton identification. This trigger is highly relevant for searches of Higgs boson or new 
exotic particles like Z' and will also be beneficial for SUSY double tau end point analyzes. 

• tau+jets, tau+ft-jets. This type of trigger is an interesting alternative for tt studies. At low lu- 
minosity, it allows the study of events with low jet Ej thresholds, while at high luminosities it is 
necessary to reduce QCD and multiple interaction background events. 

5.2 Commissioning triggers 

In addition to single tau triggers there are single track triggers, highly optimized to select Rols with 
one track only. The main purpose of this type of trigger is alignment of the tracker or hadronic calibra- 
tion. These triggers work in parasitic mode, taking all LI Rols passing 6GeV or 9GeV with isolation 
requirements in a given event. In this manner, a sufficient amount of tracks for commissioning can be 
obtained. 

Furthermore, additional triggers like taul5i_PT and 2tau25i_PT are included in the ATLAS trigger 
menu (PT stand for pass-through). For pass-through triggers, all events that are accepted by the LI 
algorithm are recorded for offline study of the HLT algorithms. At L2 and EF the algorithms are executed 
and their result is stored in the event record, however, their decision is not considered in the global trigger 
decision. As the output rate of these triggers is equal to the LI rate, only a small fraction can be recorded, 
typically ~ 0. 1 Hz after prescale factors. 

5.3 Tau trigger menu performance 

In this section we summarize tau trigger efficiencies for various physics signals, see Table 1 1 , and the 
corresponding rates estimated on minimum bias samples for lOO/^ -1 , see Table 12. For the results 
shown, the physics signal efficiencies are calculated per event, using the following references: 

• trigger with one (two) tau leptons: at least one (two) tau leptons at generator level with visible Ej 
greater than the tau trigger threshold(s) are required. Furthermore, the generated tau leptons are 
identified by the offline reconstruction algorithms. 
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• trigger with tau+e and tau+mw: besides the tau lepton requirement (see item above), the addi- 
tional lepton is required at generator level to have an Ej greater than the chosen lepton trigger 
threshold. The generated lepton additionally is identified by the "loose" offline reconstruction 
algorithm [1], [2]. 

• trigger tau+E™ lss : besides the tau lepton requirement (see item above), the missing Ej at generator 
level as well as the missing Ej reconstructed by the offline reconstruction algorithms [14] are 
required to be above the chosen trigger threshold. 

Introducing the triggers described above is important for most physics signals with tau leptons in the 
final state because: 

• allows increased statistics 

• it is a robust approach against failure or inefficiency of a particular trigger (e.g. due to detector 
problems) 

• allows reduction of systematic uncertainty by comparing results of the same analysis repeated on 
samples selected with different triggers. 

In Table 11 and Table 12 the symbol tau+xe stands for the tau+i?™ 1 " triggers, and j stands for the 
jet triggers. 

5.4 Tau trigger menus for different luminosity periods 

According to the foreseen LHC start-up plans, ATLAS will commence data taking at a low luminosity 
of about 10 31 cm~ 2 s _1 . Over the course of the first year or so, it is expected that the luminosity will 
increase to a rather high luminosity of about 10 33 cm~ 2 s _1 . While the input rate rapidly increases during 
this period, the maximal output rate of the complete trigger is expected to be constant at about 200 Hz 
(due to limitations on storage capacities and recording speed), putting a significant burden on the trigger 
system. 

As mentioned in Section 1, the low luminosity period will be used to commission the detector and 
trigger system. During this period, the focus of the trigger selection is Standard Model physics and 
events necessary for calibration and efficiency studies. The total sample collected is expected to be of 
the order of 100pb _1 . Concerning hadronic decays of tau leptons, the absolute scale of tau leptons and 
the characteristics and rate of QCD jets misidentified as tau leptons are estimated from simulated data 
studies only, and need to be verified and understood with collider data. To address this, several triggers 
are proposed, and outlined in the next section. For each trigger, a method of measuring the efficiency on 
real data should also be proposed. 

The transition to higher luminosity is foreseen to be done smoothly, dropping triggers of the low 
luminosity period menu which were used for commissioning or with unacceptably large rate, and keeping 
or introducing triggers without any prescale factors which focus on discovery physics. 

The high luminosity menu will be obtained either by tightening the threshold requirements and cuts 
of triggers present in the low luminosity menu, or adding new triggers, especially combinations of several 
signatures. Only some ideas for a tau trigger menu for a luminosity of Jz?=10 33 cm~ 2 s _1 currently exist. 

Given the current background rates, shown in Section 3, the high luminosity menu will include single 
high Ej signatures with threshold requirements higher than what shown in Section 3 (most probably 
taulOO). Furthermore, the high luminosity menu will be composed mostly of combined triggers like 
tau35i_xe45, tau25i_e25i, tau25ijnu20, 2tau35i, tau20i_j 150 and tau20i_4j50. 
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Trigger Item 






tt 


A TT (800) 


SU3 




H^hhx 


taulO 


oz.o it v.j 


91 9 ± 0 2 


93 6 ± 0 4 


97 1 ± 0 1 


94 3 ± 0 3 


93 8 ± 0 2 


96 9 ± 0 1 


"t cLU 1 0 1 




89.8 ±0.2 




96.4 ±0.2 


92.0 ±0.4 


92.1 ±0.3 






78.2 ±0.4 


88.7±0.3 


91.9 ±0.4 


96.5 ±0.1 


92.4 ±0.4 


91.6±0.3 


95.4 ±0.1 


"t cLU 1 5 1 




OU.U It U.J 






QO A A- O A 


yVJ.VJ It U.J 




tau20i 


68.5 ±0.5 


79.9 ±0.3 


83.2 ±0.6 


89.8 ±0.2 


82.5 ±0.5 


85.0±0.4 


89.8 ±0.2 




fi.fi. <c _i_ n 
DO. J in U.O 


76.0 ±0.4 


80.1 ±0.7 


89.0 ±0.3 


79.7 ±0.6 


82.0 ±0.4 




"ta.u.35i 


65.8 ±0.9 


70.0 ±0.6 


77.4 ±0.9 


87.5 ±0.3 


76.9 ±0.7 


78.2±0.5 


82.0 ±0.2 




82.7 ±1.3 


78.7 ±0.8 


88.0 ±0.9 


94.9 ±0.2 


89.6 ±0.6 


86.2 ±0.5 


88.5 ±0.2 


tau45i 


72.1 ±1.5 


68.5 ±0.9 


76.0 ± 1.2 


86.1 ±0.3 


75.1 ±0.9 


75.8 ±0.7 


78.5 ±0.3 


tau60 


77.5 ±2.6 


74.4 ±1.5 


74.7 ±1.7 


91.4±0.2 


78.2± 1.1 


76.1 ±0.9 


77.5 ±0.4 


taulOO 


83.9 ±6.6 


78.2±4.1 


80.2±3.5 


90.0 ±0.3 


81.7± 1.9 


79.1 ±1.6 


80.7 ±0.7 


2tau25i 


0.0 ±0.0 


47.2 ±1.5 


60.0 ±11.0 


62.6 ±1.2 


62.6 ±2.7 


61.5±6.7 


59.3 ±0.6 


2tau35i 


0.0 ±0.0 


43.1 ±3.1 


57.1 ±18.7 


60.6±1.3 


62.0 ±3.8 


50.0 ±9.1 


55.6 ±0.9 


taul5i_xe20 


56.3 ±0.6 


48.4 ±0.8 


80.1 ±0.7 


92.7 ±0.2 


89.5 ±0.4 


80.8 ±0.5 


80.2 ±0.3 


tau20i_xe30 


45.4 ±0.9 


38.6±1.2 


70.1 ±0.9 


84.5 ±0.3 


81.4±0.6 


73.9±0.6 


73.2 ±0.4 


tau25i_xe30 


44.2 ±1.0 


38.0±1.3 


67.5 ±1.0 


83.8±0.3 


78.7 ±0.6 


71.4±0.7 


71.1 ±0.4 


tau35i_xe20 


55.1 ±1.2 


42.0±1.2 


68.7 ± 1.1 


84.5 ±0.3 


76.3 ±0.7 


70.8 ±0.7 


69.5 ±0.4 


tau35i_xe30 


47.7 ±1.5 


38.5 ±1.7 


63.3 ±1.2 


82.3 ±0.3 


76.2 ±0.7 


69.2 ±0.8 


66.9 ±0.4 


tau35i_xe40 


42.1 ±2.5 


39.2 ±2.4 


58.0±1.4 


80.8 ±0.4 


75.6±0.8 


68.3 ±1.0 


65.3 ±0.5 


tau45_xe40 


54.7 ±3.6 


48.3±3.0 


67.4 ±1.6 


87.6±0.3 


88.2 ±0.7 


76.5 ±1.0 


71.0±0.6 


tau45i_xe20 


60.1 ±2.1 


43.8 ±1.7 


67.3 ±1.4 


83.2±0.3 


74.7 ±0.9 


69.0 ±0.9 


66.7 ±0.4 


tau20i_el0 


0.0 ±0.0 


68.1 ±1.2 


73.7 ±2.9 


79.6±0.8 


77.1 ±1.7 


73.2 ±0.8 


0.0 ±0.0 


tau20i_mu6 


0.0 ±0.0 


72.0 ±0.9 


81.4±2.4 


80.3 ±0.7 


83.3 ±1.3 


79.4 ±0.7 


0.0 ±0.0 


tau20i_j70 


70.8 ±1.4 


78.8±0.7 


81.6±0.7 


89.4 ±0.2 


82.7 ±0.6 


65.6 ±0.7 


72.6 ±0.3 


tau20i_3j23 


61.1 ±2.1 


80.2 ±0.8 


82.0 ±0.7 


89.9 ±0.2 


83.2 ±0.6 


83.7 ±0.7 


80.7 ±0.3 


tau20i_4j23 


58.2±4.1 


82.7±1.6 


80.5 ±0.9 


90.3 ±0.3 


84.5 ±0.6 


86.9 ±1.7 


83.9 ±0.7 



Table 1 1 : Physics signal efficiency without prescale factors. The requirements at generator level are 
summarized in Section 5.3. The Higgs boson mass for the last two columns is 120 GeV. Errors are 
statistical only. 



614 



Trigger - Tau Trigger: Performance and Menus for Early Running 



Trigger 


Level 1 


Level 2 


E\ 


ent Filter 


Selection 


Events 


Rate (Hz) 


Events 


Rate (Hz) 


Events 


Rate (Hz) 


tauNoCut 


79802 


23342 


79252 


23181 


77231 


22590 ±81 


taulO 


48854 


14290 


16235 


4749 


7748 


2266 ±26 


taulOi 


48854 


14290 


14066 


4114 


5413 


1583±21 


taul5 


48854 


14290 


10472 


3063 


4054 


1186±19 


taul5i 


48854 


14290 


9764 


2856 


2953 


864 ±16 


taul5i_PT 


48854 


14290 


10237 


2994 


10237 


2994 ±30 


tau20i 


16298 


4767 


3322 


972 


1229 


359 ±10 


tau25i 


9404 


2751 


1565 


458 


631 


185±7 


tau35i 


3149 


921 


473 


138 


215 


63±4 


tau45 


1062 


311 


321 


94 


255 


75±5 


tau45i 


750 


219 


154 


45 


79 


23±3 


tau60 


262 


77 


35 


10.2 


25 


7.3±1.5 


taulOO 


262 


77 


7 


2.0 


4 


1.2 ±0.6 


taul5i_xe20 


5625 


1645 


1329 


389 


198 


58±4 


tau20i_xe30 


525 


154 


139 


40.7 


23 


6.7 ±1.4 


tau20i_xe30_PT 


525 


154 


142 


41 


142 


41±4 


tau25i_xe30 


385 


113 


83 


24.3 


17 


5.0±1.2 


tau35i_xe20 


784 


229 


129 


37.7 


42 


12.3 ±1.9 


tau35i_xe30 


203 


59 


41 


12.0 


10 


2.9 ±0.9 


tau35i_xe40 


58 


17 


15 


4.4 


3 


0.9 ±0.5 


tau45_xe40 


47 


14 


18 


5.3 


6 


1.7 ±0.7 


tau45i_xe20 


278 


81 


54 


15.8 


19 


5.6± 1.3 



Table 12: Minimum bias event rates without prescale factors. The cross section value used is o = 70mb 
and the peak luminosity used is J? = 10 31 cm _2 s _1 . Errors are statistical only. 
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5.4.1 Running at ^f=10 31 cm 2 s 1 

The trigger menu for a luminosity of Jz?=10 31 cm~ 2 s _1 covers largely SM physics channels (W — ► XV, 
Z —>■ xx, it). A fair amount of the bandwidth is given to commissioning triggers (< 1 Hz each), which 
are needed to monitor trigger rates and variables used for tau lepton identification in different Ej ranges. 

Table 13 gives the trigger menu proposed for initial data taking, including expected prescale factors 
and corresponding trigger rates, with overlap between triggers taken into account. The prescale factors 
are necessary to restrict the total rate from low E T tau triggers, since there are many physics topics that 
need to share limited bandwidth. The initial prescale values have been determined from simulation, and 
will be optimised based on experience with early data. Event yields for several tau-based physics topics 
are shown in Table 14. In this table the results of only one typical trigger are reported for a set of triggers, 
corresponding to the one currently considered as the most suitable for the physics signals targeted at low 
luminosity. However, as physics simulation studies are progressing and as background rates are verified 
on data, it is likely that the trigger menu will be modified. 

6 Summary 

This note describes in detail the ATLAS tau trigger system, which is dedicated to the selection of hadronic 
decays of tau leptons. These results are obtained from simulations prior to LHC operations. In the 
selection at LI, energy deposition in the e.m. and hadronic calorimeters are used, while at L2 and EF 
calorimeter shower shape and energy information is combined with tracking information from the Inner 
Detector. At design luminosity, tau triggers are envisaged to be used without prescale factors to select 
single tau trigger candidates with Ej above lOOGeV. The events with tau trigger candidates with lower 
Ej will be selected using combined triggers which employ tau signatures in conjunction with missing Ej 
electron, muon, jet or another tau signatures. The largest contribution to the rate will be misidentified 
low Ej QCD jets, that are narrow and contain few particles mimicking hadronic decays of tau leptons. 

In the first months of LHC operation a low luminosity of 10 31 cm~ 2 s _1 is anticipated, and the data 
will primarily be used for commissioning the detector and trigger system. At that time the main focus 
of tau triggers will be on Standard Model physics such as W — > XV, Z — > xx and it events with hadronic 
decays of tau leptons in the final state. The typical tau trigger efficiency at low luminosity for generated 
tau leptons reconstructed by the algorithms of the offline reconstruction is expected to be around 80% 
for a wide range of physics channels, resulting in a total tau trigger rate of 28 Hz. 



Trigger 


Prescale 


Rate (Hz) 


Cumulative Rate (Hz) 


taulOO 


1 


2.4±0.5 


2.4±0.5 


tau60 


1 


10.7±1.0 


10.7±1.0 


tau45i 


10 


2.5±0.2 


12.6±1.1 


tau45 


20 


4.1 ±0.1 


16.0±1.4 


tau20i_xe30 


1 


4.9±0.7 


20.1±1.4 


tau20i_el0 


1 


1.2±0.4 


21.2±1.5 


tau20ijnu6 


1 


2.8±0.5 


23.7±1.6 


2tau25i 


1 


2.6±0.5 


25.3±1.6 


tau20i_4j23 


1 


0.1 ±0.1 


25.4±1.6 


tau20i_3j23 


1 


0.9±0.3 


25.8±1.6 


tau20i_j70 


1 


7.1±0.9 


28.6±1.7 
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Table 13: Tau trigger menu for ^=10 31 cm- 2 s" 1 . 



Tri er Item 

rigger em = 


W T ^hx 


2 TT 


tf 


A TT (800) 


SU3 


n t;T—>thX\. LZ -v) 




t au 1 5 i 


128.4 


9.53 




0.37 


0.002 


0.003 


0.004 


tau45 


816 




63.9 


26.2 


0.17 


0.1 1 


0.17 


tau45i 




211 4 












tau60 


4399 


670 


568 


484 


3.07 


1.18 


1.87 


taulOO 


699 


84 


116 


406 


2.58 


0.31 


0.48 


2tau25i 


0 


544 


13 


37 


0.24 


0.02 


0.68 


2tau35i 


0 


114 


4 


35 


0.22 


0.01 


0.33 


tau20i_xe30 


29005 


668 


2097 


429 


2.72 


2.30 


2.01 


tau35i_xe40 


3379 


178 


831 


374 


2.37 


1.03 


1.00 


tau20i_el0 


0 


1191 


182 


83 


0.53 


1.36 


0.00 


tau20ijnu6 


0 


2003 


227 


96 


0.61 


1.94 


0.00 


tau20i_j70 


8672 


1278 


2457 


2.96 


320 


2.04 


0.00 


tau20i_3j23 


3852 


1019 


2697 


2.31 


288 


1.30 


0.00 


tau20i_4j23 


957 


232 


1803 


1.31 


226 


0.23 


0.00 



Table 14: Number of events expected in lOOpb 'at 10 31 cm 2 s 1 for different physics signals. Cross-sections are given in 
Table 1. The requirements imposed at generator level are summarized in Section 5.3. 
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Physics Performance Studies and Strategy of the Electron and 
Photon Trigger Selection 

Abstract 

This note gives an overview of the implementation and performance of the 
electron and photon selection by the ATLAS trigger system. Trigger menus 
for commissioning as well as for the first physics run are presented together 
with the strategy for the early data taking phase. The physics performance 
in terms of selection efficiency and background rejection has been estimated 
using Monte Carlo simulations for various luminosity scenarios. An example 
of a method to determine trigger efficiency from real data using Z —>■ ee events 
is discussed. 

1 Introduction 

This note gives an overview of the implementation and performance of the electron and photon selection 
by the ATLAS trigger system. Electron and photon trigger baseline signatures and menus for LHC 
commissioning and first physics run are presented. The principles that have determined their design are: 

• Coverage of the physics needed for commissioning the trigger and detector systems as well as the 
offline reconstruction. 

• Coverage of the physics channels that allow standard model studies and searches for new physics. 

• Keeping the rates within the allowed bandwidth. 

Events with electrons and photons in the final state are important signatures for many physics ana- 
lyses envisaged at the LHC. A good selection by the electron/photon (e/y) triggers will be important for 
many analyses from searches for new physics, such as the Higgs boson, SUSY, Z' boson to standard 
model (SM) precision physics such as top quark and W boson mass measurement, rare B decays, etc. 
In the early running processes such as Z — > ee, J /iff — ► ee, W — > ev and y-jet events will be crucial for 
the understanding of the detector. These decays are important benchmark channels for the calibration, 
alignment and monitoring of the detector performance. The e/y trigger needs to cover the transverse 
energy range between a few GeV and several TeV. An overview of some relevant physics channels 
with electrons and photons in the final state, classified according to the corresponding transverse energy 
thresholds is given in Table 1 . 

To achieve this physics reach the trigger algorithms have to be optimized in terms of physics perfor- 
mance (signal efficiency, background rejection) and system performance (execution time, data require- 
ments, etc). The trigger menu has to ensure a good selection of the above physics channels within the 
allocated rate for the various luminosities during LHC running. 

In the following sections the implementation, performance and selection strategy are described in 
detail. Section 2 explains how e/y candidates are reconstructed and selected at the different trigger levels. 
Section 3 presents the selection strategy for LHC start-up, describing the schema currently envisaged for: 
commissioning, first menu for an initial luminosity of L~10 31 cm~ 2 s _1 and the subsequent development 
towards higher luminosity scenarios. In Section 4 the performance of the various electron and photon 
triggers for a startup luminosity of L~10 31 cm~ 2 s _1 and a luminosity scenario of L~10 33 cm~ 2 s _1 
is discussed. Examples of trigger efficiencies for physics processes with electrons and photons in the 
final state over the transverse energy (Ej) spectrum from a few GeV up to several TeV are included. A 
discussion of the robustness of the trigger selection follows in Section 5. Section 6 explains how the 
trigger efficiency will be determined from real data using Z — > ee events. 
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Momentum range 


Examples of some important processes 


low p T ~ 5-15 GeV 


Bj -^J/y/Kl' -> eeKK 








J/\jf^ee, Drell-Yan 


high p T -20-100 GeV 


H -> 77 (for m(H)< 130GeV) 




H -> ZZM -> eeee,eejXjX (for 130 < m(H) < 700 GeV) 




top physics, Z — > ee, W — > ev, 




direct photon production 


very high p T ~ 100 - 1000 GeV 


Z' — * ee, W — * ev 




G^77,G^ee, 




-> ee* -> ee7 



Table 1 : Examples of some important processes requiring a good electron and photon trigger selection 



2 Electron and photon trigger selection 

In this section the electron and photon trigger reconstruction and selection are described. The recon- 
struction and selection variables for each of the trigger levels are summarized in separate subsections. 

2.1 LI selection 

At LI trigger information from the electromagnetic (EM) and hadronic calorimeter system in the form 
of so-called trigger towers is used. A trigger tower has a dimension Ar) x A0 ~ 0.1 x 0.1. In this region 
all the cells are summed over the full depth of either the electromagnetic or hadronic calorimeter. The 
LI selection algorithm for electromagnetic clusters is based on a sliding 4x4 window of trigger towers 
which looks for local maxima [1]. 




Electromagnetic 
isolation ring 



Hadronic inner core 
and isolation ring 



Figure 1: LI calorimeter trigger schema, showing how trigger towers (each spanning a0.1x0.lT]x^ 
region) are used to determine the energy for the electromagnetic cluster as well as for the electromagnetic 
isolation, hadronic core and hadronic isolation. 

The trigger object is considered to contain an electron or photon candidate if the following require- 
ments are satisfied: 

• The central 2x2 'core' cluster consisting of both EM and hadronic towers is a local Ej maximum 
This requirement prevents double counting of clusters by overlapping windows. 
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• The most energetic of the four combinations of two neighbuoring EM towers passes the electro- 
magnetic cluster threshold. 

Figure 1 shows the LI trigger tower schema used to determine the LI selection variables. Isolation 
requirements can be imposed if required to control the rate: 

• E™ : The total Ej in the 12 EM towers surrounding the 2 x 2 core cluster is less than the electro- 
magnetic isolation threshold. 

• E^ 0 j^: The total Ej in the 4 towers of the hadronic calorimeter behind the 2 x 2 core cluster of the 
electromagnetic calorimeter is less than the hadronic core threshold. 

• Ef^f: The total Ej in the 12 towers surrounding the 2 x 2 core cluster in the hadronic calorimeter 
is less than the hadronic isolation threshold. 

The distributions of these isolation variables for signal and background are shown in Fig. 2 from Monte 
Carlo simulations. For signal, single electrons with an Ej between 7 and 80 GeV with a flat distribu- 
tion are used (solid line, hatched histogram). In comparison, background candidates from a simulated 
sample of QCD background (referred to as dijets) with Ej> 17 GeV are shown (dashed line, hollow his- 
togram). Most of the samples discussed in this note, unless otherwise specified, use the Pythia 6.403 [2] 
Monte Carlo event generator. The ATLAS detector Monte Carlo simulation is based on GEANT4 [3]. 
The Monte Carlo simulations store details of the generated particles including the type and kinematic 
variables, this so-called MC-Truth information can then be used as a control in subsequent analyses of 
the simulated data. In this case the truth information has been used to guarantee that no real electrons 
are included in the background distributions in Fig. 2. The distributions shown have been normalized to 
unit area. It can be seen that isolation, in particular LI EM isolation, provides a good handle to reduce 
jet background rate, typically needed at low transverse energy thresholds. As isolation depends on the 
topology and energy distribution of the event, as well as on other parameters such as luminosity and 
beam conditions, isolation cuts need to be well understood and applied carefully during data taking. 




Figure 2: LI isolation variables for single electrons with an Ej between 7 and 80 GeV with a flat 
distribution (solid line, hatched histogram). In comparison, background candidates from the Ej> 17 GeV 
dijet sample are shown (dashed line, hollow histogram). For the background, only clusters that do not 
match to a true electron within a AR cone of 0. 1 are considered. The distributions for electromagnetic 
isolation (left), hadronic core energy (middle), and hadronic isolation (right) are shown. 



2.2 L2 selection 

L2 is seeded by the LI EM Region of Interest (Rol). Thus L2 receives the reconstructed LI object with 
the T] and (j) positions and the transverse energy thresholds passed. L2 accesses a subsample of the 
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detector data around the given r\ and (j) position and applies trigger specific reconstruction algorithms 
characterized for their speed and robustness. Both photon and electron selection use the full granularity, 
full precision calorimeter information now available in the first selection step. The transverse cluster 
energy and various shower shape variables calculated in the different layers of the EM calorimeter are 
used to identify e/y candidates. The electron selection uses in addition inner detector information. Tracks 
are reconstructed in the inner detector and matched to the calorimeter energy clusters. Thus track finding 
and track-cluster matching variables can be used to select electrons. 

2.2.1 Calorimeter based electron and photon selection 

L2 calorimeter reconstruction is seeded by the 77 and 0 positions provided by LI. Calorimeter cells in a 
window of size Atj x A0 = 0.4 x 0.4 are retrieved (for more details on the data preparation see [4]). At 
the L2 trigger the cluster building algorithm scans the cells in the second layer of the EM calorimeter and 
searches for the cell with highest Ej. Subsequently, a cluster of 0.075 x 0.175 in tt. x <j) is built around 
this seed cell. The larger cluster size in 0 reduces the low-energy tails due to photon conversion and 
electron bremsstrahlung. Electrons and photons deposit nearly all of their energy in the EM calorimeter 
and deposit typically less than 1% of their energy into the hadronic calorimeter. In addition, showers 
from electrons and photons are typically smaller in the plane transverse to its direction than showers 
from jets. These quantities are used to select a low-background sample of electrons and photons. 

In detail, the L2 electron and photon calorimeter algorithms select events base on the following 
quantities: 

• Transverse energy of the EM cluster (Ej M ): Due to the energy dependence of the jet cross-section, 
a cut on E™ provides the best rejection against jet background for a given high pj signal process. 

• Transverse energy in the first layer of the hadronic calorimeter (Ej ad ): This is required to be below 
a given threshold. This cut is relaxed for high Ej triggers (90 GeV and above) as the leakage into 
the hadronic calorimeter increases with energy. 

• Shower shape in 77 direction in the second EM sampling: The ratio of the energy deposit in 
3x7 cells (corresponding to 0.075 x 0.175 in Atj x A0) over that in 7 x 7 cells is calculated: 
Rcore = E^xj/Ejxj. Photons and electrons deposit most of their energy in 3 x 7 cells and thus the 
corresponding ratio is typically larger than 80 %. 

• Search for a second maximum in the first EM sampling: After applying the cuts in the hadronic 
calorimeter and the second sampling of the EM calorimeter, only jets with very little hadronic 
activity and narrow showers in the calorimeter remain. The fine granularity in rapidity in the first 
sampling of the EM calorimeter allows checks to be made for substructures within a shower for 
a further rejection of background such as single or multiple 7T°s or tts decaying to photons. The 
energy deposit in a window At] x Atj) = 0.125 x 0.2 is examined. The shower is scanned for local 
maxima in the r\ -direction. The ratio of the difference between the energy deposited in the bin with 
highest energy E\ st and the energy deposited in the bin with second highest energy E^nd divided by 
the sum of these two energies is calculated: R str ips = {E\ st —E2 n d)/{Ei s t +E2m)- This ratio tends 
to one for isolated electrons and photons, and tends to zero for photons coming for example from 
7T° decay. 

Figure 3 shows typical distributions of the above shower shape variables for a signal sample of Higgs 
decaying into 77 and a the dijet background sample. A clear rejection power against background of the 
E T is seen for a cut above 20 GeV. The shower shape variables in the first and second EM sampling of the 
calorimeter provide a good discrimination power above 0.8. These shower shape variables will have a 
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narrower distribution for photons compared to electrons. The high granularity of the first EM sampling of 
the ATLAS detector permits efficient photon identification using only calorimeter information, tracking 
information is not used at all in the selection. 




Figure 3: Selection variables for a L2 calorimeter energy cluster. The distributions are shown for signal 
candidates from a simulated H yy sample (dashed line) and for dijet background candidates that do not 
have a photon or electron matched within a AR cone of 0.1 and that have at least 1 jet with Ej> 17 GeV 
(black solid line). Both distributions have been normalized to unity. The plots show the transverse energy 
of the EM cluster (top left), transverse energy deposited in the first layer of the hadronic calorimeter (top 
right), shower shape in the 77 direction in the second EM sampling (R core ) (bottom left), and the search 
for a second maximum in the first electromagnetic sampling (R str ip S ) (bottom right). 



2.2.2 Inner detector electron selection 

If all the criteria of the calorimeter based electron selection are fulfilled, a search for tracks is performed 
in front of the cluster, electron trigger candidates are identified by the presence of a matching recon- 
structed tracks [5]. 

2.2.3 Combined calorimeter and inner detector based electron selection 

A further rate reduction, while maintaining a high electron efficiency, can be achieved by combining 
the calorimeter and inner detector information. The background rate can be reduced by cutting on T7 
and 0 between the EM cluster and the extrapolated track into the calorimeter. As shown in Fig. 4 these 
distributions are narrower for electrons than for jets. Another quantity useful for electron selection is the 
ratio of the cluster Ej and the track pj. This quantity is affected by bremsstrahlung effects which cause 



623 



Trigger - Physics Performance Studies and Strategy of the Electron and . 




Figure 4: L2 (top) and EF (bottom) electron selection variables based on the combined calorimeter and 
inner detector information. From left to right the following distributions are shown: the difference in 
77 (left) and 0 (middle) between the cluster and track (extrapolated to calorimeter) position, and are 
shown: ratio of the Ej of the EM cluster and the pj of the reconstructed tracks (right). Distributions are 
shown for signal (solid line, hatched histogram) and background (dashed line, hollow histogram). The 
reconstructed elections come from a 7 < Ej < 80 GeV sample. The background candidates come from 
a filtered dijet simulated sample, only candidates with no match to a truth electron within a AR cone of 
0. 1 rad are selected. 



2.3 EF selection 

At the EF trigger level offline reconstruction algorithms and tools are used as much as possible. An 
important difference, however, between the offline and the EF reconstruction is that the offline recon- 
struction is run once per event accessing the whole detector, while the EF uses a seeded approach; runs 
several times per event, once for each Rol given by L2, accessing only the corresponding subsample of 
the detector. 

Currently, in the photon trigger selection only calorimeter information is used. EM clusters are 
searched for and reconstructed in Rols of size Atj x = 0.4 x 0.4. The EF calorimeter clustering 
algorithm searches for a local energy maximum with calorimeter trigger tower granularity. For electron 
and photon reconstruction only the data from the EM calorimeter is used, in contrast with L2 where the 
hadronic energy is also computed. The clusters should have an Ej above a given threshold. The default 
cluster size used is 0. 125 x 0. 125 in T] x 0 (whilst the offline reconstruction algorithms perform clustering 
with different window sizes, a single clustering option is used in the EF ). Once found by the clustering 
algorithm the cluster parameters (position, energy, etc.) are computed and further refined by a set of 
cluster correction (position and energy calibration) tools [5]. Also, corrections for the transition region 
between barrel and end-cap calorimeters are possible using the information from a set of scintillators. 

For electron triggers, tracks are subsequently reconstructed in the inner detector. The EF tracking 
currently implements an inside-out track reconstruction (track finding starts from the inner silicon de- 
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tectors and then is extended to the transition radiation tracker). In the future, an outside-in approach is 
intended to be used in the trigger in cases where photon conversions are present. There is the possibility 
to attempt bremsstrahlung recovery using offline tools. This option is not foreseen to run in the electron 
triggers for start-up but might be applied when running at higher luminosities with tighter selections. At 
L~10 31 cm~ 2 s _1 bremsstrahlung effects do not affect the trigger efficiency since the selection cuts are 
sufficiently loose to be insensitive to the performance improvements. 

Electron and photon identification in the EF is very similar to the offline [6]. Calorimeter shower 
shapes, leakage into the hadronic calorimeter and the E T of the EM cluster are used for the calorimeter 
based selection for electrons and photons. Compared to L2 more shower shape variables are used. 
Together with improved calibrations this results in a further rate reduction. For electrons track-cluster 
matching variables, track quality cuts, transverse impact parameter and for high luminosity running 
potentially transition radiation information could be used to further reduce the rate. 

As an example a loose electron EF selection will use the following selections: longitudinal leakage, 
shower shapes in the middle layer of the EM calorimeter, and very loose track-cluster matching cuts. 
Tighter selections might also use the shower shapes in the first EM calorimeter layer, information on 
the transverse impact parameter and on the track quality (number of hits in the pixels and strip silicon 
detectors and number of hits in the first pixel layer). Distributions for the track-cluster matching variables 
at EF are shown in Fig. 4. The distributions are very similar to those of the L2 track-cluster matching, 
but more refined algorithms are available at this stage, with more up-to-date calibration and alignment 
information. 

3 Electron and photon trigger selection strategy 

In this section the foreseen trigger selection strategy from the start-up phase up to physics running at 
nominal low luminosity is discussed. First the various commissioning steps at start-up are explained. 
This is followed by a discussion of a possible electron and photon trigger menu for the first physics run 
assuming a luminosity of L~10 31 cm~ 2 s _1 . 

When LHC turns on the trigger menus will need to provide data for commissioning and for analy- 
sis. In these paragraphs analysis is used to mean standard model measurements and searches for new 
physics. Typically these two objectives require very different trigger selection criteria. Commissioning 
usually demands loose selections to allow for more basic understanding of the detector and trigger, but 
loose selections accept more background events. Analysis is favoured with tight trigger selections that 
maximize the signal to background ratio in the allocated bandwidth. At start-up commissioning tasks 
would be prioritized, though signatures that provide data for a physics analysis will be included in the 
menu whenever possible. Following the progress on commissioning, loose trigger selections will be sub- 
stituted by tighter ones that increase the analysis capabilities. The main challenge for the ATLAS trigger 
is the big difference (approximately six orders of magnitude) between the collision rate and the rate at 
which data can be stored. This imposes tight constraints on the trigger menus. 

These principles used to determine the trigger menus, are developed in more detail in the following 
subsections, which also include early trigger menu tables. 

The naming conventions for the various e/y triggers used in the following sections are as follows. 
The name for LI electromagnetic candidates is EM. This is followed by the Ej threshold applied for this 
triggers and if isolation criteria are applied an "i" is added at the end of the name. The number preceding 
EM indicates the object multiplicity. For example 2EM13I requires that at least two isolated electron or 
photon candidates are identified at LI passing a Ej = 13 GeV threshold. The naming conventions for L2 
and EF are very similar. For example 2yl7i denotes a trigger which selects events in which two isolated 
photons are found passing a E T threshold cut of 17 GeV at EF level. Similarly a possible e20_xel5 
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trigger selects events in which at least one electron candidate with Ej (EF) > 20 GeV is identified, in 
addition, the missing transverse energy at EF exceeds 15 GeV. 

3.1 Commissioning 

At LHC start-up an initial luminosity of L~10 31 cm~ 2 s _1 is expected. This will allow lower Ej thresh- 
olds compared to those at the nominal LHC luminosity. The start-up menu has to provide the data 
samples needed to commission the trigger and detectors and at the same time provide useful data to be 
used for physics analysis. Therefore, the trigger has to guarantee the selection of the following standard 
model channels: W —>■ ev, J/yr —>■ ee,T —>■ ee, Drell-Yan, and direct photon production. For example: 
Z — > ee, J/yf —>■ ee and T —>■ ee will provide input for the electromagnetic calibration, alignment, ef- 
ficiency measurements, etc. Electrons from bottom and charm quark decays will be useful for studies 
of E/p. Direct photon production will provide input for the jet calibrations using /-jet events where 
the photon and jet are back-to-back. For comparison, in the first 100pb _1 of data we expect 235k of 
J/yr -► ee, 40k of T -► ee, 10k of Drell-Yan events, 10M b,c -> e, 100k direct photons and 250k of 
W ->ev. 

The strategy is to apply LI selections and run the High-Level Trigger (HLT) in pass-through mode 
(the selection criteria are tested and the trigger decision is recorded but no event is rejected). Table 2 
shows the e/ triggers foreseen. This will provide the first data samples for the low and high-py spec- 
trum and the rates for the various e// triggers. Fig. 5 shows the expected LI e//rate for a luminosity 
of L~10 31 cm -2 s _1 with and without isolation criteria applied as a function of the transverse energy 
threshold. 



Trigger 


LI 


Rate w/o 


Pre- 


HLT Rate 


item 


prescale [Hz] 


scale 


[Hz] 


em5_passHLT 


EM3 


40000 


20000 


2 


emlCLpassHLT 


EM7 


5000 


1300 


4 


eml5_passHLT 


EM13 


800 


200 


4 


eml5i_passHLT 


EM 131 


390 


100 


4 


em20_passHLT 


EM18 


280 


70 


4 


em20LpassHLT 


EM 181 


100 


25 


4 


em25i.passHLT 


EM23I 


41 


10 


4 


eml0543assHLT 


EM 100 


1 


1 


1 


2em5_passHLT 


2EM3 


6500 


1600 


4 


2eml54)assHLT 


2EM13 


80 


20 


4 


2em204)assHLT 


2EM18 


35 


10 


4 



Table 2: 'LI -only selection' trigger menu items for the LHC start-up including prescale factors. Depend- 
ing on the rate, prescale factors might be readjusted. The LI selection is applied and HLT selection is 
run in pass-through mode. 

This will provide the input, based on real data, to devise a trigger menu best suited for the first com- 
missioning and physics run with HLT selection enabled. Note that current rate estimates are affected by 
an uncertainty factor of two or three coming from the theoretical uncertainties in the jet cross-section. 
Though the LI pass-through triggers will mainly select background events they are useful samples and 
can be used to look for clusters and tracks in the whole r\ — (j) space and check that known dead and noisy 
channels and/or disconnected regions are correctly flagged. In addition the distributions of the e// selec- 
tion variables from data can be compared to those from Monte Carlo simulations. For example, signal 
over background distributions will be studied for variables with good discrimination power (e.g. Rcore). 
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L2 and EF performance can be checked and studies undertaken to evaluate the tracking performance of 
the different L2 tracking algorithms. 

In the next phase of the commissioning the HLT selection will be progressively enabled. Several mon- 




E T (GeV) E T (GeV) 



Figure 5: LI rate for single (left) and double (right) e/y triggers for a luminosity of L~10 31 cm 2 s 
The open correspond to non-isolated triggers. Errors are statistical only. 

itoring triggers will be kept, though with increased prescale factors applied. For example, L2 and EF in 
pass-through mode or loose y triggers (to monitor tighter calorimeter cuts as well as tracking efficiency 
for the electron triggers). 

3.2 First physics run 

After the commissioning phase of the detector the first physics run is foreseen. At the physics run, each 
of the trigger menu signatures including the HLT will be enabled as soon as the understanding of the 
detector and trigger allow. Table 3 gives an overview of the main electron and photon physics triggers. 
The aim is to select events with at least one electron above ~ 10 GeV or one photon above ~ 20 GeV, 
in addition to the relevant double object triggers, e.g. for selecting //y, Y, and Z events. J/\y — > ee 
and T — > ee events are particularly demanding events for the trigger system. The trigger rates errors for 
each trigger menu are statistical. Due to their relatively low masses, the electrons produced in the J/y 
and T decays are very soft (with an average transverse momenta of less that 5 GeV). This poses a huge 
challenge to the LI calorimeter trigger. Its performance at the low-energy end is limited by the noise of 
typically 0.5 GeV per Rol. A 3 GeV threshold is the limit for the LI trigger. The 6.5 kHz LI output rate 
for 2e5 makes it one of the biggest consumers of the total bandwidth. 

To keep the rate at an acceptable level selections for the electron triggers at low-pi (e.g. 2e5) have 
to apply tighter HLT selections compared to the higher-px triggers. A prescaled e5 trigger will allow 
measurement of the efficiencies and optimization of the selection cuts. A range of signatures is foreseen 
in the trigger menu to adapt to running conditions. If the trigger rate should prove to be too high back-up 
items with higher thresholds are included and/or prescale factors can be adjusted. Note, the uncertainties 
related to the detector response and the theoretical uncertainties on the jet cross-section. To ensure the 
selection of important physics channels, redundant triggers are present. As an example Table 4 shows 
the various triggers which will be useful for triggering on W — > ev, these include triggers which combine 
electrons and missing transverse energy. 
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Signature 


LI 
item 


EF 

selection 


LI Pre- 
Rate scale 


HLT 
Rate 


Motivation 


elO 


EM7 


medium 


5.0 kHz 1 


21 Hz 


e ± from b,c decays, E/p studies 


y20 


EM18 


loose 


0.3 kHz 1 


5.4 ±0.2 Hz 


direct photon production, jet calibration 












using y-jet events, high-p T physics 


e20 


EM18 


loose 


0.3 kHz 1 


4.3 ±0.2 Hz 


high-p T physics, Z — ► ee, W — > ev 


eml05_passHLT 


EM 100 




1 Hz 1 


1.0±0.1Hz 


New physics, check for possible problems 


2e5 


2EM3 


medium 


6.5 kHz 1 


6 Hz 


J / y/ — > ee, Y — ► ee, Drell-Yan production 


2yl0 


2EM7 


loose 


0.5 kHz 1 


<0.1 Hz 


di-photon cross-section 


2el0 


2EM7 


loose 


0.5 kHz 1 


0.4 ±0.2 Hz 


Z^ee 



Table 3: Summary of the main electron and photon triggers envisaged for the first physics run at 
L~10 31 cnT 2 s -1 . In this table the main physics triggers are listed including their expected rates and 
their physics motivation. 



Signature 


LI 
item 


EF 

selection 


Pre- Rate 
scale [Hz] 


Motivation 


e20 


EM18 


loose 


1 4.3 ±0.2 


main physics trigger 


y20 


EM18 


loose 


1 5.4±0.2 


redundancy, check of tracking eff. and 










performance 


el5_xe20 


EM13_XE20 


loose 


1 1.0±0.4 


access to lower pj -range 


el0_xe30 


EM7_XE30 


medium 


1 0.3±0.3 


access to lower pj -range 


e20i 


EM 181 


loose 


1 2.8±0.1 


backup if rate too high 


e25i 


EM23I 


loose 


1 1.4±0.1 


backup if rate too high 


e20_xel5 


EM18_XE15 


loose 


1 1.6±0.1 


backup if rate is too high 


y20_xel5 


EM18_XE15 


loose 


1 1.9±0.2 


backup if rate is too high, check tracking eff and 










performance 



Table 4: Summary of the main and redundant triggers for selecting W —>■ ev events foreseen for the 
L~10 31 cm~ 2 s _1 trigger menu. 
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3.3 Electron and photon trigger menus 

This section collects detailed examples of the foreseen trigger menus. Table 5 shows a summary of the 
trigger menu for the first physics run assuming a luminosity L~10 31 cm~ 2 s _1 . Each row corresponds to 
a different trigger signature. The convention to interpret the name of a trigger signature is explained in 
Section 3. For each signature the table has three blocks of information. The first block of three columns 
gives LI information: 

• Name of the LI trigger item. Which includes the transverse energy threshold required and an "I" 
if isolation cuts (as described in Section 2.1) are applied. 

• Prescale factor applied after LI selection. 

• Corresponding rate after prescale factors applied. 
The second block of three columns summarizes the EF: 

• Tightness of the EF selection cuts. 

• Prescale factor. 

• Corresponding HLT rate. 

The last column illustrates some relevant channels that the corresponding trigger would collect. 

Tables 6 and 7 have the same structure described above. Table 6 gives examples of trigger signatures 
with tighter selection cuts, to be used if the rates given by the ones presented in Table 5 are too high. 

A summary of the main trigger signatures foreseen for a higher luminosity scenario of L~ 10 33 cm~ 2 s~ 1 , 
is shown in Table 7. 

4 Electron and photon trigger physics performance 

In this section the performance of the different signatures is presented with trigger efficiency plots as a 
function of transverse energy Ej and pseudo-rapidity r\ . 

The trigger efficiency optimization of a given electron/photon trigger menu is a compromise between 
several factors: trigger efficiency for signal, QCD background rate which depends on the luminosity 
(constrained by allowed HLT bandwidth) and constrains of the average execution time at each trigger 
level. The performance of the main electron and photons triggers has been evaluated for a start-up lumi- 
nosity of L~10 31 cm~ 2 s _1 and for a higher luminosity of L~10 33 cm~ 2 s _1 . In sections 4.2 and 4.3 the 
performance of these triggers is discussed for electrons and photons respectively. For these studies two 
types of events have been used, the so called ideal and the misaligned detector geometry. The misaligned 
detector geometry simulation contains expected distortions of the detector and in addition contains ex- 
tra material coming from a more accurate description of cooling, powering and cabling services. This 
increases significantly the amount of material in the region 1.4 < \r\ \ < 1.8 compared to the ideal geom- 
etry, thus affecting the calorimeter energy calibration which was extracted using the ideal geometry. In 
section 4.2 a comparison is given for the performance based on these two layouts for one of the electron 
triggers. 

4.1 Data Samples 

The signal samples used for electron and photon trigger performance studies in this note are summarized 
in Table 8. For the electron and photon trigger optimization and turn-on curves samples with an energy 
range of 7 < Ej < 80 GeV (flat energy spectrum) are used. 
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Level- 1 






HLT 






Signature 


Item 


Pre- 




Sel- 
ection 


Pre^- 


Rate 
[Hz] 


Motivation 


e5 


EM3 


60 


0.7 


medium 


1 


4.8±0.2 


J/W -> ee, Y -» ee, Drell-Yan 


2e5 


2EM3 




6.5 


medium 


1 


6 


J/y^ee,Y^ee, Drell-Yan 


Jpsiee 


2EM3 


1 


6.5 


medium 


1 


1 


J/y^ee,Y^ee 


elO 


EM7 


1 


5.0 


medium 


1 


21 


e ± from b,c decays, E/p studies 


ylO 


EM7 


1 


5.0 


medium 


100 


0.6±0.1 


e ± direct photon cross-section, 
e-no-track trigger 


el0_xe30 


EM7_ 
XE30 


1 


0.2 


medium 


1 


0.3±0.3 


access low pj -range for 

W ->ev 


2yl0 


2EM7 


1 


0.5 


loose 


1 


< 0.1 


di-photon cross-section 


2el0 


2EM7 


1 


0.5 


loose 


1 


0.4±0.2 


Z — ► e + e~ 


Zee 


2EM7 


1 


0.5 


loose 


1 


< 0.1 


e + e~ 


2el2iX33 


2EM7 


1 


0.5 


tight 


1 


<0.1 


trigger for L^IO 33 cm -2 s _1 


yl5 


EM13 


1 


0.7 


medium 


10 


1.3 ±0.1 


e ± direct photon cross-section 


el5_xe20 


EM13_ 
XE20 


1 


0.2 


loose 


1 


1.0±0.4 


access low pj -range for 

W -» ev 


2gl7i_L33 


2EM13I 


1 


0.1 


tight 


1 


< 0.1 


trigger for L^IO 33 cm -2 s _1 


y20 


EM18 




0.3 


loose 


1 


5.4±0.2 


direct photons, jet calibration 
using y-jet events, high-pr 
physics,check tracking eff. 


e20. 


EM18 




0.3 


loose 


200 


<0.1 


check L2EF performance 


passL2 
















e20 


EM18 




0.3 




125 


0.1 


check L2EF performance 


passEF 
















em20_ 


EM18 




0.3 




750 


0.5±0.1 


check HLT performance 


passEF 
















em20i_ 


EM 181 




0.1 




300 


0.5±0.1 


check LI isolation 


passEF 
















e22iX33 


EM 181 




0.1 


tight 


1 


1.2±0.1 


trigger for L^IO 33 cm~ 2 s _1 


y55X33 


EM18 




0.3 


tight 


1 


1.2±0.1 


trigger for L^IO 33 cm -2 s _1 


eml05_ 


EM 100 




1 




1 


1.0±0.1 


New physics, check for possible 


passHLT 














problems 


yl50_ 


EM 100 


1 


1 




1 


<0.1 


check for possible problems in 


passHLT 














express stream 



Table 5: Summary of triggers for the first physics run assuming a luminosity of L~10 31 cm 2 s 1 . For 
each signature rates and the motivation for this trigger are given. 
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Level- 1 




Event Filter 




Signature 


Item 


Pre- 


Rate 


Sel- 


Pre- 


Rate 


back-up for trigger 






scale 


[kHz] 


ection 


scale 


[Hz] 




e5.e7 


2EM3 




6.5 


medium 




4.3±0.2 


2e5 


e5_el0 


EM3J3M7 




6.5 


medium 




4.3±0.2 


2e5 


3gl0 


3EM7 




<0.1 


tight 




<0.1 Hz 


2gl0, 2el0 


e20jtel5 


EM18_XE15 




0.1 


loose 




1.6±0.1 


e20 for W -> ev selection 


720_xel5 


EM18_XE15 




0.1 


loose 




1.9±0.2 


g20 for W — ► ev selection 


e20i 


EM 181 




0.1 


loose 




2.8±0.1 


e20 


e25 


EM 181 




0.1 


loose 




2.4±0.1 


e20 


e25 


EM 181 




0.1 


loose 




2.4±0.1 


e20 


e25i 


EM23I 




< 1 


loose 




1.4±0.1 


e20 


yl05_ 


EM 100 




« 1 






<0.1 


eml05.passHLT 


el05. 


EM 100 




1 






<0.1 


eml05.passHLT 



Table 6: Summary of backup triggers defined in case the rate is too high for a start-up luminosity of 
L~10 31 cm~ 2 s _1 . For each signature rates and the motivation for this trigger are given. 







Level- 1 






HLT 






Signature 


Item 


Pre- 


Rate 


Sel- 


Pre- 


Rate 


Motivation 






scale 


[kHz] 


ection 


scale 


[Hz] 




2el2i 


2EM7 




0.5 


tight 




1 


Z^ee 


2yl7i 


2EM13I 




0.1 


tight 




~ 1 


new physics e.g. H — > yy 


e22i 


EM 181 




10 


tight 




120 


high-pr electron physics 


7 55 


EM18 




30 


tight 




20±3 


high-pr photon physics 


yl05 


EM 100 




0.1 


loose 




1 


new very high-p^ physics 


el05 


EM 100 




0.1 


loose 




< 1 


new very high-p^ physics 



Table 7: Summary of triggers assuming a luminosity of L~10 33 cm 2 s For each signature rates and 
the motivation for this trigger are given. 
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Physics 


£" r (GeV) 


Geometry 


single photons 


60 


misaligned 


single photon scan 


7-80 


misaligned-ideal 


single photons 


20 


misaligned 


single electrons 


25 


misaligned 


single electron scan 


7-80 


misaligned-ideal 


ee 


- 


misaligned-ideal 


direct J/Psi 




misaligned 


77 


120 


misaligned 


ev 




misaligned 


ee 


500 


misaligned 


Z — ► ee 


1000 


misaligned 


G^yy 


500 


misaligned 


Photon+Jetl 


17-35 


misaligned 


Photon+Jet2 


35-70 


misaligned 


Photon+Jet3 


70-140 


misaligned 


Photon+Jet4 


140-280 


misaligned 


Photon+Jet5 


280-560 


misaligned 


Photon+Jet6 


560-1120 


misaligned 



Table 8: Main electron/photon signal and physics samples. 



^r(hard) 


jet filter 


a after filtering 


6GeV 


default 


4.214mb 


7GeV 


loose 


6.531 mb 


7GeV 


default 


3.788 mb 


17GeV 


loose 


0.373 mb 


17GeV 


default 


0.191 mb 


35GeV 


loose 


0.039 mb 


35GeV 


default 


0.021 mb 



Table 9: Main background samples and their cross sections. 
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Background samples and their expected cross-sections are summarized in Table 9. The minimum bias 
Monte Carlo sample corresponds to our best knowledge of the inclusive expected backgrounds (including 
both hard and soft processes). Unfortunately most of the events in it are at low energies < 7 GeV and it 
is not very practical for high-statistics studies above this energy. To increase our efficiency for the events 
with electrons and photons we use a filter which is based on the combination of the EM cluster Ej cut 
and the area of the EM cluster. Two types of the filter, "default" and "loose" are used to check for the 
potential biases in the LI trigger rate which could occur if the area required by the filter is too small or 
the E T cut too high for the trigger threshold studied. 

For the trigger rate studies in the intermediate energy range 6—17 GeV we use minimum bias events 
with a transverse energy above 6 GeV and default filter (a rejection factor of 16.61). Above 17 GeV the 
dijet processes become the most prominent background for electrons and photons. Dijet samples with a 
threshold cut of 15 GeV and different level of filters (default and loose respectively) are used to provide 
background estimates in that area (the physics processes such as W, Z, direct photons have been added). 
For the very high energy studies dijet samples with a threshold cut of 35 GeV are used. 

For this note two detector geometries are used: 

• "ideal" which uses the detector geometry to our best knowledge. 

• "misaligned" where the detector has misalignments and material distortions. 

The misaligned detector geometry has been generated to test the robustness of our reconstruction and 
trigger algorithms with respect to incorrect alignment and calibration of the detector due to unexpected 
excess of material. Extra material was added in the misaligned detector geometry with respect to the 
ideal: 

• For the inner detector, extra thin layers of material were added in the azimuthal angle range of 
0 < 0 < K only. The amount of material added varies in the z direction, from a few percent of X 0 
on the active detector elements up to 1 Xq in the areas occupied by services. 

• For the electromagnetic calorimeter, more material was added in the barrel cryostat (~ 8 — 1 1 % of 
Xq), between the barrel presampler and first calorimeter sampling (~ 5% Xq, always for positive 
0), and in the gap between the barrel and endcap cryostats (factor 1.7 increase of material density). 

4.2 Electron trigger performance 

The performance of the electron trigger has been evaluated for the trigger menus foreseen for the lumi- 
nosity expected at early running L~40 31 cm~ 2 s _1 and for a higher luminosity of L~10 33 cm~ 2 s _1 . 
The trigger efficiencies are quoted with respect to electrons identified with offline particle identification 
cuts [6]. The trigger efficiencies have been evaluated using simulated single electron samples with a 
transverse energy spectrum of 7 < Ej < 80 GeV and other physics samples such as / j\\f —> ee, Z —> ee, 
etc. Rates have been estimated using simulated dijet events or minimum bias background depending on 
the trigger thresholds. 

In the following the performance for the start-up trigger items e5, elO, e20, el05 and for a possible 
e22i trigger for higher luminosity running are discussed. 

Figure 6 shows the efficiencies as a function of Ej and \r\ | of the 1ow-/?t e5 trigger for the three trigger 
levels LI, L2 and EE The e5 trigger menu is a trigger with prescale factor at L~10 31 cm~ 2 s _1 and the 
main trigger is the 2e5 trigger, which applies the same selection cuts. The efficiencies are obtained for 
electrons from a misaligned detector geometry J fry — > ee sample. In the offline analysis these electrons 
typically will be identified via the "medium set" of offline "soft-electron" selection cuts (see [7] for 
more details). Therefore, the efficiencies shown in Fig. 6 are normalized with respect to such an offline 
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Figure 6: Trigger efficiencies at LI (solid circles), L2 (open squares) and EF (solid triangles) as a function of true electron Ej 
(left) and 1 77 1 (right) for the e5 (top), elO (second from top), e20 (third from top) el05 (bottom) menu items. The efficiencies are 
obtained from the following Monte Carlo simulated samples: J/yf — ► ee decays simulated with misaligned detector geometry 
fore5 trigger item, Z' — » ee (lTeV)forel05 and single electrons simulated with ideal detector geometry fore 10 ande20. Trigger 
efficiencies are normalized with respect to the medium set of offline soft-electron cuts for e5, with respect to the medium set 
of offline electron cuts for elO and with respect to the loose set of offline electron cuts for e20 and el 05. For trigger efficiency 
versus |?7| plots, an E T cut according to the corresponding menu item has been applied: for e5 Ej > 10 GeV, for elO Ej > 15 
GeV, for e20 E T > 30 GeV and for el 05 E T > 130 GeV. For e5 trigger item no data is shown for electrons for | tj | > 2 as this is 
beyond the coverage of the transition radiation tracker whose information is used for the offline electron selection. Errors are 
statistical only. 
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selection. As seen in the right figure, the efficiencies drop significantly in the transition region between 
the barrel and end-cap calorimeter. A better optimization of the trigger selection cuts in this region might 
help to recover part of the inefficiencies. Figure 6 shows LI, L2 and EF efficiencies as a function of 
E T and of \r\ \ of the e 10 trigger, which is the lowest p T unprescaled single electron trigger foreseen for 
running at L~10 31 cm~ 2 s _1 . The elO trigger selects events with at least one good electron candidate 
with Ej > 10 GeV. The efficiencies are obtained for single electrons using ideal detector geometry and 
are normalized with respect to the medium set of offline electron cuts as discussed in Section 2.3. The 
efficiency reaches a plateau value for E T approximately above 15 GeV and is quite uniform as a function 
of \r\ |, except for a 10 — 20% dip in the transition region between the barrel and end-cap calorimeters. In 
Fig. 7 elO trigger efficiencies are compared for samples that use ideal and misaligned detector simulation. 
The biggest effects are observed at lower Ej for the turn-on curve (left) and in the region 1.4 < \r\ \ < 1.8 
(right) where a significant increase of material in the so-called misaligned geometry with respect to the 
ideal one was introduced. Trigger efficiency plots as a function of Ej and | tj | for the electron triggers 
e20 and el05 are shown in Fig. 6. Compared to the elO trigger signature, the e20 trigger applies looser 
electron identification cuts. The el 05 trigger is aimed at selecting very high pj electrons with very loose 
selection cuts for L~10 32 cm~ 2 s _1 . At L~10 31 cm~ 2 s _1 this trigger will only apply the LI selection. 

For several physics channels a global trigger efficiency with respect to the offline loose selection is 
shown in Table 10 for each trigger level. An average trigger efficiency of ~ 98% is obtained after the EF 
level trigger. An efficiency close to 100% is expected for very high pj electrons, therefore the selection 
has been optimized towards this goal. 

Figure 8 (left) shows LI, L2 and EF efficiencies as a function of Ej of the signature e22i, the menu 
item selecting an electron with Ej > 22 GeV. The efficiencies are obtained for single electrons using 
ideal detector geometry are normalized with respect to a loose set of offline electron cuts as discussed 
in [6]. The efficiency reaches a plateau value for Ej approximately above 25 GeV. 

Figure 8 (right) shows an example of trigger efficiency dependency on the offline electron identi- 
fication cuts. Trigger efficiency has been determined with respect to loose, medium and tight offline 
electron identification selections as described in [6] for el5i. This menu item applies medium electron 
identification cuts at EF, therefore its efficiency with respect to loose offline reconstructed electrons is 
significantly lower than with respect to medium or tight offline electrons. 



2 1r 
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Figure 7: Ratio of trigger efficiencies for single electrons reconstruction in misaligned and ideal detector 
geometry as a function of true electron Ej (left) and | r\ | (right) for the elO menu item. Events in the \r\\ 
plot are required to verify Ej > 15 GeV. Errors are statistical only. 
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Figure 8: Trigger efficiencies at LI, L2 and EF as a function of true electron Ej for the e22i menu 
item (left). The efficiencies are obtained for single electrons using ideal detector geometry and are 
normalized with respect to loose set of offline electron cuts. Trigger efficiency dependency on the offline 
electron identification (right). Trigger efficiency for the el5i signature is determined with respect to 
loose, medium and tight offline electron identification selection (described in [6]). Errors are statistical 
only. 

4.3 Photon trigger performance 

The physics performance of the photon trigger menus presented in Section 3 has been evaluated for early 
running at L~10 31 cm~ 2 s _1 and for a luminosity of L~10 33 cm~ 2 s _1 . The performance has been eval- 
uated in terms of trigger efficiency after each trigger level using simulated single photons with transverse 
energies between 7 and 80 GeV. Rate estimates were calculated using dijet background simulations. 
A y20 trigger with loose selection has been defined for the first physics run at L~10 31 cm~ 2 s _1 . At this 
stage the selections still need to be understood and the photon trigger is also used to check the tracking 
part of the electron selection. Therefore, the same calorimeter selection cuts are used as for electrons. 

Figure 9 (top right) shows the relative rate for each trigger level as a function of | r\ \ . Photons in the 
transition region between barrel and end-cap calorimeters (1.37 < |tj | < 1.52) are not well measured and 
are excluded in the physics analysis. 

The single photon trigger at L~10 33 cm~ 2 s _1 selects photons above Ej =55 GeV. To keep the rate under 
control, this trigger has to apply very tight L2 and EF calorimeter selections. 

Figure 9 shows the trigger efficiencies as a function of Ej and |tj| for the y55 trigger using single 
photons. Only photons with \r\ \ < 1.37 or 1.52 < |tj| < 2.45 are considered. With respect to loosely 
selected offline photons the y55 trigger is 95.5% efficient. 

The double object trigger 2/17/ is another of the main photon triggers for running at L~ 10 33 cm~ 2 s~ 1 . 
Since the single photon dataset only contains one photon per event, the 2yl7i trigger efficiency was 
estimated as the square of the single photon trigger efficiency (Eff 2r i7i= Eff 2 17l ). The overall trigger 
efficiency estimated from this sample is 93.9 ± 0.2%. 

Figure 9 shows the 7 17i turn-on curve and efficiency as a function of | r\ | using single photon events. 
The trigger efficiencies shown for the turn-on are normalized with respect to tight offline photon selection 
criteria. In addition, the photons are required to be within the region \r\ \ < 1.37 or 1.52 < \r} \ < 2.45. 
This plot shows that the yl7i trigger is well set up and is fully efficient at 25 GeV. 

The photon trigger performance has been studied not only for single photon samples but also for 
several physics channels. Table 1 1 shows a summary of photon trigger efficiencies for simulated samples 
of standard model direct photon decays, Higgs decaying into two photons (low mass range 120 GeV) 
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Dataset (geometry) 


Trigger Level 


Trigger efficiency (%) 


W —> ev (misaligned) 


LI 


98.2 + 0.1 


(e20) 


LI + L2 


96.0 ±0.2 




LI + L2 + EF 


94.3 ±0.2 


Z' — > ee (misaligned) 


LI 


99.7+;;^ 


(el 05) 


LI + L2 


98.9 ±0.7 




LI + L2 + EF 


98.8 ±0.7 


electrons 500 GeV (misaligned) 


LI 


99.1 ±0.6 


(el05) 


LI + L2 


97.6 ±0.5 




LI + L2 + EF 


97.5 ±0.5 



Table 10: Global trigger efficiencies for the trigger items e20 and el 05. The signal efficiencies are 
determined from several different signal samples: W —>■ ev Z' —>■ ee and a sample of single electrons 
of fixed transverse energy of 500 GeV. The simulation used misaligned detector geometry. Trigger 
efficiencies are determined with respect to loose offline electron selection. In the W — » ev case the offline 
reconstructed electron is required to be within the | tj | < 2.5 region and to have a transverse energy above 
25 GeV. Efficiencies for the el05 item are determined in the kinematic region |tj| < 2.5, the transition 
region between barrel and end-cap calorimeters removed (1.37 < \ < 1.52), and with a minimum true 
transverse energy of 130 GeV. 



Dataset (luminosity) 


Trigger Level Trigger efficiency (%) 


7+Jet (L~10 31 cm" 2 s" 1 ) 


LI 






100.0 ±0.0 


(720) 


LI 


+ L2 




99.8±0.1 




LI 


+ L2h 


-EF 


95.0 ±0.3 


G — > 77 (L~10 32 cm" 2 s" 1 ) 


LI 






99.5 ±0.1 


(7105) 


LI 


+ L2 




98.8±0.1 




LI 


+ L2h 


-EF 


98.7 ±0.2 


H — ► 77 (L~10 3;i cm" 2 s" 1 ) 


LI 






95.6 ±0.2 


arm 


LI 


+ L2 




93.6 ±0.2 




LI 


+ L2h 


-EF 


91.2±0.2 



Table 1 1 : Trigger efficiencies after each trigger level, normalized with respect to the loose y selection, 
for different physics channels covering a wide range in Ej. In the case of H — > 77 standard kinematical 
cuts are applied in addition. For each physics process the efficiencies for the main physics trigger to 
select these events for a given luminosity scenario are shown. 



and an exotic signature: graviton decaying into a pair of photons G^ 77 (500 GeV). 

Trigger efficiencies after each level are given. Efficiencies are determined with respect to a loose 
offline selection using the main physics trigger for a given luminosity to select these events. In the case 
of H — > 77 standard kinematical cuts are also applied. A more detailed study of trigger efficiencies for 
different physics channels can be found in [8] and [9]. 

5 Trigger robustness studies 

The electron/photon HLT selection must be robust against detector effects such as mis-calibration, mis- 
alignment, dead and noisy read-out cells or sectors, luminosity and beam conditions changes etc. This 



637 



Trigger - Physics Performance Studies and Strategy of the Electron and . 




Figure 9: Trigger efficiencies at LI, L2 and EF as a function of the generated photon Ej (top left)and 
| r\ | (top right) for the y20 trigger. The efficiencies are obtained for single photons simulated with ideal 
detector geometry and are normalized with respect to the loose set of offline photon cuts. Note, the \r\\ 
plot includes an additional cut of E T > 23 GeV. Trigger efficiencies at LI, L2 and EF as a function 
of the generated photon Ej (middle left) and \r\\ (middle right) for the y55 trigger. The efficiencies 
are normalized with respect to photons with Ej > 55 GeV passing the loose set of offline photon cuts. 
Trigger efficiencies at LI, L2 and EF as a function of the generated photon Ej (bottom left) and \r\\ 
(bottom right) for the yl7i trigger. The efficiencies are normalized with respect to the tight set of offline 
photon cuts. Errors are statistical only. 



will be especially important during early running. Depending on the bunch structure of the LHC, the 
effect of pile-up might already be important even at a luminosity of L~10 32 cm~ 2 s _1 . More than 
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Trigger Level 


el2i no pile-up 


el2i with pile-up 


e22i no pile-up 


e22i with pile-up 


LI 


94.8 ±0.1% 


94.8 ±0.3% 


96.0 ±0.1% 


95.5 ±0.3% 


LI + L2 


86.8 ±0.2% 


86.7 ±0.4% 


90.2 ±0.2% 


89.4 ±0.4% 


LI + L2 + EF 


81.5 ±0.2% 


81.3 ±0.5% 


88.7 ±0.2% 


88.0 ±0.4% 



Table 12: The el2i and e22i trigger efficiency from a sample of electrons of 7< Ej < 80 GeV, with and 
without pile-up at a luminosity of L~2x 10 33 cm~ 2 s _1 



one proton-proton interaction might occur per bunch crossing, the proton-proton interactions that are 
not interesting for analysis but happen in the same bunch crossing that the interesting one are typically 
denoted as 'pile-up'. At L~2xl0 33 cm~ 2 s _1 around 4.6 minimum bias events are expected per bunch 
crossing. In this section only the effects of pile-up and mis-calibrations due to additional detector material 
in front of the calorimeter are discussed. 

5.1 Effect of pile-up 

The robustness of several electron triggers was studied with simulated data which included overlapping 
events ("pile-up") for a luminosity L~2x 10 33 cmT 2 s _1 . The presence of pile-up might have effects in 
reconstruction efficiency, (mostly for tracks but also for calorimeter energy clusters), as well as in the 
identification of isolated electrons and photons, both at the trigger and offline reconstruction level. 

Table 12 shows the effect of pile-up on the el2i and e22i trigger for single electron events. As can 
be seen the effect of pile-up on the trigger efficiency at L~2x 10 33 cm~ 2 s _1 is ~ 3%. There is a ~ 2% 
effect due to the isolation criteria at LI when tight isolation cuts are applied (el2i) and a ~ 1% loss 
at the HLT level. This results show that the trigger efficiency is only slightly more affected by pile-up 
effects than offline electron reconstruction. The effect of pile-up on trigger performance determined with 
respect to MC-Truth electrons as well as on the background rate is being studied. 

5.2 Effect of additional detector material 

A simulated data sample consisting of single electrons as described above was used to study the effect 
of additional inactive material in the detector. Despite the efforts to accurately describe the detector in 
Monte Carlo simulation it is not expected to be perfect. The biggest and more problematic differences 
between simulation and reality are expected to come from the description of inactive material (for exam- 
ple pipes for cooling, power and signal cables, etc). Trigger robustness against those possible distortions 
has to be tested. Methods to identify and correct those effects have to be developed. The detector simula- 
tion used to produce this data sample included distorted material distributions in both the inner detector 
volume and the electromagnetic calorimeter. 

A detailed description of the material added in the Monte Carlo sample is given in Section 4.1 The 
amount of extra material with respect to the non-distorted simulation, grows from a few percent of a 
radiation length at tj = 0 up to ~ \Xq at 1.5 < \ < 1.8, and then decreases towards higher values of 
|tj|. The amount of material added in front of the active elements was larger than the uncertainty on the 
material distribution. 

The effect of the extra inactive material on the electron trigger was studied by comparing the trigger 
efficiency for 0 > 0, where extra material was added in the detector simulation, and for 0 < 0, where 
no extra material was added. The resulting efficiencies are shown in Fig. 10. The efficiency is plotted 
as a function of the kinematic variables of the electron candidate reconstructed offline. A loose offline 
electron selection was used for normalization. 
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Figure 10: Effect of additional inactive material in the detector on the electron trigger efficiency. The 
trigger efficiency is compared for the nominal material distribution (at 0 < 0) and for increased inactive 
material (at 0 > 0) for the electron triggers elO and el5i. The efficiency is plotted as a function of |tj| 
(left) and Ej (right) of the electron candidate reconstructed offline. The left histograms correspond to 
the el5i trigger only. Errors are statistical only. 

It can be seen that the effect of the added material is more pronounced for the el5i signature than 
for elO. The el5i signature applies tighter selection cuts than the elO. In particular el5i requires the 
so-called medium electron identification cuts in EF, this explains the lower trigger efficiency for el5i. If 
trigger efficiency was computed with respect to medium or tight offline electron identification selection, 
the trigger efficiency would be higher, as shown in Fig. 8. 

6 Trigger efficiency determination from real data 

The trigger efficiency will need to be determined from data, reducing the dependence on Monte Carlo 
simulation as much as possible. Before data-taking starts some of the methods to study the trigger 
performance without relying on the MC-Truth information are being developed, an example is given in 
this section. 

The so-called "tag and probe" method has been studied using a Monte Carlo simulation sample of 
Z — ► ee. In the first subsection the basis of this method is explained, followed by the detailed explanation 
of the selection criteria chosen for this study, the trigger efficiency computation and its comparison with 
MC-Truth information. In the following subsection the performance of the method is presented for some 
loose trigger selections that will be used in early running. 

6.1 Efficiency extraction method 

The so-called "tag and probe" method uses offline identification of Z — > ee decays to select a clean 
sample of electrons, which are then used to determine the electron trigger efficiency. During data-taking 
a data sample to perform an analysis will be characterized by a given trigger signature being satisfied. 
In the study summarized in this section the data sample is defined by a given single electron trigger 
signature. The electron candidate that has satisfied the trigger is reconstructed and identified offline, it 
is the so-called "tag" electron. Z —>■ ee decays are selected requiring a second electron to be identified 
offline together with some identification conditions on the Z particle. This second electron is the so- 
called "probe" and it is used to study the trigger performance, since it is know to be a "good electron" it 
can be used to verify if electron trigger selection cuts are efficient to identify electrons. 
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The discrimination criteria applied to identify offline the tag and probe electrons and the Z can vary. 
In the following paragraphs we specify the identification criteria chosen to perform the study presented 
in this section. 

Selection of the tag electron involves tight electron identification cuts [6] in the offline reconstruction 
to reduce background, and this electron must also trigger the event on its own, through a single electron 
trigger chain. As trigger efficiencies are measured with respect to one of the established offline electron 
definitions (the so-called "loose", "medium" or "tight" electron identification set of cuts, described in 
[6]), this selection is the one initially applied to the probe electron. 

Additionally, both offline reconstructed electrons must pass kinematic cuts, i.e. Ej > Ej ut and \r\ \ < 
2.4, and not lie in the region between the barrel and endcap calorimeters (1.37 < \t]\ < 1.52 region is 
excluded). The value of Ej ut is chosen to be where the efficiency reaches its high-energy plateau in curves 
that show trigger efficiency versus transverse energy (such as the one shown in Fig. 11 ). When a trigger 
efficiency is plotted as a function of Ej or r\ , the relevant kinematic selection is relaxed. Finally, there 
are topological constraints, namely that the two electrons have opposite charge, and that their combined 
invariant mass lies in the range 70 < m ee < 110 GeV. 

At this point, the pair of electrons (tag and probe) may be considered for analysis. The trigger 
efficiency is defined by the frequency with which the probe electron in this sample passes the relevant 
trigger selection. 

It is perfectly possible for the probe electron to satisfy the tag selection as well. In this case, the 
roles of tag and probe may be swapped, the tag becoming the probe and vice versa. Because the tag 
selection is at least as tight as the probe selection at every stage, the new probe passes the trigger selection 
automatically. Thus, tag and probe events fall into one of three categories: 

• N\f events where only one electron passes the tag cuts and the probe fails the trigger selection 

• Nip events where only one electron passes the tag cut and the probe passes the trigger selection 

• N2 P events where both electrons pass the tag selection 
Counting these events, the measured efficiency may be expressed as 



where N p and Nj are defined by this equation. Assuming that the uncertainty in each variable is 
simply \/N~, the statistical error on e is 



Equation 2 may be generalized to allow for a more comprehensive uncertainty in N p and Nj. Equa- 
tions 1 and 2 can be extended to be used for differential trigger efficiency measurements. 

When using Monte Carlo simulated events, the truth record (MC-Truth) can be used to provide an- 
other estimate of the trigger efficiency. To understand the systematic uncertainty of the tag and probe 
method, the fractional efficiency difference between the trigger efficiency determined using tag and probe 
method (e« tag ani j pm be") an d the trigger efficiency determined with respect to the MC-Truth information 
(£MC-Truth) can be used: 

Fractional Efficiency Difference = £ ta § and P robe £mc Truth ^ 

^MC-Truth 

To reduce kinematic bias, the MC-Truth events have been required to satisfy the same kinematic cuts as 
the tag and probe events, although some bias can remain from the detector resolution. To reduce this, and 



£"tag and probe" — 



Ni p + 2N 2p _N P 
Ni f + Ni p + 2N 2 p~ N T 



(1) 




(2) 
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estimate just the bias from the tag and probe method itself, reconstructed quantities (£> and T]) are used 
to determine the acceptance. Non-reconstruction of a true electron is not a problem here, as the trigger 
efficiency is always measured relative to some level of offline reconstruction. 



Trigger Level 


w.r.t. loose (%) 


w.r.t. medium (%) 


w.r.t. tight(%) 




(truth) 


(truth) 


(truth) 


LI 


99.995 ± 0.005 


99.995 ± 0.005 


99.997 ± 0.005 




(99.994 ± 0.002) 


(99.995 ± 0.002) 


(99.998 ± 0.001) 


L2 


98.74 ± 0.07 


99.59 ± 0.04 


99.68 ± 0.04 




(98.67 ± 0.03) 


(99.54 ± 0.02) 


(99.62 ± 0.02) 


EF 


98.66 ± 0.07 


99.15 ±0.06 


99.96 ± 0.01 




(98.59 ± 0.03) 


(99.12 ±0.02) 


(99.96 ± 0.06) 


LI + L2 + EF 


97.41 ± 0.09 


98.74 ± 0.07 


99.63 ± 0.04 




(97.28 ± 0.04) 


(98.65 ± 0.03) 


(99.57 ± 0.02) 



Table 13: Single object tag and probe efficiencies for the elO selection used in the 2el0 signature, with 
comparison to efficiencies derived using truth information from simulation. Efficiencies are given with 
respect to the previous trigger level(s), as well as for the whole trigger (rows) and the given offline 
electron identification selection (columns). The errors given are statistical only. Errors on tag and probe 
values are scaled up, to correspond to 50 pb _1 of data. For this table, the invariant mass cut is 70 < m ee < 
100 GeV. A signal sample of Z° —> ee Monte Carlo simulation without background was used to obtain 
these results. 



6.2 Efficiency extraction for early running 

Early running at ATLAS will be vital for understanding the performance of the detector as well as the 
trigger and the offline reconstruction. There will be several single object triggers with low thresholds 
and loose selections that will be impossible to use later on in the experiment. Table 13 shows example 
results for the elO trigger selection used to build the 2el0 signature. The single object efficiency is 
shown, as the tag electron has been required to pass the tighter single electron trigger elO. Both trigger 
signatures are listed in Table 3. Table 13 displays the relative trigger efficiency for each trigger level 
as well as the overall one, comparing results obtained from tag and probe method and from simulation 
truth information, that show a good agreement within the statistical error. Figure 1 1 shows these results 
differentially, along with the fractional efficiency difference defined in Equation 3. This shows good 
agreement between the two methods, and that the loose elO trigger has high efficiency for Z° — ► ee 
electrons. The results presented in this Section have been obtained from a sample of Z° — >• ee Monte 
Carlo simulation without adding possible background. Studies performed adding Monte Carlo simulated 
background have shown that the presence of background could degrade the resolution of the method 
to a few percent level. The use of a low Ej threshold and the loose electron identification cuts on the 
probe offline selection means that backgrounds will be significantly higher than in standard data-taking 
at higher luminosities, in which tighter selection cuts would be applied. The Ej low threshold will, 
however, mean that background will be present in the invariant mass distribution of the two electrons for 
values smaller than the Z mass, allowing both sidebands to be used for background subtraction below 
the Z peak. Even with higher Ej thresholds, where only one sideband can be used, it is possible to 
extract the correct signal with a small additional systematic error [10]. 

The e20 signature foreseen for early running phase has looser electron identification cuts than the 
other single electron signature present in this menu, elO. Even though no isolation to other electro- 
magnetic or hadronic activity in the calorimeters is required, the threshold of the transverse energy is 
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Figure 11: Single object tag and probe efficiencies for the elO selection of the 2el0 trigger signature. The 
efficiencies shown are relative to a tight offline electron identification selection as described in [6], as a 
function of the reconstructed £V(left) and 77 (right). The tag and probe method (points) is compared to 
MC truth (lines). The lower two plots show the fractional efficiency difference (see Equation 3) between 
the two. The number of events used corresponds to 50 pb _1 . For this figure, the invariant mass cut is 
70 < m ee < 100 GeV. Errors are statistical only. A signal sample of Z° — > ee Monte Carlo simulation 
without background was used to obtain these results. 

sufficient to sustain a manageable rate. The trigger efficiencies for the different trigger levels with re- 
spect to the offline electron identification selections for the probe electron are shown in Table 14, both 
for the "tag and probe" method and with respect to MC-Truth, showing good agreement within the sta- 
tistical error, and high trigger efficiencies. In Fig. 12 the trigger efficiency for e20 signature as a function 
of Ej and 77 is presented. It can be seen that the L2 efficiency is close to 100% with respect to the LI 
efficiency already at very low transverse energy, the EF is almost 100% efficient with respect to L2. The 
optimization for this trigger item made to minimize the loss of efficiency in these areas, especially not 
losing efficiency in the HLT with respect to LI at large transverse energy. In the bottom part of the same 
figure the fractional efficiency difference between the "tag and probe" and MC-Truth method as defined 
in Eq. 3 is shown, good agreement between both methods is observed. 
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Trigger Level 


w.r.t. loose (%) 


w.r.t. medium (%) 


w.r.t. tight(%) 




(truth) 


(truth) 


(truth) 


LI 


99.74 ± 0.03 


99.83 ± 0.03 


99.88 ± 0.02 




(99.84 ± 0.01) 


(99.93 ± 0.01) 


(99.95 ± 0.01) 


L2 


98.55 ± 0.07 


99.48 ± 0.04 


99.58 ± 0.04 




(98.48 ± 0.03) 


(99.44 ± 0.02) 


(99.53 ± 0.02) 


EF 


98.67 ± 0.07 


99.16 ±0.06 


99.96 ± 0.01 




(98.60 ± 0.03) 


(99.13 ±0.02) 


(99.959 ± 0.06) 


LI + L2 + EF 


97.0 ± 0.01 


98.48 ± 0.08 


99.41 ± 0.05 




(96.95 ± 0.04) 


(98.51 ± 0.03) 


(99.44 ± 0.02) 



Table 14: Tag and probe trigger efficiencies for the e20 signature, with comparison to efficiencies de- 
rived using truth information from simulation. Efficiencies are given with respect to the previous trigger 
level(s), as well as for the whole trigger (rows) and the given offline electron identification selection 
(columns). The errors given are statistical only. Errors on tag and probe values are scaled up, to corre- 
spond to 50 pb _1 of data. For this table, the invariant mass cut is 70 < m ee < 100 GeV. A signal sample 
of Z° — > ee Monte Carlo simulation without background was used to obtain these results. 
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Figure 12: Trigger efficiency from the "tag and probe" method with Z — »■ ee for the e20 trigger signature. 
The efficiencies are shown w.r.t. a tight offline electron selection as described in [6], as a function of the 
reconstructed E T (left) and the reconstructed 77 (right). The "tag and probe" method (points) is compared 
with the MC-Truth method (solid line) for all three trigger levels, LI (solid circles), L2 (open triangles), 
and the EF (solid squares), The fractional efficiency difference (see Eq. 3) between the "tag and probe" 
and MC-Truth methods is shown in the bottom two plots. The number of Z — > ee events used corresponds 
to lOOpb -1 . Errors are statistical only. A signal sample of Z° — ► ee Monte Carlo simulation without 
background was used to obtain these results. 
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7 Summary 

An electron and photon trigger baseline for LHC commissioning and the first physics run have been 
presented. Many studies and tests have been summarized, without finding any problems that could com- 
promise the successful start-up of ATLAS data-taking. Electron and photon trigger performance have 
been studied in detail in a wide energy range using single electron and single photon simulated samples. 
Trigger efficiency dependencies on transverse energy (Et) and pseudo-rapidity (77) have been observed 
and studies are ongoing to explore the possibility to minimize them. Trigger efficiencies for the e/y selec- 
tion have been evaluated for simulations of different physics samples such as H —>■ yy,Z —>■ ee, W —>■ ev 
and some exotic channels. The electron and photon trigger menus for these selections have proven to 
be efficient above the threshold of the corresponding transverse energy cut with respect to offline recon- 
struction. The corresponding background rates have been estimated to be within the allocated bandwidth. 
It should be noted that due to current uncertainty in the cross-sections used for background estimations, 
rates could vary significantly. This could require the use of tighter selections with corresponding loss 
of signal efficiency. Some examples of backup signatures for such situations have been shown. Trig- 
ger robustness studies have been started, and are currently being extended. Trigger reconstruction and 
selection efficiencies do not show significant drops with respect to offline reconstructed electrons and 
photons. A method of trigger efficiency determination from data using Z — > ee decays has been studied 
in depth. This results compare well to trigger efficiency computed with respect to Monte Carlo truth. 
The extension of this method to other channels and samples are being developed. 
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Abstract 

The overall functionality and performance of the muon trigger system with re- 
spect to data produced as part of the ATLAS Computing System Commission- 
ing effort is described. The physics performance in terms of trigger efficiency 
and accepted rates is studied for the muon inclusive signatures for different 
luminosity scenarios. Dedicated studies on physics samples with single and 
double muon final states are also performed in order to evaluate the trigger ef- 
ficiencies on realistic data and background rejection capabilities. Methods to 
evaluate muon trigger efficiencies from real data are discussed. Furthermore, 
strategies to use the ATLAS calorimeters to tag and select isolated muons are 
presented. 

1 Introduction 

Triggering and identifying muons will be crucial for many LHC physics analyses. In accordance with 
the ATLAS general trigger scheme, the muon trigger system has three distinct levels: LI, L2, and the 
Event Filter (EF). The paper discusses the software tools used for muon trigger reconstruction and the 
algorithm selection strategy and trigger configuration. Next, the resolution and selection efficiencies of 
the various muon triggers are presented, followed by a discussion of the trigger rates for various lumi- 
nosities. Subsequently, the rejection of background from in-flight meson decays and selection of isolated 
muons using calorimeter information is discussed. Finally, the trigger performance on the di— muon fi- 
nal states Z — > /x/xand Z' — > /x/x is presented along with a description of plans to determine the trigger 
efficiency from collider data. 

2 Detector simulation and data samples 

The samples used in this paper were produced using a full GEANT4 based simulation of the ATLAS de- 
tector. The trigger simulation options included both standard and B-physics trigger simulation configura- 
tions, which correspond to the standard and the low trigger thresholds (see Section 3). The deterioration 
of efficiency due to the geometrical acceptance and the limited size of coincidence window are taken into 
account in the LI simulation and trigger logic emulator. 

Large samples of single prompt muons, simulated uniformly in tj — 0 , with fixed pj ranging from 

2 GeV to 1 TeV, have been used to study the muon trigger performance. One of the main backgrounds 
for the muon trigger selection comes from in-flight decays of charged kaons and pions. This has been 
evaluated using samples of minimum bias events and single pions, where the mesons are forced to decay 
inside the Inner Detector cavity in order to facilitate the production of a sizable sample of in-flight k/K 
decays. Muon trigger rates were determined using both single muons and minimum bias events. The 
selection of muons using the Tile Calorimeter has been studied using low pj (4 and 6 GeV) single muons 
and semi-inclusive b quark decays, bb — »■ jxX. Muon trigger studies on high-/^ dimuon final states and 
the determination of trigger efficiency from data have been performed using Z — >• /x/xand Z' — >• /x/x as 
signal processes and B — >• /x/x, W boson decays, Z — > XT and top-pair events as background processes. 

3 Muon trigger algorithms and configuration 

The LI muon trigger selects active Rols, in the event using Resistive Plate Chambers (RPC) [1] in 
the barrel (|t]| < 1.05) and Thin Gap Chambers (TGC) [1] in the endcaps (1.05 < \rj\ < 2.4). The 
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trigger algorithms look for hit coincidences within different RPC or TGC detector layers inside the 
programmed geometrical windows which define the transverse momentum region. A coincidence is 
required in both r\ and 0 projections. The information about muon candidates in both the barrel and the 
end-cap is transmitted to the Muon to Central Trigger Processor Interface (MuCTPI) [1], which calculates 
the number of LI muon candidates for six different pj thresholds and takes overlaps between the trigger 
sectors into account by using look-up-tables (LUT). There are several LI signatures each corresponding 
to a different p T threshold: 

• muO, mu5, mu6, mu8, mulO for the low pj selection; 

• mull, mu20, mu40 for the high pj selection. 

The integer numbers after the "mu" symbolize the required pj threshold. LI also provides the coordi- 
nates in 77 and 0 of the selected Rols. The muO threshold represents a LI configuration with completely 
open coincidence windows; it is also called the "Cosmic" threshold as it can be used to trigger on cosmic 
rays during the detector commissioning phase and between the LHC fills. Similar thresholds, labeled 
with "muXX", have been defined for L2 and EF. 

The muon HLT runs L2 and EF algorithms. It starts from the Rol delivered by the LI trigger and 
applies trigger decisions in a series of steps, each refining the existing measurement by acquiring ad- 
ditional information from the ATLAS detectors. A list of physics signatures, implemented in the event 
reconstruction and selection algorithms, are used to build signature and sequence tables for all HLT steps. 
This stepwise and seeded processing of events is controlled by the trigger steering. The reconstruction 
progresses by calling feature extraction algorithms. These typically request detector data from within the 
Rol and attempt to identify muon features. Subsequently, a hypothesis algorithm determines whether the 
identified feature meets the criteria necessary to continue. The decision to reject the event or continue is 
based on the validity of signatures, taking into account prescale and pass-through factors. Thus, events 
can be rejected after an intermediate step if no signatures remain viable. 

The main algorithm of the muon L2 system, muFast, runs on full granularity data within the Rol 
defined by LI. An optimized strategy is used to avoid heavy calculations and access to external services 
to reduce the execution time of the algorithm. After pattern recognition driven by the trigger hits which 
selects Monitored Drift Tubes (MDT) regions crossed by the muon track, a track fit is performed using 
MDT drift time precision measurements. The /^evaluation is performed using LUT. Reconstructed 
tracks in the Inner Detector can be combined with the tracks found by muFast by a fast track combination 
algorithm called muComb. 

The L2 algorithm (mulso) is used to discriminate between isolated and non-isolated muon candidates 
by examining energy depositions in the electromagnetic and hadronic calorimeters. The algorithm is 
seeded by muons selected by muFast or muComb and decodes LAr and Tile Calorimeter quantities in 
cones centered around the muon direction. For the muon selection two different concentric cones are 
defined: an internal cone chosen to contain the energy deposit deposited by the muon itself, and an 
external cone, containing energy only from detector noise, pile-up and jet particles. 

A strategy for tagging muons at L2 in the TileCal is implemented in the TileMuId algorithm. It can 
provide additional redundancy and robustness to the muon trigger, as well as enhance the efficiency in 
the low pi region. The search starts from the outermost calorimeter layer, which contains the cleanest 
signals, and once a deposited energy is compatible with a muon, the algorithm checks the energy de- 
position in the neighboring cells for the internal layers. Candidates are considered tagged muons when 
muon compatible cells are found following a 77 -projective pattern in all the three TileCal layers. There 
are two different variants of this algorithm : one (TrigTileLookForMuAlg) is fully executed on the L2 
Processing Unit (L2PU) while the other (TrigTileRODMuAlg) has a core part executed on the Readout 
Driver (ROD). 



648 



Trigger - Performance of the Muon Trigger Slice with Simulated Data 



The EF accesses the full event with its full granularity. Given the larger admissible latency, it is pos- 
sible to adapt algorithms developed for the off-line reconstruction to the on-line framework, an approach 
that minimises algorithm development. The EF processing starts by reconstructing tracks in the Muon 
Spectrometer around the muons found by L2 and is done by three instances of the EF algorithm; the first 
instance reconstructs tracks inside the Muon Spectrometer, starting with a search for regions of activity 
within the detector, and subsequently performing pattern recognition and full track fitting. The second 
step extrapolates muon tracks to their production point. Finally the information from the first two steps 
is combined with the reconstructed tracks from the Inner Detector. 

The hypothesis algorithms define a set of HLT trigger thresholds by applying cuts on the pj of the 
muon candidate. The muon trigger efficiency is defined as 

The number of events with a triggered muon ^ 
The number of events with a muon 

The effective trigger thresholds are obtained in such a way that at the nominal threshold value the ef- 
ficiency is 90% of the corresponding efficiency without cuts. For this reason effective thresholds are 
slightly lower than nominal thresholds. 

4 LI performance 

4.1 Barrel muon trigger performance 

As mentioned earlier, studies of muon trigger performance were conducted using simulated samples 
of single muon events generated over a large pj and angular range. LI selection algorithms show an 
efficiency greater than 99% for muons with p T above threshold. The overall acceptance (82% low-p T , 
78% high-^r) is due exclusively to geometrical regions of the Muon Spectrometer not covered by the 
RPC. Figure 1 shows the inefficient regions corresponding to the magnet support structures (—2.3 < 
(j) < —1.7 and — 1.4 < <j> < 0.9) and the spectrometer central crack at r\ ~ 0, not covered by RPCs. The 
overall loss in geometrical acceptance due to the r\ = 0 crack is approximately 7% while the loss due to 
the support structure is about 5%. Moreover smaller inefficiency patterns are clearly visible which are 
due to magnetic ribs in small trigger sectors. The geometrical acceptance effects are visible also in Fig. 2 
where the LI efficiency above threshold is shown with respect to r\ and 0. 




•1 -0.5 0 0.5 1 

1 



Figure 1: LI geometrical acceptance in the tj-0 plane. 
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Figure 2: r\ and 0 dependence of barrel trigger efficiency for single muons with a pj-15 GeV. 



Threshold 


Plateau Efficiency 


Effective Threshold (GeV) 


Sharpness(GeV) 


mu6 


0.82 


5.3 


2.2 


mu8 


0.82 


6.1 


1.9 


mulO 


0.82 


6.7 


2.2 


mull 


0.78 


10.9 


3.7 


mu20 


0.78 


15.3 


7.1 


mu40 


0.78 


27.8 


19.7 



Table 1: Plateau efficiencies, effective thresholds, and sharpness for LI signatures. Sharpness is defined 
as the difference of pj corresponding to 90% and 10% of the plateau efficiency. 

Figure 3 shows turn-on curves for low-pr and high-p^ thresholds; the efficiencies at plateau and 
effective thresholds are summarized in Table 1 . 

Muon tracks are deflected in the r — r\ plane under the action of the toroidal magnetic field. Their 
trajectories are symmetrical under charge exchange and reflection with respect to the plane z = 0, but 
the layout of the Muon Spectrometer is not. This asymmetry, could, in principle, produce a bias in the 
trigger efficiency calculation. From the single muon data sample, it was found that for muons with pj 
greater than the LI threshold the asymmetry in the efficiency is quite small (< 1%). 

Particular attention was devoted to the study of muons with 2 < p T < 3.5 GeV in the barrel region 
(| T] I < 1.05). Given the large inclusive cross-section with muons in the final state, this very low-pj region 
represents the most significant contribution to the total expected muon rate. Table 2 shows the fraction 
of these events that produce hits in the RPCs and the LI barrel efficiency for passing the mu6 trigger. 
Most of the low /? r muons that pass LI have \rj \ ~1. 



Muon pj (GeV) 


Percentage of events with hits in RPC 


LI Efficiency 


2 


0.15% 


(1.4±0.1)-10- 3 


2.5 


0.35% 


(3.4±0.1)-10- 3 


3 


0.48% 


(4.8±0.1)-10- 3 


3.5 


3.36% 


(33.4 ±0.3) -10^ 



Table 2: LI RPC efficiency for very low-pr single muon events with \r\ \ <1.05. 
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Figure 3: LI barrel efficiency as a function of pj for low-pt (left) and high-pt (right) thresholds. 

Very low-pr muons produced in the acceptance of the TGC (|tj| >1.05) sometimes give hits in the 
RPC (|t]| < 1.05) because they are strongly deflected by the magnetic field. The fraction of muons with 
| T] | > 1 .05 that give RPC hits ranges from 46% at p T = 2 GeV to about 10% at 3.5 GeV, and is negligible 
above 4 GeV. The overall efficiency for such muons to pass the mu6 trigger is about 10~ 3 . 

Figure 4 shows LI end-cap efficiency curves for low pj (left) and high pj thresholds (right). Ef- 
ficiencies at the threshold and plateau are summarized in Table 3. The efficiency of mu6 at threshold 
is 77%, relatively lower than other cases. This is due to the limited window-size of the three-station 
coincidence for muons having pj of 6 GeV. The rj dependence of the mu6 and mu20 efficiency are 
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Figure 4: The end-cap trigger efficiency curves for each pj threshold. The left plot shows the low-p^ 
thresholds of 6, 8, and 10 GeV and the right plot shows the high-^r thresholds of 1 1, 20 and 40 GeV. 



p T threshold (GeV) 


6 


8 


10 


11 


20 


40 


Threshold 


77% 


84% 


88% 


88% 


92% 


90% 


Plateau 


95% 


95% 


95% 


95% 


94% 


93% 



Table 3: Trigger efficiencies at threshold and plateau for various muon pj thresholds. 

shown in Fig. 5 (at threshold) and Fig. 6 (at plateau) with respect to the sign of charge of muon qxrj. 
Because the two muon end-cap stations are made as mirror images, with rj > (<) 0 behaves the 

same as with rj < (>) 0. The difference of efficiency between the two signs of q x rj is large 

at the geometrical boundary for muons near the pj = 6 threshold as shown in Fig. 5. One more point 
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worth noting is the dip at 77 =2 for the mu6 signature. Muons in the dip region pass through chambers 
which belong to different trigger sectors, consequently the requirement of a three-station coincidence is 
not satisfied and trigger efficiency is reduced. Figure 7 shows the § dependence of mu6 (left) and mu20 
(right) trigger efficiencies at the plateau and at threshold. The effect of octant symmetry of the magnetic 
field is seen in the plot of the mu6 efficiency for threshold muons. For the mu6 signature at plateau and 
for the mu20 signature, this effect is not observed, resulting in an approximately uniform efficiency. 




Figure 5: 77 dependence of end-cap trigger efficiency for mu6 (left) and mu20 (right). The solid circles 
represent q x 77 > 0, the open circles represent q x 77 < 0. 




Figure 6: 77 dependency of end-cap trigger efficiency at plateau for mu6 (left) and mu20 (right). The 
solid circles represent q x 77 > 0, the open circles represent q x 77 < 0. 



5 Performance of L2 muon algorithms 

As described in Section 2, algorithm performance is evaluated on samples of single muons generated with 
different transverse momenta. The resolution of the inverse of the measured momentum with respect to 
the generated transverse momentum is studied. Due to the non-uniform magnetic field in the Muon 
Spectrometer it is divided into four regions according to the pseudorapidity of the muon candidate: the 
Barrel region with |tj| < 1.05, and three end-cap regions with 1.05 < \r\\ < 1.5, 1.5 < \r\\ < 2.0, and 
2.0 < 1 7] I <2.4. 

For the Muon Spectrometer standalone reconstruction (muFast), the resolution of inverse pj as a 
function of the muon transverse momentum is shown in Fig. 8(left). The degradation in the resolution 
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Figure 7: 0 dependence of end-cap trigger efficiency for mu6 (left) and mu20 (right) for muons with 
Pt = 45 GeV. The open circles show the efficiency at threshold and the solid circles show the efficiency 
at plateau. 

with respect to previous results [2] is caused by the realistic geometry misalignment introduced in the 
muon simulation. Resolution as function of r\ and §l oc 1 is shown in Fig. 8(right). The degradation of 
the resolution in the endcap regions is evident. The muFast efficiency with respect to LI selection as a 




Figure 8: \lpj resolution (Muon Spectrometer StandAlone) as a function of pj (left) and r\ — §l oc (right). 

function of muon momentum are shown for mu4, mu6, and mu20 selections in Fig. 9. The low rejection 
at small momentum (2.5 < pj < 4.5 GeV), in particular in the barrel region, is caused by candidate 
tracks not pointing to the nominal interaction vertex due to large scattering angles. 

Efficiencies of the MS standalone reconstruction (muFast) (mu6 trigger selection) with respect to LI 
selection as a function of Y\ and §l oc for muons with Pj = 6 GeV are shown in Fig. 10. 

The combination of a Muon Spectrometer standalone muon candidate with an Inner Detector track 
found by the L2 tracking algorithms is performed by muComb. For muons with pj < 50 GeV the Inner 
Detector measurement has a better resolution than the Muon Spectrometer standalone measurement. 
Therefore, the combination of the two measurements gives better resolutions in the low-pj range. Figure 
11 shows the \lpj resolution as a function of pj and also the resolution as function of r\ and §u, c . The 
muComb efficiency with respect to muFast as a function of pj for mu4, mu6, and mu20 is shown in 
Fig. 12. The problem of low rejection for low-pr muons is partially solved when the Muon Spectrometer 
candidates are combined with Inner Detector tracks. 

l (j>L OC is the azimuthal angle folded up in [0, ft/16] such to cover half of an odd MS sector ([0, ft/32]) and half of an even 
MS sector [ft/32, ft/ 16]. 
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P T (GeV) " P T (GeV) 
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Figure 9: muFast efficiency with respect to LI selection for the mu4, mu6, and mu20 triggers for different 
77 regions. 




Figure 10: Efficiency of muFast as a function of X] (right) and ^ oc (left) for muons with pj = 6 GeV. 
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Figure 11: The muon combined XI pj resolution as a function of pj (left) and as a function of 77 — 0£ OC 
(right). 




P T (GeV) " P T (GeV) 



Figure 12: The muComb algorithm efficiency with respect to mu4, mu6, and mu20 triggers for the 
different r\ regions. 
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Efficiencies of muComb (mu6 trigger selection) with respect to muFast selection as a function of X] 
and §u>c for muons with pj = 6 GeV are shown in Fig. 13. 




Figure 13: Efficiency of muComb as a function of r\ (right) and <fc oc (left) for single muons with p? = 6 
GeV. 



6 Event filter performance 

The full reconstruction in the Muon EF has been executed on the simulated samples described in Sec- 
tion 2. The reconstruction in the Muon Spectrometer is carried out by the MOORE algorithm, the 
extrapolation to the vertex of the muon track found in the Muon Spectrometer is performed by the Muld 
standalone algorithm and the combination of the tracks found in the Muon Spectrometer and in the Inner 
Detector by the Muld Combined algorithm. 

Efficiency for single muon events is defined as the ratio of events with a reconstructed track at the 
EF after the execution of each reconstruction step to all events which have passed LI and L2. The 
efficiency with respect to L2 as a function of muon pj is shown in Fig. 14 for all three EF algorithms. 
The efficiency is defined on an event basis, and counts only once events having L2 muon-feature or EF 




Figure 14: Efficiency as a function of muon pj for MOORE, Muld Standalone and Muld Combined. 



track multiplicity greater than 1 . According to this definition, the efficiency to trigger an event with more 
than one muon is expected to be higher with respect to what is estimated here. 
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The efficiencies are lower for 3 < pj < 6 due to multiple scattering and energy loss fluctuation 
effects. Moreover, in the case of Muld Combined, at very high pj the increasing probability of muon 
showering is responsible for a small loss in efficiency. The efficiencies as a function of 77 and 0 show 
a structure, especially at low momentum, explainable with some residual dependence in r\ and 0 on the 
Muon Spectrometer geometrical acceptance and on the magnetic field inhomogeneities which affect less 
previous levels. It can be seen in Fig. 15 where the efficiency is shown for 6 GeV muons. In Fig. 16 



Figure 15: Efficiency of Muld Combined as a function of Y\ (left) and of (j) (right) for 6 GeV muons. 



the Muld combined efficiency with respect to L2 for different thresholds is shown on the left, while the 
overall trigger efficiency (LI + L2 + EF) with respect to the generated muons is shown on the right. All 
efficiency values are averaged over the whole |tj | <2.4 range. 




Figure 16: Muld combined efficiencies for various pj thresholds with respect to L2 (Left) and with 
respect to generated truth muons (Right). 



In Fig. 17, the \lpj resolution is shown as a function of muon pj for all EF algorithms. For a muon 
with pj below 50 GeV the Inner Detector dominates the reconstruction precision so the combination of 
measurements greatly improves the resolutions. For a muon with pj above 100 GeV, the Muon System 
dominates the measurement of the muon combined transverse momentum. II pt resolution as a function 
of 77 is shown in Fig. 18 for 20 GeV muons. The worsening of the resolution in the region 1.0 < |tj| < 
1.5 can be attributed to the highly inhomogeneous magnetic field in the transition regions of the Muon 
Spectrometer. This effect is recovered by means of the combined reconstruction which exploits the Inner 
Detector performance. 
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Figure 17: \lpj resolution as a function of pj for MOORE, Muld Standalone and Muld Combined. 
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Figure 18: l/pr resolution for Muld Combined as a function of tj for muons with pj = 20 GeV. 



7 Muon trigger rates 

The trigger rates for single muon event originating from all the physical processes expected in ATLAS 
were obtained using the EF MuiD combined algorithm (the events must of course also pass the LI and 
L2 muon algorithms). Various luminosity scenarios expected during LHC operation (from J£ = 10 31 
cm~ 2 s~ ! to 10 34 cm~ 2 s _1 ) were considered. Trigger rates were typically computed by convolving, 
over a given pj range, the estimated efficiencies with the cross-sections of processes representing the 
main muon sources at LHC. For the i-th process with cross-section a,, the rate is 

Ri = &l ^e( PT )dp T (2) 
J dp T 

where JSf is the instantaneous luminosity and e(pr) is the muon trigger efficiency for a given pj value. 
In order to take into account the r\ dependence, separate estimates for different r\ regions have been 
considered. An 0.5 GeV step has been applied in the numerical integration. The inclusive muon cross- 
sections at the LHC for b — ► [l and c — ► decays have been parameterized by using PYTHIA 6.403 [3] 
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LI muon trigger rates 




Jz^ = 10 31 cm- 2 s- 1 


.'/ 10- cm - s 1 


J^ = 10 34 cm" 2 s- 1 




Barrel (Hz) | Endcaps (Hz) 


Barrel (Hz) | Endcaps (Hz) 


Barrel (Hz) | Endcaps (Hz) 


n/K 


"Co 

454 


smic" 

199 


6 

8600 


GeV 

5300 


20 

1100 


GeV 

5200 


beauty 


85 


74 


4400 


5100 


2500 


3300 


charm 


124 


104 


6100 


6900 


2800 


4400 


top 


<0.1 


<0.1 


<0.1 


<0.1 


0.3 


0.5 


W 


<0.1 


<0.1 


3.0 


4.4 


26 


41 


TOTAL 


663 


377 


19100 


17300 


6400 


12900 


n/K 


5 

162 


GeV 

81 


8 

2200 


GeV 

3800 


40 

470 


GeV 

1900 


beauty 


54 


53 


2900 


4000 


1100 


1300 


charm 


76 


73 


3800 


4700 


1200 


1400 


top 


<0.1 


<0.1 


<0.1 


<0.1 


0.3 


0.3 


W 


<0.1 


<0.1 


4 


4.5 


23 


33 


TOTAL 


292 


207 


8900 


12500 


2800 


4600 



Table 4: Single muon trigger rates at LI, for various low and high pj thresholds, at Jz? = 10 31 cm 2 s \ 
££ = 10 33 cm- 2 s- 1 and ££ = 10 34 cm" 2 s" 1 . 



L2 muon standalone trigger rates 




Jz^ = 10 31 cm" 2 s- 1 


££ = 10 33 cm" 2 s- 1 


J^ = 10 34 cm" 2 s- 1 




Barrel (Hz) | Endcaps (Hz) 


Barrel (Hz) | Endcaps (Hz) 


Barrel (Hz) | Endcaps (Hz) 


n/K 


4 

190 


GeV 

140 


6 

4300 


GeV 

3700 


20 

410 


GeV 

1800 


beauty 


50 


67 


3000 


3900 


540 


1500 


charm 


70 


94 


4000 


5200 


520 


1700 


top 


<0.1 


<0.1 


<0.1 


<0.1 


0.2 


0.4 


W 


<0.1 


<0.1 


3 


4 


24 


38 


TOTAL 


310 


301 


11300 


12800 


1494 


5038 


n/K 


5 

82 


GeV 

120 


8 

840 


GeV 

1500 


40 

200 


GeV 

690 


beauty 


37 


59 


1000 


2200 


87 


280 


charm 


49 


81 


1300 


2900 


83 


290 


top 


<0.1 


<0.1 


<0.1 


<0.1 


0.1 


0.2 


W 


<0.1 


<0.1 


3 


4 


17 


23 


TOTAL 


168 


260 


3143 


6604 


387 


1283 



Table 5: Single muon trigger rates at L2 muon standalone, for various low and high pj thresholds, at 
= 10 31 cm- 2 s _1 , 10 33 cm- 2 s _1 and 10 34 cm -2 s _1 . The large expected rate in particularly in 
the endcap, caused by the relatively low rejection of low pj muons, can be reduced by improving the 
selection algorithm. 
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L2 muon combined trigger rates 




J^ = 10 31 cm" 2 s- 1 


££ = 10 33 cm- 2 s- 1 


J^ = 10 34 cm" 2 s- 1 




Barrel (Hz) | Endcaps (Hz) 


Barrel (Hz) | Endcaps (Hz) 


Barrel (Hz) | Endcaps (Hz) 


n/K 


4 

130 


GeV 

124 


3500 


GeV 

2600 


20 

68 


GeV 

890 


beauty 


48 


66 


2700 


3400 


320 


830 




66 


91 


3800 


4400 


280 


840 


top 


<0.1 


<0.1 


<0.1 


<0.1 


0.2 


0.4 


W 


<0.1 


<0.1 


3 


4 


22 


L 35 


TOTAL 


244 


281 


10000 


11000 


690 


2590 


n/K 


5 

44 


3eV 

55 


8 

400 


GeV 

530 


40 

6 


GeV 

310 


beauty 


31 


45 


660 


1100 


31 


92 


charm 


41 


61 


780 


1300 


26 


99 


top 


<0.1 


<0.1 


<0.1 


<0.1 


0.1 


0.1 


W 


<0.1 


<0.1 


3 


4 


7 


12 


TOTAL 


116 


161 


1840 


2900 


70 


513 



Table 6: Single muon trigger rates at L2 muon combined, for various low and high pj thresholds, at 
££ = 10 31 cm- 2 s"\ 10 33 cm- 2 s" 1 , and 10 34 cm" 2 s" 1 . 



Event Filter muon trigger rates 




Jz^ = 10 31 cm" 2 s- 1 


££ = 10 33 cm" 2 s- 1 


Jz^ = 10 34 cm" 2 s- 1 




Barrel (Hz) | Endcaps (Hz) 


Barrel (Hz) | Endcaps (Hz) 


Barrel (Hz) | Endcaps (Hz) 


n/K 


4 

125 


GeV 

119 


6 

1890 


GeV 

1230 


20 

46 


GeV 

40 


beauty 


44 


56 


1870 


2190 


260 


380 


charm 


60 


76 


2390 


2780 


220 


330 


top 


<0.1 


<0.1 


<0.1 


<0.1 


0.2 


0.3 


W 


<0.1 


<0.1 


2.9 


3.9 


21 


31 


TOTAL 


229 


251 


6150 


6200 


550 


780 


n/K 


5 

36 


GeV 

25 


8 

290 


GeV 

260 


40 

0.14 


GeV 

0.2 


beauty 


27 


33 


550 


800 


10.5 


16.3 


charm 


36 


43 


640 


930 


7.1 


11.1 


top 


<0.1 


<0.1 


< 0.1 


<0.1 


<0.1 


<0.1 


W 


<0.1 


<0.1 


2.8 


3.8 


3.9 


6.1 


TOTAL 


99 


101 


1480 


1990 


21.7 


33.7 



Table 7: Single muon trigger rates at EF muon combined, for various low and high p T thresholds, at 
= 10 31 cm- 2 s"\ 10 33 cm- 2 s" 1 , and 10 34 cm" 2 s" 1 . 
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Figure 19: Expected EF rates at Jzf = 10 31 cm 2 s 1 for single muon processes as a function of muon 
Pt threshold integrated over \ f]\ < 2.4. 



which produces conservative estimates since it predicts cross-sections about 2 to 3 times higher than 
previous descriptions [4]. Top quark and W/Z decays were simulated using PYTHIA 5.7 [5]. Rates of 
muon in-flight decays from %IK mesons have been computed using the DPMJET Monte Carlo program 
[6]. 

To verify the results obtained with this method and to understand the systematics, an alternative 
approach, relying on event counting, has been applied to the minimum bias events (counting method). 
The convolution and counting methods give EF final rates which are in good agreement, within statistical 
errors due to the limited size of the minimum bias sample, starting from pj threshold of 6 GeV. The 
values obtained with the counting method for lower pj thresholds (4 and 5 GeV) are a factor of two to 
four less for muons from nIK decays with respect to the convolution (provided by DPMJET) of Eq. 2. 

The rates obtained for some low and high pj thresholds in the barrel and in the endcaps after LI, L2 
muFast, L2 muComb and EF selection are shown in Tables 4, 5, 6 and 7. 

In Fig. 19 the total (barrel+endcaps) EF rates at L = 10 31 cm~ 2 s _1 are shown as a function of the pj 
threshold. In this figure, to keep uniformity among the rate results, mostly provided by PYTHIA 6.403, 
it has been chosen to report for the 4 and 5 GeV thresholds the EF rates obtained with the counting 
procedure. 

7.1 Fake dimuon trigger rate 

A single muon can be detected in multiple muon trigger sectors, causing such events to erroneously 
satisfy dimuon triggers. Such fake triggers can be suppressed by the overlap handling implemented in 
the MuCTPI . For the single muon samples used in the analysis, the fake dimuon trigger probability is 
defined as 

p Number of events with more than one muon triggered ^ 
Number of events with a triggered muon 
Four sources of fake double-counts have been considered : 

• Barrel-Barrel double counts (BB): When a single muon is detected by two overlapping RPC sec- 
tors. 
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• Barrel-Endcap double counts (BE): When a single muon is detected by an overlapping RPC-TGC 
sector pair. 

• Endcap-Endcap double counts (EE): When a single muon is detected by two overlapping "Endcap" 
TGC sectors. 

• Forward-Forward double counts (FF): When a single muon is detected by two overlapping "For- 
ward" TGC sectors. 

The probabilities that a single muon would cause any of these fake double counts have been calcu- 
lated separately. The effect of the MuCTPI overlap handling can be seen in Fig. 20, where the left plot 
shows the BE fake dimuon trigger probabilities for the 6 pj thresholds in the 2 to 50 GeV range with- 
out using the overlap handling, while the right plot shows the probabilities after applying the MuCTPI 
overlap handling. 




Figure 20: Barrel-Endcap fake dimuon trigger probabilities without (a) and with (b) using the 
overlap handling of the MuCTPI. 

The probabilities for 6 and 20 GeV single muons to produce a fake dimuon trigger if they caused a 
single-muon trigger, for all available LI muon thresholds, can be seen in Table 8. 

The fake probabilities can be used to calculate the single-muon trigger rates according to 



fifake=^ J O p {p T )e{pT)P^{PT)dPT (4) 

where ££ is the instantaneous luminosity of the accelerator, o p is the inclusive muon production cross- 
section at LHC and e is the LI trigger efficiency. The fake dimuon trigger rates are presented in Table 9. 



8 Rejection of muons from nIK decays 

Despite the large theoretical uncertainties on the rates of the muon production processes at low transverse 
momentum, it is clear that in flight decays of pions and kaons are a significant source of single muons 
and, therefore, a strategy must be developed to reject these events in the trigger. Rejection of muons from 
n and K decays at the EF is described below. A study describing rejection at L2 can be found in [7]. 
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Trigger item 




BB prob. [%] 


BE prob. [%] 


EE prob. [%] 


FF prob. [%] 


2mu4 


6.0 


1.56 ± 0.07 


1.39 ± 0.08 


1.00 ± 0.07 


0.81 ± 0.06 




20.0 


1.43 ± 0.06 


0.13 ± 0.02 


0.49 ± 0.05 


0.55 ± 0.05 


2mu5 


6.0 


1.14 ± 0.06 


1.17 ± 0.07 


0.40 ± 0.05 


0.56 ± 0.06 




20.0 




n i a _i_ n no 
u. i j m u.uz 


u.t? in u.uj 


u.jj in u.uj 


2mu6 


6.0 


1.11 ± 0.05 


0.97 ± 0.06 


0.39 ± 0.04 


0.55 ± 0.05 






l.^O It U.UD 


U.l J It u.uz 


U.M-7 III U.UJ 


U.JJ III U.UJ 


2mu8 


6.0 


0.87 ± 0.05 


0.38 ± 0.06 


0.31 ±0.05 


0.58 ± 0.07 




20.0 


1.33 ± 0.06 


0.10 ±0.02 


0.42 ± 0.05 


0.45 ± 0.05 


2mul0 


6.0 


0.68 ± 0.05 


0.12 ±0.08 


0.21 ± 0.09 


0.54 ±0.13 




20.0 


1.26 ± 0.06 


0.10 ±0.02 


0.36 ± 0.04 


0.36 ± 0.04 


2mull 


6.0 


0.43 ± 0.21 


0.00 ± 0.00 


0.32 ±0.15 


0.42 ±0.16 




20.0 


0.86 ± 0.05 


0.00 ± 0.00 


0.33 ± 0.04 


0.32 ± 0.04 


2mu20 


6.0 


0.28 ± 0.28 


0.00 ± 0.00 


0.48 ± 0.34 


0.00 ± 0.00 




20.0 


0.75 ± 0.05 


0.00 ± 0.00 


0.24 ± 0.03 


0.18 ±0.03 


2mu40 


6.0 


0.42 ± 0.42 


0.00 ± 0.00 


0.00 ± 0.00 


0.00 ± 0.00 




20.0 


0.49 ± 0.04 


0.00 ± 0.00 


0.08 ± 0.03 


0.08 ± 0.03 



Table 8: Probabilities that single muons with transverse momenta 6 and 20 GeV which caused a single 
muon trigger, to also passes a fake dimuon signature at the same threshold. 



Trigger item 


BB rate [Hz] 


BE rate [Hz] 


EE rate [Hz] 


FF rate [Hz] 


Total fake rate [Hz] 


2mu4 


1846.6 ± 119.2 


271.6 ± 14.1 


136.2 ± 24.5 


69.2 ± 12.3 


2323.7 ± 123.1 


2mu5 


243.9 ± 13.0 


203.1 ± 10.6 


35.3 ± 10.5 


33.3 ± 6.5 


515.5 ±20.8 


2mu6 


193.9 ± 12.4 


82.6 ±7.1 


24.6 ± 6.0 


24.7 ± 4.7 


325.7 ± 16.2 


2mu8 


114.1 ±9.8 


16.1 ±2.1 


9.7 ± 3.0 


12.4 ± 3.3 


152.3 ± 11.0 


2mul0 


79.2 ± 8.0 


4.9 ± 1.2 


4.8 ± 2.2 


5.5 ± 2.0 


94.4 ± 8.6 


2mull 


11.7 ± 1.8 


0.1 ±0.1 


3.9 ± 2.0 


4.5 ± 1.8 


20.1 ± 3.2 


2mu20 


2.4 ± 0.4 


0.1 ±0.0 


2.4 ± 1.9 


0.7 ± 0.5 


5.5 ± 2.0 


2mu40 


0.8 ±0.1 


0.0 ± 0.0 


1.7 ± 1.7 


0.1 ±0.1 


2.6 ± 1.7 



Table 9: Rates of various fake dimuon triggers at Jz? = 10 33 cm 2 s l . 
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8.1 Data samples and their validation 

Minimum bias samples would be the most suitable for studies involving pion and kaon decays. However, 
the probability that pions or kaons produced in low or moderate pj QCD scattering would decay before 
interacting hadronically in the calorimeters is low, between 0.1% and 1% depending on the meson pj. 
In order to enhance the number of charged pion and kaon decays in the sample, the simulation of events 
without any charged meson with p T above a given threshold is aborted and one or is forced to 
decay in the Inner Detector cavity. 
The samples produced are: 

• 10 6 single pions with p T > 2.5 GeV and kinematics (p T x Tj) generated according to a double 
differential cross-section of primary pions in minimum bias events 

• 10 5 minimum bias events, where one charged % or K with pj > 2 GeV per event is forced to decay; 

In order to estimate cross-sections or trigger rates, the abundance of forced decays must be re-weighted 
on an event by event basis according to meson decay probability. In addition to the above samples, 
standard minimum bias events have been used as a reference to cross check the results obtained from 
these dedicated productions. 

The muon p T spectra observed in minimum bias events and single pions, forced to decay, were 
found to be consistent, after appropriate re-weighting, with each other and in agreement with previous 
predictions and unforced minimum bias events. 

8.2 Rejection strategy at the event filter 

The fraction of in flight decay muons retained at the EF, normalised to the L2 efficiency, for the mu6 
trigger item, has been measured as a function of the muon pj. There is a very poor rejection capability 
(< 90%) for muons coming from pion decays, which demonstrates that the standard muon identification 
procedures are not very sensitive, as expected, to the small kink between the pion and muon tracks. The 
kinematics of charged kaon two-body decays, which are the dominating kaon contribution to the muon 
rate, is much more favorable toward rejection due to the larger average value of the angle between the 
kaon and the muon tracks. In order to improve the rejection capability, additional measured parameters 
providing some discriminating power between background and primary muons have been identified: 

• the impact parameter, do, of the track reconstructed in the inner tracker; 

• the number of hits associated to the Inner Detector track in the Pixel Detector (Nhits (Pixel)), in the 
pixel B-layer (N hits (Bi ayer )) and in the Silicon Tracker (N hits (SCT)); 

• the ratio Ptid/Ptms between the transverse pj in the Inner Detector and in the Muon Spectrometer, 
after back-extrapolation to the interaction point and correction for the measured energy loss in the 
calorimeters; 

• the ^matching °f tne matching between the track parameters as reconstructed in the Muon Spectrom- 
eter and in the Inner Detector. 

The discrimination power of each variable has been studied by measuring the fraction of accepted events 
as a function of the cut applied for both isolated muons and fake muons above a given pj threshold. 
The results, shown in Fig. 21, are based on the simulations of single muons and single pions with forced 
decays. For each variable, the fraction of events retained after the cut is normalised to the number 
of events passing the EF reconstruction before the application of any hypothesis algorithms. From the 
analysis of the exclusive rejection power of the individual variables, the set of cuts listed below have been 
defined. These cuts try to minimize the efficiency loss for prompt muons while reducing the background: 
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Figure 21: Efficiency for prompt muons and muons from pion decays as a function of the cut on some 
discriminating variables. 

• |d 0 | < 0.15 mm, N hits (B layer ) > 1, N hits (Pixel) > 3, N hits (SCT) > 6, 

• PTID/PTMS< 1-25, hatching < 26 - 

In particular, these values have been chosen by considering efficiency and background rejection at pr = 
4 GeV. It is assumed that cuts will be optimized for each muon item in the trigger menu. 

From the application of these cuts on the reference sample of events accepted at the EF, the efficiency 
for prompt muons and in flight decay muons shown in Fig. 22 have been obtained. A loss of efficiency 
between 25% at the 4 GeV threshold and 10% at 20 GeV correspond to a reduction in background of 
65% and 75%, respectively. The rejection achieved for kaon decays is slightly better than that achieved 
for K decays, as expected from the different decay kinematics. These results are derived from nominal 
detector performance and algorithm resolutions. However, they demonstrate that cuts can be adjusted to 
obtain reasonable trigger rate at the very low pj threshold of 4 GeV which is reached mostly by reducing 
uninteresting events at the cost of some efficiency loss for prompt muons. An optimization of the cuts, 
with specific tuning for each trigger element, will eventually further improve the signal to background 
ratio. 



9 Muon isolation 

9.1 Optimization procedure 

The L2 isolation algorithm is seeded by either muFast or muComb. The algorithm decodes LAr and Tile 
Calorimeter quantities (i.e. transverse energy deposit or sums of calorimetric cells above a predefined 
energy threshold) in cones centered around the muon direction. The geometrical definition of these 
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Figure 22: EF efficiency as a function of pj for different rejection cuts for prompt muons (a), muons 
from single pion decays (b), minimum bias events (c and d). In (c) n/K contribution has been separated. 
Each efficiency curve shows the data reduction obtained by the addition of the corresponding cut to the 
overall selection procedure. The specific values of the cuts are discussed in the text. 



cones is given by the condition AR < ARmax, where AR = ^f(Ar} 2 + A(j> 2 ), and A 77, A0 are the distances 
in pseudorapidity and azimuthal angle between the calorimetric cell and the cone axis. Because the 
muon itself contributes to the energy deposit inside the cone, to improve the discriminating power of 
the isolation algorithm, two different concentric cones are defined: an internal cone chosen to contain 
the energy deposit released by the muon itself, and an external one, supposed to include contributions 
only from detector noise, pile-up and jet particles if present. The optimization of the muon isolation 
algorithm consists of determining the optimal size of the inner and outer cone radius, the values of the 
cell energy thresholds, used to compute the transverse energy and number of cells sums, and the isolation 
requirements. 

Table 10 summarizes the samples that have been used to optimize the algorithms and to measure their 
performance. Half of the events in the samples number 1 and 2 in the Table have been used as signal 
and background, respectively, in the optimization of the algorithm parameters, the remaining events for 
sample 1 and 2 and the other samples listed in the Table have instead been used to estimate the algorithm 
performances. 

Only the parameters relative at the muon trigger in the barrel region (|tj| < 1.05) have been studied. 
Simulation of the electronic readout noise for both LAr and Tile Calorimeters has been also included. 

A muon track passing through the calorimetry will deposit energy in the cells which immediately 
surround it. The deposited energy can be contained within some cone of radius Ri nn er, where R = 
\/At] 2 + A(j) 2 . If the muon is isolated, there will be little energy deposited in cells which lie in an outer 
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Process 


Generator 


Number of events 


1 


z - ii+ii- 


Pythia 


1 10 4 


2 




Pythia 


1.5 10 4 


3 


bb^ii{6)X 


Pythia 


1 10 4 


4 


qq^llX 


Pythia 


2.1 10 4 


5 


Single-jU Q? r =100 GeV) 


Single-Mu gun 


2 10 5 


6 


Single-;UQ? r =38 GeV) 


Single-Mu gun 


2 10 5 


7 


Single-jU(/?7-=19 GeV) 


Single-Mu gun 


2 10 5 



Table 10: Data samples used in the muon isolation algorithm optimization. 



annulus around this (R e [Ri m ,er,Routei])- The radius of the inner cone (i.e. the cone fully containing the 
muon) has been determined from the distribution of the summed transverse energy contained within a 
cone of increasing radius around the muon direction from Z —> nn, as shown in Figure 23. The value 
of R corresponding to the inner cone radius is visible as a change in the slope of the curve. Once the 
radius for which all the muon energy is contained in the cone is reached, for each further increase of the 
cone radius only noise will be summed, resulting in a reduction of the slope of the energy sum curve. 
The reduction in the slope depends on the level of electronic readout noise per cell, as shown in Fig. 23, 
where curves for several values of the threshold cut on the calorimetric cell energy is shown, ranging 
from 40 to 90 MeV The effect of the electronic noise is only relevant for the LAr calorimeter. From 
the two figures it can be seen that a cone of radius 0.1 (one readout cell), is sufficient to contain the 
muon energy deposition in the hadronic calorimeter, while a radius of about 0.07 (one to three readout 
cells, depending on position), is sufficient for the electromagnetic calorimeter, due to the finer readout 
granularity. The value of the outer cone radius is instead constrained by timing requirements. Increasing 
the outer cone radius requires a larger fraction of the calorimeter to be read out and decoded. Because 
the readout step of the algorithm dominates the execution time (> 90% of the overall algorithm time) the 
requirement to keep the overall timing below O(10) ms constrains the maximum outer cone radius to be 
below about 0.4. We have verified that optimal background rejection is obtained by keeping the outer 
cone radius at is maximum value. 

An analysis has been performed over all the quantities used in the isolation hypothesis testing, with 
a goal of minimizing the number of variable used in the optimization step. Each variable used in the 
optimization is listed in Table 11, together with the respective separation power expressed in term of 
minimum variance bound [8]. The optimal value of the cell energy cut thresholds, used to compute the 
transverse energy and number of cell sums, has been obtained by maximizing the background rejection 
after applying a fixed cut on the isolation variables (varl and varl in Table 11), giving a 95% efficiency 
for the Z —> fifi signal. A common threshold value of 60 MeV has been obtained with this procedure for 
both the LAr and Tile calorimeter. Algorithm performances are stable for threshold variations of ±10 
MeV around the optimal values. The distributions of some of the most powerful variables for signal 
selection and background rejection are shown in Fig. 24. 

Optimal cut values for the isolation variables described above have been obtained in a multivariate 
optimization procedure by simultaneously varying all the cuts in sensible ranges and by minimizing the 
bb — > iiX background efficiency at fixed Z —>■ iiji signal efficiency. In the optimization procedure, both 
background and signal efficiencies are calculated with respect to muons satisfying the L2 muFast mu20 
requirement. In Fig. 25 the background rejection,defined as I/Ebg where Ebg is the efficiency for the 
bb background sample, and (1 — Ebg) as a function of the Z — > nn signal efficiency, obtained after the 
optimization procedure, are shown. The chosen working point for the isolation algorithm yields a factor 
of 10 reduction for the bb background at a 95% signal efficiency for muons with pj >20 GeV. 
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0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0.4 



Figure 23: The total transverse energy contained within a cone of increasing radius around the muon 
candidate track from Z — > jiji signal events in the LAr calorimeter (left) and Tile calorimeter (right). The 
different curves on each figure correspond to different thresholds applied on the cell energy. 



Label 


variable 


Separation 


varl 


Iso LAr = ZEf<^ l /ZEf<» A 


0.21 


var2 


IS0 Ti le=LEF << » A /LE* 1{ <U A 


0.29 


var3 


F 0 -pARe [0.07,0.4] 
^LAr — L^T 


0.75 


var4 


F 0 v p ARe[0.1,0.4j 
n Tile ~ ^ T 


0.40 


var5 




0.23 


var6 


E l _ YE Ak<0A 


0.06 


varl 


Number of LAr cells above threshold with AR £ [0.07,0.4] 


0.72 


varS 


Number of Tile cells above threshold with AR £ [0.1,0.4] 


0.34 


var9 


Number of LAr cells above threshold with AR < 0.07 


0.31 


rarlO 


Number of Tile cells above threshold with AR < 0. 1 


0.07 



Table 1 1 : Variable used in the muon isolation optimization procedure. The separation is zero for identical 
signal and background shapes, and it is one for shapes with no overlap. 
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(a) Number of LAr cells above threshold in (b) Transverse energy sum in the outer ring 
the outer ring of LAr 




(c) Transverse energy sum in the Tile in (d) Isolation variable in Tile calorimeter 
the outer ring 



Figure 24: Most powerful variables for calorimetry -based muon isolation. 



9.2 Performance 

The performance of the isolation algorithms in terms of bb and dijet background reduction, efficiency 
of benchmark signal channels and timing is presented below. The performance of isolation algorithms 
can be affected by the instantaneous luminosity since the pile-up requires higher thresholds for the same 
nominal efficiency. This is particularly true for calorimetry-based isolation, while for track-based iso- 
lation the effect can be reduced by requiring that the contributing tracks come from the same primary 
vertex as the muon. For this reason, the results of this study should be taken as preliminary and valid 
only in the framework of the approximations used in the simulated events production for these studies. 
Possible changes and further development may occur as soon as real data is available. 

The effect of isolation algorithms on various sources of non-isolated muons at L2 is shown in Ta- 
ble 12. The quantity 1 - £ BG is shown for the isolation requirements corresponding to a working point for 
the isolation algorithm with a nominal Z — »■ jx\x signal efficiency of 95%. Results from dijet decays give 
an estimate of the rejection power of the isolation algorithm for high-pj muons from K and n in flight 
decays. The rejection power for low-pr muons from bb decays selected by the level 2 mu6 requirement 
has also been estimated. The reduction in rejection power, from a factor 10 to a factor of about 2, at the 
low- pr limit is expected, given the low energy associated with the jets. As already mentioned, calorime- 
try based isolation algorithms are not effective against these kind of muons, and the track-based isolation 
is expected to be much more powerful in reducing this kind of background. 
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Figure 25: Background rejection (1/ebg) (left), and 1 - £bg (right), as a function of the signal efficiency 
as obtained in the muon isolation algorithm optimization procedure. 



Process 


Trigger item 


Average muon p T (GeV) 


l-e BG (%) 


bb^n{\5)X 


mu20 


25.0 


89.4 ±0.7 


bb^n{6)X 


mu6 


9.0 


54.6 ±0.9 


qq^\lX 


mu20 


40.0 


99.6 ±0.1 


qq^/xX 


mu6 


20.0 


97.3 ±0.2 



Table 12: Muon isolation algorithm 1 - £bg for muons from several background samples. Efficiencies 
are calculated with respect to muons passing the level 2 mu20 or mu6 requirements, as specified. 
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The isolation algorithm has been tuned such that the efficiency is 95% at the chosen working point. 
The efficiency of the isolation requirement has been studied as a function of pj using samples of single 
muons with a pj of 1 1, 39, and 100 GeV. No sizable change in the muon efficiency are visible, indicating 
that radiation effects are small for pj in this range. Evaluation of the effect of the muon radiation for 
very high pj muons (500 to 1000 GeV) is ongoing. The efficiencies for muons from Z — > jiji decays 
and for single muons are reported in Table 13. 



Process 


Trigger path 


e(%) 




mu20 


95.5 ±0.4 


Single ju p T =\W GeV 


mu20 


98.68 ±0.07 


Single n p T =39 GeV 


mu20 


98.97 ±0.07 


Single ;U p T =6 GeV 


mu6 


98.54 ±0.09 



Table 13: Muon isolation algorithm efficiencies for muons from several processes and thresholds. 

The time available for running L2 algorithms in the on-line trigger is limited to approximately 20 ms. 
The CPU processing time is therefore a relevant parameter for the feasibility of algorithms to be included 
in the trigger chain. The results obtained indicate a typical overall time of less than 10 ms. Further and 
more detailed timing studies performed on the actual L2 processors are ongoing. 

10 Muon identification using the tile calorimeter 

The muon signatures in the three radial layers of the Tile Calorimeter are well measured quantities with 
a typical pattern that can be used to identify the muons efficiently down to very low pj. This information 
can be used to confirm the Muon Spectrometer Triggers (i.e. provide redundancy in noisy/dead regions) 
or to enhance the selection efficiency for very soft muons typically out of reach for the spectrometer. 

The algorithm exploits the radial and transverse calorimeter segmentation. The search starts from 
the outermost layer, which is the one with the cleanest signal, and once a cell is found with energy 
compatible with a muon, the algorithm checks the energy deposition in the neighbor cells for the most 
internal layers. These "candidate patterns" are considered as muons when cells compatible with the 
typical muon energy deposition are found following a r\ -projective pattern in all the three TileCal layers. 
More details can be found in Ref. [9]. 

10.1 Performance 

The performance of the TileMuId algorithms has been studied with MonteCarlo single muons and semi- 
inclusive muon production (bb — > /x(4)X) samples. The effect of minimum-bias pileup at low luminosity 
(jSf = 10 33 cm _2 s _1 ) has been investigated as well. 

Two algorithms, implementing complementary strategies are described, one (TrigTileLookForMuAlg) 
is fully executed on the LVL2 Processing Unit (PU) the second (TrigTileRODMuAlg) has a core part ex- 
ecuted on the Read Out Driver Digital Signal Processor (ROD-DSP) in order to save time. This allows a 
very fast processing of the entire detector (full scan) as opposed to the Rol based processing typical of the 
trigger algorithm running on the LVL2 PUs. Since each ROD-DSP processes a small part of the detector 
readout the TrigTileRODMuAlg acceptance is lower compared with that of TrigTileLookForMuAlg. 
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10.1.1 Spatial resolution 

The spatial resolution of the algorithms can be studied using the distributions of the residuals Arj = 
V (MTile) — H (MTruth) and = 0 (jUxiie) — 0 (jUxruth) in single muon events with 2 < pj < 15 GeV. The dis- 
tributions are well described by a Gaussian and resolution can be defined as ov, = 0.05 for TrigTileROD- 
MuAlg (On = 0.04 for TrigTileLookForMu) and c§ = 0.03 rad. To characterize the performance of the 
algorithms with MC physics events a matching region with the MC truth will be used. For this analysis 
a matching region of Arj x A0 = 0.2 x 0.12 is used. 



10.1.2 Efficiency 

The muon-tagging efficiency is defined as the ratio of the number of tagged muons which match a truth 
muon (A^g) to the number of generated truth muons (N gen ). Figure 26 shows the efficiency as a function 



Figure 26: Efficiency as a function of Tj (left), <j) (center) and pj (right)for TrigTileLookForMu (filled 
circles) and TrigTileRODMu (open squares) using single muon events. 




Figure 27: Efficiency as a function of X] (left) and pj (center) for TrigTileLookForMu (filled circles) 
and TrigTileRODMu (open squares) in bb —> ju(6)X events. Right plot show for TrigTileLookforMu 
the effect of pileup of Minimum Bias events at low luminosity (filled circles) compared with the case 
without pileup (open squares). 



of Tj, 0, and pj of the muon for the two algorithms as obtained using the single muon sample. The 
efficiency of TileRODMu is lower than that of TileLookForMu. In the region 0. 8 < | Tj \ < 1 . 1 the towers 
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are split between the barrel and the extended barrels, and the cells belonging to different partitions are 
processed by different ROD DSPs. Similar effects are observed for the boundary at r\ ~ 0. Except 
for these two regions of low geometrical acceptance both algorithms show efficiency ~ 85% with good 
agreement. Since TileCal is homogeneous in (j), the efficiency is uniform as a function of 0, see Fig. 26 
(center). The efficiency decreases with the muon pi for px < 3 GeV and is about 42% at pj = 2 GeV. 
Most of the muons with pj < 2 GeV stop in the Tile calorimeter. For pj > 4 GeV the efficiency is flat 
at about 60%. Figure 27 (left) and (center) show the efficiency curves for both algorithms as obtained 
in bb — > jli(6)X events. These results are in good agreement with the performance obtained using single 
muons, indicating that the algorithms are not too sensitive to the additional hadronic activity in bb events. 

To evaluate the performance in a realistic LHC operation scenario a sample of bb — > £i(6)X events 
simulated with pileup of minimum-bias events at a luminosity Jz? = 10 33 cm _2 s _1 was used. As shown 
in Fig. 27 (right), the efficiencies as a function of pi for two cases are similar for pj > 5 GeV. The 
additional muons from minimum-bias events make the efficiency worse in the low pj region. The average 
efficiency in the sample with pileup (67.97±0.81)% is slightly lower than the one obtained without pileup 
(74.25±0.79)%. It can be concluded that the efficiency is not substantially affected by minimum-bias 
pileup. 

10.1.3 Fraction of fakes 




Figure 28: Fraction of fakes as a function of tj (left) and (j) (center) for TrigTileLookForMu (filled circles) 
and for TrigTileRODMu (open squares) in bb —>■ jJ.{6)X events. The right plot compare performance of 
TrigTileLookForMu in samples with (filled circles) and without pileup (open squares). 



The muon tags which are not matched with truth muons are considered fake. The same At] x A(j) 
matching cuts are used for the efficiency and fake computation. The fraction of fakes in a given data 
sample is defined as the ratio of the number of misidentified muons to the total number of events. 

The left and center plots of Fig. 28 show the fraction of fakes as a function of r\ and (j) obtained 
by the two algorithms. Both algorithms show a very small fake rate in the central region (0. 12% for 
|t]| < 0.7). The main contribution of fakes comes from the extended barrel and gap regions, where the 
cell segmentation is coarse and the projectivity is the worst. The fraction of fakes in the whole range 
| T] | < 1.4 is 2.7 ±0.1% for TrigTileRODMu and 4.1 ±0.1% for TrigTileLookForMu. The fraction of 
misidentified muons as a function of (j) is flat as expected. 

Figure 28 (right) shows the performance of TrigTileLookForMu in bb —> jJ.(6)X events with and 
without the pileup of minimum-bias events at low luminosity. The fraction of fakes increase from 3.7 ± 
0.1% to 6.0 ±0.1% when the minimum bias pileup at Jz? = 10 33 cm _2 s _1 is taken into account. The 
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fake rate increases at larger values of Y\ (gap and extended barrel), where the cell granularity is worse 
and more minimum bias event are expected, compared to the central 77 region. 

10.2 Combined performance with the inner detector 

In order to measure the pj of the identified muon, the secondary Rol produced by the TileMuId algorithm 
is used to seed the Inner Detector (ID) track reconstruction algorithm. The size of the ID Rol that requires 
processing is defined by the Tile algorithm resolution and by the bending in the central solenoid. For 
PT(MTmth) > 2 GeV, A0 = 0(jUTiie) - ^/^Track) ~ 0-2 is required. If at least one track is found within the 
region At? x A0 = 0.1 x 0.2 and with pj > 2 GeV, the calorimetric tag is confirmed to be a muon and 
the trigger sequence is successful. 




N(tracks) N(tracks) P T (GeV) 



Figure 29: The number of tracks within the given Rol for bb —>■ ju(4)X (left) and with the fixed Rol 
size of Ar) =0.1 and A0 = 0.2 for bb — > /x(6)X with/without pileup (center). The right plot shows the 
efficiency as a function of pj for the muons tagged by only TileCal (TileLookForMu) and the muons 
combined with the associated track. 

Figure 29 shows the multiplicity of track in the ID Rol; left plot shows that a region with A<p =0.1 
misses the low pj tracks and results in more events with zero track within the Rol. The Rol with Ar) = 0.2 
does not give any advantage. The Rol with a size At] =0.1 and A0 = 0.2 is a good compromise; the 
efficiency to reconstruct the muon track is good and the multiplicity of tracks (ambiguity) is acceptable. 
In the case of reconstruction of multiple tracks, the closest is chosen as the best-matched for the }X tagged 
by TileCal, and all tracks are saved since the ambiguity cannot be further resolved at this level. As shown 
in Figure 29 (center), the multiplicity of tracks within the Rol is not significantly affected by the pileup. 

Figure 29 (right) shows the overall combined (TileCal+ID) efficiency for A(j) =0.1 and Atj) = 0.2 
as a function of muon pj. The combined efficiency obtained with A<j) = 0.2 is approximately equal to 
that of the TileCal stand-alone except for pj < 3.5 GeV. The efficiency from the matched track shows 
no dependence on tj or 0. The efficiency, purity and acceptance using the different sizes of Rol are 
summarized in Table 14. The efficiency and acceptance are significantly improved from A0 = 0. 1 to 
A<j) = 0.2. For A(j) = 0.2, 97% of tagged muons by TileCal match the associated track. The purity and 
acceptance of A0 = 0.3 are similar to those of A0 = 0.2. However, the size of Ar) does not affect the 
efficiency of the matched track with }X significantly. The differences due to the minimum-bias pileup are 
observed to be about 2 to 3% due to the small number of events from pileup samples. 
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TileLookForMu 


Rol size with 

(0.1,0.1) 


Atj, A<j>) for m 
(0.1,0.2) 


atching tracks 

(0.1,0.3) 


Efficiency (%) 
Unmatched jUxiie (%) 
Efficiency (p T > 4 GeV) 
Purity (p T > 4 GeV ) 
Acceptance (pj > 4 GeV) 


73.08 ±0.17 


42.02 ±0.19 

42.50 ± 0.36 
44.09 ± 0.20 

98.51 ±0.88 
40.78 ±0.31 


70.91 ±0.18 
2.98 ± 0.08 
72.94 ±0.18 
98.79 ± 0.67 
71.93 ±0.45 


72.06 ±0.18 
1.40 ±0.05 
73.06 ±0.18 
98.69 ± 0.67 
72.01 ± 0.45 



Table 14: Performance with the matched track for bb — > jJ.(4)X. 

11 Muon trigger performance for Z — > jU + jU~ 
11.1 The "tag and probe" method 

The trigger efficiency is a fundamental parameter in physics analyses and therefore it is important to have 
several independent methods for estimating it. The "Tag and Probe" method is a concrete application of 
a data-driven technique for performance analysis. This method is based on the definition of a "probe- 
like" object, used to make the performance measurement, within a properly "tagged" sample of events. 
Physics processes suitable for this method are generally those characterized by a double-object final state 
signature. The decay of the Z provides two high-pr muons that can lead to two trigger tracks in the Inner 
Detector and Muon Spectrometer and to a combined object. These two measurements are in principle 
independent, thought not necessarily uncorrected. 

"Tagged" events require one triggered track with pj > 20 GeV and "Probe" objects can be defined 
as Inner Detector offline reconstructed tracks (ID-Probe), where measurements are referred to the offline 
Inner Detector reconstruction efficiency (~100%), or as Muon Spectrometer offline reconstructed tracks 
(MS-Probe), where values are normalized to the offline Muon Spectrometer reconstruction efficiency 
(standalone or combined with the Inner Detector). 

The trigger performance is measured by checking for LI, L2, and EF trigger tracks associated with 
each probe object. A schematic illustration of the method is shown in Fig. 30. It must be verified that se- 
lected tracks come from a Z decay. A background process with two isolated tracks in the Inner Detector, 
of which only one is a real muon, would introduce a systematic error in the efficiency evaluation. For 
this reason, cuts have to be applied in order to select a clean signal sample. 

A significant background contribution is expected from QCD processes, which have large cross-sections. 



Process Generation cuts a [pb] 

xx+xx~ M MA , > 60 GeV/c 2 1497 

1/x: |77|<2.8,p r >5GeV 

W -> JTv 1/x: <2.S,p T > 5 GeV 11946 

BB^nnX 1/x: |r/| <2.5,p T > 15 GeV 4000 

1/x: |7?|<2.5, P7 ->5GeV 

tt^W + bW~b only leptonic decay 461 
Z^x+x- tt — > ll.Mf+n >60GeV/c 2 77 
lH: |T]|<2.8,p r >5GeV 



Figure 30: Illustration of the Tag and Probe method (left) and cross-sections with generation cuts for 
signal and background processes (right). 
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This background has been studied by considering the dominant contribution of muons from decays of B- 
meson pairs. Also the muonic W boson decay, which can give a higher energetic muon plus an additional 
muon from a QCD jet and the Z —> T + T~ — ► jU+V^Vf lx^V^v r process have been considered. 

Moreover the top-pair production cross-section at LHC is of the same order of magnitude as Z boson 
cross-section. Top quarks decay with a 99.9% probability into a Wboson and a b quark. Therefore muons 
originating from W boson and ft-quark decays can also give a signal-like signature. Cross-sections and 
generation cuts of the processes considered are reported in Fig. 30. PYTHIA [5] is used to generate the 
processes. 

Another possible source of background is muons from cosmic-rays. An estimation of cosmic rates 
in the trigger system has been done in Ref. [10] and shows a negligible effect on trigger performance. 

The isolation variables chosen for this analysis are the number of reconstructed tracks in the Inner 
Detector (Nj.g ne ), the sum of pj of the Inner Detector tracks (£ p^ co „ e ), the energy of a jet candidate 
(Econe) and the sum of reconstructed energy in the cells of the Calorimeter (XZ?™). Muons from QCD 
processes tend to be produced within a large cascade of other particles and therefore should not appear 
isolated in the detector. In the case of the decay of top pairs, one highly energetic and isolated muon 
can come from one W boson decay while the second W boson can decay leptonically into a high-pr 
electron which appears as an isolated track in the Inner Detector. In order to not count this as a false 
probe, electrons are vetoed. The values of the selection cuts applied in this analysis have been defined 
in [11]. The isolation cuts allow for background rejection of approximately 99% while retaining a signal 
efficiency of about 76%. After applying the probe selection cuts the signal to background ratio is more 
than 10 3 . In addition, probe muons selected from background processes can be associated to trigger 
tracks and hence have no negative impact on trigger efficiency measurements. 

11.2 Determination of trigger efficiencies 

Two measurement scenarios have been studied: 

• Low luminosity (J S£dt ~ 50 pb _1 ): in order to not rely on the combined reconstruction based on 
Inner Detector and Muon Spectrometer matching, only the tracks from the Muon Spectrometer are 
used. The isolation cuts are also based only on Inner Detector quantities; 

• High luminosity (J J£dt ~ 1000 pb~ l ): full combined information from Inner Detector and Muon 
Spectrometer is used and also Calorimeter based cuts are applied to select isolated tracks. 

In each scenario both the ID- and MS-Probe methods have been studied. The efficiency dependence on 
(j) and tj is determined by the Muon Spectrometer layout. A pj cut of 20 GeV has been applied on the 
probe tracks to test the system in its plateau region. The efficiency as a function of pr has been also 
estimated from data in the high luminosity scenario. 

11.2.1 Low luminosity measurements 

The relative efficiency as a function of tj, measured at each trigger level, is shown in Fig. 31 using the 
ID-probe. LI acceptance losses are related to an incomplete coverage of the trigger detectors due to 
the presence of support and access structures. The L2 efficiency, with respect to the LI selection, is 
about 96% in the barrel region with a small decrease in the endcap, an improvement is expected due to 
optimization of the TGC cabling in next software releases. The EF shows an tj efficiency distribution, 
with respect to L2, close to 99% in the barrel region and a very small decrease from \r\\ > 2.0. In the 
region 1.05 < | tj \ < 1.3 the absence of the some MDT chambers in the ATLAS initial layout resulted in 
an efficiency loss of about 10%. 2 

2 The missing chambers are scheduled to be installed by the end of 2009. 
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Figure 31: The muon trigger efficiency for each trigger level (left) and fractional efficiency difference 
(right) as a function of rj in the low luminosity scenario using the ID-Probe. The efficiencies determined 
with the Tag and Probe method are compared to those calculated in a Monte Carlo truth-based analysis. 




Figure 32: The muon trigger efficiency for each trigger level (left) and fractional efficiency difference 
(right) as a function of 77 in the low luminosity scenario using the MS -Probe. The efficiencies determined 
with the Tag and Probe method are compared to those calculated in a Monte Carlo truth-based analysis. 



The observed agreement with the Monte Carlo truth-based analysis is very good. In order to quantita- 
tively estimate the bin-by-bin differences the "fractional efficiency difference" 

^Tag&Probe ~ ?MC ^ 

£mc 

has been computed. This quantity is shown for each trigger level in Fig. 31 as a function of 77. 
The agreement between Tag and Probe method and Monte Carlo analysis is very high, more than 99% 
over all the trigger coverage. The only observed deviations, at the level of 2%, are found in the central 
crack at tj = 0 and in the transitions from barrel to endcap at \rj\ = 1.05. The results obtained by the 
application of the Tag and Probe method using the standalone MS -Probe are reported in Fig. 32 as a 
functions of rj. With respect to the values shown in Fig. 31 inefficiencies due to LI acceptance cracks 
are partially factorized in the offline muon reconstruction efficiency of the MS-Probe (e.g. the rj = 0 
region.). The same effect is clearly evident at the EF level for the efficiency loss at 1.05 < \rj \ < 1.3 
visible in Fig. 31. 

Table 15 shows the uncertainties on the overall trigger efficiency in each case, calculated also only 
in barrel and endcap regions. The statistical uncertainty is reported together with expected systematic 
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Detector region 


Barrel 


Endcap 


Overall 


= 


(|tj<1.05|) (1.05 < |tj 1 < 2.4) 


(0 < ?7 < 2.4) 


— ow uminosi y 


ID probe ( J J^dt 






Trigger Efficiency 


71.65 


83.59 


T7T8 

• 


Statistical Uncertainty 


0.42 


0.36 


0.28 


\£truth - £tp\ 


0.23 


0.40 


0.10 


Expected Background Contribution 


0.57 


0.17 


0.40 


Overall Systematic Uncertainty 


0.61 


0.43 


0.41 


Low luminosity 


MS probe (f&dt 


= 50 pb- 1 ) 




Trigger Efficiency 


76.94 


87.83 


82.13 


Statistical Uncertainty 


0.41 


0.34 


0.27 


\£truth - £tp\ 


0.17 


0.64 


0.33 


Expected Background Contribution 


0.01 


0.00 


0.01 


Overall Systematic Uncertainty 


0.17 


0.64 


0.33 



Table 15: Estimated uncertainties of in-situ determined muon overall trigger efficiency for the low lu- 
minosity scenario, using an ID- and an MS-Probe track. Systematic uncertainties are reported for back- 
ground contribution and absolute difference with Monte Carlo truth-based analysis. 



Detector region 


Barrel 


Endcap 


Overall 




(|tj< 1.05|) (1.05 < |tj| < 2.4) 


(0<|ij|<2.4) 


High luminosity 


ID probe (fJfdt 


= 1000 pb~ l ) 




Trigger Efficiency 


73.24 


86.31 


79.73 


Statistical Uncertainty 


0.10 


0.08 


0.06 


\£truth - £tp\ 


0.02 


0.72 


0.58 


Expected Background Contribution 


0.05 


0.01 


0.03 


Overall Systematic Uncertainty 


0.05 


0.72 


0.58 



Table 16: Estimated uncertainties of in-situ determined muon overall trigger efficiency for the high lu- 
minosity scenario using the ID-Probe. Systematic uncertainties are reported for background contribution 
and absolute difference with Monte Carlo truth-based analysis. 

errors. Two sources of systematic uncertainties are considered: the absolute difference with respect to 
the value measured in a Monte Carlo truth-based analysis and the background contribution, evaluated 
by comparing the efficiency calculated with Tag and Probe method using only the signal sample and 
using a cross-section weighted sum of all processes. Both systematics are less than 0.5%. A greater 
background contribution is observed when using ID-Probe, since the isolation is based only on Inner 
Detector quantities. 

11.2.2 High luminosity measurements 

After early data is collected and analyzed, a better understanding of the detector, in terms of calibration 
and alignment, will allow to use all the available information such as Calorimeter quantities for track iso- 
lation and combination of Inner Detector and Muon Spectrometer tracks. The trigger efficiency measure- 
ments from data in this scenario are reported using the high luminosity dataset of / ^£dt = 1000 pb~ l . 

As in the low luminosity case the measured differences between the Tag and Probe and Monte Carlo 
analysis are compatible with zero. Small deviations at the level of 1 to 2% are observed in the endcap 
for the L2 trigger efficiency in the \r]\ > 1.5 region. These effects are expected to be reduced by the 
L2 algorithm optimization. Results are shown in Table 16. With the addition of the calorimeter-based 
isolation, the background systematic contribution is reduced by a factor of 10 with respect to the low 
luminosity scenario. The dependence of the trigger efficiency on the px shows the typical shape of a 
turn-on curve. The sharpness of the curve is related to the finite pj resolution, ~ 30% at LI, ~ 5% at 
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Figure 33: Muon trigger efficiency turn-on curve after each trigger level determined by the Tag and Probe 
method and by the Monte Carlo truth-based analysis in the high luminosity scenario using the ID-Probe 
(left). In the right plot the fractional efficiency difference is shown. 



L2, and ~ 3% at the EF. Turn-on curves are shown in Fig. 33 using the ID-probe and similar results are 
obtained with the MS -probe. The turn-on point and the plateau values are correctly reproduced from data. 
The fractional efficiency difference is shown for each trigger level. The disagreement near the threshold 
is within 5%, due mainly to resolution effects, while in the plateau region the observed difference is less 
than 1%. 



12 High pr Dimuon final states 

In principle, the high mass dilepton/diphoton resonance search should have a fairly straightforward trig- 
ger strategy as there are very high energy leptons in the event. However, there are several questions that 
remain: what trigger requirements are optimal for the analysis? What pj thresholds and object quality 
selection should be applied? How can one estimate the trigger efficiency from data for such rare (or 
non-existent) events? Are the same object quality requirements that are appropriate for lower pj objects 
appropriate for very high energy objects? 

This Section addresses these questions, evaluates the trigger efficiency for the signal samples of in- 
terest, and discusses the trigger strategy for the earliest data taking periods. It is expected that during both 
low and high luminosity periods there will be an unprescaled single muon trigger without an isolation 
requirement. The 20 or 40 GeV threshold are expected to be highly efficient for a high mass resonance 
decaying into two muons. 

12.1 Efficiency estimate 

The muon trigger efficiency is estimated using several methods. The first method is to rely on simulation; 
while this is the simplest and most direct method it is believed to be somewhat more optimistic (better 
resolution, higher efficiency). Therefore, the trigger efficiency is also estimated using methods which 
can be applied to real data. 

The trigger efficiencies are calculated with respect to the offline event selection. Two combined 
muons are required to satisfy the cuts: |tj | < 2.7, pj > 30 GeV, track fit j^-p < 10 and Inner Detector 
and Muon Spectrometer track match j^-p < 10. The trigger efficiencies for the dimuon heavy resonance 
Monte Carlo samples are shown in Table 17. The efficiency as a function of pj has been fit to 
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Sample 


LI % 


L2% 


EF % 


Total Trigger Efficiency % 


400 GeV pr/cor 


97.6 ±0.10 


98.8 ± 0.07 


99.5 ± 0.05 


96.0 ±0.13 


600 GeV pr/cor 


98.1 ±0.08 


98.5 ± 0.08 


99.2 ± 0.06 


95.9 ±0.13 


800 GeV pr/cor 


97.6 ±0.10 


98.7 ± 0.07 


99.2 ± 0.05 


95.6 ±0.13 


lTeVp T /(OT 


97.6 ± 0.09 


98.7 ± 0.07 


99.2 ± 0.05 


95.6 ±0.12 


1 TeV Z' (E6) 


97.8 ± 0.09 


98.9 ± 0.06 


99.5 ± 0.04 


96.3 ±0.1 


2 TeV Z' (SSM) 


97.6 ±0.14 


98.7 ±0.11 


98.9 ±0.10 


95.3 ± 0.2 



Table 17: Trigger efficiencies of dimuon resonance samples. For the meaning of E6 and SSM see [12]. 



Trigger Level 


A 0 


Ai 


A 2 


LI 


12.5 ± 0.3 


3.7 ± 0.4 


0.845 ± 0.02 


L2 


19.6 ± 0.2 


1.59 ±0.19 


0.976 ± 0.02 


EF 


19.5 ± 0.4 


1.56 ±0.3 


0.931 ± 0.01 



Table 18: Fitted parameter for the LI, L2, and EF of the trigger p T turn on curves 



f(p T ) = 0.5 -A 2 • (\.0 + erf(^=— °)) (6) 
v2-Ai 

where erf is the error function, An, A\, and A 2 are the fit parameters which represent the pj value at 
which the efficiency reaches half its maximum value, the slope of the turn-on curve, and the maximum 
efficiency in the plateau region, respectively. 

There are several methods to evaluate the trigger efficiency from the data itself. A possible one is 
to look at the trigger efficiency for a known experimentally clean signature that is similar to the final 
state of interest; Z — > jJ. + jJ.~ is one of such signatures. Since the Z is light compared to the total center 
of mass energy, it can be produced with a significant pj distribution. The trigger efficiency on the 
Z can be measured and extrapolated to high pj. The advantage of this method is that it uses data to 
measure the trigger efficiency which is the most accurate method of measuring the Z trigger efficiency. 
A disadvantage is that the muon trigger efficiency is being extrapolated to a pj by a factor of 10 higher 
than the mean pj of the muons from the Z decay. 

The strategy of evaluating the trigger efficiency from data is as follows. It is first necessary to use 
one of several methods to estimate the muon trigger efficiency as a function of the muon pj and its 
uncertainty. The single object trigger efficiency allow the construction of the probability for an event 
with N objects to pass the trigger. This probability can be written as: 

P= l-Y\{l-Pi) (7) 

(=i 

where Pi is the probability for the i-th object to pass the trigger. 

Two common methods that have been used extensively at the Tevatron are the selection by orthogonal 
triggers and the 'Tag and Probe' method using Z — > fiji decay. The 'Tag and Probe' requires two offline 
muons to have an invariant mass within 12 GeV 2 of 91.1 GeV 2 . The turn on curves as a function of the 
offline muon pj, obtained using this method, is fit to equation 6. This procedure was repeated for all 
three trigger levels and the results are summarized in Table 18. 

A second possible method of evaluating the trigger efficiency with data is by the method of orthog- 
onal triggers. To obtain a sample of unbiased events we select events that pass one of the calorimeter 
based triggers, the single 20 GeV jet trigger. We then perform the offline analysis and require that we 
have a dimuon pair using identical event selection to the 'Tag and Probe' analysis. From this sample we 
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Sample | LlMu20 Efficiency % | L2Mu20 Efficiency % | EFMu20 Efficiency | Total Efficiency | 



Z' 1 TeV (SSM) 


97.7 ±0.11 


99.0 ± 0.07 


99.6 ± 0.04 


96.3 ± 0.01 




97.83 ± 0.04 


98.86 ± 0.03 


99.52 ± 0.02 


96.26 ± 0.05 



Table 19: LlMu20 trigger efficiencies at LI, L2, and Event Filter w.r.t offline reconstruction using 
orthogonal trigger selection to record events 



simply check the fraction of events that pass the LI, L2, and EF trigger conditions for the 20 GeV muon 
trigger. The results are shown in Table 19 and are in good agreement with the 'Tag and Probe' method 
and direct emulation of the trigger on the Monte Carlo sample. Unfortunately, in the real experiment a 
single jet trigger with a threshold of 20 GeV would be very highly prescaled and hence will suffer from 
poor statistics. Events that passed any calorimeter trigger could be used for this study if biases in the 
event topology were taken into account, however, such a study is beyond the scope of this note. 

We have developed two methods that could be used to evaluate the trigger efficiency from data. 
Extraction of the muon trigger efficiency as a function of the reconstructed muon kinematics via a tag 
and probe method and an orthogonal trigger method agree well with the simulated trigger efficiency. 
These methods will allow us to more accurately estimate the trigger efficiency for LHC data. 

13 Summary 

In this paper Muon trigger baseline performance and rates for initial and standard LHC operation have 
been presented. Trigger efficiency has been studied in detail in a wide energy range using single muon 
simulated samples. From efficiencies the muon rates have been evaluated. It should be noted that due 
to the uncertainties of the inclusive muons cross-sections, rates could vary significantly and different 
threshold cuts could be adopted. A further rate reduction should come from dedicated strategies to reject 
muon from in-flight decays of K and K\ in this paper a preliminary analysis is presented at Event Filter. 
It is demonstrated that a good rejection can be achieved with contained losses of prompt muons. 

The possibility to select at the ATLAS second level trigger with high efficiency isolated muons 
from W and Z decays reducing the ones from heavy quark decays has been studied in depth. Although 
electronic readout and pileup noise have been simulated, no cavern background has been yet included. A 
factor of ten reduction on high-pj muons from heavy-quark decays has been obtained while maintaining 
a 95% efficiency on Z — > jJ. + jJ.~ final state. Next step will be to investigate how much the use of the 
longitudinal granularity of the calorimeters and inner tracker detector will increase the muon isolation 
rejection power. 

The overall performance of the TileCal muon tagging algorithm has been presented, using MC sam- 
ples of single muons and inclusive B -Physics processes, including minimum-bias pileup at low luminos- 
ity. 

We finally addressed the question of how the muon trigger efficiency can be measured with Z — ► 
jU + jU~ and Z' — > jUjU using the tag and probe method. This technique shows a very good agreement with 
results based on Monte Carlo studies. 
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Abstract 

The ability to trigger on b-jets improves the flexibility and physics performance 
of the High Level Trigger (HLT), especially for topologies containing more 
than one b-jet. It will be shown that the acceptance for b-jets can be increased 
and background reduced by lowering jet transverse energy thresholds and ap- 
plying ^-tagging selections based on the impact parameter of tracks in jets. 
This note reviews the £>-jet selection in the HLT and discusses its integration 
into the ATLAS trigger menu. 

1 Introduction 

Final states containing 6-jets have been proposed as signatures with substantial discovery potential in 
a variety of physics channels. The ability to separate ft-jets from light-quark and gluon jets is thus an 
important ingredient of the online selection strategy in ATLAS. 

One of the most interesting physics cases addressed by such a b-jet trigger selection involves events 
with final states containing four b-jets. This event class is relevant for Higgs bosons search in the low 
mass range, m# < 130 GeV. The most promising channels are the H — > bb decay, where the Standard 
Model Higgs boson is produced by way of the associated production channel ttH and, in supersymmetric 
theories, the channels bbH, bbA with H /A —>■ bb or H —>■ hh —>■ bbbb. 

The selection of b-jets at the trigger level is mainly meant to improve the flexibility of the HLT, 
extending its physics performance for the topologies described above. This is achieved by increasing the 
acceptance for signal events, while concurrently reducing the background. 

The b-jet selection relies on tracking information which is only available starting with the Second 
Level Trigger (L2). Therefore, the acceptance for signal can only be increased by simultaneously low- 
ering LI jet thresholds and applying a more discriminating b-jet selection in the High Level Trigger (L2 
and EF). High rejection power from the b-jet trigger is required to compensate for less rejection due to 
lower LI thresholds and thereby to comply with L2 and EF output rate limitations. 

2 Monte Carlo samples 

The ^-tagging performance on single jets, presented in this note, is evaluated on b-jets from H — > bb 
decays, where the Higgs boson has a mass of 120 GeV and is produced in association with a W decaying 
leptonically. The standard background for single -jet studies are the corresponding w-jets, obtained by 
artificially replacing the b-quarks from the Higgs decay with w-quarks. While these events imprecisely 
model the real background from light-flavour jets they can be seen as a worst case scenario since the 
kinematical properties of signal and background are very similar. 

Even in this very simple situation, the association between Regions of Interest (Rol), identified by 
the LI trigger, and jets is not uniquely defined: a generic x-quark in the final state of an interaction or a 
decay can radiate gluons and, therefore, change its direction. An Rol from H — > bb or H —>■ uu is labeled 
as x-jet (x = b, u) if an x-quark from the original hard process points, after final state radiation, along the 
Rol direction within an angular distance of AR = ^At] 2 +A<p 2 < 0.1. 

In order to evaluate the rate of the b-jet trigger menu, the rejection power must be evaluated on a 
more representative background sample. As for many of the trigger studies in ATLAS, dijet samples 
are chosen for this purpose since they correctly include all contributions to the ^-tagging background, 
including c-quarks and taus. 
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All data samples studied in this note have been generated without pile-up, leaving the influence 
of pile-up for further studies. The activity due to underlying event is taken into account since it is 
automatically included in the PYTHIA [1] event generation. 

3 HLT 6-jet selection 

3.1 LI configuration 

The HLT reconstruction starts from the Rols selected by the LI trigger [2]. In particular, the ft-jet trigger 
starts from a LI jet-RoI AT7 x A(j) = 0.8 x 0.8 and performs track and vertex reconstruction in a smaller 
Rol Atj x A(j) = 0.4 x 0.4 in order to reduce data access and consequently processing time. 

3.2 b-jet trigger feature extraction algorithms 

The first step in the b-jet trigger chain, both at L2 and EF, is the reconstruction of the relevant quantities 
needed to perform the selection. The b-jet Rols can be separated from light jet Rols using the impact 
parameters of the charged tracks, the properties of reconstructed secondary vertices, or soft leptons; all 
these quantities are related to the ft-quark lifetime and to its decay properties. 

The present b-jet trigger implementation relies only on the impact parameters of charged tracks. 
Primary vertex reconstruction is performed only in the z direction while its coordinates in the transverse 
plane are assumed to be compatible with the origin. 

Track reconstruction algorithms are described, together with their performance, in [3]. The two Inner 
Detector tracking algorithms available at L2 show equivalent performance when operating on jet sam- 
ples [3]. Thus to avoid unnecessary comparisons, the results obtained with the SiTrack algorithm are 
presented. For EF track reconstruction, the algorithm corresponding to that used for offline reconstruc- 
tion has been adopted (NewTracking). 

3.2.1 Primary vertex reconstruction 

Along the z direction no a priori knowledge of primary vertex z v tx is available; consequently, this has to 
be reconstructed, starting from the tracks available in the Rol. This information is needed for the correct 
evaluation of the longitudinal parameter of each track with respect to the primary interaction position. 

The adopted algorithm, a simple histogramming method based on a sliding window, yields an effi- 
ciency of 98(99)% and a resolution on z vt x of about 120(100)jum at L2(EF) as illustrated in Fig. 1. 

3.3 Tagging variables 

The HLT ft-jet tagging methods are based on the transverse and longitudinal impact parameters of the 
reconstructed tracks. Since the methods are the same for L2 and EF they will be described using L2 
variables only. 

3.3.1 Transverse impact parameter 

The most natural choice is to build the ^-tagging discriminant variable from the transverse impact pa- 
rameter do of the reconstructed tracks. Since the hadrons containing ^-quarks have a finite lifetime 
(t ~ 1.6 ps), tracks from their decays are characterized by large do values, while tracks from w-jets come 
dominantly from the primary vertex (d vtx = 0). 

In particular, the significance of the transverse impact parameter S = do/o(do) is used, where c(do) 
is the error on the impact parameter. The error on the transverse impact parameter at L2 is parametrized 
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Figure 1: The distribution of the difference between the true and the reconstructed z primary vertex 
coordinates at L2 (full line) and EF (dashed line). The widths as determined by a fit to the distributions 
are 120 jim and 100 /xm, respectively. 



as a function of reconstructed pras: 



o(d 0 ) = 




where po is the asymptotic term, p\ is the term due to multiple scattering, and p2 is the exponent of the 
multiple scattering contribution (close to two). Although L2 tracking algorithms have recently reached a 
good level of precision, the above error parametrization is still applicable at L2 during early data, while 
for the EF the reconstructed error is used. 

Figure 2 shows the distributions of the impact parameter significance do/a(do) for b-jets and light 
jets at L2. The significance has been rescaled according to the function f(x) = log{\ + \x\) in order 
to have a reasonably uniform bin population along the x axis. From these plots it can be guessed that 
the impact parameter significance is a promising choice for the discriminant variable, since the two 
distribution are very well separated. 

3.3.2 Longitudinal impact parameter 

The longitudinal impact parameter (z 0 ), i.e. the track's z-intercept, can be adopted, as well as the trans- 
verse impact parameter, to discriminate between &-jets and light jets. After the primary vertex position 
has been reconstructed, the Szo = zo — z v tx variable can be used to form a discriminant which can then 
be used for b-]et selection. Figure 3 shows the distributions of the longitudinal impact parameter signif- 
icance (5zo/a(zo)) of &-jets and light jets at L2. The significance has been rescaled as described above 
for the transverse impact parameter. 

As for Fig. 2, the signal and background distributions are different although much less so than for 
the transverse impact parameter significance. From this comparison, it is clear that most of the discrim- 
ination will be provided by the measured transverse impact parameter significance. The resolution of 
the longitudinal impact parameter significance is not as good due to the coarser resolution of the silicon 
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Figure 2: Distribution of the rescaled function (de- 
scribed in the text) of the transverse impact pa- 
rameter significance for tracks coming from ft-jets 
(solid line) and light jets (dashed line) at L2. 



Figure 3: Distribution of the rescaled function (de- 
scribed in the text) of the longitudinal impact pa- 
rameter significance for tracks coming from &-jets 
(solid line) and light jets (dashed line) at L2. 



tracking detectors along the z-direction, bigger extrapolation distance from innermost silicon layer hit to 
primary vertex at high 77, and to the resolution of the reconstructed primary vertex. 

3.4 HLT &-jet tagging methods 

In this Section, HLT ^-tagging methods are described. The likelihood ratio method is quite general and 
can be applied to different variables while the % 2 method is essentially designed to test the compatibility 
of the tracks with respect to the primary vertex using the transverse impact parameter. 

The likelihood ratio, using information on the signal and background shape that have to be estimated 
on real data, is both more powerful and more difficult to tune than the x 2 method. 

3.4.1 The likelihood-ratio method 

The likelihood-ratio method is a statistical tool used to separate two or more event classes, and is based 
on a set of characteristic variables. The likelihood-ratio variable W is evaluated, for a given event, as 
the ratio between the probability distributions for two alternative hypotheses. In its application to b-jet 
selection, the likelihood-ratio variable is defined as 

W = S(s)/S(b), 

where S(s) and S(b) are the probability densities for the signal, the b-jets, and the background, rep- 
resented in this case by the w-jets. This variable is widely used to obtain the best possible separation 
between signal and background, in terms of a single variable, in fits aimed at extracting the fraction of 
signal events in a given sample. The same variable can be also directly used, as in the b-jet selection 
case, to select signal events, for example by applying a cut on the likelihood-ratio variable itself. 

The probability density distributions used in the ^-tagging application can be functions of some 
parameter of each track (e.g. the transverse impact parameter do) or of some collective property of the 
jet (e.g. its track multiplicity). In the first case, these distributions take the form 

s(par\ , par 2 , par 3 par n ), 
b(par\,par2,par^,. . .,par n ), 
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where the 1, . . . ,n indices identify each track belonging to the jet. The corresponding likelihood-ratio 
variable is thus defined as 

w = sjpan , pari , par 3 par n ) 
b(par\,par2,par3,. . . ,par n ) 
Exact evaluation of the s and b functions is very difficult, since it would require an almost infinite amount 
of simulated data; for example, in order to reasonably populate an ^-dimensional cube, about 100 entries 
are needed for each dimension, corresponding to n m) tracks; even worse, the number of tracks in a jet is 
not fixed. However, if we assume that the variables corresponding to different tracks are independent, the 
ratio between the overall probability densities reduces to the product of the ratios of the single probability 
densities: 

f { b{pan) ' 

which is much easier to evaluate. In the ^-tagging case, track parameters have complex correlations 
which depend on the proper time for the B hadron and on its decay kinematics. Nevertheless, it can be 
proven that, neglecting these correlations does not invalidate this variable, but results only in a slight 
reduction of its discriminating power. 

The W variable, can take any value between 0 (for the background) and +oo (for the signal). For 
practical reasons, it is useful to handle a variable defined on a finite interval; to achieve this, W is usually 
replaced by another variable 

x- w 
T+w' 

which can only range between 0 and 1. 

As an illustration of the method, Fig. 4 and Fig. 5 show the distributions of the discriminant variable 
X which is based on the combination of the transverse and longitudinal impact parameter for b-jets and 
light jets respectively at L2 and EF. It can be seen that signal events (&-jets) accumulate near X = 1, while 
the background (light jets) tends to have X close to 0. 





Figure 4: Distribution of the discriminant variable 
X based on the combination of the transverse and 
longitudinal impact parameter significances for b- 
jets and w-jets (dark area) at L2. 



Figure 5: Distribution of the discriminant variable 
X based on the combination of the transverse and 
longitudinal impact parameter significances for b- 
jets and w-jets (dark area) at EF. 



Contrary to the offline ^-tagging methods based on likelihood ratio, the sign of the impact parameters 
is currently not used at HLT since the Rol direction does not give a precise estimation of the b-jet 
direction. Future studies will use the impact parameter sign determination respect to a track based cone 
jet described in the next Section. 
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3.4.2 x 2 method 

The x 2 method offers an alternative tagging approach that is more robust since it is less dependent on 
the details of the impact parameter distributions. This method has been studied and characterized only at 
L2. 

The X 2 method computes the probability for a jet to originate from the primary vertex, based on 
the signed transverse impact parameter significance of tracks pointing to the jet. This technique was 
originally developed by the ALEPH collaboration and extensively used at LEP and Tevatron experiments 
[6] [7] [8]. One of the main advantages of this method is that it only relies on the transverse impact 
parameter significance distribution of prompt tracks in multi-jet events, which can be easily derived 
completely from real data. On the other hand the performance of this method is limited due to the fact 
that tracks from beauty and charm particles produce significant tails with negative impact parameters. 
These negative tails originate from the differences of the direction of the estimated jet and B-hadrons and 
also in the differences of the direction of B-hadrons and charmed hadrons in the cascade decays. 

The definition of the sign of the transverse impact parameter is based on the angle between the jet 
axis and the line between the primary vertex and the point of closest approach of the track, such that it 
is negative when the track appears to originate behind the primary vertex (i.e. when the angle is greater 
than n/2) as illustrated in Fig. 6. 




d 0 = \8\ 



do = -\S\ 

Figure 6: Definition of the sign of the transverse impact parameter. When the angle between the jet axis 
and the line between the primary vertex and the point of closest approach of the track is lower (greater) 
than n/2 the sign is positive (negative). 

Tracks from light-quark jets have equal probability to have positive or negative transverse impact 
parameters, and the width of the signed transverse impact parameter distribution depends on the tracking 
detector resolution and multiple-scattering effects. The signed transverse impact parameter distribution 
of tracks from displaced ft-jets, on the other hand, has a large positive asymmetry due to the fact all that 
most of the long-lived particles from Miadron decays are produced with positive impact parameters. 

Good jet angular resolution is a key to achieving a good Might-quark jet discrimination since the 
direction of the jet axis enters into the calculation of the sign of the impact parameter. Poor jet angular 
resolution results in frequent mis-assignment, particularly for tracks with small angle with respect to the 
jet direction. 

In order to improve the resolution of the azimuthal angle (0) of the jet, a track-based simple cone jet 
reconstruction algorithm is used instead of the jet-RoI 0 direction. Figure 7 shows that the 0 resolution 
improves by more than a factor of two when tracks are used to compute the jet direction. The effect of 
jet <j) resolution on Might quark jet discrimination can be seen in Fig. 8, which shows the distribution 
of signed transverse impact parameter significance for b-jet tracks when 0 is computed using truth, Rol, 
and track-jet 0 directions. 

The negative transverse impact parameter significance is computed using a parameterization for the 
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Figure 7: A0 between the true ft-quark direction 
and its estimate using Rols (solid line) and track- 
based jets (dashed line), as a function of the track- 
jet multiplicity. 
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Figure 8: Distribution of the signed impact param- 
eter significance for &-jets when the jet 0 direction 
is computed using truth, track-jet, and Rol infor- 
mation. 
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Figure 9: Negative transverse impact parameter 
significance and the resolution function R(S) for 
tracks with less than 7 hits in the Silicon detectors. 



Figure 10: Negative transverse impact parameter 
significance and the resolution function R(S) for 
tracks with at least 7 hits in the Silicon detectors. 



transverse impact parameter error as a function of p scat = psind 3 / 2 and the number of hits in the pixel 
detector. A double-Gaussian fit to the distribution of negative transverse impact parameter significance 
(R(S)) is used to define p tr k(S), the probability for a track to originate from a primary vertex: 

where only tracks with positive impact parameter are used in the calculation. The distribution of negative 
transverse impact parameter significance and the resolution function R(S) is shown in Fig. 9 and Fig. 10 
for tracks with less and more than 7 Silicon detectors hits. 

The definition of p tr k ensures a uniform distribution between 0 and 1 for tracks originating from the 
primary vertex. Tracks from displaced B decays result in p tr i ~ 0. 

A x 2 jet probability is defined by considering the probabilities of all tracks with positive transverse 
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impact parameter in a jet [6] : 



Pjet 



(-io g ny 



(2) 



where n = 111 Art (5). 

Figure 1 1 shows the p tr k distribution for light and ft-quark jets. The spike at small probability for 
light quarks tracks is due to tracks from V° decays, which have positive transverse impact parameter. 

Figure 12 shows the pj et distribution for light and ft-quark jets. Jets are tagged as b, if the pj et is 
below some value, typically between 0.5% and 5%. 




Trackx 2 probability 




Figure 11: Track % 2 probability (p trk ) for ft-jets 
(full histogram) and light jets (shaded histogram). 



Figure 12: Jet % 2 probability (pj et ) for 6-jets (full 
histogram) and light jets (shaded histogram). 



4 HLT &-jet selection performance on single jet-RoIs 

Every tagging method will be characterized by the curve showing the light-jet rejection versus the effi- 
ciency to select £>-jets (£&). The light-jet rejection is defined as the inverse of the efficiency of selecting 
w-jets (R u = l/e«) where we have assumed that u-jets are representative of light jets in general. 



4.1 Likelihood ratio method using impact parameters 

Figures 13 and 14 show, respectively, the ^-tagging performance for L2 and EF when the transverse 
impact parameter significance is used in defining the discriminant variable X, while figures 15 and 16 
show the ^-tagging performance curves for L2 and EF, when the significance of the longitudinal impact 
parameter with respect to the primary vertex is used instead. 

Figures 17 and 18 show the ^-tagging performance curves for L2 and EF when the likelihood ratio 
method is built on the combination of the transverse and longitudinal impact parameter significances. 

4.2 x 2 method 

The performance of the % 2 ^-tagging algorithm, evaluated as a function of the % 2 cut is shown in Fig. 19. 
The limited efficiency of the method is due to the request of at least two reconstructed tracks to define 
the track-jet. Clearly, an effort should be made to include Rols having only a single track. Nonetheless, 
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Figure 13: Performance of the b-jet selection 
based on the do significance discriminant variable 
atL2. 




b-jet efficiency 



Figure 14: Performance of the b-jet selection 
based on the do significance discriminant variable 
atEF 



2. *i 
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b-jet efficiency 



Figure 15: Performance of the b-jet selection 
based on the 8zo significance discriminant vari- 
able at L2. 




b-jet efficiency 

Figure 17: Performance of the b-jet selection 
based on the combination of the transverse and 
longitudinal impact parameter significances at L2. 



Figure 16: Performance of the b-jet selection 
based on the Szo significance discriminant vari- 
able at EF 




b-jet efficiency 



Figure 18: Performance of the b-jet selection 
based on the combination of the transverse and 
longitudinal impact parameter significances. 



we note that the strength of the method lies in its impact intrinsic robustness and this advantage must 
also be considered when comparing its performance with that of the likelihood method. 



691 



Trigger - HLT b- Tagging Performance and Strategies 



'0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 
b-jet efficiency 



Figure 19: Performance of the 6-tagging selection based on the jet % 2 probability variable. 



4.3 Comparison with the offline selection 

To tune the online working points so as to ensure the attainment of the overall (i.e. including offline cuts) 
efficiency goal of 60% for b-jet tagging and avoid biases, it is crucial to evaluate the correlation between 
the online and offline algorithms. 

The performance of the L2 and EF trigger algorithms based on impact parameters in the transverse 
plane has been compared to that obtained with the corresponding offline algorithm. This choice is mo- 
tivated by the wish to perform a coherent comparison; more exhaustive comparison studies will be per- 
formed on specific physics selections. 

Figure 20 demonstrates that the L2, EF and Offline selections are well correlated. In particular it is 
always possible to recover the full offline performance at a given b-jet efficiency if the L2 and EF working 
points are set at an appropriate higher efficiency. In particular for the trigger menu studies shown in the 
following, a working point of about 80% efficiency at L2 and about 70% at EF have been chosen in order 
to ensure full acceptance for the standard offline working point (60%). 

Future studies will address remaining differences between EF and offline algorithms. 

4.4 Execution time at L2 and EF 

The execution time needed to reconstruct relevant quantities described in this note and to perform b-jet 
selection was evaluated both at L2 and EF. Results highlight that the timing performance fits design 
requirements and that the overall time spent is dominated by data preparation and track reconstruction 
algorithms. Further details are given in [3]. 



5 Magging trigger strategy 

After having defined and characterized the b-jet selection algorithm on single b-jet Rols the b-jet trigger 
menu has to be built. Figure 21 illustrates the online b-jet selection algorithm's performance as evaluated 
using high statistics samples. The performance of the L2 algorithm is indicated along with the perfor- 
mance of the EF algorithm on events which are selected by L2 (at the nominal working point of 80% 
efficiency). Since the ^-tagging cut is fixed, the ^-tagging efficiencies vary with isj-fhreshold. Typical 
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Figure 20: The correlation between L2, EF and offline taggers 



L2(EF) ^-tagging efficiencies are e b = 76(61)% at E T = 18 GeV and e h = 80(73)% at E T = 70 GeV. 
This variation of the working point as a function of ^compensates for the effect of the worsening of the 
^-tagging performance at low Ej, so the rate reduction does not change significantly with Ej. 

It is clear that &-jet selection can play an important role especially for events with multiple ft-jets 
because the selective filtering of ft-jets can produce very high rejection and thereby allow a significant 
decrease of the LI thresholds while keeping the jet-RoI output rate of L2 and EF almost constant. 



5.1 £-jet trigger menu 

The possible &-jet signatures initiated by multi-jet LI signatures with given Ej thresholds can be repre- 
sented in general as nb£y_mLlJ£y, where n indicates the number of ^-tagged jets required out of m 
LI jets with transverse energy greater than Ej. The HLT ^-tagging working point is the one describe in 
section 4.3. The rate reduction as a function of the available LI thresholds is shown in Fig. 22. The EF 
output rates of different multi ft-jet signatures at the luminosity of 10 31 cm _2 s _1 are given in Table 1. 
The rates and uncertainties of these rates have been computed for dijet samples using the relations 



R 



(3) 
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Figure 21: 6-jet performance based on the combination of the transverse and longitudinal impact param- 
eter (EF selection starts from the chosen L2 working point). 
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Figure 22: Rate reduction achieved with HLT &-jet as a function of the LI £yfhreshold. 



where N' EF and N l Total are respectively the number of events selected at the end of the trigger chain and the 
total number of events in the sample , a, is the cross-section of the sample /; and Jzf is the luminosity. 
The uncertainties in the tables indicate that at high transverse energy, the rate computation is not very 



694 



Trigger - HLT b- Tagging Performance and Strategies 



precise. Nevertheless, with the requirement of keeping the EF output rate at a few Hz for each multi b-]et 
signature, trigger menus for different luminosities can be chosen as: 

• luminosity 10 31 cm^s" 1 : 3b23_3LlJ23, 3bl8_4LlJ18 

• luminosity 10 32 cm V 1 : 2b42^LlJ42, 3b35 3L1J35, 3b23 4L1J23, 4bl8 4L1J18 

• luminosity 10 33 cm V 1 : 2b70_3LlJ70, 3b42_3LlJ42, 3b35_4LlJ35, 4b23_4LlJ23 



It can be noticed that as the luminosity increases, requiring more ^-tagged jets is a viable alternative to 
increasing £y thresholds. 



Transverse energy 




Signature rate [Hz] 




£ r [GeV] 


2bE T _3LUE T 


3bE T _3LUE T 


3bE T ALUE T 


AbE T AL\JE T 


18 


47 ±11 


1.5±0.4 


1.0±0.3 


0.2 ±0.1 


23 


18±7 


0.5 ±0.2 


0.4±0.2 


0.004 ±0.002 


35 


1.0 ±0.2 


0.04 ±0.01 


0.02 ±0.01 


0.0007 ±0.00006 


42 


0.4±0.1 


0.02 ±0.01 


0.01 ±0.01 


0.0007 ±0.00006 


70 


0.01 ±0.02 


0.0008 ±0.0006 


0.0007 ±0.0006 


0.0007 ±0.00006 



Table 1: EF output rates for the different multiple 6-jet signatures at 10 31 cm 2 s l . 

The strategy behind the evolution of the b-]et signatures is to select more aggressively as luminosity 
increases and HLT tracking becomes better understood. Before the b-]et trigger achieves full perfor- 
mance, a good online resolution of track impact parameters must be achieved. In turn, this requires 
adequate knowledge of the inner detector alignment and sufficient understanding of the overall detector 
performance. 

5.2 Prospects for measuring efficiency and correlation with offline on real data 

The HLT ^-tagging group is working closely with the offline ^-tagging group to develop a method to 
measure the b-]et efficiency with real data. For an explanation of the method and a discussion of its 
performance we refer to the ^-tagging note on dijets [5]. 

In addition to the "physics" triggers listed in the previous Section , the b-]et group has introduced 
several "technical" triggers in order to study rate and correlation of the online and offline algorithms: 

• single b-]et signatures: M8, b23, b35, b42, blO: prescaled to limit their contribution to the EF 
output to few Hz; 

• each multi jet item is duplicated with an identical signature which selects, independently of the 
HLT £>-jet result, one over n events (where n is presently set at 1000 but will be tuned according to 
the rate allocated to b-]et triggers). 

6 Summary and conclusions 

The £>-jet selection at the L2 and EF stages of the ATLAS High Level Trigger has been described and 
characterized. An HLT ^-tagging trigger menu has been implemented which demonstrates the feasibility 
of increasing the acceptance of events with more than one b-]et by decreasing LI jet £ythresholds while 
controlling the output rate by introducing a b-]et selection at the HLT. 
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Overview and Performance Studies of Jet Identification in the 
Trigger System 

Abstract 

This note describes in detail the algorithms used to identify jets in the ATLAS 
trigger system. Results from performance studies of these jet algorithms are 
presented. An initial trigger menu using jets and proposed strategy to adapt to 
increases in luminosity are also discussed. 

1 Introduction 

A critical component of the ATLAS trigger system is the ability to efficiently identify hadronic jets in an 
event. The performance of the jet reconstruction depends on the trigger jet energy resolution and scale. 
In this note we discuss the algorithms at the different trigger levels and evaluate their performance. 

2 LI jet trigger algorithm 

A detailed description of the LI jet trigger algorithm can be found in [1]; however, for completeness, a 
summary of the relevant features of this algorithm is presented below. 

The ATLAS electromagnetic and hadronic calorimeters are segmented into approximately 7200 trig- 
ger towers, with granularity of approximately 0.1 x 0.1 in r\ x <j> space. The granularity varies slightly in 
different sub-detector systems, for further details see [1]. Analog signals from these trigger towers are 
transmitted directly to the LI system. The LI hardware digitises the trigger tower signals, associates them 
with a bunch crossing and performs pedestal subtraction. The LI system also applies a noise suppression 
threshold and transverse energy calibration. The electromagnetic tower Ej response is calibrated at the 
EM scale and the hadronic tower Ej response is calibrated for jets. 

The LI trigger constructs "jet elements" made of the sum of 2 x 2 trigger towers in the electro- 
magnetic (EM) calorimeter added to 2 x 2 trigger towers in the hadronic calorimeter which gives a 
granularity of 0.2 x 0.2 in tj x 0 space. The jet reconstruction algorithm consists of a sliding window of 
programmable size that could be either 2 x 2, 3 x 3 or 4 x 4 jet elements. A jet is reconstructed if the 
total transverse (EM+Hadronic) energy within the window is above a given threshold. The step size for 
the sliding window is 0.2 in both tj and 0 which implies significant overlap of the window in neighbour- 
ing positions. To prevent the LI algorithm from identifying overlapping jets, the transverse energy of a 
cluster, defined as a region spanned by 2 x 2 jet elements, is required to be a local maximum within ±0.4 
units in tj and in 0. The LI jet algorithm identifies jets within the region of |tj| < 3.2. Figure 1 shows a 
schematic diagram of the jet reconstruction algorithm at LI. 

In contrast to the other calorimeters, the LI forward calorimeter (FCAL) trigger towers have a granu- 
larity of approximately 0.4 x 0.4 in tj and 0. The forward jet trigger electronics was originally designed 
to only be used for the calculation of missing Ej at LI, and not to identify in addition jets in the forward 
regions of the detector. As a consequence, limited granularity of the FCAL data is available at LI. A jet 
element in the FCAL is formed by summing calorimeter towers in tj . Therefore, the FCAL jet elements 
have a <j) granularity of 0.4 with only a single tj bin at each end. This has an impact on how the HLT 
forward jet reconstruction algorithm is implemented, as discussed in Section 5. 

In the tentative menu proposed for early data taking, a sliding window size of 4 x 4 jet elements has 
been chosen for almost all thresholds; the exception is the algorithm used to reconstruct b-jets which 
has a proposed transverse energy threshold of 5 GeV and uses a window size of 2 x 2 to minimise the 
identification of fake jets associated with possible calorimeter noise. 
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Figure 1: Schematic diagram of the LI jet algorithm showing a window of 4 x 4 jet elements spanning 
the electromagnetic and hadronic calorimeter in depth, and a local maximum transverse energy cluster 
of 2 x 2 jet elements. 



Table 1: Simulated dijet event samples used to study the performance of the jet trigger algorithms. The 
last row gives a summary of the cuts applied on the hard scatter parton in the event. 



Event Sample 


JO 


Jl 


J2 


J3 


J4 


J5 


J6 


J7 


J8 


Cross-section (mb) 


17.6 


1.4 


9.3E-2 


5.9E-3 


3.14E-4 


1.3E-5 


3.6E-7 


5.3E-9 


2.22E-11 


E T Range (GeV) 


8-17 


17-35 


35-70 


70-140 


140-280 


280-560 


560-1120 


1120-2240 


>2240 



3 LI performance 

Dijet events were used to study the performance of the LI jet trigger algorithm. Table 1 provides a 
summary of the simulated data samples used along with their respective cuts on the hard scatter parton 
and cross-section. These simulated dijet event samples, together, span the whole Ej jet spectrum relevant 
for jet identification in the trigger. 

The transverse energy scale of LI jets is defined as the ratio between the transverse energy measured 
in LI divided by the truth jet Ej. Each jet identified by the LI trigger was matched to a truth jet found 
using the cone algorithm with R = 0.4. The matching criterion consists in searching for the closest truth 
jet in the tj — 0 plane, where the distance is defined as AR = \J — r\R eC o) 2 + (0li — <l>Reco) 2 - The LI jet 
transverse energy scale as function of truth jet Ej is shown in Fig. 2(a). The transverse energy scale at 
LI varies from about 70% to 90%, increasing with the transverse energy of the jet. There are several 
effects that contribute to this behaviour. The first one is the noise suppression mentioned in Section 2. 
The lower the transverse energy of the jet, the larger is the number of trigger towers that do not satisfy this 
noise suppression threshold leading to an underestimation of the jet transverse energy. In addition, each 
LI trigger tower signal must be associated with a particular Bunch Crossing Identification number. The 
efficiency of this requirement is approximately 50% for 1 GeV trigger tower signals, reaching nearly 
100% for 3 GeV trigger tower signals. Again, this inefficiency particularly affects lower transverse 
energy jets that will tend to contain a larger fraction of trigger towers with low energy signals. The 
most important reason for the lower jet transverse energy scale with respect to truth jets comes from the 
different e/n response of EM and hadronic towers. For presentation purposes, a common conversion 
factor is used to translate the number of counts measured in an EM or hadronic tower into units of 
transverse energy. This results in an apparent lower jet transverse energy scale compared to truth jets. It 
is important to note that thresholds in the LI are applied in terms of the internal LI quantity "counts" 
and not in units of transverse energy. In Fig. 2(b), the LI jet transverse energy scale is also shown as 
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Figure 2: The LI jet transverse energy scale as function of truth jet transverse energy (a) and pseudo- 
rapidity (b). 




Figure 3: LI jet transverse energy resolution as a function of truth jet transverse energy (a) and the LI jet 
trigger efficiency as function of the offline reconstructed jet Ej for different LI energy thresholds (b). 



a function of the pseudo-rapidity (T]) of the truth jet. The response of the different calorimeter sub- 
detectors can be identified. Figure 3(a) shows the LI transverse energy resolution as function of truth jet 
Ej. The transverse energy resolution is defined as the width of the E\ x —Ej uth distribution in each Ej uth 
bin, divided by the Ej uth of that bin. 

Figure 3(b) shows the LI jet trigger efficiency as function of offline reconstructed jet Ej for different 
LI jet trigger thresholds. The limited jet transverse energy resolution of the LI system particularly affects 
the higher LI energy thresholds. 

The effect of pile-up on the performance of the LI jet trigger reconstruction was also studied. At a 
luminosity of 10 33 s _1 cm -2 , an average of approximately 2 inelastic collisions are expected per bunch 
crossing. Simulated dijet events including this level of pile-up were generated. The effect of increased 
occupancy in the calorimeters will have some impact on the reconstructed transverse energy of the jet. 
Figure 4 shows the impact of this pile-up on the LI jet transverse energy scale and jet trigger efficiency. 
The transverse energy scale increases due to the additional energy deposited in the calorimeter from pile- 
up events. The LI transverse energy resolution also worsens as seen on Fig. 4(b). The effect is clearly 
more important for low energy jets where the contribution of pile-up energy can be of the same order of 
magnitude as the jet energy itself. 



699 



Trigger - Overview and Performance Studies of Jet Identification in the . 



++ ++ t A 




OfflinejetE^GeV] 



Figure 4: The LI jet transverse energy scale as function of truth jet transverse energy (a) for simulated 
dijet events with pile-up. The LI jet trigger efficiency as function of offline reconstructed jets (b) for a 
35 GeV LI trigger threshold, with and without taking into account pile-up. 



4 HLT jet algorithms 

The HLT object reconstruction is guided by the result of the LI system. HLT algorithms typically only 
access data from a limited region of the detector in the vicinity of an Rol provided by LI. The position 
of this Rol is successively updated and refined by the HLT algorithms. 
HLT algorithms are classified in two types: 

• "Feature Extraction algorithms": Algorithms that retrieve and unpack detector data and create 
simple classes composed of useful physics observables. These algorithms consume most of the 
available time. 

• "Hypothesis algorithms": Algorithms that retrieve the physics information produced in the preced- 
ing Feature Extraction algorithms, and validate a specific hypothesis (e.g. Ej threshold). These 
algorithms have a very fast execution time. 

This separation between Feature Extraction and Hypothesis algorithms optimises the overall execu- 
tion time since the data retrieved by a single Feature Extraction algorithm can be used to provide input 
to several fast Hypothesis algorithms to test various physics signatures, and hence avoids multiple data 
access and unpacking. 

Figure 5 shows a schematic diagram of the sequence of algorithms used to reconstruct jets in the HLT. 
The Hypothesis algorithms for jets reconstructed at L2 and EF compare the energy of the jet candidates 
to some predefined Ej thresholds. The next few sections describe in detail each Feature Extraction 
algorithm appearing in Fig. 5. 

5 L2 jet algorithm 

Standard L2 jets are defined within the \r\ \ < 3.2 region and are reconstructed using the electromagnetic 
and hadronic calorimeter data. Forward jets are defined within the range 3.2 < \r\ \ < 5 and are recon- 
structed using data from the forward calorimeters. As described below, the forward jet reconstruction is 
different than the standard jet algorithm due to the limited tj position resolution of LI jets. 

The output of the L2 Feature Extraction algorithm is a reconstructed jet with a given energy and 
position in T7 and <j). The algorithm contains three distinct parts described below: data preparation, jet 
finding and calibration. 
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Figure 5: Schematic diagram of the sequence of algorithms used to reconstruct jets in the HLT The ovals 
and diamonds represent Feature Extraction algorithms and Hypothesis algorithms, respectively. 



5.1 L2 jet data preparation 

The data preparation for the L2 jet trigger is a critical part of the algorithm chain. It provides the col- 
lection of data from the detector readout drivers (ROD) to the L2 processing units and the conversion 
from the raw data into bytestream files readable by the HLT algorithms. The RODs receive data from 
the calorimeters front-end boards (FEB) via optical fibres. The FEBs are installed on the detector and 
contain the electronics for amplifying, shaping, sampling, pipelining, and digitising the signals [2,3]. 

The ATLAS calorimeters consist of more than 10 5 individual readout channels; therefore, in order 
to meet the L2 timing performance goals of 40 ms total processing time per event, the amount of data 
unpacked must be kept to a minimum while simultaneously maximising the physics performance of the 
algorithm. 

The L2 jet trigger algorithm accesses calorimeter data that lies in a rectangular region centred around 
the LI jet Rol position with a width AT7 and A(j) that can be defined to have any size. The widths A 77 and 
A0 are parameters that are specified at trigger configuration time. Figure 6 shows a schematic diagram 
of the L2 jet reconstruction algorithm. 

The position and transverse energy of each detector element that falls into the chosen (A 77, A<j>) 
window is read out by the algorithm. As a result the calorimeter read out region can be regarded as 
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Figure 6: Schematic diagram summarising the L2 jet algorithm. The data unpacking step reads in the 
necessary calorimeter data within a predefined window size and defines a grid of calorimeter elements 
each with an associated energy and position. The size of grid elements depends on the calorimeter data 
unpacking method used. The dark (red) boxes in the diagram represent grid elements with substantial 
energy deposition. The algorithm is seeded by the LI position as shown on the left. The final jet is found 
after a given number of iterations. The position of the jet is calculated as the energy weighted average of 
the grid elements position within a given cone size. The energy of the jet is calculated as the sum of the 
energy of each grid elements falling within the given cone size. 



partitioned into a grid of elements with associated energies and (r},(j>) -coordinates as shown in Fig. 6. 
Therefore, the amount of data accessed is equivalent to the number of grid elements. 

Two different data unpacking approaches are implemented and described below. One of the methods, 
the cell-based approach, has finer granularity and hence produces more accurate energy and transverse 
energy reconstruction, but is more time consuming. The other method, the front-end board approach, is 
faster but the coarser granularity produces a less precise reconstruction. Both methods are, as we will 
see in section 6, reasonably within the L2 time budget limits. The final decision of what approach should 
be used will be made depending on the final High Level Trigger setup. 

5.1.1 The cell-based approach 

This method uses the full granularity of the calorimeters [1]. Each grid element in Fig. 6 corresponds 
to a calorimeter cell with a given transverse energy and (tj, 0) -coordinate provided by the ROD. In the 
following discussions, jets reconstructed using this data unpacking approach are referred to as "cell-based 
jets". 

5.1.2 The front-end board approach 

This method uses a coarser granularity than the cell-based data unpacking. Instead of reading out ev- 
ery cell over a specified region of the liquid argon (LAr) calorimeters, only information from the LAr 
calorimeters front-end boards is used. There are 128 readout channels per FEB and two FEBs are con- 
nected to one ROD. For the tile calorimeter, the full cell granularity is still used. 

In the RODs, the sums of the Cartesian components of the cell energies are calculated for each FEB: 



COS0i 

coshr/i 
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E z = ££itanhTji (3) 

where E- x is the energy and T7; the position of each cell. This sum is computed per FEB and runs over 
all channels with an energy E\>2 - a no i se - The noise cut value, which is a configuration parameter, has 
been determined from performance studies with QCD dijet events, such as to give the similar jet energy 
scale as obtained with the cell-based approach, which will be presented in the next section. The total 
energy of the cells connected to a FEB is then obtained from the quadratic sum of the three components, 
£tot = yJ^l + E^ + El. The corresponding r\ and 0 coordinates are calculated as 

7] =atanh(£ z /£ to t), 
0 =atan2(£ y /£ x ), 

where the function atan2 returns the angle (j) G [— k,k]. The resulting values for the energy and the (tj, 
0)-coordinate are then used to define the grid of elements in a region around the LI Rol position, as 
is illustrated in Fig. 6. In the following text, jets reconstructed using this data unpacking method are 
referred to as "FEB-based jets". 



5.2 L2 jet finding algorithm 

Jets are defined as a cone-shaped object in the (tj, 0)-space with a given radius AR = y / ATj 2 + A0 2 . The 
value AR is a parameter of the algorithm that is defined at trigger configuration time. The jet energy and 
position are found through an iterative procedure with the following steps: 

• The LI jet Rol is used as a seed for the algorithm. A reference jet is created, labeled jo, defined 
by the LI jet Rol position with the pre-defined cone radius AR (see left-hand side of Fig. 6). Note 
that the possible positions of the reference jet jo are discreet due to the LI granularity. 

• Grid elements that fall within the (tj, 0)-region encompassed by the reference jet jo are used to 
recalculate the jet energy and position according to: 



(4) 
(5) 



The sum runs over the k grid elements that are contained in the cone defined by the reference jet 
jo. A grid element is included in the sum if its centre falls within the region spanned by the cone 
radius AR. The total energy and coordinates (rjjj, (j>0, computed in Equations (4) and (5), are used 
to define the new reference jet j\. 

• The previous step is repeated with jo replaced by ji in Equations (4) and (5), which results in 
updated coordinates Tjj 2 and 0j 2 to define the updated jet j 2 . This algorithm can be repeated N 
times to create jet j N . 
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• A predetermined fixed number of iterations are executed. The energy of the final jet is calculated 
as the sum of the energy of all the grid elements falling within the cone radius. The position of 
the jet is obtained as the energy weighted average of the position of each grid elements within the 
cone, as shown in Fig. 6. 

The outcome of this algorithm is a jet defined by its (77, 0) position and total energy. The calorimeter 
energy scale, at this point, is set at the electromagnetic scale which does not provide an accurate measure 
of the jet energy. The next Section describes the calibration weights applied to correct the total jet energy 
for non-electromagnetic shower components. 

5.3 L2 jet calibration 

The ATLAS calorimeter response to the electromagnetic component of a hadron shower is not equal to 
the response to the non-electromagnetic component. In general, the hadronic response (h) is smaller than 
the electromagnetic response (e), 

e/h>\. 

This effect is mainly due to the energy lost in the breakup of nuclei or in nuclear excitation. In order 
to correct for it, a weight is applied to each element that makes up a jet. Depending on the calibration 
method used, a jet can be regarded as composed, for example, of individual calorimeter cells or of energy 
deposits in calorimeter samplings. The calibrated jet energy can then be written as: 

E^ = p Vi E i (6) 

where the sum runs over the n constituents of the jet. 

The weights w\ in Equation (6) are extracted using simulated event samples by minimising the func- 
tion: 

S.gp^] 2 (7) 

The sum runs over all the jets in the events. The true energy of the m-th jet in the event is labeled 
£"^ uth and is obtained from the Monte Carlo (MC) truth information, running a cone jet algorithm with 
radius AR = 0.4 over the MC truth particles and finding the truth jet that is closest to the m-th jet. A 
truth jet is made up of all particles generated, excluding neutrinos and muons which have their own 
observables, missing transverse energy for neutrinos and reconstructed tracks for muons. By minimising 
5 with respect to the true jet energy, an improvement of the jet energy scale and resolution is obtained. 

Different calibration methods can be applied, which differ in the partitioning of the jet energy into 
calorimeter components (e.g. cells, layers). The calibration method used here is the so-called sampling 
technique [4]. In this method, individual weights can be applied to the energy deposited in each of the 
electromagnetic (EM) and hadronic (HAD) calorimeter sampling layers. The energy dependence of the 
weights is chosen to be: 

Wi = a + blog(E). (8) 

In the implementation used for L2 jets, only two weights are calculated and applied to calibrate 
the reconstructed jet energy; one weight for the total energy deposited in the EM calorimeter, and one 
weight for the total energy deposited in the hadronic calorimeter. Furthermore, the 77 range is split in 32 
bins with a size of 0.1 in the region 0 < |t]| < 3.2. This procedure assumes an azimuthally symmetric 
response. This assumption will need to be validated with real data. A fit to Equation (8) is performed 
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Table 2: Parameters used in the L2 jets reconstruction. The value of the cone size radius is defined for 
each iteration. 



Parameter 


Standard Jets 


Forward Jets 


Number of iterations 


3 


3 


Cone radius 


0.4/0.4/0.4 


1/0.7/0.4 


Atj window size 


1.4 


3<|T]|<5 


A(j) window size 


1.4 


1.4 



using the MINUIT2 package [5]. It yields the values of the two parameters of Equation (8) for each 
17 -bin and for the electromagnetic and hadronic calorimeter sampling. The computed weights are then 
stored in a configuration file, which serves as input to the L2 jet algorithm that applies the weights to 
each identified jet. 

5.4 L2 jet parameters 

In the configuration of the L2 jet trigger algorithm, the values of the following parameters need to be set: 

• the data unpacking method: cell-based or FEB-based, 

• the calorimeter window size AT7 and A(j) , where data need to be unpacked, 

• the radius of the cone AR used in the jet finding algorithm, 

• the number of iterations used in the jet finding algorithm, 

• the calibration constants used. 

A set of parameter values which yields the optimal balance between short execution times and ade- 
quate physics performance must be chosen. 

6 L2 performance 

The performance of the L2 jet algorithm was studied using the simulated dijet event samples described 
in Table 1. 

The parameters used in the reconstruction of the L2 jets are summarised in Table 2. These parameters 
were found to be a good compromise between the physics performance (e.g., energy resolution and scale) 
and the algorithm execution time. 

The first parameter studied was the number of iterations of the jet finding algorithm. The variation in 
the jet (T7 , 0) coordinate and transverse energy after each iteration of the jet reconstruction algorithm is 
shown in Fig. 7. The largest variation in Ej , X] and (j) happens after the first iteration and thus it has the 
largest impact on the measurement precision. This suggests that the number of iterations could possibly 
even be reduced to 2 without losing too much precision on this measurement. 

In order to study the effect of the window size used to unpack calorimeter data, the total time spent 
by the jet algorithm running with different sizes was studied. Windows with dimensions of 1.0 x 1.0 (in 
T7 x (j)) and 1 .4 x 1 .4 were studied. A window size of 1 .0 x 1 .0 is only slightly larger than the diameter of 
a jet with AR = 0.4. The maximum initial displacement of a jet with respect to a truth jet is approximately 
0.2 in AT7 or A0, as can be seen in Fig. 7. Therefore, a window size of 1.0 x 1.0 should be adequate for 
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Figure 7: Variation in the (a) transverse energy, (b) 0 position and (c) 77 position of jets as function 
of number of iterations performed by the L2 jet reconstruction algorithm. The area of the boxes is 
proportional to the number of entries in each bin. 
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Figure 8: Time spent by the L2 jet reconstruction algorithm for two different window sizes: 1.4 x 1.4 
(dashed line) and 1.0 x 1.0 (solid line). 



the jet reconstruction. The total time spent by the L2 jet algorithm for two different window size and 
using three iterations is shown in Fig. 8. A reduction of the window size from 1.4 x 1.4 to 1.0 x 1.0 
results in a considerable reduction (of order 30%) in the processing time. As shown in the next section, 
the energy scale and resolution is not significantly affected by this reduction of the size of the window. 

6.1 Performance for cell-based reconstruction 

The calibration constants used to reconstruct cell-based jets in L2 were extracted using dijet events 
simulated with PYTHIA [6] and following the approach described in Section 5.3. The L2 jet transverse 
energy scale and resolution for cell-based jets after calibration are presented in Fig. 9. The transverse 
energy scale is defined as the L2 jet transverse energy divided by the truth jet Ej. Truth jets are identified 
by applying the cone algorithm with R con e = 0.4 on the collection of truth final state particles. The 
transverse energy scale is close to unity for all the X] coverage of the L2 jet trigger and all the transverse 
energies studied, demonstrating that the transverse energy is correctly measured within 2%. The jet 
transverse energy resolution decreases from 12% for the lowest transverse energies to 4% for transverse 
energies above 1000 GeV. The resolution curves were fitted with the following expression that includes 
a stochastic term convoluted with a constant term: 

Table 3 shows the result of the fits for all the tj bins, before and after calibration. A few percent improve- 
ment in the resolution is obtained with the current calibration method. A further improvement can be 
achieved in the future by exploiting the correlation between the fraction of electromagnetic energy and 
the calibration weights [7]. 

In another study, two different window sizes were used in order to study the effect on the jet energy 
scale and resolution. L2 jets were reconstructed using a window size of 0.7 x 0.7 (in J] X (j>) which was 
chosen to be slightly smaller than the jet cone diameter (2 x AR = 2 x 0.4 = 0.8) such that some of the 
energy of the jet may lie outside the window considered. Results were compared with the performance 
obtained using the window size dimension of 1.4 x 1.4. The jet energy calibration constants were calcu- 
lated independently in both cases and the resulting jet energy scale and resolutions were compared. In 
both cases, the jet energy scale was found to be within 2% of unity. The resolution of the jets was also 
found to be similar for both window sizes. This means that the calibration algorithm can adequately cor- 
rect for a small fraction of the jet energy lost outside the window considered. Therefore, using a window 
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Figure 9: Jet energy scale for the L2 jets as a function of the truth jet Ej (a), for four different bins in Tj. 
Jet energy resolution as a function of the truth energy of the jet (b), for four different bins in 77. These 
results are obtained after calibration. 



Table 3: Results of the jet energy resolution fit as a function of the truth jet energy, before and after 
applying the calibration. The fit was done using Equation (9). 



tj region 


Before calibration 
A B 


After calibration 
A B 


(0.0,0.7) 
(0.7,1.5) 
(1.5,2.5) 
(2.5,3.2) 


1.03 ±0.03 0.059 ±0.001 
1.28 ±0.03 0.064 ±0.001 
1.53 ±0.04 0.046 ±0.001 
1.86 ±0.13 0.063 ±0.003 


0.96 ± 0.02 0.039 ± 0.001 
1.18 ±0.03 0.041 ±0.001 
1.37 ±0.03 0.025 ±0.002 
1.46 ±0.08 0.040 ±0.003 
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Figure 10: Comparison of the L2 jet energy scale and resolution obtained for two different MC genera- 
tors, PYTHIA ( blue circles) and HERWIG (red triangles), for jets in the region 0 < r\ < 0.7. 

size of 1 .0 x 1.0 will reduce processing time while keeping essentially the same physics performance as 
the larger window (1.4 x 1.4). 

The jet energy measured in the calorimeter may be sensitive to the shower development and the 
hadronisation mechanism that was introduced in the MC simulation. In order to test the sensitivity of the 
calibration procedure to the simulation, a set of dijet event samples generated with HERWIG [8] was used. 
Calibration constants were extracted with the PYTHIA dijet samples and used in the reconstruction of the 
HERWIG dijet events. The jet energy scale and resolution obtained in this way with the HERWIG data 
sample was compared, in different Tj regions and Ej values, with the one obtained for PYTHIA. Figure 10 
shows, as an example, such a comparison in one particular Vj region. Differences between the two 
generators were found to be smaller than 2%, for all regions of r\ and jet Ej, suggesting that the L2 jet 
energy scale is relatively insensitive to the hadronisation model used in MC generators. 

Figure 11 shows the L2 trigger efficiency as function of offline jet Ej after calibration has been 
applied for four different L2 thresholds. The LI threshold of 15 GeV was chosen to be significantly 
smaller than that for L2 in order to avoid a mixture of resolution and jet energy scale effects from the two 
different trigger levels. The limited sharpness of the curves shown on Figure 1 1 is, therefore, dominated 
by the resolution of the L2 jet energy. 

Initially, the detector simulation is not expected to provide an exact model of the real detector. The 
energy measurement will be mainly affected by an incomplete knowledge of the dead material distribu- 
tion in front of the calorimeters. To study the effect that the limited knowledge of the detector geometry 
may have on the performance of the trigger and reconstruction algorithms, dedicated MC dijet event 
samples were produced. They were reconstructed with a geometry where detectors were slightly dis- 
placed from their nominal positions and extra dead material was added. This accounted for 7-10% of a 
radiation length in the inner detector and a few percent of one radiation length in front of the calorimeter. 
The knowledge of the dead material distribution is assumed to be worst at the interface between different 
calorimeter subsystems. 

The calibration constants obtained assuming a perfect geometry were used to identify L2 jets in 
the misaligned samples. Hence, the resulting jet transverse energy scale and resolution can serve as 
an estimation of how much the performance of the L2 jet reconstruction may be degraded due to the 
limited knowledge of the detector geometry and dead material distribution at the beginning of the data 
taking period. Figure 12 shows the jet transverse energy scale and resolution obtained for dijet events 
reconstructed with this mismatch of calibration constants. For most of the pseudo-rapidity region, the 
linearity with energy is within 3-4%, slightly worse than before. In the region (1.5 < |tj| < 2.5) the jet 
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Figure 11: Trigger efficiency as function of offline jet transverse energy for L2 jets after calibration, for 
four different thresholds (35 GeV, 42 GeV, 70 GeV and 100 GeV). The statistical uncertainty on each 
point is smaller than the symbols. 
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Figure 12: Jet transverse energy scale for L2 jets as a function of the truth jet transverse energy (a), 
for four different tj regions. Jet transverse energy resolution as a function of the truth jet transverse 
energy (b), for four different X] regions. Both plots were obtained using dijet event samples reconstructed 
assuming a limited knowledge of the detector's dead material distribution. 

energy scale drops to about 94% due to the extra dead material. The transverse energy resolution is also 
degraded, as shown in Table 4. The performance of the calibration at the beginning of data taking can 
be improved using in-situ calibration procedures used to extract the calibration constants directly from 
the data or to correct the detector response in the MC. Several different procedures are currently under 
study. 

The reconstructed jet position is another parameter used to measure the algorithm performance. Cur- 
rently no selection on jet position is made at the trigger level, but this may prove useful in the future 
for some physics channels. The position resolution is shown in Fig. 13. This figure is also included for 
completeness to compare with the FEB unpacking approach that uses a coarser granularity of data. 

6.2 Performance for FEB-based reconstruction 

Studies were done to evaluate the performance of the L2 jet reconstruction using the FEB-based method 
of data unpacking. For comparison with results presented in Fig. 12, dijet event samples that assume 
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Table 4: Results of the jet energy resolution fit as a function of the truth jet energy for the event samples 
where a limited knowledge of the detector's dead material is assumed. The fit was done assuming 
Equation (9). 



77 region 


After calibration 
A B 


(0.0,0.7) 
(0.7,1.5) 
(1.5,2.5) 
(2.5,3.2) 


1.04 ±0.02 0.038 ±0.002 
1.24 ±0.03 0.055 ±0.001 
1.95 ±0.04 0.018 ±0.004 
1.66 ±0.09 0.039 ±0.002 
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Figure 14: (a) Jet energy scale for the L2 FEB-based jets as a function of the truth jet Et for four different 
bins in rj. (b) Jet energy resolution as a function of the truth energy of the jet for four different bins in rj. 
Both plots were obtained using dijet event samples reconstructed assuming a limited knowledge of the 
detector's dead material distribution. 

a limited knowledge of the detector dead material were used to study the transverse energy scale and 
resolution of FEB-based jets. The transverse energy of jets reconstructed using the FEB-based data 
unpacking approach was weighted using the default L2 calibration constants obtained using the cell- 
based method. 

Figure 14 shows the jet transverse energy scale and resolution. The energy scale stays within 5% of 
unity for most of the pseudorapidity range. The transverse energy resolution distribution was fitted using 
Equation (9) and the results are presented in Table 5. The transverse energy resolution of FEB-based jets 
is comparable to that of cell-based jets presented in Table 3. The transverse energy scale and resolution 
of the FEB-based jets depend strongly on the energy cut-off introduced in Equation (3). It was found 
that using a cut-off at 2 • <7 no i S e gave the best result in terms of transverse energy scale and resolution. 

Figure 15 shows the 77 and <j> resolution. These distributions were obtained from the difference 
between a L2 FEB-based jet and its nearest MC truth jet. These results are comparable with those 
obtained using the cell-based method as shown in Fig. 13. 

The L2 trigger efficiency for different thresholds as function of reconstructed jet transverse energy 
for FEB-based jets is presented in Fig. 16. The initial slope of these efficiency curves is similar to 
the results obtained with the cell-based method shown in Fig. 11. This indicates that FEB-based jets 
have a selection efficiency that is comparable to the cell-based one. The FEB-based jet reconstruction 
significantly reduces the amount of data to be unpacked at L2 compared to the cell-based approach. The 
impact of the unpacking choice on the processing time of the jet reconstruction algorithm is presented in 
Section 6.4. 

6.3 Forward jets 

The implementation of L2 forward jet reconstruction must consider constraints on the precision of the 
LI forward jet Rol position. Since each LI FCAL trigger tower spans the entire 77 range of the FCAL 
detector, a dedicated optimisation of the jet reconstruction algorithm is required. In order to properly 
account for the overlap region between the FCAL and endcap calorimeters, a window size of 3.0 < 
I tj I < 5 was chosen. The cone size (AR) must also be modified in order to remove any bias from the 
initial LI Rol seed position. Similar to the standard jet reconstruction, three iterations of the jet finding 
algorithm are performed. An initial cone size of AR = 1 is used to collect sufficient data to remove the 
LI bias. In the second and third iteration of the jet finding algorithm, the cone size is reduced to 0.7 
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Figure 15: The Tj-resolution (a) and 0 -resolution (b) of L2 FEB-based jets with respect to the truth jet 
energy. The mean and standard deviation of a Gaussian fit of (a) is 0.002 and 0.03, while the mean and 
standard deviation of a Gaussian fit of (b) is -0.00007 and 0.009. 



Table 5: Results of the jet energy resolution fit as a function of the truth jet energy for FEB-based jets 
after applying the default calibration constants obtained using the cell-based data unpacking method. 
The fit was performed assuming Equation (9). 



tj region 


After ca 
A 


ibration 
B 


(0.0,0.7) 
(0.7,1.5) 
(1.5,2.5) 
(2.5,3.2) 


0.93 ± 0.05 
1.18 ±0.05 
1.56 ±0.04 
1.93 ± 0.01 


0.02± 0.01 
0.03± 0.01 
0.05± 0.01 
0.01± 0.01 




0.6 - -•- E^ 2 > 35 GeV 

0.4 L ^ 2 >42GeV 
— E^ 2 >70GeV 
02 f ^ ^ -^E^ 2 >100GeV 



0cT'""5 l fi"3 l 8 , I ?3 T T §0^10 0 ' 120 140 160 180 ' 200 

Offline jet E t [GeV] 



Figure 16: L2 trigger efficiency as function of reconstructed jet transverse energy for FEB-based jets 
after calibration, for four different thresholds (35 GeV, 42 GeV, 70 GeV and 100 GeV). 
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Table 6: L2 trigger efficiency for L2 forward jets with respect to truth jets with E T > 25 GeV. 



Tagged Objects 


Data unpac 
cell-based jets (%) 


dng method 
FEB-basedjets (%) 


Highest p l ™ th forward jets 
All forward jets 


98±1 
97±1 


98±1 
91±1 



and subsequently to 0.4. Calibration constants for forward jets are derived using the method described 
in Section 5.3. 

The L2 forward jet trigger efficiency was determined with respect to truth jet. Reconstructed jets 
were matched to truth particle jets using the requirement that AR = ^/Ar^+A^ < 0.2. In addition, 
truth jets were required to satisfy pr>25GeV. Results from this study are presented in Table 6. 

6.4 L2 jet timing 

The time budget of approximately 40 ms per event and the strong rejection power needed at the L2 trigger 
impose strong constraints on the L2 algorithms speed and physics performance. 

During the commissioning of the ATLAS Trigger/DAQ system, dedicated Technical Runs are per- 
formed in order to test HLT algorithm performance. In these runs all the detector Read Out Systems 
(ROS) and the full Trigger/DAQ infrastructure are dedicated to exercising the data acquisition system 
and the trigger. Bytestream files 1 containing a mixture of events close to that expected in LHC collisions 
(mainly QCD jets, mixed with a few Z and W decays to leptons, tt events, etc.) are preloaded into the 
ROS. These events, which contain the Rols obtained from the LI simulation, are then processed by the 
HLT system. The measurements presented here were made during the Technical Run that took place in 
November 2007. 

In order to compare the difference between the cell-based method and the FEB -based method, two 
data-collection runs were recorded, one with L2 jets reconstructed using the cell-based approach, and 
one with L2 jets reconstructed using the FEB-based method. The timing distributions obtained from the 
cell and the FEB-based methods are shown in Fig. 17(a). Both distributions have a similar shape with 
peaks corresponding to the number of Rols per event. The FEB-based approach is however almost 50% 
faster than the cell-based method. 

The distribution of the total processing time shown in Fig. 17(a) includes the data unpacking, jet 
finding algorithm and calibration, as well as the data collection time from the detector Read Out System 
(ROS) to the L2 processors, which is shown in Fig. 17(b). The data collection time contributes sig- 
nificantly to the total L2 processing time, about 30% for the cell-based method and about 50% for the 
FEB-based approach. A detailed description of the data preparation methods and performance can be 
found in [9]. 

The processing time distribution per Rol for the different steps involved in the L2 jet reconstruction 
are shown in Fig. 18(a) and (b) for the cell-based and FEB-based method, respectively. These mea- 
surements show that the algorithm execution time is dominated by the data unpacking step. The small 
features observed in the unpacking time distributions outside the main peak come from Rols that point 
in regions where less data needs to be unpacked. 

Timing measurements of the L2 forward jet algorithm were also performed. A comparison of the 
reconstruction time between cell-based and FEB-based jets is presented in Table 7. Due to the limited 
time available during dedicated Technical Runs, these measurements were obtained by running the L2 jet 

'Files with the same format as the raw data that will come out of the ATLAS detector. 
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Figure 17: The total processing time per event (a) for the L2 jet algorithm. The solid line is the processing 
time measured using the FEB-based method (mean 13 ms). The dashed line is the processing time 
measured using the cell-based method (mean 22 ms). The total data collection time per Rol (b) for both 
the cell-based and FEB-based data unpacking methods. 




Figure 18: L2 jet algorithm processing time per jet Rol for the cell-based method (a) and FEB-based 
method (b). The total processing time is shown together with the execution time of the individual steps 
involved in the L2 jet reconstruction. 
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Table 7: Comparison of the average L2 jet reconstruction time between the standard and forward jet 
algorithms in units of ms. Different data unpacking methods are also compared. 



Tagged Objects 


Data unpac 
cell-based jets 


dng method 
FEB-based jets 


Forward Jets 
Standard Jets 


0.9 
6.1 


0.5 
2.1 



algorithms on offline computing resources rather than making use of the full Trigger/DAQ infrastructure. 
Although the absolute values of reconstruction time should only be taken as an approximate indication 
of the performance to be expected on the online trigger system, the relative comparison of each measure- 
ment is nevertheless informative. For example, although the L2 forward jet algorithm uses a larger r\ x 0 
window than the standard L2 jet algorithm, it requires less than 25% of the total time used to reconstruct 
standard L2 jets. This is due to the larger granularity of the forward calorimeter, as compared to the 
liquid Argon barrel and endcap calorimeters, which results in a smaller amount of data being unpacked 
on average. 

7 The event filter jet algorithms 

The jet reconstruction in the Event Filter (EF) can be divided into two tasks: the input data preparation 
and the jet finding. These two tasks are carried out by algorithms used in the offline reconstruction 
but adapted to run in the online trigger environment. Once an EF jet has been identified, it is passed 
on to a Hypothesis algorithm which validates whether or not the reconstructed jet satisfies a predefined 
transverse energy threshold. 

7.1 Input data preparation 

There are three calorimeter data preparation algorithms which, respectively, unpack the calorimeter cell 
information, build trigger towers and construct calorimeter clusters. The implementation of these data 
preparation algorithms in the EF has been adapted to allow the running of a subset of the algorithms 
thereby reducing computing time. 

The first data preparation algorithm unpacks calorimeter cell information in a predefined window 
around the position of the jet found at L2. The size of the window where data needs to be unpacked is a 
parameter that can be adjusted and is configured at trigger initialisation. The energy of each calorimeter 
cell, at this point, is set at the EM scale [2,3]. Although not currently used, this unpacking algorithm 
provides the ability to apply calibration weights to the energy of individual calorimeter cells. 

The second data preparation algorithm performs the calorimeter tower reconstruction. A calorimeter 
tower is an array of calorimeter cells within an (At],A(^) = (0.1,0.1) region. The EF calorimeter trigger 
towers granularity is four times finer than that of the LI. Calorimeter trigger towers are the inputs to the 
jet finding algorithm described in the next Section. 

A third data preparation algorithm, currently being studied, is also available in the EF. This algorithm 
constructs three-dimensional calorimeter clusters instead of towers. These clusters can alternatively be 
used as input to the jet finding algorithm instead of calorimeter towers. 
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Parameter 


Value 


Window size (t] x <j>) 
Input objects 
Proto-jet Ej cut 
Jet finding algorithm 
Jet finding parameter 
Calibration scheme 
Final jet Ej cut 


1.6 x 1.6 
towers 
E T > 2 GeV 

cone 
R cone = 0.7 
"Energy density"-based cell calibration 
E T > 10 GeV 



Table 8: Parameter values for the EF jet reconstruction. 



7.2 Jet finding algorithm 

The EF jet reconstruction uses the offline reconstruction algorithms [10] adapted for the EF. It takes as 
input any E T ordered list of calorimeter objects (cells, towers or clusters). The jet reconstruction consists 
of the following steps: 

• Removal of negative energy towers (or clusters) by combining them with adjacent ones. A list of 
proto-jets is also constructed using a simple pre-clustering algorithm. 

• Removal of proto-jets with transverse energy smaller than a given threshold. 

• Running of a jet finding algorithm (cone or fast Kt algorithm [11]). 

• Jet calibration 

• Removal of reconstructed jets below a given transverse energy 

Many different parameters can be modified as part of the jet reconstruction. The parameters used 
for the configuration of the EF jet reconstruction are summarized in Table 8. Unless specified explicitly, 
performance studies described subsequently use this configuration. 

All calibration methods available for the offline jet reconstruction can be used by the EF jet algorithm. 
The default method used in the EF is an "energy density"-based cell calibration described in details 
in [12]. 

8 Event filter performance 

The performance and parameter optimisation of the EF jet reconstruction algorithms were made using 
the simulated dijet event samples summarised in Table 1. 

The window size defining how much data is unpacked was chosen by studying the position differ- 
ence between offline reconstructed jets and jets found by the L2 system. A distribution of the distance 
AR = \ZJr\L2 - ^offline) 2 + (0L2 - ^offline) 2 is shown in Fig. 19. Most L2 jet Rol are well within AR < 0.2 
of offline reconstructed jets using a cone algorithm with AR = 0.7. A window size of 1.6 x 1.6 in r\ x 0 
was therefore chosen to ensure that data is unpacked at the EF in a calorimeter region of sufficient size to 
reconstruct correctly a jet similar to an offline jet while ensuring complete overlap with the LI Rol. Ad- 
ditional studies could include the optimization of a variable window size as function of the jet transverse 
energy as measured by L2. 

The EF jet energy scale and resolution were assessed using the same technique as described in Sec- 
tion 6.1. The results are shown in Fig. 20 as a function of the truth jet E T . In general, the energy scale 
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Figure 19: Distance AR = \f (r)^ — ^offline) 2 + (0L2 — ^offline) 2 between L2 and offline reconstructed jets 
using a cone algorithm of R cone = 0.7. 



improves with increasing truth jet Ej. For truth jet energies larger than 200 GeV, the EF jet energy scale 
is within 2% of unity. The limited window size within which the jet reconstruction takes place in the EF 
results in some energy leakage that is not corrected for by the offline calibration used in the EF. These 
results nevertheless suggest that offline calibration are adequate for use in the EF jet reconstruction. 

jr 1 '°T (a) 

0.95- 

ATLAS 

0 - S ' 2 3 " L' : 

° Truth jet [GeV] 10 Truth jet ^ [GeV] 

Figure 20: The EF jet transverse energy scale (a) and resolution (b) as function of truth jet Ej. 



(b) 




ATLAS 



The effect of pile-up corresponding to a luminosity of 10 33 s _1 cm -2 on the jet energy scale and 
resolution was also studied. Figure 21 shows a comparison of (a) the EF jet transverse energy scale and 
(b) the EF resolution for simulated dijet event samples with and without pile-up. The effect of pile-up 
on the overall energy scale is observed to be insignificant, however, a noticeably worse resolution is 
observed for events with pile-up. The effect is clearly more important for low jet energies where the 
contribution of pile-up energy can be comparable to the total jet energy. 

The EF jet trigger efficiency as function of offline jet energy is shown in Fig. 22 for two different 
signatures. For an EF trigger threshold close to the LI threshold, the trigger selection performance is 
limited by the LI energy resolution. For EF trigger thresholds significantly higher than the LI threshold, 
the excellent EF energy scale improves slightly the performance of the trigger selection as compared to 
the current L2 capability. These plots emphasize the importance of correctly optimizing the jet trigger 
thresholds used for each jet signature. 



718 



Trigger - Overview and Performance Studies of Jet Identification in the . 



■ no pile-up 
* pile-up 



h ( b ) 






■ no pile-up 






* pile-up 




f ATLAS 





Figure 21: The EF jet transverse (a) energy scale (b) and resolution as a function of truth jet Ej for 
simulated dijet event samples with and without pile-up. 
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Figure 22: EF trigger efficiency as function of offline jet transverse energy for two different signatures 
consisting of a set of LI, L2 and EF trigger thresholds: (a) E^ 1 > 10 GeV, E\ 2 > 30 GeV, £f F > 50 GeV; 
and (b) E\ l > 70 GeV, E^ 1 > 150 GeV, E^ F > 255 GeV 



8.1 Event filter jet algorithm timing 

The timing performance of the EF jet reconstruction algorithm has been measured in the November 2007 
Technical run described in Section 6.4. Figure 23(a) shows the execution time per Rol for the two EF 
data preparation steps: the unpacking of calorimeter cell data and the construction of calorimeter towers 
used by the jet algorithm. Figure 23(b) shows the execution time of the different steps involved in the jet 
reconstruction described in Section 7.2. The EF jet reconstruction total time per Rol is of order 100 ms. 
Assuming approximately 4 to 5 jet Rol per event implies a total processing time of order half a second 
per event which falls within the design budget of approximately one second per event. 



9 Jet trigger menu 

Jet triggers will be used to record many different types of events. Single and multi-jet signatures will 
identify useful events for various Standard Model QCD measurements. This event sample will also be 
valuable to study the properties of background events for other analyses and to measure the misidentifica- 
tion efficiency of different offline reconstruction algorithms. Jet trigger requirements are also important 
when combined with other trigger criteria to identify rare signal events with well-defined topologies. 
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Trigger threshold 


J5 


no 


J18 


J23 


J35 


J42 


J70 


J 120 


3J10 | 3J18 


4J10 


4J18 


4.123 


Prescale factor 


300000 


42000 


6000 


2000 


500 


100 


15 


1 


150 1 


30 


1 


1 


Level 1 rate (Hz) 


1 


4 


1 


1 


1 


4 


4 


8 


40 140 


40 


20 


8 



Table 9: Summary of the LI single-jet and multi-jet triggers optimised for the initial data taking period. 
The LI prescale factors and expected rate at a luminosity of 10 31 cm _2 s _1 for each threshold are also 
presented. 



The trigger strategy adopted is to define a set of single and multi-jet signatures that, together, will 
select events with approximately uniform rates over the entire jet energy spectrum. Table 9 shows the 
set of LI single and multi-jet signatures and associated LI prescale factors optimised for this purpose, 
assuming an initial luminosity of 10 31 cm~ 2 s _1 . The trigger rates were calculated using close to 7 mil- 
lion simulated non-diffractive inelastic events, with an estimated cross-section of 70 mb. These large 
simulated event samples result in a statistical uncertainty of approximately 0. 1 Hz for the rates of un- 
prescaled triggers. Note that a maximum of 8 different jet thresholds can be defined in the LI system. 
The LI thresholds and prescale factors for the single-jet signatures were chosen to significantly limit the 
LI output event rate thereby initially avoiding the need to use the HLT at the beginning of data-taking. 
The LI jet trigger menu was also designed to be compatible with a luminosity of up to 10 32 cm _2 s _1 
without any changes. Figure 24 shows the differential distribution for the number of events selected by 
the LI trigger menu as function of offline reconstructed jet transverse energy for 1 fb _1 of recorded data. 

For commissioning purposes, it is foreseen that the L2 and EF jet algorithms will be initially run for 
the single-jet signatures in so-called pass-through mode where the result of the HLT selection is recorded 
with the event data but no events are rejected based on the result of the algorithm selection. The rate of 
the multi-jet signatures listed in Table 9 will be reduced by using L2 and EF jet requirements. Table 10 
summaries the expected rate out of the HLT for all jet signatures foreseen to be run at the beginning of 
data-taking. In addition to the single and multi-jet signatures discussed above, forward jets signatures 
are also foreseen. These are also shown in Table 10. 

It should be noted that, due to the very large prescales used for the lower threshold jet triggers, the 
overlap between individual triggers is greatly reduced. This implies that the cumulative rates will rapidly 
grow, as can be observed in Table 10. The total rate of the entire jet menu has been approximated to be 
37 Hz, which represents a little over 18% of the overall trigger output rate budget of 200 Hz. 
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Figure 24: Differential number of events selected as function of the offline reconstructed transverse 
energy of the leading jet in the event for 1 fb _1 of data. The dashed and solid lines show the expected 
distributions before and after applying the LI jet trigger menu criteria described in Table 9. 



Trigger 


Overall 
Prescale 


Rate (Hz) 


Cumulative 
Rate (Hz) 


4j23 
4jl8 
4jl0 
3jl8 
3jl0 


1 

100 
300 
100 
1500 


6.9 (± 0.8) 
0.14 (±0.01) 

0.045 (± 0.004) 
0.92 (± 0.03) 

0.061 (± 0.002) 


6.9 (± 0.8) 
7.0 (± 0.6) 
7.0 (± 0.6) 
7.9 (± 0.3) 
7.9 (± 0.2) 


Total Multi-Jets 




7.9 (± 0.2) 




jl20 

jvo 

J42 
j35 
J23 
J18 

jio 

J5 


1 

15 

100 

500 

2000 

6000 

42000 

300000 


8.7 (± 0.9) 
4.2 (± 0.2) 
3.73 (± 0.06) 
1.37 (± 0.02) 
1.37 (± 0.008) 
1.02 (± 0.004) 
3.9 (± 0.003) 
0.9470 (± 0.0004) 


15.3 (± 0.5) 

18.7 (± 0.5) 
22.3 (± 0.3) 
23.6 (± 0.3) 
24.9 (± 0.2) 
26.0 (±0.1) 
29.9 (± 0.02) 

30.8 (± 0.01) 


Total Single- Jets 




24.40 (± 0.01) 




2f]70 
2f]35 
2fjl8 


1 
1 

100 


0 

1.7 (±0.4) 
0.94 (± 0.03) 


30.8 (± 0.01) 
32.5 (± 0.01) 
33.4 (± 0.01) 


Total Multi-Fjets 




2.65 (± 0.09) 




f]120 
f]70 
fj35 
fjl8 


1 

20 
700 
7000 


0.9 (± 0.3) 
1.16 (±0.08) 
0.68 (± 0.01) 
1.04 (± 0.004) 


34.1 (± 0.01) 

35.2 (± 0.01) 

35.8 (± 0.01) 

36.9 (± 0.02) 


Total Single-Fjets 




3.74 (± 0.01) 





Table 10: Estimated trigger rates for jet signatures foreseen at the beginning of data-taking with a lumi- 
nosity of 10 31 cm _2 s _1 . 
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10 Summary 

A detailed description of the algorithms used to reconstruct jets in the LI, L2 and Event Filter was 
presented. The performance of each of these algorithms was shown. The physics performance and 
algorithm timing has been shown to be within the design targets. Nevertheless, optimisation studies 
to further improve the performance of these algorithms are ongoing. An example trigger menu for jet 
signatures proposed for the beginning of data-taking was presented. The overall trigger strategy for 
selecting events based on jet signatures was also described. 
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Abstract 

This note describes the methods developed for the measurement of the prop- 
erties of minimum bias events during low luminosity running using the 
ATLAS detector. These methods aim to reconstruct the inclusive pseudo- 
rapidity and transverse momentum distributions of charged particles with 
Pt > 150 MeV produced from 14 TeV pp collisions. The triggers used 
to record minimum bias events are described and their acceptances evalu- 
ated. An analysis to measure the inclusive distributions is presented and the 
systematic uncertainties discussed. Finally, there is an overview of future 
physics studies with minimum bias samples. 

1 Introduction 

This note focuses on how the ATLAS detector [1] can be used to measure the central pseudorapidity 
and transverse momentum distributions of charged particles produced in inelastic proton-proton (pp) 
collisions during early running at the LHC. Measuring the characteristics of these collisions allows 
an understanding of the physics behind these processes to be developed, particularly their energy de- 
pendence. The minimum bias events allow the soft-part of the underlying event in high-p^ collisions 
to be characterised. Studies of inclusive particle distributions in minimum bias events in pp collisions 
are important to provide the baseline for measurements in heavy-ion collisions, such as allowing dif- 
ferences in the number of particles to be attributed to QCD effects rather than the simple scaling of 
the number of nucleons. Finally these interactions will be a major background during low luminosity 
running (10 33 cm _2 s _1 ) and high luminosity running (10 34 cm _2 s _1 ), where the average number of 
such interactions per beam crossing is ~ 2 and ~ 18, respectively. 

The total proton-proton cross-section can be divided into elastic and inelastic components, and 
the inelastic component can be further divided into: non-diffractive, single diffractive and double 
diffractive components [2]. The total cross-section (o tot ) can then be written as: 

Gtot = &elas + ®sd + Gdd + Grid 

where these cross-sections are elastic (o e i as ), single diffractive (o^), double diffractive (Odd) and 
non-diffractive (a„j), respectively. The cross-sections for the inelastic subprocesses determined using 
PYTHIA [3], which was used to generate the event samples for this study, are given in Table 1. For 
comparison, the cross-sections predicted by PHOJET [4] are also shown in Table 1. The PHOJET 
predictions include central diffraction, however this hard proton-pomeron interaction only contributes 
to the cross-section at the few per cent level. As this is not simulated in PYTHIA, it was not considered 
further in this note. 

The acceptance of inelastic events is defined by the trigger, which is usually known as a minimum 
bias trigger. It is designed to avoid bias in the sample, such as selecting high-p r events by triggering 
on high-^y objects. However, some bias is usually introduced due to effects such as the geometrical 
acceptance of or minimum energy thresholds in the trigger detector. It is therefore not unusual to 
find different definitions for minimum bias events in the literature. Historically, the minimum bias 
triggering used in hadron collider experiments [5-11] often used triggers based on forward-backward 
coincidences that favoured the detection of non-single diffractive inelastic events (NSD), i.e. o ns d = 
Otot — Oelas — Csd- Thus, NSD events have often been classified as 'minimum bias events'. In this 
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note, results from simulated events selected using the minimum bias triggers are presented. These 
results have been corrected for detector reconstruction and acceptance effects. The results for the non- 
single diffractive sample are also presented to allow comparison with previous results from hadron- 
collider experiments. However, this requires correcting for the trigger acceptance for each of inelastic 
processes, which depends on the physics model used to generate the different processes. 

Minimum bias interactions have previously been studied at a range of different energies at the 
CERN ISR [5], CERN SppS [6-8] and Fermilab's Tevatron [9-11] and RHIC colliders. Based on 
these results, Monte Carlo models have been tuned to generate predictions for LHC multiplicities [12]. 
Figure 1 shows a comparison of model predictions for the central charged particle density in NSD pp 
events for a wide range of centre-of-mass energies (a/?). The data points shown are corrected for 
detector and trigger effects and efficiencies back to the particle level. Figure 1 compares predictions 
generated with two different tunings of PYTHIA [3] (ATLAS and CDF tune- A [12]) with the central 
charged particle density generated with PHOJET [4, 13]. It is clear from this figure that there is a large 
uncertainty in the predicted central particle density of non-single diffractive interactions at the LHC 
energy even though the models have been tuned to agree with data at lower energies. This uncertainty 
arises because the energy dependence in a variety of different models for low-p^ hadronic processes 
is not well understood. Measuring the central particle density at the LHC will thus be crucial to 
determining the energy dependence of the central particle density and constraining models of inelastic 
events. 




Figure 1 : Central charged particle density for non-single diffractive inelastic events as a func- 
tion of energy. The lines show predictions from PYTHIA using the ATLAS tune and CDF 
tune-A, and from PHOJET. The data points are from UA5 and CDF pp data. 



This paper is organized as follows: In section 2 we briefly describe the experimental setup and 
the trigger strategies that have been developed to accept inelastic collisions with minimum bias. The 
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analysis procedure, corrections, as well as the systematic effects involved in this study are discussed 
in section 3. Section 4 presents an overview of future work planned to be done with ATLAS minimum 
bias measurements. Finally, in section 5 we present our conclusions. 





Cross-section (mb) 


Process 


PHOJET 


PYTHIA 


non-diff. 


69 


55 


single diff. 


11 


14 


double diff. 


4 


10 


central diff. 


1 




total inelastic 


85 


79 


elastic 


35 


23 


total 


120 


102 



Table 1: Cross-section predictions for pp interactions at y/s = 14 TeV from PYTHIA and PHOJET. 



1.1 Characteristics of inelastic events 

The pseudorapidity (17) and transverse momentum (pr) distributions of charged particles generated 
using PYTHIA [3], with the parameter set defined in Ref. [14] and PHOJET [4, 13] with default 
parameters are shown in Figs. 2(a) and 2(b). These distributions correspond to non-diffractive, single- 
and double-diffractive inelastic pp interactions at y/s = 14 TeV, respectively. They clearly show large 
uncertainties in model predictions for the LHC and that the events are dominated by low-pr particles 
with the highest densities found in the central region \ t] \ < 3.0. Much of this central region is covered 
by the ATLAS inner detector. The charged particle distributions will be reconstructed from tracks 
which are measured by the inner detector for \ r\ \ < 2.5, with p T greater than 150 MeV. 

1.2 Backgrounds 

The main backgrounds in minimum bias events, particularly during early running, will be beam-gas 
collisions within the beampipe over the length of ATLAS, and beam-halo from interactions in the 
tertiary collimators in the accelerator. These backgrounds can provide spurious triggers that must be 
removed from the inelastic event sample to prevent distortion of its characteristics. During early low 
luminosity running a large fraction of bunch crossings will have no pp interaction. Using a trigger 
based only on bunch-crossings would result in a large number of empty events, which only contain 
detector noise, being recorded. Therefore, the trigger must be able to reject such events in order 
to optimise the use of the trigger bandwidth. In this paper we will present results which include 
discussions on beam-gas events and empty events but not on beam-halo. 

2 Simulation and experimental setup 

The samples of single-, double-, and non-diffractive inelastic events used in this study were gener- 
ated with PYTHIA version 6.403 [3]. The PYTHIA event generator was configured according to an 
underlying event tuning fit [14] from previous experiments and is expected to provide a reasonable 
description of the non-diffractive part of minimum bias events. 
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Figure 2: Pseudorapidity (a) and transverse momentum distribution (b) of stable charged particles 
from simulated 14 TeV pp inelastic collisions generated using PYTHIA and PHOJET event generators. 
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2.1 The ATLAS inner detector 

The pseudorapidity and transverse momentum distributions of charged particles are measured using 
the ATLAS inner detector. The inner detector is described in detail elsewhere [1]. It consists of, in 
order of increasing radius, a silicon pixel system, a silicon microstrip system (SCT) and a gas-based 
transition radiation detector (TRT). The inner detector is mounted inside a solenoid magnet which 
provides a 2 T magnetic field. 

The inner detector sub-detectors are designed as independent but also complementary systems. 
The pixels cover radii of 50.5 mm-149.6 mm, the SCT covers 299 mm-560 mm and the TRT covers 
563 mm-1066 mm. The precision tracking detectors (pixel and microstrip) cover |tj| < 2.5 and are 
divided into barrel (|T7 1 < 1.4) and endcaps (1.4 < |t] | < 2.5). 

The ATLAS inner detector will provide hermetic and robust pattern recognition, excellent mo- 
mentum resolution and both primary and secondary vertex measurements for charged tracks above a 
Pt threshold, which is nominally 500 MeV but can be as low as 100 MeV within \r} \ < 2.5. The 
nominal pr-cut of 500 MeV corresponds to tracks traversing the full inner detector, however as dis- 
cussed in section 1.1a measurement of the properties of minimum bias events requires a py-cut of < 
200 MeV. A p T -cut of 150 MeV, used in this paper, corresponds to tracks that traverse the precision 
Si tracker (pixels and SCT) allowing low-pr tracks to be well reconstructed. 

2.2 The minimum bias trigger scintillators 

The Minimum Bias Trigger Scintillators (MBTS) [15] are mounted on the inner surface of the liq- 
uid argon endcap cryostats and cover a pseudorapidity range of 2.12 < |r/| < 3.85. The MBTS is 
constructed from 2 cm polystyrene-based scintillator counters. Each side of the MBTS is made up 
of 16 counters each of which is split into two regions of equal pseudorapidity (2.12 < \r]\ < 2.83, 
2.83 < |tj| < 3.85) and covering § in azimuth. The MBTS is read out through the tile calorimeter 
electronics providing a fast LI signal, which is discriminated above a voltage threshold, relative to the 
bunch-crossing signal. 

3 Minimum bias trigger scenarios 

A minimum bias trigger should select inelastic collisions with as little bias as possible, precluding the 
use of the standard high-/?7 triggers. Ideally, the LI random trigger [16] with beam pickup would 
be used to accept events with zero bias, and inelastic collisions would be selected offline. However, 
during early running when the luminosity is expected to be < 10 30 cm _2 s _1 , the random trigger will 
be very inefficient since the probability of an interaction during a bunch crossing is < 1%. Therefore, 
validation of the LI random trigger is required in the high-level-trigger (HLT). The use of tracking in 
the HLT to validate the random trigger and a dedicated LI trigger based on the minimum bias trigger 
scintillators (MBTS) has been studied. A minimum bias trigger stream consisting of the random 
trigger, track trigger and the MBTS is shown in Figure. 3. The random-based track trigger and the 
MBTS triggers are discussed in sections 3.1 and 3.2, respectively. In addition, the LI MBTS could be 
used in conjunction with the pixel and SCT spacepoints reconstructed at L2 and the Event Filter (EF) 
track trigger. However, this was not studied in this note. 

As the luminosity increases from ~ 10 31 cm _2 s _1 to 10 32 cm _2 s _1 and higher values, the mean 
number of interactions per bunch crossing will be around 1. The random trigger will then record 
interactions for each crossing without requiring further triggers at L2 or EF to reject empty bunch 
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events. This will allow a zero bias sample to be efficiently accepted by ATLAS using the random 
trigger. 

For the purposes of this note, a luminosity of 10 31 cm _2 s _1 with a bunch spacing of 75 ns has 
been assumed. The rate of inelastic events under those conditions is 792 kHz, and the mean number 
of events per bunch crossing is 0.06. Trigger efficiencies were calculated using around 10 5 events of 



Space Point 



Figure 3: Minimum bias trigger slice. 



simulated hydrogen beam-gas, single diffractive, double diffractive, non-diffractive and empty events. 
Each of these simulated samples were reconstructed and passed through the trigger logic and the 
trigger efficiency was then calculated from the number of events satisfying the trigger logic. 

3.1 Random-based track trigger 

The selection of minimum bias events by a track trigger is performed by the high-level trigger. The 
aim is to reject empty bunch and beam-gas events. After the random event selection at LI, the L2 
trigger rejects empty events by requiring a minimum number of spacepoints in the pixel and SCT 
detector. The trigger efficiency curves for different types of events (ddiff: double diffractive, ndiff: 
non-diffractive and sdiff: single diffractive) are shown in Figure 4. The efficiency of empty bunch 
events as a function of spacepoints clearly shows that a modest constraint on either the number of 
pixel or SCT spacepoints rejects events containing only random noise. The thresholds for the number 
of pixel and SCT spacepoints were determined from the requirement to have a signal to background 
for non-diffractive to empty bunch events of 100:1. Given that the beam conditions assumed in this 
paper correspond to a probability of a pp non-diffractive inelastic interaction of 0.05, this corresponds 
to constraining the efficiency of empty bunch events to be less than 5 x 10~ 4 . This requires setting the 
threshold on the number of spacepoints to be 12 and 3 in the pixel and SCT respectively. 

While the spacepoint (SP) trigger reduces the number of beam-gas events accepted, the rate can 
be further reduced by requiring the presence of reconstructed tracks within a small z-region around 
the nominal interaction point. A full track reconstruction scan with a minimum pj of 200 MeV is 
carried out in the EF on the events that pass the SP trigger. These tracks are then used to select events 
using the cuts defined in Table 2. The trigger efficiency is plotted with respect to the number of 
reconstructed tracks in Figure. 5. The empty events are rejected by the cut on the number of pixel and 
SCT spacepoints resulting in an efficiency of zero. 
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Number of tracks 
Track zO 


>2 

< 200 mm 



Table 2: Track cuts for EF track trigger. 



Figure 5: Trigger efficiency as a function of the number of reconstructed tracks for events satisfying 
the L2 spacepoint requirement. 
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3.2 Minimum bias triggers using the MBTS 

Two simple MBTS LI trigger strategies were considered for minimum bias event selection: MBTS_1 _1 
and MBTS_2. MBTS 11 is defined as at least one MBTS counter above threshold on each side, where 
the threshold of 40 mV was chosen from measurements of cosmic commissioning data. MBTS_2 is 
defined as two or more MBTS counters above threshold anywhere in the MBTS system. The trigger 
efficiencies as a function of MBTS counter threshold for inelastic non-diffractive, double-diffractive, 
single-diffractive, beam-gas and empty bunch events are given in Fig. 6. 



MBTS Lvl 1 Threshold (mV) MBTS Lvl 1 Threshold (mV) 

(a) (b) 

Figure 6: MBTS LI trigger threshold scans for the two trigger configurations: (a) MBTS_1_1 (b) and 
MBTS_2. 



3.3 Trigger efficiency 

A summary of the trigger efficiencies for the track-based and MBTS triggers are given in Table 3. The 
efficiency of diffractive events is around half of that of non-diffractive events. This is primarily due to 
the low particle multiplicity in diffractive events and because the triggers cover the central region and 
are therefore only sensitive to a fraction of the diffractive cross-section corresponding to high mass 
diffractive states. 

The acceptances for the different processes i.e. the efficiencies weighted by the fraction of the 
inelastic cross-section are given in table 4. This shows that due to lower efficiencies and smaller 
cross-sections the acceptance of diffractive events is significantly suppressed. The acceptance of 
the proposed minimum bias triggers is around 85-92% and this will consist of around 80% of non- 
diffractive events and roughly equal numbers of single and double diffractive events. 



4 Analysis 

The goal of the analysis is to reconstruct the number of primary charged particles per unit of pseudora- 
pidity and per unit of transverse momentum for inelastic pp interactions taken with the minimum bias 
trigger. Primary particles are defined as particles produced in the pp collision, but excluding secondary 
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MBTS_1_1 


MBTS_2 


SP 


SP & EF Tracks 


Non-diffractive 


99% 


100% 


100% 


100% 


Double-diffractive 


54% 


83% 


66% 


65% 


Single-diffractive 


45% 


69% 


57% 


57% 


Beam-gas 


40% 


54% 


47% 


40% 



Table 3: MBTS and random-track trigger efficiencies. 





MBTS_1_1 


MBTS^ 


SP 


SP & EF Tracks 


Non-diffractive 


69% 


70% 


70% 


70% 


Double-diffractive 


7% 


10% 


8% 


8% 


Single-diffractive 


8% 


12% 


10% 


10% 



Table 4: MBTS and random-track trigger acceptances. 



particles from weak decays of strange hadrons or from electromagnetic or hadronic interactions in the 
detector material. 

A sample of ~150,000 inelastic events, with ND, SD and DD events mixed in the proportion given 
by the PYTHIA cross-sections, was reconstructed and used in this study (MB sample). A typical non- 
diffractive event contains around 45 reconstructed tracks of which 37 are from primary particles and 8 
are from secondary particles. The efficiencies and corrections were derived from half the sample and 
used in the analysis of the other half of the sample. 

4.1 Event selection 

Two selection criteria are applied to the set of reconstructed events: the event must be triggered by the 
minimum bias trigger and it must contain at least one reconstructed vertex. The sample of inelastic 
events selected by the MBTS_2 trigger, described in Section 3.2, was used for this analysis. No beam- 
gas or pileup events were included in the sample studied here. An additional criterion should be 
added to exclude events with multiple pp interactions in a bunch-crossing but this was not included 
in this analysis as the samples of simulated data used for this study contained events with a single pp 
interaction in a bunch-crossing. 

4.2 Track selection 

Tracks were reconstructed using tracking with the minimum pj set to 100 MeV rather than the default 
500 MeV. To avoid threshold effects, tracks with pj > 150 MeV were used in the analysis. The relative 
pj -resolution, o(—)pt, was found to be around 1.5% for \r}\ < 0.8 and 4.3% for |tj| > 1.6. 

Cuts on the measured track parameters were used to select a set of well reconstructed tracks for 
the analysis. These cuts are listed in Table 5. The requirement of a hit in the inner pixel layer,the 
b-layer, removes a sizable portion of fake tracks and tracks associated to secondary particles since 
they frequently do not leave a hit in the first layer of the pixel detector. The cut on the number of pixel 
and SCT hits is a standard track scoring cut applied during reconstruction and is listed here only for 
completeness. The values of the cuts were chosen by fitting each of the distributions with a gaussian 
and cutting at ~ 3a. 
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Quality cuts 


No. of b-layer hits > 1 
No. of Silicon hits > 5 


Resolution cuts 


\a do \ < 1.6 mm 
\a ZQ \ < 6.0 mm 
a 0 | <0.03 
\o e \ < 0.015 

\a q/pT \< 0.0003 (GeV)" 1 


Track-to-vertex cut 


No <3 



Table 5: Track selection cuts used in this analysis. The resolutions <rJ o , <7^, o^, Oq and o^/^ 
are the five diagonal elements in the track parameter covariance matrix. For more information 
on the tracking Event Data Model (EDM) see [17]. 



The track-to-vertex cut is the selection that most effectively cuts away tracks from secondary 
particles while accepting tracks from primaries. It is made by cutting on the distance of closest 
approach to the nearest reconstructed vertex, normalized by the error. The normalized distance to the 
vertex is defined as 




(1) 



where Ada/Od 0 and Azo/o ZQ are the normalized distances in the transverse and longitudinal directions, 
respectively. The number of o to the vertex as a function of AR (for a two-dimensional gaussian) is 
given by 

N a = v^err^l-e-^ 2 / 2 ). (2) 

Note that the above formula assumes no correlations between the resolutions in the transverse and 
longitudinal direction. 

To determine the effects of the cuts and evaluate efficiencies and acceptances, the reconstructed 
tracks are matched to the generated primary and secondary particles. The track that has the highest 
percentage of hits that overlap with the trajectory of a generated particle is matched to that generated 
particle. A track is well matched to a particle if greater than 50% of its hits overlap with the generated 
particle's trajectory. A primary track is one matched to a primary particle and similarly a secondary 
track is one matched to a secondary particle. If a track is not matched to any generated particle, it is 
classified as a fake. 

Table 6 shows the influence of the track cuts on the reconstructed sample. Each row shows the 
percentage of tracks that did not pass each of the specified cuts. Note that a single track can fail several 
cuts and therefore give counts in several rows. Three of the rows show the percentage of tracks that fail 
any of the quality cuts ('Quality'), any of the resolution cuts ('Resolution') and any of the quality or 
resolution cuts ('Q 1 1 R'). The strongest and most effective cut is the track-to-vertex cut, which rejects 
about 88% of the secondaries. The second last row shows the percentage of reconstructed tracks that 
are outside the pseudorapidity and pj range used in the analysis (|t]| > 2.5, pj < 150 MeV). 
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Cut 


% Cut All 


% Cut Primary tracks 


% Cut Secondary tracks 


b-layer hit 


15.9 


8.5 


46.8 


COVrff) 


11.5 


6.0 


34.2 


cov z0 


9.4 


5.0 


27.4 


COV0 


8.9 


5.1 


24.3 


cov e 


4.9 


4.2 


8.2 


COV q/ P T 


6.4 


4.3 


14.9 


Quality 


15.9 


8.5 


46.8 


Resolution 


16.7 


10.9 


40.4 


Q II R 


24.6 


15.6 


62.1 


Track-to-Vtx 


30.7 


16.9 


87.8 


n II pt 


1.2 


1.3 


0.9 


Total 


38.6 


24.6 


96.5 



Table 6: Fraction of tracks cut away by the selection cuts. 



4.3 Procedure & corrections 

The dN c h/dri and dN c h/dpj distributions are obtained by starting with the measured number of se- 
lected tracks in selected events and applying the following three corrections: 

• Track-to-particle correction 

• Vertex reconstruction correction 

• Trigger bias correction 

The first correction accounts for the difference between the number of measured tracks and the 
number of primary charged particles. This is essentially a correction for the inner detector accep- 
tance and efficiency of the tracking software [18]. The second correction takes into account the vertex 
reconstruction efficiency and corrects for events that have no reconstructed vertex. These two correc- 
tions account for detector dependent effects and produce the minimum bias sample, in which the data 
have been corrected for detector reconstruction and acceptance but have not been corrected for the 
trigger acceptance. 

The trigger bias correction depends both on the detector simulation and on the physics model 
used in the event generator to simulate inelastic pp interactions. Different corrections for the trigger 
can be applied to correct the measured distributions back to different physics processes, i.e. inelastic, 
non-single-diffractive or non-diffractive interactions. 

The track-to-particle correction is applied only at the track level. The vertex correction and the 
trigger bias correction are applied at the track and event level. The track-level vertex and trigger bias 
corrections are needed to compensate for any bias on the measured track distribution due to vertex 
and trigger requirements. All track-level corrections are determined as a function of pseudorapidity 
(tj), p T and the z-position of the collision vertex, v z . The event level corrections are determined as a 
function of v z and the number of reconstructed tracks in the event. 

The corrections are 3-D and 2-D histograms but the principal corrections shown in the figures 
below are plotted as projections of pj tj z-position or the number of reconstructed tracks as appropriate 
for clarity. 



734 



Standard Model - A Study of Minimum Bias Events 



4.3.1 Track-to-particle correction 

The number of selected reconstructed tracks differs slightly from the number of primary charged 
particles due to a number of different effects: the acceptance of the detector, the detector and track 
reconstruction efficiency, the contribution from secondaries and fakes, and the acceptance of the track 
selection cuts. The track-to-particle correction takes all these effects into account and is calculated as 
No. of primary charged particles 
No. of selected reconstructed tracks ' 
The numerator and denominator are calculated for the same events taken through the full detector 
simulation and reconstruction. 

Figure 7 shows the projection of the 3-dimensional track-to-particle correction onto the 77 and 
Pt axes. The correction is significant and changes rapidly with pj below 200 MeV where the recon- 
struction efficiency is lower due to a lower number of hits on a track and the effects of material and 
multiple scattering. Above 200 MeV the correction is small and has a small dependence on pr- 



Ctrkin ,v z , pt) 



(3) 



P T [GeV] 



Figure 7: The track-to-particle correction C tr u as a function of (a) 77 and (b) pj. 



4.3.2 Vertex reconstruction correction 

The vertex correction takes into account the bias introduced by events that are not counted because 
their vertex was not found by the vertex reconstruction algorithm. The event-level correction is calcu- 
lated as a function of v z and number of reconstructed tracks in the event: 

£ t v N \ = No. of triggered events ^ 

vtx z ' No. of triggered events with > 1 reconstructed vertex 

The track-level vertex correction is calculated as a function of 77, v z and pr' 
^ _ No. of tracks in all triggered events 

vtx i v ziPt ^ o q £ tmc ^ s m triggered events with > 1 reconstructed vertex' 



735 



Standard Model - A Study of Minimum Bias Events 



Although the vertex correction is essentially a detector correction, some model dependence is 
present since only triggered collisions are taken into account to compute the correction factors. How- 
ever, the use of only triggered events has the advantage that the correction can be computed directly 
from the data, without relying on any simulation. Once data are available it will also be possible 
to compare the properties of the triggered events with no reconstructed vertex to the corresponding 
events in the simulation. This may help minimize and better estimate any systematic uncertainties. 

Figure 8 shows the event-level vertex reconstruction correction as a function of the number of re- 
constructed tracks, N, and vertex z-position. A vertex can be defined using a single well reconstructed 
track. As the number of tracks increases, the correction approaches unity. For events with N > 10, a 
vertex is always found and the correction is no longer required. 

Figure 9 shows projections of the 3-dimensional track-level vertex correction onto the v z and 
Pt axes. These corrections are found to be at the level of a few percent. 




ATLAS 
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v z [mm] N 

(a) (b) 

Figure 8: The event-level vertex correction as a function of (a) v z and (b) the number of recon- 
structed tracks (N). 



4.3.3 Trigger bias correction 

The trigger bias correction accounts for the difference between the MB sample selected by the mini- 
mum bias trigger and the physics process of interest. This correction depends on the trigger detector 
simulation and on the models of the pp interactions used in the event generator. Therefore, this cor- 
rection is dependent on the relative cross-sections of the inelastic processes used and on the modelling 
of particle production for the different inelastic processes. 

After an initial model-independent measurement is made, the MB sample, the trigger bias correc- 
tion can be used to obtain distributions for other samples of interest: non-diffractive (ND), non-single 
diffractive (NSD) or inelastic (INEL) collisions. Each of these samples corresponds to a different 
trigger correction and each correction can be applied to yield the final distributions for the different 
collision samples. In general the correction from MB to NSD collisions is the smallest since these 
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Figure 9: The track-level vertex correction as a function of (a) v z and (b) pj. 



two event samples are almost identical, while the correction from MB to INEL is on the order of 1.2 
(integrated over N and v z ). 

The trigger bias correction is determined on event level as a function of v z and multiplicity, defined 
as in the vertex correction: 



C tHg (v z ,N) = 
and on track-level as a function of 77 , 

Origin, v z ,p T ) = 



No. of interactions in sample of interest 



No. of triggered events 
and pj: 

No. of tracks in sample of interest 



(6) 



(7) 



No. of tracks in triggered events 
In this analysis it is assumed that there is only one interaction per event since the events have been 
simulated in this way. This will need to be verified in the real data for any events with more than one 
reconstructed vertex by looking at the multiplicity of each of the vertices and the distance between 
the vertices. As in the vertex correction, all tracks are counted since the track-to-vertex distance is 
undefined for events with no reconstructed vertex. 

Figure 10 shows the event-level trigger bias correction for the NSD sample as a function of multi- 
plicity and vertex z-position. Using the MBTS_2 trigger, the correction is only important for N < 25; 
for higher multiplicities the correction factor is not needed. It is largest for N < 5 which are prin- 
cipally diffractive events that have a lower trigger efficiency relative to the non-diffractive events as 
discussed in section 3. 

Figure 11 shows projections of the 3-dimensional track-level trigger bias correction for the NSD 
sample to two of the three 2-dimensional planes and to the v z and pj axes. 

4.4 Corrected distributions 

An n-Vz-PT (3-D) histogram is filled for each track in each event and weighted with the values of 
the track-level corrections described above. A v z -N (2-D) histogram is also filled for each event and 
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Figure 10: The event-level trigger bias correction for the NSD sample as a function of (a) v z 
and (b) number of reconstructed tracks (N). 




v 2 [mm] P T [GeV/c] 



(a) (b) 

Figure 1 1 : The track-level trigger bias correction for the NSD sample as a function of (a) v z 
and (b) pj. 
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weighted with the values of the event-level corrections. This histogram is needed to provide the proper 
normalization factor for the analysis. 

After filling the histograms, a vertex range is chosen and the v z variable is integrated out. In 
the case of dN c h/dr\, the pj variable is integrated for pj > 150 MeV; in the case of dN c h/dpj, the 
77 variable is integrated over -2.5 < r\ < 2.5. Each 77 -bin or p T -bin is then divided by the total number 
of events, calculated by integrating the weighted event distribution histogram over N and v z . Finally, 
the distribution is normalized by the inverse width of the bins. No correction was made for the effect 
of the pj cutoff in the reconstruction. However, the stability of the results will be tested by varying 
the p T cutoff in data. 

The distributions are presented for the sample events accepted by the minimum bias trigger. For 
this analysis MBTS_2 is used, and are labelled as MB. The MB distributions have only been corrected 
for the acceptance and efficiencies associated with reconstructing tracks and vertices. To obtain the 
NSD distributions to allow comparisons with other results, the MB distributions are corrected for the 
trigger bias. 

The correction procedure can be expressed mathematically for one bin, corresponding to a certain 
region of phase space, in the following way. The number of particles P and number of interactions / 
are calculated as: 

p(v,vz,pt) = £ Y,&Tk(n,v z ,PT)-c^(n,v z ,pT)-Qri g (n,v z ,p T )), (8) 

(9) 



«Vz,N) = I (C vtx (v z ,N)-C trig (v z ,N)). 



Tracks are weighted by the track-to-particle correction C tr t(ri,v z ,pT), by the track-level vertex cor- 
rection C vtx (ri,v z ,pT) and the track-level trigger bias correction C m - g (r] ,v z ,pr). Events are weighted 
by the event-level vertex correction C vtx (v z ,N) and the event-level trigger bias correction C tr i g (v z ,N). 
For a given vertex range [Vi, V2] the dN c h/dr\ and dN c h/dpr are then calculated as: 



dN ch \ _ Sv?jP(ri , ,v z ,pT)dpTdv l 



dn |„ =fJ , $JI(v z ,N)dNdv z 
JN ch \ _ Sy?JP(ri,v l ,p , T )dTidv l 



d PT \ PT=p , T $JI( Vz ,N)dNdv z 



(10) 
(11) 



To exercise the analysis chain, the complete reconstructed sample was divided in half: one part 
was used to calculate the corrections and the other used as 'data' input to the analysis. 

Figures 12 and 13 show the corrected pseudorapidity distribution (pj > 150 MeV) and the cor- 
rected transverse momentum spectrum (\t]\ < 2.5) for the MB and NSD event samples. Statistical 
errors on the corrected distributions are shown. The aim was to achieve statistical errors on the cor- 
rections of less than 2% per bin. The errors are mostly negligible since sufficient statistics were used 
to determine the correction factor. Data in regions where small statistical errors could not be achieved 
(i.e. near the edges of the acceptance) are not included in the analysis. 



4.5 Systematic uncertainties 

The systematic uncertainties have been estimated by changing the parameters in the generation, re- 
construction or analysis and then re-evaluating the corrections and applying them to the analysis input 
sample. Although many of the systematic uncertainties are correlated, the different effects are studied 
here independently. The systematic uncertainties investigated include: 
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• Track selection: Based on the estimated track resolutions (a) for their distance from the vertex 
of the interaction, do, tracks with do > 3a are removed from the track sample. The error asso- 
ciated with this cut was estimated by varying the cut value by ±0.5(7 and observing the change 
in the number of accepted tracks. A mis-estimate of ±0.5(7 leads to a change in the number of 
accepted tracks of less than 2%. 

• Secondaries: Before track selection cuts are applied, the number of tracks originating from 
secondaries is about 25% of the number of tracks originating from primaries. After the track 
selection the total number of accepted secondary tracks is about 2.2% of the number of the 
accepted tracks. The error on the number of secondaries was estimated by changing the number 
of secondaries in the reconstructed sample by ±50%. The total number of accepted secondary 
tracks was found to change by 1.1%. The effect is strongest in the low-pr region and vanishes 
only for pr>5 GeV. The systematic uncertainty was therefore estimated to be 1.5%. 

• Vertex reconstruction bias: Since the z-position of the reconstructed vertex is integrated out, 
it is possible that a bias from the reconstruction of the vertex could introduce a systematic 
effect on the measurement. Bias due to the vertex reconstructed was evaluated by repeating 
the dN c h/dr\ analysis using the generated v z instead of the reconstructed v z . The observed 
difference was found to be 0.1%. 

• Misalignment: The systematic effect from misalignment was estimated by re-running the re- 
construction with a different geometry that corresponded to a misaligned and distorted detector, 
and comparing the dN c h/dr\ produced with that produced for a detector that is ideally aligned. 
The misaligned geometry was misaligned both globally and locally. However, the local mis- 
alignments had been corrected for using the alignment procedure. The two distributions agreed 
at the level of 5-6% across the barrel and forward regions. This is the exepected level that 
can achieved with cosmics and early data. A systematic uncertainty of 6% was attributed to 
misalignment. 

• Beam-gas interactions: The presence of background events coming from beam-gas would result 
in additional systematics on the measurement. 

The rate of beam-gas interactions is highly dependent on the particular beam conditions during 
startup such as the number of protons per bunch and the beam current. Studies by the LHC 
group have estimated the rate of beam-gas collisions within ATLAS to be 100 Hz [19]. As- 
suming a pp collision rate during early running of 8 x 10 5 Hz this corresponds to approximately 
1 beam-gas event per 8000 pp events. Preliminary studies with simulated beam-gas events 
have also shown that only a very small fraction (<1%) of beam-gas events passed the vertex 
reconstruction requirement. Therefore the effect of beam-gas events is expected to be small. 
The systematic uncertainty due to beam-gas events was estimated to be ~ 1%. 

• Particle composition: The track-to-particle correction is calculated from events generated with 
PYTHIA [14]. Although 7t ± , K ± , and p ± compose over 98% of the charged particle multi- 
plicity in these events, the efficiency for detecting each of these is considerably different for 
Pt < 500 MeV. This model dependent effect introduces an additional systematic error on the 
measurement since relative abundances in PYTHIA could differ from reality. 

Relative abundances of charged pions, kaons and protons and anti-protons were enhanced and 
reduced by ±50% to estimate the systematic uncertainty due to variations in the particle com- 
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position. The systematic uncertainty obtained by this method is estimated to be around 2%. 
This is highly dependent on the low-pr cut used in track reconstruction. 

• Relative process frequency: The trigger bias and vertex reconstruction corrections are calculated 
from a sample of inelastic events where the relative cross-sections are as predicted by PYTHIA 
[3]. Since the trigger and vertex reconstruction efficiencies are different for each of the inelastic 
components (non-diffractive, single diffractive, and double-diffractive), these corrections are 
dependent on the relative cross sections of the different components. 

The relative cross sections between non-diffractive, single-diffractive and double-diffractive 
components were enhanced and reduced to estimate the systematic due to the uncertainty from 
the PYTHIA event generator. The corrections have been calculated by changing the diffractive 
cross sections by ±50% of the PYTHIA prediction, i.e. 7.1 mb <o SD < 21.4 mb and 5.1 mb < 
Odd < 15.3 mb, which, as can be seen in Table 1, covers the difference in the predicted relative 
cross sections between PYTHIA and PHOJET. 

Changing the diffractive cross sections by ±50% changes the result of the analysis by about 4% 
on the final NSD sample. For the minimum bias sample in which no trigger bias correction is 
applied, this systematic error is approximately zero. 



Name 


Level 


Estimated Uncertainty 


Track selection cuts 


Analysis 


2% 


Mis-estimate of secondaries 


Analysis 


1.5% 


Vertex reconstruction bias 


Reconstruction 


0.1% 


Misalignment 


Reconstruction 


6% 


Beam-gas and pileup 


Offline Trigger 


1% 


Particle composition 


Generation/Simulation 


2% 


Diffractive cross sections (NSD sample) 


Generation 


4% 


Total 


8% 



Table 7: Summary of the various systematic uncertainties and the level at which they are in- 
troduced. The total systematic uncertainty assumes each of the individual uncertainties are 
independent. 

A summary of the systematic uncertainties is given in Table 7 along with the step in the full 
chain at which they are introduced. The uncertainties in the track selection, vertex reconstruction bias 
and mis-estimate of secondaries are dominated by the low-pr tracks (<500 MeV). Any uncertainties 
at the generator level are due to uncertainties in predictions of physics models at LHC-energy and 
are unavoidable in the final analysis. However, the principal error in the reconstruction arises from 
alignment of the inner detector. The estimate of 6% is for initial running and is likely to improve. 

5 Future work 

5.1 Work required for first data 

A number of issues have arisen during the work for this note that need to be studied further. Many of 
these studies are already in progress and should be completed before the first pp collisions from the 
LHC are recorded. The main topics currently under investigation are: 
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• Studies with other Monte Carlo generators: The studies of pp interactions in this note have been 
carried out using PYTHIA [3]. The biases introduced by the trigger will certainly depend on 
the particular physics models employed by PYTHIA. 

The generator PHOJET [4, 13] is being used to simulate samples of LHC inelastic pp collisions 
(non-diffractive, single- and double-diffractive). The physics studies discussed previously in 
the analysis section are going to be repeated with PHOJET samples and the results compared 
with those shown in this note. Comparing the results from analyses in different samples will 
provide us indications on the systematic uncertainties due to different physics models. 

• Vertexing: During initial running at low luminosities, there will still be periods where there will 
be more than one event per bunch crossing. It is therefore important to be confident that an 
event contains only one interaction by identifying events with a single vertex. Further work is 
required to look at vertex reconstruction with low levels of additional events to evaluate how 
well events with several vertices can be reconstructed. 

• MBTS readout: The MBTS LI trigger readout is currently being upgraded as part of the tile 
calorimeter refurbishment. As a part of this refurbishment the 3-in-l cards connected to the 
MBTS readout are being switched from low gain to high gain. This modification means that 
the signal to noise should be 5-6 times better than the values used in Section 3.2. In addition 
to the LI improvements, the MBTS will be readout at L2 to allow verification of the Lltrigger 
decision using the precision readout of the Tile Calorimeter electronics. Work is ongoing to 
implement software to match these requirements. Once the software has been validated, the 
MBTS LI and L2 trigger performance will be reevaluated. 

• Beam-gas and beam halo estimate from commissioning runs: Recent efforts to produce more 
accurate predictions for beam gas rates will allow a re -run of studies on how best to optimize the 
MBTS and the track-trigger to reduce this background in the measured minimum bias sample. 
For both cases, strategies on how to estimate the rates of beam-gas and halo events from data 
during commissioning runs with a single proton beam are beginning to be discussed. 

• ATLAS Beam Conditions Monitor The Beam Conditions Monitor [20] is designed to distin- 
guish collisions from background through time-of-flight measurement and can improve our 
understanding of beam-gas and halo rates. 

5.2 Physics studies beyond first data 

Following the first physics studies with minimum bias data a series of applications can be foreseen. 
Among them we would highlight: 

• Retuning Monte Carlo generators: LHC predictions for minimum bias events generated with 
tuned MC generators PYTHIA and PHOJET indicate that although these tuned models give 
comparable descriptions of lower energy data, they disagree typically by ~ 30% for minimum 
bias multiplicity distributions [12]. 

Measuring the global properties of minimum bias interactions as presented in this paper will 
provide enough information to revisit the generator predictions and retune event generators 
to describe interactions at the LHC energy. This will not only allow us to distinguish which 
physics model and assumptions better agree with the data, but will also allow the reassessment 
of systematic uncertainties in various channels. 
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• Multiparton interactions: Charged particle multiplicity distributions [21] have been widely used 
as important tools for studying multiple particle production in inelastic hadronic events [5,6, 
10, 11]. They are particularly useful when displayed in terms of multiplicity scaled variables 
known as KNO variables [21]. Plotted as a function of KNO variables, the charged multiplicity 
distributions provide a clearer display of fluctuations seen for both very low (less than half of the 
average multiplicity) or very high multiplicity (more than the double the average) events. The 
rise of the high multiplicity tail in these distributions has been interpreted as an effect caused by 
multiparton interactions [10, 1 1] and the LHC measurement of this effect will contribute greatly 
to a better understanding of the underlying physics in inelastic collisions. 

• Contribution to the measurement of the total cross section: A measurement of the total inelastic 
rate by combining the NSD analysis with a measurement of single diffractive cross section by 
the ALFA detector should be possible. Simultaneously measuring the elastic t-spectrum, which 
can then be extrapolated to t —>■ 0, will enable us to determine the total cross section (G tot ) in 
a luminosity-independent way or to calibrate the absolute luminosity in c tot -independent way, 
using the optical theorem. 

• Baseline for heavy-ion studies: A measurement of the particle yield from a single minimum 
bias event can and will be used to cross-check components and assumptions made by models 
for heavy-ion collisions which will then be tuned according to the LHC data. This procedure is 
expected to help in generating more reliable predictions for the heavy-ion runs. 

6 Conclusions 

We have investigated methods for measuring the characteristics of inelastic collisions with the ATLAS 
detector during early LHC running. The main goal of our study was to reconstruct the normalised 
central pseudorapidity (j^-j^) and transverse momentum (77-3^) distributions of charged primaries 
in pp interactions selected by a minimum bias trigger. Triggering strategies for minimum bias event 
selection have been developed and an analysis to determine the charged particle distributions were 
presented. 

Aiming at selecting inelastic collisions with as little bias as possible, two scenarios were proposed 
for triggering: a random-based track trigger and using the minimum bias trigger scintillators. The 
random-based track trigger combines random event selection at LI with the use of information from 
the ID in the HLT; spacepoints from the pixel detector and the SCT at L2 and tracks reconstructed in 
the EF. Minimum bias selection with the MBTS is performed by recording events that pass LI MBTS 
hit requirements. It has been shown that both triggers are highly efficient at selecting non-diffractive 
inelastic samples. However, the acceptance of the diffractive component is highly suppressed relative 
to the non-diffractive component, and the acceptance of the single-diffractive sample is similar to that 
of the double-diffractive sample. This means that the ATLAS minimum bias triggers do not select a 
NSD sample as previous experiments have and therefore model dependent corrections will be required 
to compare ATLAS measurements to results from previous experiments. 

Results were presented for two samples: the MB sample, which has only been corrected for 
detector effects, and the NSD sample where the MB sample has been further corrected for the trigger 
acceptance, which is model dependent. The analysis was performed on a sample of ~ 75,000 events, 
corresponding to a luminosity of 10~ 6 pb _1 . The uncertainty on -U ^ for both samples is dominated 
by the systematic uncertainty from alignment and there is an additional uncertainty on the NSD sample 
due to the model dependence of the relative cross-sections. The systematic uncertainty on ^- ^ for 
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the MB sample was estimated to be 6% and 8% for the NSD sample. This will be sufficient to 
distinguish between different models of minimum bias events. 
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Abstract 

This report summarises the ATLAS prospects for the measurement of W 
and Z production cross-section at the LHC. The electron and muon decay 
channels are considered. Focusing on the early data taking phase, strategies 
are presented that allow a fast and robust extraction of the signals. An over- 
all uncertainty of about 5% can be achieved with 50 pb~Hn the W channels, 
where the background uncertainty dominates (the luminosity measurement 
uncertainty is not discussed here). In the Z channels, the expected precision 
is 3%, the main contribution coming from the lepton selection efficiency un- 
certainty. Extrapolating to 1 fb _1 , the uncertainties shrink to incompressible 
values of 1-2%, depending on the final state. This irreducible uncertainty is 
essentially driven by strong interaction effects, notably parton distribution 
uncertainties and non-perturbative effects, affecting the W and Z rapidity 
and transverse momentum distributions. These effects can be constrained 
by measuring these distributions. Algorithms allowing the extraction of the 
Z differential cross-section are presented accordingly. 

1 W and Z cross-section measurements at the LHC 

The study of the production of W and Z events at the LHC is fundamental in several respects. First, 
the calculation of higher order corrections to these simple, colour singlet final states is very advanced, 
with a residual theoretical uncertainty smaller than 1% [1]. Such precision makes W and Z production 
a stringent test of QCD. 

Secondly, more specifically for Z production, the clean and fully reconstructed leptonic final states 
will allow a precise measurement of the transverse momentum and rapidity distributions, respectively 
do I dp j and do/dy. The transverse momentum distribution will provide more constraints on QCD, 
most significantly on non-perturbative aspects related to the resummation of initial parton emissions, 
while the rapidity distribution is a direct probe of the parton density functions (PDFs) of the pro- 
ton. The high expected counting rates will bring significant improvement on these aspects, and this 
improvement translates to virtually all physics at the LHC, where strong interaction and PDF uncer- 
tainties are a common factor. 

From the experimental point of view, the precisely measured properties of the Z boson provide strong 
constraints on the detector performance. Its mass, width and leptonic decays can be exploited to mea- 
sure the detector energy and momentum scale, its resolution, and lepton identification efficiency very 
precisely. 

Finally, a number of fundamental electroweak parameters can be accessed through W and Z final 
states {M\y, through the W boson decay distributions; sin 2 %, via the Z forward-backward asymme- 
try; lepton universality, by comparing electron and muon cross-sections). These measurements are 
long term applications where the understanding of the hadronic environment at the LHC is crucial, 
and to which the above-mentioned measurements are necessary inputs. 
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The present note summarises the ATLAS preparations for W and Z cross-section measurements, in 
the context of the early running of ATLAS and the LHC. The electron and muon decay channels are 
considered. A baseline integrated luminosity of Jz? = 50 pb~ 1 is assumed for the total cross-section 
analyses; based on these results, we estimate our prospects for Jz? = 1 fb _1 . Anticipating that the mea- 
surement precision will soon be limited by the above-mentioned theoretical uncertainties, differential 
cross-section analyses are presented in the second part of this work. We consider the Drell-Yan mass 
spectrum below the Z peak, and the y and pj distributions on the Z resonance. 

The note is organised as follows. Section 2 gives technical details about cross-section measurements, 
lists the simulation samples used in the analyses, and reviews the main reconstruction aspects. Sec- 
tions 3 and 4 describe the selections that allow extraction of the W and Z signals, give the expected 
statistics, and discuss the uncertainties on the background rates, as these are specific to each channel. 
Common systematic uncertainties affecting the cross-section determination are discussed in Section 5. 
Section 6 then presents the expected performance for total cross-section measurement. Differential 
cross-sections are discussed in Section 7, and Section 8 summarizes our results. 

2 General discussion 

This section describes the general procedure used to extract physical cross-sections, and the simulation 
samples used to evaluate the expected performance. 

2.1 Total cross-section measurements 

The number of events N passing a given set of selections is expressed as follows: 



N = ^ oAe + B (1) 

where Jz? is the integrated luminosity; a the signal cross-section; A the acceptance of the signal, 
defined as the fraction of the signal that passes the kinematic and angular cuts; e is the reconstruction 
efficiency of the signal within the fiducial acceptance; B is the number of background events. In the 
above, e is to be understood as averaged over the phase space accepted by the selections. Conversely, 
the measured total cross-section is expressed as: 

and the overall measurement uncertainty gets contributions from the different terms as below: 
So SN®SB SJz? SA 8s 

Above, 8N ~ \/N is of purely statistical origin, and the relative uncertainty decreases with increasing 
luminosity following 8N/N ~ 1/VJz?. The terms SB, 8A and 8e are of both theoretical and experi- 
mental origin. They are considered as systematic uncertainties in the cross-section measurements, but 
can be constrained via auxiliary measurements. We thus expect these terms to improve over time, pro- 
vided the auxiliary measurements have statistically dominated uncertainties. The machine luminosity 
Jzf will be measured with different methods. The uncertainty on this parameter, <5Jzf, is expected to 
decrease through improved understanding of the LHC beam parameters and of the ATLAS luminosity 
detector response [2]. 
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2.2 Signal and background samples. Benchmark cross-sections 

Our main signals, W and Z events decaying into electrons and muons, are generated using PYTHIA [3]. 
The analysis described in Section 7, a measurement of the low-mass Drell-Yan cross-section, exploits 
samples produced using HERWIG [4]. 

The W and Z samples are filtered at generation-level, requiring at least one lepton within the fiducial 
acceptance. The electron channels require \r\ e \ < 2.7 and p e T > 10 GeV; the muon channels require 
\r\p\ < 2.8 and Pj>5 GeV. These filters have an efficiency of about 85% for Z events, and about 
65% for W events. In the case of the Z sample, the available energy for the hard process is limited 
by \f§ > 60 GeV. The low-mass Drell-Yan samples have the same fiducial cuts on both leptons, but 
require 8 < < 60 GeV. The W and Z cross-sections are normalised to the NNLO cross-sections as 
provided by the FEWZ program [1]. 

The backgrounds considered in the analyses originate from W and Z events decaying to T-leptons, 
with subsequent leptonic T decays; it events involving at least one semileptonic decay, and from in- 
clusive jet events filtered to favour the presence of real leptons or hadrons misidentified as leptons. 
Low-mass Drell-Yan events, analysed only in the electron channel (y* — >• ee), also account for back- 
grounds from boson pair production. 

The W — »■ TV T and Z — ► TT events are produced as the signal samples (generators and filters). The 
it samples are generated inclusively, using MC@NL0 [5] to provide both the final states and the cross- 
section. They are filtered for the presence of at least one electron or muon, without kinematic con- 
straints. Diboson events, one of the main background in the low-mass Drell-Yan analysis, are gener- 
ated using MC0NL0. For the WW process only W — ► ev decays are considered, while the ZZ and the 
WZ are generated inclusively. No filters are applied. 

The jet events are produced using PYTHIA. The transverse momentum defined in the rest frame of 
the hard interaction is required to be above 15 GeV for jet samples used in W and Z analyses, whilst 
no such pr cut is required for the sample used in low-mass Drell-Yan analysis. Jet backgrounds for 
the muon channels are generated as inclusive jets, then requiring a final state with one (W analysis; 
Pt(Li) > 15 GeV) or two (Z analysis; prill) > 5 and 15 GeV) muons from b-hadron decays with 
(tj^I < 2.5. Background events from hadron punch-through and from decays in flight of long lived 
particle are found negligible. Background events from cosmic muons can be eliminated in a very 
efficient way with timing cuts. Relying on the Tevatron results [6] , this background is neglected. This 
appears as a safe approximation for the ATLAS experiment, since the Tevatron experiments are built 
close to the surface, while the ATLAS detector is ~ 100 m underground. In the electron channels, fake 
electrons are an important issue. Therefore, rather than requiring a true electron in the final state, the 
events are required to contain at least one narrow cluster of energetic final state particles. In practice, 
there should exist a tower of size Atj x A0 = 0.12 x 0.12 containing a total transverse energy greater 
than 17 GeV for use in W and Z analyses and 6 GeV for low-mass Drell-Yan analysis. Events passing 
this filter are considered likely to produce fake electrons and passed through the simulation step. 

All samples are interfaced to the CTEQ6L1 or CTEQ6M parton density sets [7] depending if the 
generator uses a leading or next to leading order calculation, respectively. The events are processed 
through full simulation using Geant 6.4 and a special misaligned geometry, as described in Ref. [8]. 
While summarised here, the cross-sections used are described and justified, together with their uncer- 
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tainties, in Ref. [9]. Tables 1 and 2 summarise the signal and background samples and their properties. 
The first column in these Tables indicates explicitely the cases in which a specific leptonic decay have 
been required at the generation level. For this reason, the second column represents in some case (like 
for the tt dataset) the total production cross section and in other cases the total cross-section multi- 
plied by the leptonic branching ratio(s). The third column indicates the efficiency of the filter which 
is applied on the generated final states described in the first column. 



Channel 


a(x B r ) 


Bfilter 


Nevt (Xl0 3 ) 


JSf (pb- 1 ) 


W^ev 


20510 pb 


0.63 


140 


11 


y/Z ^ee,yfs> 60 GeV 


2015 pb 


0.86 


399 


230 


y/Z -> ee, V~s < 60 GeV 


9220 pb 


0.022 


197 


969 




20510 pb 


0.20 


32 


8 


Z^TT 


2015 pb 


0.05 


13 


129 


tt 


833 pb 


0.54 


382 


850 


Inclusive jets {pj >6 GeV) 


70 mb 


0.058 


2480 


0.0006 


Inclusive jets (p T >17 GeV) 


2333 jUb 


0.09 


3725 


0.02 


WW^ («V)(«V) 


1.275 pb 


1. 


20 


15608 


zz 


14.8 pb 


1. 


43 


2922 


wz 


29.4 pb 


1. 


50 


1699 



Table 1: Signals and background samples in the electron channels. W and Z cross-sections are nor- 
malised to the NNLO prediction; the tt cross-section is computed at NLO; the jet cross-section is the 
LO result. The filters are described in the text. The number of simulated events and the corresponding 
integrated luminosity are also indicated. 



Channel 


a(x B r ) 


BfUter 


Nevt (X10 J ) 


J^(pb-') 


W -^IXV 


20510 pb 


0.69 


190 


13 


y/Z — > jJ.jJ.,y/$ > 60 GeV 


2015 pb 


0.89 


446 


249 


W ^TV T 


20510 pb 


0.20 


32 


8 


Z^TT 


2015 pb 


0.05 


13 


129 


tt 


833 pb 


0.54 


382 


850 




766 jUb 


2.1 x 10~ 4 


110 


0.67 


bb -> lipi+X 


25 /lb 


1.6 x 10" 4 


140 


35 



Table 2: Signals and background samples in the muon channels. W and Z cross-sections are nor- 
malised to the NNLO prediction; the tt cross-section is computed at NLO; the bb cross-section is the 
LO result. The filters are described in the text. The number of simulated events and the corresponding 
integrated luminosity are also indicated. 



2.3 Common selection aspects 

As already mentioned, W and Z boson final states are selected through their decays into electrons 
and muons. The reconstruction of electrons is based on a cluster measured in the electromagnetic 
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calorimeter, geometrically matching a track reconstructed in the Inner Detector. The identification of 
isolated high-pr electrons is then based on the shapes of the electromagnetic showers, and on track 
reconstruction information. Three sets of identification criteria have been defined. The Loose crite- 
rion consists of simple shower-shape cuts; the Medium criterion adds further cuts on shower-shape 
and on track quality; the Tight criterion tightens the track-matching requirement, adds a cut on the 
energy-momentum ratio and further selections based on the vertexing-layer hits and on the Transition 
Radiation Tracker. Electron reconstruction and its performance are described in Ref. [10]. 

The muon reconstruction is done with the Muon Spectrometer, possibly completed by the Inner De- 
tector. Stand-alone muons are defined as consisting of a reconstructed track in the spectrometer only, 
and combined muons are the subset of the above that include a matching track in the Inner Detector. 
Muon reconstruction is documented in Ref. [11]. 

The measurement of missing energy in the transverse plane (tfj) is an important requirement for W 
boson cross-section measurements, as significant tyj reflects the presence of at least one neutrino in 
the final state. The algorithm exploits the energy deposits in the calorimeter cells, the reconstructed 
muon tracks, and an estimate of the energy lost in the cryostat. The calorimeter cells are calibrated 
according to the physical object they represent (electrons or photons, taus, jets and muons). Cells 
corresponding to electrons, photons and muons are calibrated at the electromagnetic scale, whereas 
all other cells are calibrated at the hadronic scale. The #j value is then computed as the vector sum 
of the cell transverse energies. If muons are reconstructed in the event, their transverse momentum 
is added to the calorimetric sum. A complete description of the reconstruction can be found in 
Ref. [12]. 

Jets are reconstructed from calorimeter cells. The Cone algorithm is used, where the jet size parame- 
ter, A/? = ^(AT7) 2 + (A0) 2 , is set to AR = 0.7. 

At low luminosity, Jz? = 10 31 cm~ 2 s _1 , the relevant trigger items require at least one electron or muon 
with pr > 10 GeV, at least two electrons with pr > 5 GeV, or two muons with pr > 4 GeV. No iso- 
lation criteria are imposed on the leptons. As the LHC luminosity ramps up towards its design value, 
tighter selections will be needed to control the rates. The thresholds are raised, and isolation criteria 
are imposed on the electrons. The trigger items relevant for W boson selection require at least one 
isolated electron with pj > 22 GeV, or one muon with pr > 20 GeV. For Z production, two isolated 
electrons with pj > 12 GeV or two muons with pj > 10 GeV can be required in addition to the 
above. The trigger items described above are summarised in Table 3. Many more trigger items exist; 
a complete description can be found in Ref. [13, 14]. 

The reconstruction efficiency for electrons and muons, and the resolution of the Tf.j reconstruction al- 
gorithm are illustrated in Fig. 1. For electrons the Medium identification efficiency is illustrated, and 
for muons the combined reconstruction efficiency is shown. With looser criteria, higher efficiency and 
weaker r\ dependence are obtained, at the cost of larger backgrounds. A complete description of the 
ATLAS detector and its performance can be found in Ref. [15]. 
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Trigger item 


Description 


elO, e20 


One electron, p T > 10,20 GeV 


mulO, mu20 


One muon, p T > 10,20 GeV 


2e5 


Two electrons, pj > 5 GeV 


2mu4 


Two muons, pj > 4 GeV 


e22i 


One isolated electron, pr > 22 GeV 


mu20 


One isolated muon, pj > 20 GeV 


2el2i 


Two isolated electrons, pj > 12 GeV 


2mul0 


Two isolated muons, pj > 10 GeV 



Table 3: Main trigger items relevant for the selection of W and Z boson final states. The first group of 
trigger items is relevant to the Jzf = 10 31 cm _2 s _1 trigger menu; the second group is relevant for the 
Jz? = 10 33 cnrV 1 trigger menu. 




Figure 1: From left to right : Medium identification efficiency for electrons vs. r\; muon combined 
reconstruction efficiency vs. Tj ; resolution. The efficiencies are obtained from Z boson events, and 
integrated over pj > 20 GeV. The resolution is obtained from W — > ev events. 
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Selection W -> ev jets W — ► TV Z —> ee 

Trigger 37.01± 0.09 835 ±18 1.73 ±0.02 6.07 ±0.01 

E T >25GeV, \r]\ < 2.4 30.84±0.09 383±12 1.03 ±0.01 3.23±0.01 

Electron ID 26.77 ±0.09 110 ±6 0.91 ±0.01 2.95 ±0.01 

#r>25GeV 22.06 ±0.09 4.6 ±0.7 0.55 ±0.01 0.06 ±0.01 

M r >40GeV 21.71 ±0.08 1.5 ±0.4 0.43 ±0.01 0.04±0.01 

Table 4: Number of expected signal and background events (xlO 4 ) in the W —>■ ev channel after 
all selections, for an integrated luminosity of 50 pb _1 . The quoted uncertainties refer to the finite 
Monte-Carlo statistics only; systematic uncertainties are discussed in the text. 



3 Electron final states 

This section describes the event selections in the electron final states, the expected event rates, and 
estimations of the uncertainties on the remaining backgrounds. 

3.1 W -> ev 

Event selection. The selection of W —> ev events proceeds as follows. First, the e20 trigger item of 
the 10 31 trigger menu should be passed. Then, exactly one electromagnetic (EM) cluster, matched with 
a track and such that E T > 25 GeV, | 7] |< 1.37 or 1.52 <| 7] |< 2.4, should be present in the event. 
This object should satisfy the Medium electron identification criterion. Finally, the reconstructed 
missing transverse energy, reflecting the missing final state neutrino, should satisfy > 25 GeV, and 
the transverse mass of the (I, v) system should satisfy M T > 40 GeV. 

Due to the high di-jet cross-section and the high rejection power of the selections, the available Monte 
Carlo statistics is not sufficient to evaluate this background directly. To overcome this difficulty, the 
jet background has been estimated by applying the trigger and electron identification selections only, 
and correcting the result with a factor obtained by computing the rejection power due to the $t and 
Mj cuts only. 

The number of signal and background events after the successive cuts are given in Table 4 for an 
integrated luminosity of 50 pb _1 . The statistical uncertainty on the expected number of events corre- 
sponding to 50 pb _1 is Ajv s = 0.04 • 10 4 . The resulting transverse mass distribution is shown in Fig. 2. 

Background estimation. As can be seen from Table 4, jet events constitute the largest background 
component. In addition, the jet production cross-section and fragmentation properties at the LHC 
are largely unknown and induce a significant uncertainty on the magnitude of this background; an 
uncertainty of about a factor 3 is estimated. It is therefore important to develop methods allowing to 
monitor the jet background using the data. An attempt is presented below. 

The principle of the method is to measure the normalisation and shape of the jet background ahead 
of the $t cut, in a sufficiently pure jet sample. It is thus needed to find a jet sub-sample that is free 
of signal events, but exhibits a transverse mass distribution and jet multiplicity close to that of the 
jet background to W —> ev at this level of the selection. This sub-sample is then used to evaluate the 
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Figure 2: Transverse mass distribution in the W — > ev channel, for signal and background after all 
selections, for Jf = 50 pb _1 after all selections except Mr cut. 

rejection of fa cut, allowing a realistic estimation of the jet background in the W — > ev selection. 

In this approach, the signal sample is obtained by applying the same trigger, kinematics and electron 
identification selection as described before and removing in addition events with a second high-p^ 
electromagnetic cluster giving an invariant mass, together with first selected electron, close to the Z 
boson mass (65 < M u < 130 GeV). 

The jet background control sample is selected using a single photon trigger with Ej > 20 GeV, and 
subsequent photon identification using the same calorimetric variables as the electron identification. 
The photon cluster should also satisfy the same kinematics cuts of the electron candidate in the signal 
sample. There should be no Inner Detector track matching the photon cluster, to reject events with 
true electrons (e.g. W events) contaminating this photon sample. Simulation studies show that these 
selections provide a sample essentially composed of jet events, even at high values of fa, and that the 
shape of the fa distribution is identical, within the statistical precision, to that of the jet background in 
the W —> ev sample (see Fig. 3). Above fa > 10 GeV, the slope can be described with the convolution 
of an exponential and a second degree polynomial function. 

After the subtraction of the estimated background to the signal sample, the analysis then proceeds 
applying the fa selection mentioned above. This data-driven estimation yields a jet background 
fraction of (0 + 4 — 0)%. The uncertainty corresponds to a number of events, 8B = 0.92 x 10 4 events. 
Besides, a relative uncertainty of 3% is assumed on the W — ► TV background, as estimated from 
the experimental uncertainties on the W and % branching fractions. This process thus contributes 
SB = 0.01 x 10 4 events. 

3.2 Z -> ee 

Event selection. This analysis relies on the e 10 trigger. Events are further preselected by requiring 
two EM clusters with Ej > 15 GeV and \r\ \ < 2.4. The presence of two electrons in the final state al- 
lows application of the Loose electron identification criteria, which we briefly describe below. Three 
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Figure 3: Left: ratio of the distributions in jet background events and in the control sample. Right: 
comparison of the jet background (points with error bars) and the fitted background (rectangles), for 
an integrated luminosity of 50 pb _1 . 




Ethad1/Et37 Width(S1) 



Figure 4: The electron identification criteria described in the text: the cluster hadronic to EM energy 
ratio (left); the cluster width in the first calorimeter sampling (right). The distributions are normalised 
to the number of background entries. 

discriminant variables are used to separate EM clusters, deposited by electrons, from the hadronic 
background. 

The first one is based on the longitudinal shower shape, and represents the ratio of the transverse en- 
ergy deposited in the first compartment of the hadronic calorimeter divided by the transverse energy 
of the EM cluster. This ratio is expected to be small for EM objects, and large for hadronic clusters. 

The second and third one are based on the shower width measured in the EM calorimeter. In the 
second compartment of the EM calorimeter, the width is computed from the ratio of the shower en- 
ergy deposited in a region of size Arj x A(j) = 0.075 x 0.175, divided by the energy deposited within 
Ar) x A(j) = 0.175 x 0.175 around the cluster barycenter. In the first compartment, the cluster spread 
is used, computed as the root-mean-square (RMS) of the cluster energy distribution. These two vari- 
ables discriminate the narrow EM clusters from the wider hadronic clusters. Distributions of the three 
discriminators for electrons and hadrons are shown in Fig. 4 and in Fig. 5. 
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Figure 5: The electron identification criteria described in the text: cluster width in the second 
calorimeter sampling (left) and electron isolation variable (right). The distributions are normalised 
to the number of background enuies. 



Selection 


Z —* ee 


jets 


Trigger 


6.70 ± 0.01 


3110 ±40 


E T > 15 GeV, \r\\ < 2.4, 80 GeV < M ee < lOOGeV 


2.76 ± 0.01 


11.1 ±0.8 


Electron ID 


2.64 ± 0.01 


0.8 ± 0.2 


Isolation 


2.48 ± 0.01 


0.2 ±0.1 



Table 5: Number of expected signal and background events (x 10 4 ) in the Z — > ee channel after all 
selections, for an integrated luminosity of 50 pb _1 . The quoted uncertainties refer to the finite Monte- 
Carlo statistics only; systematic uncertainties are discussed in the text. 



Electrons identified as above are then required to be isolated. The isolation variable is computed from 
the total measured energy in a cone of size AR = 0.45 around and excluding the electron, divided by 
the electron energy. Electrons are isolated if this ratio is smaller than 0.2. Distributions of this variable 
for the signal and the background are shown in Fig. 5. 

The number of signal and background events after the successive cuts are given in Table 5 for an inte- 
grated luminosity of 50 pb _1 . The expected signal counting rate isN = (2.48 ±0.02) x 10 4 events.The 
uncertainty quoted here is obtained by scaling the statistics of the Monte-Carlo sample to 50 pb 1 . The 
resulting di-electron invariant mass distribution is shown in Fig. 6. 

Background estimation. As in the W —> ev analysis, the simulation-based jet background estimate 
of Table 5 is replaced by a data-driven estimate. In this analysis, the signal and background fractions 
are estimated simultaneously, via a fit to both conuibutions. The signal is described by the convolution 
of a Breit-Wigner and a Gaussian resolution function, and the background, completely dominated by 
jet events, by an exponential function. 

At the preselection level (just ahead of the electron identification and without the M ee cut), the back- 
ground largely dominates the signal and allows to determine the exponential slope. After the identifi- 
cation and isolation cuts, the fit yields a background fraction of (8.5 ± 1.5)%, or B = (0.23 ±0.04) x 
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Figure 6: Di-electron invariant mass distribution in the Z — > ee channel, for signal and background, 
for 50 pb _1 , after all selection cuts, except M ee cut. 

10 4 events. The uncertainty on the background fraction derives from the modelling of the signal and 
background shapes. 

The relatively important background rate is explained by the rather loose identification cuts. The 
present selections are chosen to illustrate the robustness of the signal extraction, and to exemplify the 
background extraction method. 

4 Muon final states 

4.1 W -> Aiv 

Event selection. The W — > }XV signal is selected as follows. The events should contain exactly one 
muon track candidate, passing the mu20 trigger item and satisfying |tj | < 2.5 and pj > 25 GeV. The 
energy deposited in the calorimeter around the muon track, within a cone of radius AR = 0.4, is re- 
quired to be lower than 5 GeV. The event missing transverse energy should satisfy tyj > 25 GeV, and 
M T > 40 GeV is required. 

For the initial luminosity the pj cut of the trigger on the muon track is expected to be 20 GeV. Having 
a higher pj threshold, however, can further reduce the backgrounds in particular from heavy flavour 
hadron decays and from decays in flight of long lived particles. The isolation, $t and Mj cuts are 
also effective to reduce those backgrounds. 

After all selections, the overall efficiency for the signal is expected to be close to 80%, with very large 
rejection factors for bb and it events. The number of events that are expected to pass the selection 
criteria for an initial integrated luminosity of 50 pb _1 are shown in table 6. The expected background 
level corresponds to a fraction of ~ 7%. Figure 7 shows the corresponding W transverse mass distri- 
bution before the transverse mass cut (last cut of table 6). 



Background estimation. In contrast to the electron channels, the jet background is less important 
here and does not dominate the overall background. Muons from heavy flavour decays are rejected 
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Selection W -> pv W — > TV Z^pp bb -> pX ft 

Trigger 44.44 ± 0.07 1.53 ±0.01 2.03 ± 0.01 83.34 ± 0.09 0.53±0.07 

p T >25GeV, |tj| < 2.5 35.55 ± 0.06 1.22 ±0.01 1.62 ±0.01 68.27 ± 0.08 0.42±0.06 

Isolation 34.80 ±0.06 1.20 ±0.01 1.59 ±0.01 9.67 ± 0.03 0.35±0.06 

#r>25GeV 28.59 ± 0.05 0.72 ± 0.01 1.10 ±0.01 1.00 ±0.01 0.30±0.05 

M r >40GeV 28.03 ±0.05 0.57 ± 0.01 1.10 ±0.01 0.10 ±0.01 0.24 ±0.05 

Table 6: Number of expected signal and background events (x 10 4 ) in the W —> /IV channel, for an 
integrated luminosity of 50 pb _1 . The quoted uncertainties refer to the finite Monte-Carlo statistics. 




Figure 7: Transverse mass distribution in the W — ► /IV channel, for signal and background, for 
50 pb _1 after all selections except Mr cut. 

using the pj and the isolation cuts, and muons from decays in flight of long lived particles could be 
further rejected using loose impact parameter cuts. The tt background and its uncertainty are small. 

As can be seen in Table 6, the dominant backgrounds are expected from W — > TV and Z ^ pip: events. 
These processes are well understood theoretically, in particular with respect to the W — ► piv signal, 
and can be safely estimated based on simulation. A relative uncertainty of 3% is assumed on the 
W — > TV background, as estimated from the experimental uncertainties on the W and T branching 
fractions. Exploiting the CTEQ6.5 eigenvector sets (cf. Section 5), an uncertainty of 2% is assumed 
on the Z event rate passing the selections. 

The jet background (mostly muons from b-hadron decays) is theoretically not well known. An uncer- 
tainty of 100% is assumed on this background component. 

A theoretical uncertainty of about 15% on the tt cross-section is assumed. In addition, an uncertainty 
of 10% is considered on the rejection obtained from the isolation cut. This leads to a total uncertainty 
of about 20% on the tt background rate. 

4.2 Z ^ /l/i 

Event selection. The Z — ► pip: analysis uses the 10 GeV single muon trigger. The triggered data 
sample is further reduced by requiring at least two reconstructed muon tracks. The present analysis 
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Figure 8: Distributions of the muon isolation variables for signal and backgrounds after all selection 
cuts. Left : track multiplicity within AR = 0.5 around the muon. Right : total transverse momentum 
of these tracks. 



relies on the muon spectrometer only. The reconstructed muon tracks should satisfy |tj| < 2.5 and 
Pt > 20 GeV. The reconstructed charges must be opposite, and the invariant mass of the muon pair is 
required to fulfil |91.2 GeV-M^| <20 GeV. 

Muons in jet events tend to be produced within a decay cascade of further particles, and should there- 
fore not appear isolated in the detector, in contrast to the leptonic decays of Z and W bosons. To 
quantify the isolation of the muons, the number of Inner Detector tracks within a cone around the 
candidate muon, as well as the total transverse momentum of these tracks are used. The cone size is 
AR = 0.5, and the muon track itself is excluded from the calculation. 

The distributions of the isolation variables for signal and background processes normalised to their 
cross sections is shown in Fig. 8, after the above-mentioned cuts. 

The isolation and pj cuts are chosen to minimise the statistical uncertainty on the cross-section mea- 
surement. The expected number of events after each cut are shown in Table 7. The chosen cuts select 
about 70% of the Z — >• \x\x events with muons in the detector acceptance. The residual background 
fraction of this selection is 0.004 ± 0.001 (stat). The corresponding invariant mass distribution is 
shown in Fig. 9. 



Background uncertainty. In this channel, the dominant background originates from tt events. Be- 
sides a theoretical uncertainty of about 15% on the cross-section, an uncertainty of 10% is assumed 
on the rejection obtained from the isolation cuts. This leads to a total uncertainty of about 20% on the 
tt background rate. 

The jet background (mostly muons from b-hadron decays) is expected to be smaller, but is theoreti- 
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Selection Z ->■ jiji bb -> jj,jj,X W -> ^iV Z — > TT « 

Trigger 3.76±0.01 10.08 ±0.04 36.7±0.1 0.09 ±0.01 0.69 ±0.01 

2 muons + 

opp. charge 3.33±0.01 3.00±0.04 1.14 ± 0.02 0.04 ±0.01 0.35±0.01 

M w cut 3.04±0.01 0.26±0.01 0.04±0.01 (14±4)xl0~ 4 0.02±0.01 

p r cut 2.76±0.01 0.125± 0.001 0.004 ±0.001 (ll±4)xl0- 4 (134±8)xl0- 4 

Isolation 2.56±0.01 (18±5)xlQ- 4 (9±5)xl0~ 4 (ll±4)x!Q- 4 (66±4)xlQ- 4 

Table 7: Number of expected signal and background events (x 10 4 ) in the Z — > jiji channel, for an 
integrated luminosity of 50 pb _1 . The quoted uncertainties refer to the finite Monte-Carlo statistics. 
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Figure 9: Di-muon invariant mass distribution in the Z — > jj,/J, channel, for signal and background, for 
50 pb -1 , after all cuts, except the isolation and cuts. 
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cally not well known. An uncertainty of 100% is assumed on this background component. 
The other backgrounds are smaller, theoretically well known in comparison to the above, and con- 
tribute negligibly to the overall background uncertainty. 

5 Common systematic uncertainties 

5.1 Trigger and reconstruction efficiency 

As has been seen in Sections 3 and 4, the selection of leptonic Z boson decays provides clean signals 
with low backgrounds. This allows determination of the lepton trigger [13, 14] and reconstruction 
efficiencies [10, 1 1] using the well-known tag-and-probe method, which is briefly outlined below. 

Selecting Z —>■ ee and Z — > jiji events as in Sections 3 and 4, i.e. requiring a single lepton trigger 
and two reconstructed leptons, the efficiency of a given trigger item is defined as the fraction of the 
selected events where the second reconstructed lepton passes this trigger item. 

The off-line reconstruction efficiency can be determined in a similar way. Requiring one reconstructed 
lepton satisfying tight identification criteria, and requiring a second isolated, high-py object such that 
the invariant mass of the pair is close to the Z boson mass, provides a sufficiently pure Z — >• 11 sample; 
the efficiency of a given identification criterion is then defined as the fraction of events where the 
second object is indeed identified. Conversely, the efficiency of the isolation cuts can be determined 
by requiring the second object to be identified, and counting the fraction of events where the isolation 
cut is passed. 

The above methods are exact in the limit where backgrounds vanish. For tight trigger and off-line 
cuts, this is the case in practice : the background magnitude and uncertainty have a negligible im- 
pact on the efficiency determination. Backgrounds are larger when assessing looser identification 
and isolation cuts, and lower trigger thresholds. In this case, interpreting the observed dilepton mass 
spectrum as a sum of signal and background contributions (described by the convolution of a Breit- 
Wigner resonance and a Gaussian resolution function, and by an exponential or polynomial function, 
respectively) allows to extract the background fraction and correct the computation accordingly. This 
procedure was performed and shown to provide efficiency estimates that are unbiased within the sta- 
tistical precision expected for «Sf = 50 pb _1 . 

Figures 10 and 11 illustrate this discussion, on the examples of the e20 trigger item and Medium 
electron identification cut, and of the mu20 trigger item and combined muon reconstruction. For the 
selections used in the analyses of Section 3 and 4, the overall efficiency can be reconstructed with a 
precision of 8e/e = 0.02 for electrons and muons. 

The overall reconstruction efficiencies in Fig. 10, 11 reflect the performance of the reconstruction 
software at the time of writing this note. Improved performance and results are presented in [10, 11, 
13,14]. 

5.2 Theoretical systematic uncertainties 

This section presents comparisons on the acceptance for W — >• lv and Z — >• 11 events, as obtained from 
the Pythia, Herwig and MC@NL0. The purpose of this study is to determine the contribution of the 
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Figure 10: Electron detection efficiency vs. tj, as measured from the tag-and-probe method and 
compared to the truth, for 50 pb _1 . The product of the e20 trigger efficiency, and Medium electron 
identification efficiency is represented. 




Figure 11: Muon detection efficiency vs. Tj, as measured from the tag-and-probe method and com- 
pared to the truth, for 50 pb _1 . The mu20 trigger efficiency, and the combined muon reconstruction 
efficiency are represented. 
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uncertainties on the acceptance to the overall systematic uncertainty on the cross-section. 

The kinematic cuts described in Sections 3 and 4 are applied to the generator-level particles for each 
of the above generators. For W events, the acceptance varies by 2.5% from one program to the other. 
For Z events the observed variation is of order 3.2%. The sources which could explain the observed 
differences are the Initial State Radiation (ISR), the intrinsic k T for the incoming partons, the Under- 
lying Event (UE), final state photon radiation, Parton Density Functions (PDFs) and matrix element 
corrections applied to the parton shower (ME). 

To quantify the impact of the individual sources, samples are generated with ISR, kj, UE and ME all 
switched off. The impact of each effect is then studied by switching on this effect individually. For 
the sake of clarity, the discussion is given explicitly for W —>■ ev production only; at the end of the 
section results are given for both W and Z production. 

The effects of electroweak corrections on the acceptance have been studied using PHOTOS. By running 
alternatively with and without PHOTOS, one obtains an effect of 1.8% for W events. 
Switching on ISR, or changing the intrinsic kj of the incoming partons has an important impact on the 
lepton r\ and p T distributions. Specifically, ISR introduces a difference of 10.2% on the acceptance 
for W events. Similarly, turning on and off the kj, ME and UE, the following differences on the W 
acceptance are obtained: 1.9% for the intrinsic kj, 1.0% for the UE and no effect for the ME. 

For these sources, the systematic uncertainty is estimated as 20% of the above numbers, which 
amounts to assuming that the models describing the above are correct within 20%. One thus ob- 
tains the following uncertainties for W ev: 2.0% (ISR), 0.4% (kT) and 0.2% (UE). In the case of 
PHOTOS, one has an uncertainty of 0.3%. 

The PDFs are an important source of differences in the acceptances. The uncertainty is determined 
using the CTEQ6.5 PDF uncertainty sets. An uncertainty of 0.9% is found. 

To simplify the procedure, we assume no correlations between the different sources and calculate 
the total uncertainty from the quadratic sum of all numbers, finding 8A/A = 2.3% for W events. 
Repeating the same exercise with Z events gives a very comparable systematic uncertainty. While 
these uncertainties will be significantly reduced with the analysis of the LHC data, we assume these 
figures hold for our initial cross-section measurements. 

6 Total cross-section results 

Cross-section results for Jzf = 50 pb 1 are presented first. At the end of the section, the performance 
is extrapolated to higher luminosity. 

6.1 Results for ^ = 50 pb 1 

We gather below the results of the analyses performed in Sections 3 and 4, and of the discussion of 
systematic uncertainties of Section 5. Table 8 contains our estimations of statistical and systematic un- 
certainties, the cross-section values and their uncertainties computed according to Equations 2 and 3. 
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Process 


7V(xl0 4 ) 


B(xl0 4 ) 


Axe 


8 A/ A 


Se/e 


<r(pb) 


W -^ev 


22.67 ±0.04 


0.61 ±0.92 


0.215 


0.023 


0.02 


20520 ±40 ±1060 


W -^tiv 


30.04 ±0.05 


2.01 ±0.12 


0.273 


0.023 


0.02 


20530 ± 40 ± 630 


Z — > ee 


2.71 ±0.02 


0.23 ±0.04 


0.246 


0.023 


0.03 


2016 ± 16 ± 83 




2.57 ±0.02 


0.010 ±0.002 


0.254 


0.023 


0.03 


2016 ± 16 ± 76 



Table 8: Measured cross-sections, their uncertainties and overall selection efficiency Axe, for an 
integrated luminosity of 50 pb~ 1 . The uncertainty on N is statistical, the other sources are systematic. 
The quoted cross-section uncertainties include the mentioned statistical and systematic contributions 
but not an overall luminosity uncertainty. 



As can be seen from Table 8, the results are dominated by the systematic error, even for «£? = 50 pb~ 1 . 
The luminosity uncertainty is common to all cross-sections, and vanishes in cross-section ratios, e.g 
Ow/oz- In the W channels, the systematic uncertainty is dominated by the background uncertainty. 
This can be expected given the important fraction of jet events. This background could be further 
reduced, notably by requiring the absence of jets, but this would jeopardize the inclusive nature of 
the cross-section measurement. The efficiency and acceptance uncertainties give a slightly smaller 
contribution. 

The Z channels benefit from smaller backgrounds, due to the presence of two decay leptons. For 
the same reason, the efficiency uncertainty is also larger than in the W channels. Given the smaller 
acceptance uncertainty, the efficiency uncertainty is the largest source of uncertainty. 

6.2 Prospects for jSf = 1 fb~ 1 

For higher integrated luminosity, the statistical uncertainty on the counting (N) becomes negligible, 
and the efficiency uncertainty, which is determined from measurement and also of statistical nature, 
strongly decreases. 

With increased luminosity, a number of modifications will have to be applied to the analyses. Most 
prominently, in the electron channels, the single electron trigger threshold will be increased to 22 GeV, 
and the Tight electron identification is expected to be used. In Z — >• /x/x channel, spectrometer muons 
are replaced by combined muons, and the muon isolation cuts are refined by exploiting calorimetric 
information in addition to the track-based isolation used in the low-luminosity analysis. The W — ► /XV 
analysis is unchanged. 

Table 9 summarizes the expected signal yields and the cross-section determination in this case. On 
this timescale, Jz? might be measured with improved precision, exploiting elastic proton scattering at 
very small angles [2]. Compared to the low-luminosity analysis, the systematic uncertainties from 
backgrounds and efficiency are expected to scale with statistics. Without further input, the acceptance 
uncertainty does not decrease and dominates the result. 
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Process 


7V(xl0 3 ) 


B(xl0 5 ) 


Axe 


8A/A 


Se/e 


a(pb) 




W ^ev 


45.34 ±0.02 


1.22 ±0.41 


0.215 


0.023 


0.004 


20520 ± 


9±516 


W 


60.08 ±0.02 


4.02 ±0.05 


0.273 


0.023 


0.004 


20535 ± 


7 ±480 


Z — > ee 


5.42±0.01 


0.46 ±0.02 


0.246 


0.023 


0.007 


2016 ± 


4± 49 


Z^nn 


5.14±0.01 


0.02 ±0.001 


0.254 


0.023 


0.007 


2016 ± 


4± 49 



Table 9: Measured cross-sections, their uncertainties and overall selection efficiency Axe, for an 
integrated luminosity of 1 fb -1 . The uncertainty on N is statistical, the other sources are systematic. 
The quoted cross-section uncertainties include the mentioned statistical and systematic contributions 
but not an overall luminosity uncertainty. 



7 Differential cross-sections 

As it is clear from the previous sections, total cross-section measurements are dominated by the sys- 
tematic uncertainty even for modest integrated luminosity. The main cross-section uncertainty is 
related to the acceptance uncertainty, which in turn comes from our limited knowledge of the under- 
lying physics (notably non-perturbative mechanisms and PDFs). It is therefore important to measure 
the distributions, which will help to constrain these uncertainties. Three examples are given below, 
namely the measurement of the Drell-Yan invariant mass spectrum at low mass, and the rapidity and 
transverse momentum distributions for Z events. 

Compared to the inclusive analyses, the differential measurements require larger statistics. For this 
reason the differential distributions shown in this section refer to an integrated luminosity of 200 pb 1 , 
which corresponds to the available statistics of the Monte Carlo signal samples. 

7.1 Low-mass Drell-Yan production 

Event selection. This study relies on a low threshold single electron trigger, elO, possible at low 
luminosity. At the reconstruction level, exactly two oppositely charged electrons are required, each 
satisfying the tight identification criteria, and with pj > 10 GeV. Offline electron reconstruction is 
limited to |tj| < 2.5. After these preselections, the reconstructed missing transverse energy in the 
event, Ef' ss , should be smaller than 30 GeV. Finally, the di-electron invariant mass, m ee , is required to 
be in the range 20 < m ee < 60 GeV. Table 10 shows the impact of these cuts on signal and background. 
Only the main backgrounds except the one arising from QCD dijets are considered at this level; the 
dijet background is discussed separately. 

The kinematical acceptance, given by the final state requirement of two electrons satisfying pr > 
10 GeV and |tj| < 2.5, is 17% in the range 20 < m ee < 60 GeV For 40 < m ee < 60 GeV, the accep- 
tance reaches 34%. The elO trigger channel provides an efficiency of about 89% for signal events 
within the kinematical acceptance. The trigger efficiency is about 74% for di-electron pair masses of 
m ee = 20 GeV, and reaches 97% at m ee = 60 GeV The offline electron selection efficiency is 36% [10], 
and the further kinematic selections have an efficiency of 83%. 

Dijet background estimation. The background contributions listed in Table 10 are estimated from 
simulation, by counting the events that pass the selection criteria described above. This method can 
not be applied to the dijet background which, due to very high rejection factors, can not be simulated 
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cut y* — > ee TT ft di-boson 



Preselections 2632±12 48±3 218±3 342±3 
£™'™<30GeV 2604±12 38±3 28±1 164±2 
20<m ee <60GeV 2189±11 30±3 7±1 7±1 



Table 10: Signal and background event rates, following the selections described in the text. For all 
samples, the normalization corresponds to 50 pb _1 . The y* /Z —>■ ee signal with m ee > 60 GeV is 
excluded from these numbers, at all levels of the selection. The dominant dijet background is not 
displayed here, and discussed separately below. 

in sufficient amounts. 

An alternative estimation, based on single electron rejection factors, is used. This method relies on the 
probability for an event to display a single fake electron. On the inclusive jet sample, this probability 
is found to be (1.5 ±0.1) x 10~ 4 . Assuming no correlations, the probability to find a fake di-electron 
pair with opposite charge is then estimated as one-half of the square of the single fake probability. 
The invariant mass distribution of fake electron pairs is estimated using Monte Carlo information. 

The invariant mass distribution of the signal and backgrounds are displayed in Figure 12 (left). The 
estimated background represents about one third of the selected sample. It is dominantly composed 
of inclusive jets (96%), the other backgrounds being negligible. 

Two thirds of the dijet events that contain one tight electron candidate are true electrons from heavy 
quark decays, while the remainder are due to light hadrons mis-identified as electrons and to photon 
conversions. It may be possible to reduce the jet background further by optimizing the selection crite- 
ria for these low Ej electrons. Additional event variables, like hadronic activity estimators, could also 
be used to reduce or constrain the size of the dijet background. However, the large fraction of elec- 
trons from heavy quark decays indicates that the uncertainty on the background composition related 
to different rejection factors for heavy flavour and light jets under the the Drell-Yan signal is large and 
requires further study. 

Measurement of the cross-section. The raw measured differential cross-section may be corrected 
bin by bin in di-electron mass, as follows: 



where ./V, is the number of signal events in mass bin i, B, is the expected number of background events, 
A, is the kinematical and angular acceptance efficiency, e, is the overall efficiency accounting for trig- 
ger, identification and further selections. Am/ is the width of bin i and ££ represents the integrated 
luminosity. 

In practice, the corrections can be considered to be of two types: those correcting for experimental 
effects (backgrounds and efficiencies), and those correcting for the kinematic acceptance of the elec- 
tron pr and tj requirements. Figure 12, right, illustrates the two corrections. The agreement between 
the corrected distribution and the expected distribution from theory provides a technical consistency 




(4) 
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m ee [GeV] m ee [GeV] 



Figure 12: Left: Mass distributions of the measured raw events (solid line), Drell-Yan signal events 
(dashed line) and background contributions arising from inclusive jets, tt, tt, di-boson in decreasing 
numerical importance. Right: invariant mass disuibutions of the selected sample before any correc- 
tions (rectangles with error bars), the theoretical signal within the fiducial acceptance (dashed line), 
the corrected Drell-Yan signal sample (unshaded rectangles with error bars) and the complete theoret- 
ical distribution (solid line). 

check of the method. 

The integrated cross-section can be calculated by integrating the corrected histogram; the statistical 
error on the cross-section is estimated by adding the error from each bin in quadrature. The total 
Drell-Yan cross section in the electron channel, for 20 < m ee < 60 GeV is Gdy = 1 -07 nb. The corre- 
sponding statistical uncertainty is 4% for Jzf = 50 pb~ : and 1% for «Sf = 1 fb _1 . This can be compared 
with the PDF contribution to the cross-section uncertainty, estimated to be 6.9% using CTEQ6 . 1M [7]. 

The statistical sensitivity provides a natural target for the different contributions to the systematic un- 
certainty: the knowledge of the background level, and of the efficiency and acceptance corrections 
should match the statistical sensitivity. The efficiency can be determined with sufficient precision 
using the methods discussed in Section 5. The acceptance corrections are mostly affected by the 
transverse momentum distribution of the Drell-Yan pairs in this mass range; this has large theoretical 
uncertainties, and will have to be measured from data. The invariant mass resolution, about 1 GeV, is 
found to have no significant effect on the Drell-Yan spectrum. As discussed above, the most serious 
challenge is posed by the understanding of the jet background, where the current uncertainties are 
much larger than the expected statistical precision. Further study is needed to assess the precision 
of techniques to constrain this background directly from data; this is beyond the scope of the current 
work. 



7.2 Z differential cross-section : bin by bin correction method 
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Category i 


Definition 




1 


All events 


398 750 


2 


Fiducial and kinematics (generation) 


172 544 


3 


Trigger and off-line (fiducial, kinematics and ID) 


49 754 


4 


Intersection of categories 2 and 3 


48 436 



Table 1 1 : Event categories used for the extraction of detector smearing corrections, geometric accep- 
tance and event selection efficiency in the electron channel. 



Electron channel. In the electron channel, only events that pass the 2el2i trigger condition are 
considered. Furthermore, it is required that they contain exactly two, oppositely charged electrons, 
each of them satisfying \r\\ < 2.5 and Pj > 20 GeV. Both electrons are required to pass the Tight 
electron identification criteria. 

The background is dominated by hadrons misidentified as electrons in inclusive jet events. Taking 
into account that with the Tight identification criteria the expected rate of hadrons misidentified as 
electrons is very low (see [10]), the background to the di-electron signal is neglected in the following. 

After all selections, the following event categories are defined. In the following, n\ denotes the total 
sample size; «2 is the number of events having two generator-level electrons satisfying \r\\ < 2.5, 
pj > 20 GeV, and 75 GeV < M ee < 105 GeV. The number of events satisfying these conditions at the 
reconstruction level is noted «3 ; finally, TI4 counts the events passing these criteria on both generation 
and reconstruction levels. Table 1 1 summarises these definitions and contains values for the n,-, di- 
rectly counted from the simulated signal sample. 



Muon channel. In the muon channel, only events that pass the mu20 trigger condition are consid- 
ered. The events should further contain exactly two oppositely charged muons, each of them satisfying 
1 77 1 < 2.5. Both muons should be reconstructed in the Inner Detector and in the Muon Spectrometer. 
The most energetic muon should satisfy pj > 20 GeV; the second one should have pj > 15 GeV. The 
muon pair invariant mass should lie between 76 and 106 GeV. 

Contamination from W — > }XV and tt events effectively disappears after the selection process, but 
bb —> iXjl contamination is still about 3.5% of the signal. To minimise the bb background, two isola- 
tion quantities are studied. The first one is the number of tracks in a cone of size AR =0.45 around the 
muon track ; the second one is the total calorimeter transverse energy in the same cone. The distribu- 
tions of these two quantities for the four samples, corresponding to 40 pb _1 of integrated luminosity, 
are shown in Fig. 13. A muon track is accepted if the first isolation variable is less than six and the 
second isolation variable is less than 20 GeV. These cuts are applied to both muons in the event. 

The isolation cut efficiency for the signal sample is larger than 98%, and the residual contamination is 
less than 0.5%. The sample is pure enough at this point that we can neglect the background contami- 
nation in the differential cross-section plots. 

As in the electron channel, four event categories are defined to allow the computation of the differen- 
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Figure 13: Number of tracks (top) and calorimeter Ej in a cone R = 0.45 (bottom). Distributions for 
the muon with the higher value of the quantity is shown on the left, and distributions for the muon with 
the lower value of the quantity on the right. Black line: Z — > red (or dark grey line): W — ► /IV, 
dashed line: bb — > fijj,, green (or light grey line): it. 
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Category i Definition 


"i 


1 All events 


445650 


2 Fiducial and kinematics (generation) 


234610 


3 Trigger and off-line (fiducial, kinematics and ID) 


181652 


4 Intersection of categories 2 and 3 


180260 



Table 12: Event categories used for the extraction of detector smearing corrections, geometric accep- 
tance and event selection efficiency in the muon channel. 

tial cross-section. The categories and their sizes are given in Table 12. 



Extraction of dOz/dprdy The Z boson phase space is sliced in rapidity and transverse momentum 
regions, or bins. In each region, labeled a, the differential cross-section is obtained from the raw 
event count using the usual expression: 

S a d a -b a 

where S a , e a and A a respectively represent the detector smearing correction (correcting for event 
migration to and from bin a, due to resolution effects), overall event selection efficiency and geometric 
acceptance in region a; d a is the observed event count and b a the estimated background in this region, 
and ££* is the integrated luminosity. In terms of the definitions in Tables 1 1 and 12, we have: 

See = — , £« = — , Aa = e/,7 ?er — , d a = n 3 ,a, (6) 
"4,a "2,a n l,a 

where the a are computed in each bin a of the Z phase space. The acceptance values account for 
the generator-level filtering efficiency, as described in Section 2.2. 



Differential cross-section results In the electron channel, the Z boson phase space was divided in 
50 p T bins and 9 rapidity bins. The p T bins have a width of 2 GeV, in the range 0 < p T < 100 GeV. 
The rapidity bins have a width of 0.3, in the range 0 < \y\ < 2.7. A good agreement is found between 
the reconstructed cross-sections and the true distributions, obtained from a statistically independent 
sample. Figure 14 shows the Z boson differential cross-section in the electron channel in rapidity and 
transverse momentum bins. 



In the muon channel, the Z boson phase space is divided as in the electron channel. A good agreement 
is again found between the reconstructed cross-sections and the true distributions. Figure 15 shows 
the Z boson differential cross-section in the dimuon channel in rapidity and transverse momentum 
bins. 



The plots have been normalised to the total NNLO cross-section of 2015 pb times a global accep- 
tance factor of 0.73. Since the correction factors and measurement were both extracted from the same 
dataset, the good agreement is a check of consistency of the method. 
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Figure 14: Left: doz/dy, integrated over pj. Right: doz/dpr, integrated over —2.7 < y < 2.7. 
Distributions obtained in the electron channel, with a precision corresponding to an integrated lumi- 
nosity of 200 pb _1 . The black line histograms correspond to the generated cross-section, the dashed 
histograms to the measured cross-section before corrections are applied, while the crosses show the 
measured cross-section after all corrections have been applied. 




|Z rapidityl Z P T [GeV] 



Figure 15: Left: dOz/dy, integrated over pj. Right: doz/dpr, integrated over —2.7 < y < 2.7. 
Distributions obtained in the muon channel, with a precision corresponding to an integrated lumi- 
nosity of 200 pb _1 . The black line histograms correspond to the generated cross-section, the dashed 
histograms to the measured cross-section before corrections are applied, while the crosses show the 
measured cross-section after all corrections have been applied. . 
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7.3 Z differential cross-section : alternative method 

The method presented here attempts to fully exploit the phase space of the Z boson and its decay 
products. Writing the cross-section in terms of the complete phase space allows to extract, in addition 
to the Z boson distributions, possible p T , TJ or 0 dependencies of the lepton selection efficiency. 

Method The events are classified in bins both for the Z and for the decay leptons. We define N y z 
bins along y z , and N pt z bins along pj . As before, the Z boson phase space intervals are labelled a. In 
addition, we define Nst intervals in the lepton transverse energy distribution, and intervals for the 
leptons pseudorapidity. The lepton phase space is labelled i, j (one index for each lepton). 

For each a, we measure Nfj, which is the number of lepton pairs reconstructed with one lepton in 
bin i and one lepton in bin j. The following relation holds, in the practical absence of background, as 
justified in the previous sections: 

Nfj = s i s j P l ( *£'Ao a , (7) 

where Pjj is the probability, computed on Monte Carlo, that a Z boson produced in bin a decays into 
two leptons in bins / and j; is the lepton reconstruction efficiency in bin i; is the integrated 
luminosity, and Aa a is the Z production cross-section in bin a. 

Resolution effects, primarily on the lepton Ej, are accounted for as follows. When the Pfj histograms 
are filled, the lepton Ej is first smeared according to its expected, Ej and tj dependent resolution. The 
smeared quantities are then used to compute the Z variables (p t , y). In this way the above equation is 
unchanged and all detector effects can be incorporated in the Pfj factors. Writing the above for all a, 
i, j provides an over-constrained system whose unknowns are the efficiencies and cross-sections. We 
can then compute the £; in each bin a, up to a factor related to Jz? Aa a . 

The system can be solved analytically, using for example the singular value decomposition method, 
or SVD. A drawback of this method is that it is based on least squares; it is thus not valid in the case 
of low statistics. In particular, at low luminosity, the statistics are such that several bins contain only 
a few events. To avoid bias in the efficiency determination, a likelihood using Poisson probabilities is 
constructed and used to fit the efficiencies numerically. In order to help the fit to converge, we first 
solve the system using the SVD method, and we use the results as initial parameters of the fit. Since 
the £,• are expected not to depend on a, we can compute their weighted average over the bins a. 

In case of low statistics, the bin sizes should be large enough to integrate a sufficient statistics in each 
bin. If the lepton reconstruction efficiency is not constant within each bin, the hypothesis that the 
efficiency does not depend on the Z boson phase space might be violated. To avoid such effects, the 
lepton binning is chosen such that the efficiency is a priori constant within each bin. This results in 
bins with variable width, which does not affect the method. 

In any given Z phase space interval a, we can write for each lepton bin (i,j) the following relation: 

^Ao a = ^ -, (8) 

<ei><ej>PPj , 
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Figure 16: Left: JzfAa versus y z , integrated over p z . Right: JzfAa versus p z , integrated over y z . 



where < > and < £y > are the average efficiencies computed at the previous step. Finally, JzfAa" 
is computed by averaging over all 



"Classical limit" of the method. As discussed above, the method proposed here might need sig- 
nificant integrated luminosity to be applied safely. The classical method is reached by simply setting 
Nei = A^rj = 1 , and accordingly computing the acceptance and efficiencies from Monte-Carlo in each 
Z boson phase space interval a. 



Resolution effects are taken into account as before, by smearing the Monte Carlo input before deter- 
mining the acceptance. Once the acceptance and efficiency are determined in each bin a, just counting 
the number of events N a in the bin and allows to deduce the differential cross-section using the usual 
cross-section expression in the absence of background: 

^Aa« = -^-. (9) 



Results. The complete method has been tested on the Z — > jiji. samples described in Section 2.2. 
Due to the limited statistics, the Z phase space was mapped using N pt z =10 for 0 < p z < 60 GeV, N y z 
=5 for —2.5 <y z < 2.5. The muon reconstruction efficiency has no pr -dependence above 10 GeV; 
this allows to set Afe =1. The definition of the muon tj intervals is dictated by the detector geometry 
which affects the efficiency as a function of T7; we set =7, with the intervals [-2.7,-1.6], [-1.6,-1.4], 
[-1.4,-0.1], [-0.1,0.1], [0.1,1.4], [1.4,1.6], [1.6,2.7]. 

The results are illustrated in Fig. 16. The Z boson rapidity and pj distributions are correctly recon- 
structed. Measured and true distributions agree within the statistical precision, which varies from 3% 
in regions where the differential cross-section is high, to about 20% in the tails of the Z boson phase 
space (y z > 1.5). In addition, the X] dependence of the reconstruction efficiency can be measured 
accurately. Given the interval definition above and the size of our sample, a precision of about 2% is 
obtained for each point. This is competitive with the tag-and-probe determination described in Sec- 
tion 5, and illustrated in Fig. 17. 
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Figure 17: Muon reconstruction efficiency versus T7, measured simultaneously with the differential 
cross-section. 

For cross-checks, the classical limit of the method has been tested using the Z^ee samples. The inter- 
vals are defined as before, except N y z =10 and N pt z =20. The following selection criteria are applied: 
two reconstructed electrons of opposite charge are required in the detector acceptance (|tj| > 2.5), 
with 20 GeV < p t < 80 GeV. Both electrons should pass the Tight identification criterion. Z events 
are selected around the mass peak (87 GeV < Mz < 95 GeV), with p z < 60 GeV . The acceptance 
cuts on the electrons imply y z < 2.5. 

The results of the differential cross-section do/dy z , integrated over p z , and do/dp?, integrated 
over y z are shown in Fig. 18. The squares represent the raw distribution, the dots represent the mea- 
surements, corrected by the acceptance and the efficiency factors. The shows the input value. The 
measurements are consistent with the input, which proves the consistency of the method. 



8 Summary and perspectives 

This work presents the ATLAS prospects for the measurement of W and Z boson cross-sections at the 
LHC. In the four considered channels (W — > ev, Z — > ee, W — > /IV, Z — > jUjU), the analyses confirm 
the high purity of the samples after fairly usual selections (high-^lepton identification, isolation, and 
#t in the W final states). The jet background is poorly predicted, and dedicated studies are needed to 
monitor its magnitude using real data. Data-driven methods are presented that seem to have sufficient 
sensitivity to keep the jet background at a level where it does not prevent a precise cross-section mea- 
surement. 

With 50 pb _1 , the background and signal acceptance uncertainties contribute similarly to the mea- 
sured cross-section uncertainty, at the level of 2-4% depending on the channel. The uncertainty on 
the integrated luminosity is not included in the present discussion. Extrapolating to 1 fb _1 , all un- 
certainties are expected to scale with statistics, except the acceptance uncertainty. This leads to the 
conclusion (see also, for example, [16, 17]) that the W and Z cross-sections can not be measured to a 
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Figure 18: Left: JzfAa versus y z , integrated over p z . Right: JzfAa versus pj , integrated over y z . 
precision better than about 2 %. 

This argument however ignores the additional input from differential cross-section measurements. 
In contrast to total cross-sections, the differential ones benefit from small acceptance uncertainties, 
and have the potential to constrain the uncertainties that affect total cross-sections. The examples 
of the dilepton mass spectrum below the Z peak, and of the Z boson rapidity and pj distributions 
are studied here. The methods presented are shown to provide correct estimations of the differential 
cross-sections. The natural next step of these analyses, i.e. quantify their physical implications, is 
beyond the scope of this note and reserved for the real data. 
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Abstract 

We simulate a measurement of the inclusive Z(^e + e^/jj, + jj,^) + jets cross- 
section with the ATLAS experiment in pp collisions at 14 TeV for an in- 
tegrated luminosity of 1 fb _1 using fully-simulated signal and background 
Monte Carlo data sets. The reconstruction of leptons and of missing trans- 
verse energy becomes more complex in the presence of a multi-jet final 
state. We quantify the reconstruction differences with respect to those ob- 
served for inclusive Z production. We derive statistical and systematic lim- 
itations in terms of probing the perturbative QCD predictions and discrimi- 
nating between predictions of different event generators. 

1 Introduction 

The production ofW/Z + jets in pp collisions at 14 TeV is an important part of the physics program at 
ATLAS. The processes are interesting in their own right as tests of perturbative QCD at the LHC, as 
well as forming important backgrounds for both Standard Model and Beyond Standard Model physics 
processes. The results of the measurements can be compared directly with fixed-order predictions at 
leading order (LO) and next-to-leading order (NLO) in QCD and the gauge boson mass provides a 
large scale for the pertubative calculations. In addition, the measurements can be used to test the 
performance of Monte Carlo event generators that will also be used to simulate the backgrounds for 
other physics processes. 

In this analysis we present a feasibility study of the cross-section measurements for data corre- 
sponding to an integrated luminosity of 1 fb _1 performed with fully-simulated signal and background 
Monte Carlo samples. The goal of the analysis is to test the performance of the lepton and jet trig- 
gering and reconstruction algorithms in high jet multiplicity events, to develop the necessary analysis 
techniques (unfolding, background subtraction) and to evaluate the statistical and systematic limita- 
tions of the data, in terms of probing the fixed-order QCD predictions and of discriminating between 
predictions of different Monte Carlo event generators. The primary end-result of the analysis with 
real ATLAS data will be hadron-level cross-sections. In this note we will concentrate on Z + jets in 
final states with electrons and muons, but with techniques applicable to the case of W + jets as well. 
Much of the effort on the triggering and reconstruction of leptons is in common with the inclusive 
W/Z note [1]; thus, we do not reproduce all of the details from that note, but rather comment on the 
impact of a multi-jet environment on these issues. 

2 Reference Cross-Sections and Monte Carlo Datasets 

Reference cross-sections are collected in [2], but we briefly discuss here the cross-sections relevant for 
this note. NLO is the first order at which the Z + jets cross-sections have a realistic normalization (and 
realistic shape for some kinematic distributions) [3]. The current state of the art for NLO calculations 
is for Z + 2 jets, although there is ongoing work for the calculation of 3 jet final states. Cross-sections 
for Z + 0,1 and 2 (3) jet final states can be conveniently calculated at LO and NLO (LO only) using 
the parton-level MCFM [4] program (version 5.1. interfaced with LHAPDF 5.2.3 [5]), and it is from 
this program that we determine our reference cross-sections. We use the CTEQ6.1 partem distribution 
functions (PDFs) [6] and a dynamic renormalization/factorization scale of m| + p\ z . We apply similar 
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kinematic cuts on the leptons and jets and the same jet algorithm on the partons as will be described 
in Section 3. The error on the cross-section stemming from the PDF uncertainty is calculated using 
the complete set of error PDFs in the CTEQ6.1 set. 

2.1 Monte Carlo Datasets 

The most important Monte Carlo data sets for the signal processes (Z — »■ e + e~ and Z — »■ \l + \l~ ) used 
in these studies are generated with ALPGEN [7] (v 2.05), interfaced with HERWIG [8] and using 
the leading order PDF set CTEQ6LL [9]. (Hereafter, when we refer to ALPGEN it is understood 
that it is interfaced with HERWIG.) The generation is done with a renormalization/factorization scale 
of m| + p^ z and a MLM [10] matching cut at pj = 20 GeV (jets below this cut are generated by 
the partem shower and not by the matrix element) and \r\\ < 6. A discussion of the uncertainty in 
predictions for Z + jets final states using different matrix element + Monte Carlo calculations and 
different matching cuts is beyond the scope of this study, but is given in Ref. [10]. 

The final Monte Carlo data sets are obtained following the standard prescription [10], by merging 
the samples of Z + n partons (where n=0-5), each sample weighted with the product of the respective 
sample cross-section, the MLM matching efficiency and the efficiency of the generator-level filter. 
All but the highest jet multiplicity sample are exclusive, i.e. events are only kept if all jets with 
Pt > 20 GeV and 1 77 1 < 6 are matched to a matrix element partem. The highest multiplicity sample, 
Z + 5 partons, is inclusive; events with additional jets softer than the partons from the matrix element 
are not discarded. Thus, there can be more than 5 jets in this sample. The di-lepton mass is required to 
be larger then 40 GeV and lower th an 200 GeV. A generator-level filter requires one seeded-cone jet, 
with a radius of R = v / At] 2 + A0 2 = 0.4, with p T > 20 GeV and 1 77 | < 5.0, and two electrons/muons 
with p T > 10 GeV and | r\ \ < 2.7 in the event. 

For the comparison with the fixed-order theoretical predictions, the merged data sets are normal- 
ized to the NLO inclusive Z — ► e + e~ and Z — > jJ. + jJ.~ cross-sections. Because of the jet filter used 
in their generation, the fully-simulated data sets can not be used to derive the global normalization 
factor. For this purpose, we use additional Z — ► e + e~ and Z — >• }X + }X~ ALPGEN data sets which are 
produced with the same conditions, but without the generator-level filter applied. 

PYTHIA [11] signal and background samples are generated with version 6.323 (Z —>■ e + e~, 
Z — ► H + H~ , Z — ► TT, and W — ► ev) or 6.403 (tt, filtered QCD multi-jet) using the corresponding 
ATLAS underlying-event tune [2]. PYTHIA Z — > e + e~ and W — > ev events are preselected with 
a generator-level filter requiring one electron with pj > 10 GeV and \r\ \ < 2.7. The filter for the 
corresponding processes with muon final states requires one muon with pj > 5 GeV and |tj| < 2.8. 
PYTHIA Z — > TT events are generated with a filter requiring two electrons/muons with pr > 5 GeV 
and 1 77 1 < 2.8. For each of the Z — > U samples, the di-electron (di-muon, di-tau) mass is required to 
be larger than 60 GeV. The jet background for the electron channel is simulated with a PYTHIA QCD 
multi-jet sample with a minimum hard-scattering transverse momentum of 15 GeV. A generator-level 
filter requires a jet of pj = 17 GeV clustered in a narrow region of A77 /A<f> = 0.06, a size similar to an 
electron cluster. The QCD multi-jet background for the muon channel is estimated with a PYTHIA 
bb sample. Two muons with p T > 4 GeV and 6 GeV, respectively, are required in the final state. 

2.2 Corrections from Parton to Hadron Level 

Comparisons of data cross-section measurements and LO/NLO predictions are to be made at the 
hadron level (particle level). Hence, the MCFM predictions for the observables have to be corrected 
with respect to the non-perturbative effects resulting from jet fragmentation and from the underlying 
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event. The impact of the underlying event correction is to add energy to the MCFM jets, while the jet 
fragmentation correction subtracts energy. Both corrections are expected to decrease with increasing 
jet pj. The non-perturbative corrections are determined from the current ATLAS PYTHIA tune by 
comparing the multiplicity and the pj distribution of jets with a cone radius of 0.4 clustered on the 
final-state particles in Z — >• ]X + ]X^ Monte Carlo samples generated with PYTHIA 6.403 (a) using 
the standard ATLAS PYTHIA tune [2] and (b) with fragmentation and multiple-particle interactions 
switched off. To the extent to which the two partons that can comprise a jet in MCFM mimic the effects 
of the parton shower in PYTHIA, the corrections derived from the above procedure can be applied to 
the MCFM output [3]. For jets with cone radius 0.4 with pj > 40 GeV, the effects of fragmentation 
and underlying event cancel up to a residual correction at the percent level, which is then applied 
to the MCFM predictions. These corrections are expected to be re-done with the underlying event 
measurements determined from ATLAS data. 

3 Particle Identification and Trigger 

We adopt as much as possible definitions and cuts in common with the other analyses, and in particular 
with the inclusive W/Z study [1]. 

3.1 Particle Identification 

The electron candidates are required to have pj > 25 GeV, and to lie in the range \r\ \ < 2.4, excluding 
the barrel-to-endcap calorimeter crack region (1.37< |tj | <1.52). The electrons are required to fulfill 
the medium electron-identification signature [12], which consists of requirements on the calorimeter 
shower-shape and the matched track. The Z selection requires two electron candidates with an invari- 
ant mass of 81 < m ee < 101 GeV and AR > 0.2 between the electrons. No calorimeter isolation cuts 
are applied for this analysis, although they will be applied for actual data analysis. There is an implicit 
isolation cut, however, present in the trigger [13]. 

A muon candidate requires the combined reconstruction of an inner detector track and a track 
in the muon spectrometer [14]. Muons are required to have pr > 15 GeV and \t]\ < 2.4, with 
the range 1.2 < \r]\ < 1.3 being excluded. Isolation is applied by requiring the energy deposition 
in the calorimeter to be less than 15 GeV in a cone of AR = 0.2 around the extrapolation of the 
muon track. The Z selection requires that there be two muon candidates with an invariant mass of 
81 <m w < 101 GeV. 

For the analyses in this study, we use jets clustered with the standard ATLAS seeded-cone algo- 
rithm with a radius of R = 0.4, built from either calorimeter towers (Z —> e + e~ analysis) or topological 
clusters [15] (Z — ► jJ. + jJ.~ analysis), and calibrated to the hadron level. The lepton and jet candidates 
must be separated by ARij > 0.4. It is required that the jet transverse momentum be larger than 
40 GeV and that the jet be in the range \r}\ < 3.0. 

3.2 Trigger Selection 

The trigger selection used here is the same as that used in the inclusive analyses [1]. In the electron 
channel, Z — > e + e~ + jets events are required to pass the isolated di-electron trigger or the isolated 
single-electron trigger. In the muon channel, Z — > jJ. + jJ.~ + jets events are required to pass the isolated 
di-muon trigger. The trigger efficiencies at the first, second and event filter levels are evaluated as 
a function of the jet multiplicity. The trigger efficiency is also studied as a function of the overall 
hadronic activity, the pj of the leading jet and the Z transverse momentum. For this purpose, the 
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generated Monte Carlo information and the data driven tag-and-probe method are compared. Good 
agreement between the two methods is found. The efficiency for an electron to pass the isolated 
single-electron trigger is found to decrease with increasing jet multiplicity, pj of the leading jet and 
with decreasing distance to the closest jet. 

4 Measurement of Z + jets Cross-Sections 

We study the comparison of theory and measurement for quantities suited to compare with a fixed- 
order NLO calulation: the inclusive cross-section for Z — >• it with at least 1 jet, 2 jets and 3 jets and 
the differential cross-sections with respect to the p T of the leading and the next-to-leading jets. 

4.1 Lepton Reconstruction in a Multi-jet Environment 

The presence of additional jets in the event has an impact on the kinematics of both the leptons and 
jets: the leptons are more boosted (larger pj and lower A(j) between leptons) in events with jets and 
the distance between leptons and jets becomes smaller in high-multiplicity events. The average jet 
Pt increases with the number of jets. Due to the combination of the single electron and di-electron 
trigger channels used in this analysis and due to the boost of the electrons with large jet activity, the 
efficiency loss of the isolated electron triggers for large jet multiplicities has only a negligible impact. 
The total Z reconstruction efficiency (offline+trigger) is stable with respect to both the jet multiplicity 
and the transverse momentum of the leading jet. 

Muon reconstruction efficiencies and rejections for QCD multi-jet background are investigated 
for different isolation requirements. The isolation requirement for this analysis (see Section 3.1) is 
chosen such that it presents no significant bias for events with large jet multiplicities and large jet pj 
while at the same time providing a sufficiently large rejection for the QCD multi-jet background. 

4.2 Background Estimation 

For the evaluation of backgrounds to the Z — > e + e~ + jets signal we consider processes with real elec- 
trons (tt, W —>■ ev, Z —>■ T + T~) and QCD multi-jet production. Statistics of the multi-jet background 
sample are increased by applying a very loose electron selection and then reweighting the events with 
the rejection from the final electron identification cuts. For the Z — > jJ. + jJ.~ analysis the background 
is dominated by processes with real muons (tt, W —>■ /J.V, and QCD multi-jets). QCD multi-jet back- 
grounds for isolated highly-energetic muons result mainly from decays of bb mesons. We thus use a 
bb(— > jU + /0 sample to evaluate this background. For both analyses, all backgrounds are estimated 
from fully-simulated Monte Carlo samples, generated with PYTHIA. They are compared with the 
signal distributions, derived from the respective ALPGEN Z + jets data sets. 

Table 1 provides an overview of the accepted cross-section and the corresponding signal and 
background fractions within the selected event sample, for several jet multiplicities and for the electron 
and the muon channel respectively. The uncertainties displayed in the table are of statistical nature 
only. The total background is at the level of 5-15 % depending on the jet multplicity. With increasing 
jet multiplicity, tt replaces QCD multi-jet production as the dominant background source in both 
analysis channels. Due to the larger acceptance and the larger lepton reconstruction efficiency, we 
obtain more than twice as many signal events in the muon channel. Since the dominant (tt) background 
also contains two real leptons, the signal-to-background ratio is comparable in both analyses. 

Figures l(a+b) show the combined distribution of the di-muon mass and the jet multiplicity for 
events with at least one jet for signal and background events in the Z —> jJ. + jJ.^ + jets channel. Fig- 
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Process 


Z -> > ljet 
a (fb) | fraction (%) 


Z ££± > 2jets 
a (fb) | fraction (%) 


Z U+ > 3jets 
o (fb) | fraction (%) 






z — 


+jets analysis 






2 — > 

QCD jets 
tt 

W -^ev 
Z -► T+T- 


23520±145 
1545±89 
496±28 
(28±13) 
3.2±1.2 


91.9±0.8 
6.0±0.4 
1.9±0.1 
(0.1 ±0.05) 
0.01±0.01 


4894±45 
336±42 
333±23 
(5.9±2.6) 

(0.67±0.25) 


87.9±1.3 
6.0±0.8 
6.0±0.4 
(0.1 ±0.05) 

(0.01±0.01) 


900±15 
78±20 
146±15 
(1.1±0.5) 
(0.1 ±0.05) 


80.0±2.4 
6.9±1.8 
13.0±1.4 
(0.1 ±0.05) 
(0.01±0.01) 








jU + ;U +jets analysis 






Z^Ai+Ai- 
QCD(66) 
tt 


59400±650 
1230±550 
1140±110 
Oil 80 


96.2±1.0 
2.0±0.9 
1.8±1.8 
0.0±0.3 


12600±300 
600±300 
790±90 
0±30 


90.1±1.9 
4.3±2.2 
5.7±0.2 
0.0±0.2 


2450±100 
0±110 
275±50 
0±5 


89.7±3.7 
0.0±4.0 

10.2±1.9 
0.0±0.2 



Table 1: The accepted cross-sections (a, in fb) and the corresponding fraction of the total sample 
(in %) for signal and for the background channels in the Z — »■ e + e~ ± jets and the Z — > jJ. + jJ.~ + jets 
analyses, after applying the cuts outlined in Section 3. The numbers in brackets are extrapolated from 
results obtained for a lower jet multiplicity. Errors shown are statistical only. 



ures l(c+d) show the distribution of signal and backgrounds for the p T of the leading and the next- 
to-leading jet in the Z —> e + e~ ± jets channel. The jets from the QCD multi-jet background have a 
similar pj distribution as the jets from the signal events, while the jets from the tt background tend 
to be harder. 

4.2.1 Background Subtraction 

The Z — ► t + t~, tt and W — > ev backgrounds are subtracted using the Monte Carlo estimates. The 
systematic uncertainty from the limited background statistics is propagated into the systematic uncer- 
tainty of the cross-section measurement. Special care will be needed in validating against data the 
differential cross-section for the tt process, since it is the dominant background for large jet multiplic- 
ities. The QCD multi-jet background is expected to be determined with data-driven methods. From 
the simulations we expect a multi-jet background fraction independent of the jet pj such that it can 
be subtracted by applying a global factor. We assume in the following an uncertainty of 20% on the 
measurement of the QCD multi-jet fraction. The error is propagated into the systematic error on the 
measured cross-section. 

4.3 Unfolding of Detector Effects 

The reconstructed data have to be unfolded from the detector level to the hadron level, correcting for 
efficiency, resolution and non-linearities in electron and jet reconstruction. In this study, the individual 
unfolding corrections are assumed to factorize in leading approximation, and the individual contribu- 
tions are investigated and corrected for separately. The corrections are detailed in the following for 
the case of the Z — ► e + e~ channel. Unfolding of the Z — > jl + ix~ final state is done in a similar way. 
All corrections are derived with fully-simulated ALPGEN Monte Carlo samples. 

The dominant correction on the inclusive cross-section for the Z —> e + e~ channel stems from the 
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(a) (b) 




P T leading jet (GeV) P T 2nd leading jet (GeV) 

Figure 1: The distribution of the di-muon mass (a) and the inclusive jet multiplicity (b) for signal 
and backgrounds in the muon channel. In order to provide higher statistics for the background deter- 
mination, background events in an invariant mass window of 51 — 131 GeV, are scaled down for an 
invariant mass window of 81 — 101 GeV. Also shown is the distribution of pj of the leading (c) and 
the next-to-leading (d) jet in the electron channel for / Ldt = 1 fb _1 . The vertical lines in (a), (c) and 
(d) indicate the kinematic cuts applied in the analysis. 



electron reconstruction. For each of the two electrons, the cross-section is corrected for the electron 
reconstruction efficiency, given as a function of the electron pseudo-rapidity and transverse momen- 
tum. The cross-section is also corrected for the trigger efficiency with respect to the offline selection. 
Corrections from jet reconstruction have a comparably small impact on the overall cross-section but 
bias the jet pj spectrum since, in general, the detector effects are greater for low pj jets. The recon- 
structed jet pj is corrected for the non-linearity of the jet energy scale, and for each jet in the required 
selection, the cross-section is corrected for the reconstruction efficiency and for the effect of the jet 
energy resolution. The uncertainties on deriving these corrections, stemming from the limited Monte 
Carlo statistics, are taken into account as systematic uncertainties on the cross-section measurement. 

Figure 2 compares the distributions of the pj of the leading jet and the next-to-leading jet, in differ- 
ent unfolding stages, with the pj distribution of the original (hadron-level) Monte Carlo jets. Within 
the statistical and systematic errors, the pj distributions of the Monte Carlo jets and the corrected 
reconstructed jets are in agreement, thus providing a consistency check for the unfolding corrections. 
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Figure 2: Comparison of the distribution of the pj of the leading jet (a) and the next-to-leading jet 
(b) for the generated (hadron-level) Monte Carlo and for the reconstructed quantities without any cor- 
rection, after the corrections for electron triggering and reconstruction and after applying in addition 
the jet-releated corrections. 



4.4 Comparison of Event Generator and MCFM Predictions at the Hadron Level 

One of the goals of our study is to evaluate the statistical and systematic precision of the Z + jets 
cross-section measurement and to compare this precision with the size of the uncertainties that are 
expected in the first inverse femotbarn of data. Since this study deals with the measurements from 
the first data, we compare the precision of the measurement with the differences in the predictions of 
our LO and NLO QCD calculations and with the predictions from matrix element and parton-shower 
generators. 

4.4.1 Generator Comparisons 

In this section we compare the prediction for the inclusive jet cross-section from the generators 
PYTHIA and ALPGEN with those from the MCFM partonic level event generator. In order to sep- 
arate the reconstruction from the generation effects we use only Monte Carlo hadron-level generator 
information. 

Figures 3(a)-(c) show the comparison of the distribution of the jet multiplicities and the pj of 
the leading and next-to leading jets for ALPGEN and PYTHIA Z — ► fl~ + jets samples with the 
NLO (LO) calculations from MCFM. The errors on the generator distributions are purely Monte Carlo 
statistics whereas the errors on the MCFM cross-section correspond to the PDF uncertainties and to 
the error from the unfolding to the hadron level. MCFM predictions are corrected to the hadron level 
as specified in section 2.2. The two Monte Carlo samples are normalized to the inclusive NLO Z 
cross-section, as determined in MCFM. 

The NLO MCFM predictions for the Z + 1 jet and Z + 2 jets cross-sections are, in general, greater 
than the LO predictions by 20 to 30%. PYTHIA predicts a larger Z + 1 jet cross-section than ALP- 
GEN, but also predicts a lower average jet multiplicity. Both Monte Carlo generators predict a lower 
cross-section than the NLO MCFM calculation for final states with more then one jet. The differ- 
ence between the predictions of PYTHIA and ALPGEN, and between both generators and MCFM, 
amounts to 10-60% depending on the jet multiplicity. A comparison of the differential cross-section 
as a function of the jet pj indicates that the inclusive cross-sections shown in Figure 3(a) depend very 
much on the minimum jet pj required by the selection. PYTHIA predicts larger cross-sections than 
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Figure 3: Comparison of the inclusive jet cross-section (a) and the pj of the leading jet (b) and of the 
next-to-leading jet (c) for the Z -> Ll + Li~ + jets channel from PYTHIA and ALPGEN Monte Carlo 
with NLO (LO) MCFM predictions. The MCFM predictions have been corrected to the hadron level. 



even NLO MCFM for low jet pj. But, while the shape of the jet pj distribution predicted by ALP- 
GEN agrees well with the NLO MCFM prediction, PYTHIA generates a clearly softer pj spectrum, 
as expected. 

4.4.2 Statistical and Systematic Errors 

In order to determine the expected precision of the analysis, the cross-section measurement is per- 
formed on the fully-simulated ALPGEN Z + jets data sets, which are corrected to the hadron level, 
following the prescription of Section 4.3. Systematic errors from the corrections are included. In the 
next step we evaluate the impact of the uncertainties expected for real ATLAS data taking. ATLAS ex- 
pects a limited precision of the jet energy scale in the first years, starting from uncertainties at the level 
of 10% and converging eventually towards 1%. We obtain two benchmark scenarios by propagating 
jet energy scale uncertainties of 5% and 10% into the measured cross-sections. Several backgrounds 
will be estimated with data-driven methods, introducing additional systematic errors. We account for 
that in a first approach by adding an error of 20% on the fraction of the multi-jet background for each 
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jet multiplicity. The statistical uncertainties in the samples are scaled to the number of events expected 
to be selected for an integrated luminosity of 1 fb -1 . 

Figure 4 compares, for the Z — > e + e~ + jets channel, the inclusive jet multiplicity (a) and the pj 
of the leading jet (b) for MCFM and the fully-simulated corrected ALPGEN sample. The errors on 
the MCFM predictions result from the PDF uncertainty and from the errors from the correction for 
the non-perturbative effects. The errors on the ALPGEN Monte Carlo data include all the statistical 
and systematic uncertainties described in this section with the jet energy scale uncertainty set to 5%. 

An additional systematic uncertainty is introduced on the unfolding correction for the jet resolu- 
tion due to the uncertainty on the jet resolution measurement and to the uncertainty on the shape of 
the pj distribution which we use to derive the corrections. Using corrections from different event 
generators and varying the jet resolution within its uncertainty results in a systematic error on the 
cross section at the percent level. 




0 1 2 3 4 5 50 100 150 200 250 300 

> N jets (P T (jet)>40 GeV) p t , leading jet (GeV) 



Figure 4: The inclusive jet cross-section (a) and the distribution of the pj of the leading jet (b), 
as predicted by NLO (LO) MCFM (corrected to the hadron-level) and by ALPGEN for the Z -> 
e + e~ +jets process. 



The uncertainty on the theoretical predictions and on the measured cross section, as shown in 
Figure 4, are propagated on the data/theory ratio. Figure 5 shows the resulting uncertainty on a ratio 
of 1 for the inclusive cross-section and for the pj of the leading jet. The systematic uncertainty on the 
inclusive cross-section from a jet energy scale uncertainty of 5% is twice as large as as the sum of all 
the other statistical and systematic uncertainties. In this case, the overall precision on the data/theory 
ratio expected with the first fb -1 of data is at the level of 8-15% for topologies with 1-3 jets. A 
jet energy scale uncertainty of 10% results in the dominant error on the cross-section. In this case, 
the total uncertainty on the cross section is at the level of 15-30%, which is at the same order as the 
typical differences expected between LO and NLO predictions, or between predictions from PYTHIA, 
ALPGEN and MCFM. Statistical limitations become sizable for large jet p T (> 200 GeV). 

5 Conclusions 

Final states containing Z + jets will serve as one of the Standard Model benchmarks for physics 
analyses at the LHC. We have simulated cross-section measurements for theoretically well-defined 
quantities such as the inclusive Z + jets cross-section, and the jet transverse momentum for the leading 
and next- to-leading jets. An unfolding technique from the detector to the hadron level has been 
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Figure 5: Uncertainty on the ratio of measurement and theory for the inclusive jet cross-section (a) 
and the pj of the leading jet (b) for the Z — »• e + e~ + jets process. 



developed, and results are presented at the hadron level. Theoretical corrections from parton to hadron 
level, necessary for comparisons of data to parton level predictions, are determined. 

The main background sources are found to be QCD multi-jet processes for low jet multiplicities 
and tt for large jet multiplicities and amount to the level of 5-20%, depending on the jet multiplic- 
ity. Predictions from the Monte Carlo generators PYTHIA and ALPGEN have been compared with 
MCFM NLO (LO) calculations. The inclusive cross-section predictions differ by 10-60%, with larger 
discrepancies for the PYTHIA parton shower prediction (with respect to MCFM and ALPGEN) with 
increasing jet pj. Statistical and systematic uncertainties on the ratio data/theory have been deter- 
mined. A jet energy scale uncertainty of 5% would be the dominant systematic uncertainty on the 
measured cross section, resulting in a total uncertainty of 8-15% for final states with 1-3 jets. A 
jet energy scale uncertainty of 10% results in an overall precision at the level of 15-30% which is 
at the same order as the typical differences expected between LO and NLO predictions or between 
predictions from PYTHIA, ALPGEN and MCFM. 
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Abstract 

We present new methods for measuring the W mass at ATLAS, and show 
their performance on simulated data. The experimental systematic uncer- 
taintiess and their impact on the mw measurement are evaluated, with sam- 
ples downscaled to 15 pb _1 . The electron transverse momentum analysis 
yields a precision of 8mw = 120 (stat) © 117 (syst) MeV. The systematic 
uncertainty is dominated by the energy scale. In the muon channel, the 
transverse mass analysis gives Smw = 57 (stat) © 231 (syst) MeV, where 
the dominant contribution comes from the recoil calibration. PDF uncer- 
tainties contributes Smw = 25 MeV. Other theoretical uncertainties were 
not explicitely considered, but expected to be small in comparison to exper- 
imental uncertainties in early mw measurements. 

1 Introduction to the W mass measurement at the LHC 

The mass of the W boson is currently measured to be mw = 80.399 ±0.025 GeV [1]. Since the W 
boson mass and the top quark mass are the largest sources of uncertainty in the indirect determination 
of the Higgs boson mass, improved precision is desirable. The expected total inclusive W cross sec- 
tion at the LHC is about 20.5 nb for each lepton channel [2]. In 10 ftr 1 of data, around 30 x 10 6 W 
events will be selected in each leptonic decay channel (W — > ev,juv), providing a combined statistical 
sensitivity of about 2 MeV. Systematic errors will need to be controlled to comparable precision. 

The present note is however concerned with the early ATLAS data (Jz? ~ 10 — 20 pb _1 ), with an 
expected statistical precision of about 100 MeV. In this context, the main sources of systematic uncer- 
tainty are of experimental origin (energy and momentum scales, resolution, efficiency). Other sources, 
related to the theoretical description of W production, play a less significant role. 

The aim of the following is to establish an unbiased W mass fit with templates, demonstrate that using 
Z events for calibration is valid, and show that the experimental sources of systematic uncertainties 
can be controlled to the level of the statistical sensitivity. In doing so, we test the readiness for the W 
mass measurement in the first years of data taking. 

This note is structured as follows. First a general discussion in Section 2 gives the outline of the 
analysis, the ingredients involved in a W mass measurement, and the challenges in controlling them. 
Template based W mass fits are presented in Sections 3 to 5, exploiting the electron and muon chan- 
nels. In a first step, all ingredients entering the templates are supposed perfectly known. We then 
evaluate the dependence of the fit results on each of them. Section 6 describes the calibration of the 
absolute lepton energy scale and resolution using Z decays. This section also discusses the effect of 
the lepton reconstruction efficiencies. Background uncertainties are treated in Section 7. Section 8 
quantifies residual systematic uncertainties after in situ calibration. Finally, Section 9 summarizes the 
analysis and concludes. 
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2 Outline and strategy of the analysis 
2.1 Simulation and data sets 

The simulated W and Z samples on which this note is based are generated using the PYTHIA event gen- 
erator [3]. Photon radiation is earned out by PHOTOS [4], and X decays are handled by TAUOLA [5]. 
Detector simulation is done using GEANT4 version 4.0 [6], and reconstruction with Athena ver- 
sion 12.0.6. These fully simulated events are reconstructed using the ATLAS software, and used 
as real data ("pseudo-data"). For the production of templates, we use the generator-level information 
only, together with estimated detector smearing corrections. Simulated statistics of our main signal 
samples are shown in Table 1. Our W signal samples correspond to roughly 15 pb -1 . In the following, 
all results will be normalized to this luminosity, i.e. downscaling results from the larger Z samples. 



Channel 


Nb. of events 


Cross section [pb] 


£ of filter 


Corresponding JS? [pb 1 ] 


W -^ev 


170143 


20510 


0.624 


13.3 




189903 


20510 


0.686 


13.5 


Z — > ee 


377745 


2015 


0.857 


218.7 




150650 


2015 


0.896 


83.4 



Table 1: Number of events, cross sections (at NNLO [2]), and corresponding luminosity of the simu- 
lated W and Z signal samples used in the analysis. 



2.2 Event selection 

Since the dijet cross section at hadron colliders is several orders of magnitude larger than the W bo- 
son cross section, the hadronic decay modes of W and Z bosons are not usable. Therefore, only the 
leptonic decay modes W —> i\ and Z — ► it (£ = e,n) are considered. While the W and Z bosons are 
produced with a small transverse momentum on average (see Figure 2(a)), their longitudinal boost can 
be large. However, this boost leaves the lepton transverse momenta unchanged, which subsequently 
is a good distribution to study W boson decays. 

W events are required to have one isolated lepton with pj above 20 GeV and missing transverse energy 
(#t) in excess of 20 GeV. Z events are required to have two isolated leptons with pj above 20 GeV 
of opposite charge (see Figure 1). The triggers providing these events are an isolated 15 GeV electron 
trigger and a 20 GeV muon trigger. The electrons are required to pass tight identification criterion [7], 
and only combined muons (with reconstructed tracks in both the inner detector and the muon spec- 
trometer [8]) are used. Both electrons and muons are required to lie within the tracking range |tj | < 2.5 
(see Figure 2(b)). In addition, the calorimeter barrel-endcap transition range 1.3 < \r\ \ < 1.6 is ex- 
cluded for electrons. 

In addition to this basic selection, some other requirements apply. To reject backgrounds from jet and 
it events, the signal events are required not to have large hadronic transverse activity. A summary of 
the requirements can be found in Table 2. 

The expected numbers of events in 15 pb -1 from the above mentioned event selection are summarized 
in Table 3. Though the expected number of reconstructed Z events is an order of magnitude smaller 
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Figure 1 : Transverse view of a W — > i\ (a) and aZ^ 11 (b) event. The combined transverse momen- 
tum of the recoil u, which should match that of the boson, is used to estimate the momentum of the 
undetected neutrino in the W — > Iv decay. The dotted ellipses represent the uncertainties. 



Requirement 


W^ev W->nv 


Reconstructed lepton 


p T >20GeV, \r\\ < 2.5 p T >20GeV, |tj| < 2.5 


Isolation 


E T 0De /E T < 0.2 


Missing energy 


$ T > 20 GeV $ T > 20 GeV 


Crack region 


Remove 1.30 < |t]| < 1.60 


Recoil momentum 


p T < 50 GeV 


Requirement 


Z^ee Z^\l\i 


Reconstructed leptons 


p T > 20 GeV, |tj| < 2.5 p T > 20 GeV, |t]| < 2.5 


Isolation 


E T one /E T < 0.2 


Crack region 


Remove 1.30 < |t]| < 1.60 


Recoil momentum 


p T < 50 GeV 



Table 2: Selection criteria for the W and Z decays. See text for details. 
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Figure 2: (a) Reconstructed transverse momentum of W and Z bosons. The larger resolution in the 
W events causes the wider peak at low pj, while at higher pj the Z spectrum slightly dominates, (b) 
Distribution in X] of reconstructed muons from W and Z events. 



than that of W events, the fact that Z events are fully reconstructed and thus have much better mass 
resolution compensates for this deficit. 



Channel 


W -^ev 


W ^juv 


Z^ee 




Detector acceptance [%] 


44.3 


45.4 


42.4 


39.9 


Reconstruction efficiency [%] 


21.7 


39.1 


10.4 


33.4 


Nb. of events for 15 pb~ ! [10 3 events] 


66.7 


120.2 


3.2 


10.1 



Table 3: Acceptances, total reconstruction efficiencies, and resulting statistics for 15 pb _1 of data. 
The W cross sections are inclusive, while the Z cross sections are for invariant masses above 60 GeV. 
Both are at NNLO and contain the relevant branching fractions. The acceptance is the fraction of 
events which lies within the detector acceptance, while the total reconstruction efficiency is the overall 
efficiency for an event to pass all selection criteria including the acceptance. 



2.3 Input to W mass fit 

While the Z decay can be fully reconstructed, and its mass calculated from the invariant mass of 
the decay leptons, this is not the case for W decays, where the neutrino goes undetected. From the 
momentum imbalance one can infer the missing energy, but with limited precision and only in the 
transverse direction. This means that the invariant mass can not be determined, and one is forced to 
consider other variables sensitive to the W mass. In principle there are three sensitive variables: 

• The lepton transverse momentum, p ( T . 

• The missing transverse momentum, pj = If.?. 

• The W transverse mass, defined as: nij = yJlpjPj{ \ — cos(0 £ — <j) v )). 

The lepton transverse momentum is measured with an accuracy of about 2% for electrons and muons 
[7,8] in the momentum range of interest (see Section 6.2). This is an order of magnitude better com- 
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pared to the accuracy of the missing transverse energy determination, which has a resolution of about 
20-30% [9]. Finally, the W transverse mass combines the two momenta along with the azimuthal 
angle between them. 

All of the above distributions have a Jacobian peak either at my/ 12 (p T and p T ) or mw (nij), which is 
sensitive to the W mass. The sharpness of the peak is affected both by the resolution and the boson 
pj. While the lepton pj has a very good resolution, the pj of the boson smears this Jacobian edge. 
On the contrary, is to first order insensitive to the pj of the boson, but here the edge is smeared 
by the poor resolution of the missing transverse energy (see Figure 3). Finally, p T suffers from both 
effects, and is therefore the poorest candidate for a fitting variable. Since mj' is formed from p e T and 
p T , it is of course statistically con-elated with p T . However, the statistical correlation between and 
p l T is only about 30%, and since they have different systematic errors, combining the measurements 
based on these observables could improve the sensitivity. 




P T [GeV] m T [GeV] 



Figure 3: Fitted distributions of p e T (a) and (b), showing the Jacobian peak, and the effects of 
finite detector resolution (i.e. smearing) and recoil (i.e. pj of the W). While p l T is more sensitive to 
the recoil than to the resolution, the converse is true for the distribution. 



2.4 Fitting the W mass with templates 

The lepton transverse momentum and W transverse mass distributions, p T and shown in Fig- 
ure 3, are the result of several non trivial effects. For this reason no analytical expression describes 
the distributions in detail, and one is forced to use numerical methods. One method for fitting these 
distributions is template fitting [10, 11]. Templates of the p l T and rrij distributions are produced with 
varying mw values, and compared to the corresponding distribution observed in data (see Figure 6). 
The comparison is based on a binned % 2 method. 

To estimate the impact of a given effect on the W mass determination, templates unaware of the effect 
under consideration are produced and subsequently fitted to data, which includes this effect. Assuming 
an unbiased fit, when the effect is not included in the data (see Sections 3 and 4), the resulting shift 
in fit value measures the systematic error on the W mass from not including the effect. By gradually 
changing the size of an effect, the systematic error on the W mass as a function of this effect can be 
determined. As most effects are small, the dependencies are approximately linear. They are in general 
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different for the p e T and rrij fits. If an effect can be characterized by one parameter a, the systematic 
eiTors (dmw) can be calculated from the derivative dmw/da times the size of the uncertainty da. If 
more parameters are required, the systematic uncertainty is calculated from all parameters a, and their 
co variances Cov, j. 



dmw c 5. 2 v dmw dmw „ 

Omw = oa (Single parameter) Omw = > ^ , (Multiparameter) 

da "ri da{ daj 

In the following, the da are to be understood as relative uncertainties, and the derivatives are with 
respect to these relative uncertainties. Our results for dmw/da are normalized in MeV/%. 



2.5 Calibration procedure 

The calibration of the absolute energy /momentum 1 lepton scale plays a central role, as it is the largest 
systematic uncertainty and the starting point of all other calibrations. 



2.5.1 Average calibration 

To first order, a single average lepton scale factor, defined as Ofc = E KC /E tme , independently of r\ 
and pr, can be obtained by demanding that the reconstructed Z peak matches its known mass. While 
this assures the correct lepton scale and resolution for Z events, the energy scale obtained in this way 
might not apply to W events. Because of non linearities and non uniformities, the different pj and r\ 
distributions in W and Z events can possibly introduce significant bias. 



2.5.2 Differential calibration 

If needed, an upgrade to a differential calibration can be performed, which contrary to the average 
calibration includes variations of scale and resolution with energy/momentum, 77, and/or (f>. The three 
key ingredients to such a calibration are the precise knowledge of the Z mass, width, and decay kine- 
matics, the non zero transverse momentum of the Z bosons, and the very large sample of Z bosons 
produced at the LHC. 

The calibration uses a large sample of reconstructed Z — > ii events along with corresponding tem- 
plates, representing our knowledge of the Z lineshape, which we assume known. Through a compar- 
ison of the two in bins of the variables of interest (lepton pj, r\, and 0) one can extract the scale and 
resolution in each of these bins [12]. 

Once the absolute scale is set for the leptons, the hadronic recoil scale can be determined by com- 
paring it to the transverse momentum of the leptons from the Z. Again, this calibration can be done 
differentially, measuring the missing momentum response as a function of pj (Z) , the recoil size, and 
the hadronic transverse activity (y^£'J. adloIls ). While this calibration is naturally not as accurate as that 
of the lepton scale, due to the poorer resolution of the hadronic recoil, it fortunately turns out that the 
mass fit is less sensitive to the recoil scale (see Section 4.3). 

'These two terms cover the same aspect, but will generally be used about electrons and muons, respectively. Angular 
uncertainties are omitted throughout this note, as they play an insignificant role. 
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2.5.3 Efficiency determination 

The efficiency is determined using the "tag and probe" methods [13], again applied to Z —> it events. 
The different spectra in W and Z events is accounted for by determining the efficiency as a function 
of pr and Tj, which has been found to yield efficiencies compatible between W and Z [7, 8]. 

The following sections attempts to quantify the above outline. 

3 Fitting the W mass with templates - electron channel 

3.1 Modelling templates for W mass fit 

As no analytical expression matches the lepton transverse momentum and W transverse mass distri- 
butions p ( j and rrij, we fit the W mass using the template method (see Section 2.4). For the templates 
of varying W mass to match the measured distribution well, all effects influencing these distributions 
must be included when producing the templates. The principle influences are scale, resolution, non 
gaussian tails, efficiency and background effects, which are (with the exception of background) pri- 
marily obtained from the similar but more constrained Z — > il events, as discussed in Section 6. 

Two assumptions have to be validated first, namely the lack of bias of the fit in itself, and the porta- 
bility of the calibration from the Z to the W. The lack of bias is tested by assuming perfectly known 
physics and detector response. In practice, the detector response is determined at this stage from direct 
comparisons of the lepton reconstruction to the generator level kinematics, using the W sample that 
is used in the mass fits. Then the fit is repeated using templates with the detector response estimated 
from the Z sample, still comparing reconstructed to simulated kinematics. An unbiased result vali- 
dates the portability, i.e. that detector parameters can indeed be ported from Z to W events, justifying 
an in situ determination of these parameters using Z events. 

In addition it has to be tested that the template components can be included without biasing the fit, 
and thus that a subsequent calibration, which matches the truth, will yield unbiased templates. This is 
tested in the following. The statistical sensitivity of our W — > ev sample, corresponding to 15 pb -1 of 
data, is about 120 MeV, and we provide an estimate of the required precision on the detector response 
parameters to keep the systematic uncertainty within this limit. 

3.2 Fits to myy using templates: Validation of the method 

In this section, the detector parameters are determined from fits to E rec /E true , the ratio of reconstructed 
to true energy of the decay electrons. The fits are done using the so-called "Crystal Ball" PDF [14], 
which aims at describing the result of calorimetric resolution together with upstream energy loss in 
a single function. It has four parameters and displays a gaussian core, and a power-law tail at low 
energy. Its expression is, up to normalization factors: 



where x = Erec/^true, 0Ce is the position of the peak, Oe the gaussian width; n gives, in units of Oe, the 
point of transition between the gaussian and power-law descriptions, and j8 is the exponent control- 
ling the tails. The relative normalization of the two components preserves continuity at Gfc — uOe, up 



CB(x) 




(1) 
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EreJE true ....... . . . ^ /E ^ 



Figure 4: Examples of detector response functions fitted to E rec /E tme , for 30 < p e T < 40 GeV. From 
upper left to lower right: 0.4 < |tj| < 0.5 (a), 0.8 < \r] \ < 0.9 (b), 1.3 < \r\\ < 1.4(c), and 1.9 < |rj| < 
2.0 (d). 



to the first derivative. While not fully satisfactory from a theoretical point of view (the combination 
of resolution effects and radiation should in principle be given by a proper convolution), it is very 
effective in describing the observed response. 

The fits are performed vs. T\ and pj. The angular range 0 < |tj| < 2.5 is divided in intervals of size 
Arj =0.1. In each interval, fits are done for 10 < pr < 70 GeV, in intervals Apr = 10 GeV. Figure 4 
shows a number of examples, at different values of X] and pj. The X] dependence of the parameters, 
for 30 < pr < 40 GeV, are displayed in Figure 5, where the shaded regions correspond to the excluded 
1 7] | region described in Section 2.2. 

In our fits the /3 parameter was constrained to the range 0 < j3 < 5. As the examples in Figure 4 
illustrate, the j3 parameter appears to systematically choose values close to its upper bound, while sat- 
isfactory fits are still obtained. Therefore, we fix /3 = 5 in the remaining of the analysis, and treat the 
response functions in terms of Gfc, Oe and n only. The pj spectrum templates are produced from gen- 
erator level W — > ev events, where the electrons are smeared using the above function and parameters 
according to their kinematic variables. Three example template distributions are shown in Figure 6(a), 
corresponding to three values of mw- The number of events used to produce the templates is increased 
by repeatedly smearing generator level particles. Although this limits the impact of statistical fluctu- 
ations in the templates on the result, template fluctuations are still visible. Distributions on Figures 6 
are plotted on a wide range to assure that the entire pj spectrum is under control. The fitting range 
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is chosen to be between 30 and 60 GeV to avoid edge effects and to reject backgrounds (see Section 7). 

The mass fit is performed using binned % 2 comparisons between the pseudo-data and the template 
histograms. Given that all pj bins contain at least several hundred events, the x 2 of a given compari- 
son can be defined as: 

2 _ V" 1 ( n i,data ~ n i,template) 2 
1=1 °i,data + ° template 

where the sum is over the histogram bins, and n and (7 are the bin contents and their errors, re- 
spectively. Computed as a function of a m = mw/m 1 ™, the X 2 follows the parabola illustrated in 
Figure 7, which can be used to determine the m^f and its error. The obtained parabola is satisfactory 
despite the fluctuations of the x 2 points with respect to the fitted curve, which are due to the finite 
statistics of the templates. We obtain mt^ = 80.468 ±0.1 17 GeV, to be compared to the input value 
m 1 ™ = 80.405 GeV. The best fit template is shown in Figure 6(b). The stability of this result is ver- 
ified by repeating this exercise a number of times, with the detector smearing applied independently 
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160. 



1.01 



Figure 7: % 2 vs. a m = mw/m'™, for the comparisons of pseudo-data and templates described in the 
text. 



in each exercise (i.e, producing independent sets of templates). The distribution of has a spread 
well compatible with the estimated fit uncertainty. 

We thus conclude that within the statistical sensitivity of the W —> ev sample, the current procedure 
provides an unbiased estimate of m^. We now proceed to relax our main assumptions and quantify 
the dependence of the fit on the detector response parameters. 

3.3 Sensitivity of to the template components 

This section quantifies the stability of inff under variations of the assumptions used to produce the 
templates. As stated earlier, we leave phenomenological considerations aside and concentrate on 
experimental effects. We study explicitly the effect of non gaussian tails in the detector response 
function, reconstruction and identification efficiency, and backgrounds. The dependence of on 
the detector scale and resolution was studied in [15], and reviewed here. The bias 8mw = niyj — mw 
as a function of the fractional error on the lepton scale (cfe) and resolution (erg) is found to be: 



The impact of non gaussian tails is studied as follows. Starting from the detector response parametriza- 
tion decribed in Section 3.2, we suppress the tails of the distribution and assume a pure gaussian 
response. The parameters describing scale and resolution are kept to their previous value. We then 
produce templates and perform the fit as above. The procedure is illustrated in Figure 8. The response 
distribution can be compared to Figure 4 to assess the impact of neglecting the non gaussian part of 
the distribution. As can be seen, the corresponding templates are biased towards higher pj ; we thus 
anticipate, as otherwise expected, that an underestimation of the tails should imply a negative dmw- 

We obtain a bias 8mw = —555 MeV, corresponding to an underestimation of the non gaussian tails 
by 100%. Denoting X the non gaussian fraction of the response function, we thus estimate the bias as 
a function of the relative error on the tails as: 



dm w /da E = 800 MeV/%, dm w /da E = 0.8 MeV/%. 



(3) 



dm w /dz = -5.5MeV/%. 



(4) 
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Figure 9: Electron reconstruction efficiency as a function of pj, for different regions in r\. 



Distortions in the pj distribution can also be caused by the lepton reconstruction efficiency, as soon 
at it has a non trivial pj dependence, i.e. Ee = £i>{pt)- This is the case in the electron channel, as 
illustrated in Figure 9. 

As above, we quantify the impact of this pj dependence by taking the pseudo-data as they are, but 
assuming a flat efficiency in the templates. Since Ei(pr) is an increasing function of pj, we expect 
that the templates will be biased towards lower pj values, inducing a positive shift in m§j . Perform- 
ing the mass fit indeed yields Smw = 360 MeV (this bias corresponds to a perfectly flat efficiency 
assumption). We estimate the bias per percent relative error on the pj dependence of Eg to be: 

dm w /d£ { = 3.6 MeV/%. (5) 

Note that the present analysis only relies on the pj dependence of £<? and not on its absolute value. 
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Figure 10: Ratio of the fitted values of a£, Oe and n, between W and Z events. Each histogram 
represents the r\ dependence of the parameter Cfe (a), Oe (b) and n (c), for 30 < /?f- < 40 GeV. 



3.4 Comparison of W and Z events 

Before explicitly calibrating detector parameters from Z events and applying them in the mw fit, we 
verify that this procedure is indeed justified. To this end, we perform the detector response fits as 
described in Section 3.2 on our Z — > ee sample, obtaining a map of the response parameters Gfc, Oe 
and n as a function of X] and pj of the electrons. 

A first check is to compare the obtained values to those extracted from the W sample. This is illustrated 
in Figure 10. For all parameters, agreement is found within the statistical sensitivity throughout the 
analysed electron phase space, except for the resolution parameter in the shaded X] region, excluded in 
the study as explained in Section 2.2. We thus expect that templates produced using detector response 
to Z events will provide an adequate description of W events. 

A mw fit is performed next. Templates are produced from generator level W — ► ev events, smeared 
according to detector performance found on Z events. The resulting distributions are shown in Fig- 
ure 11(a) together with their ratio in Figure 11(b); good agreement is observed. Fitted with a straight 
line, Figure 1 1(b) shows a slope of (3 ± 2) 10~ 4 , compatible with 0, but which yields a small bias to- 
wards higher masses. Accordingly, the result of the fit is = 80.567 ± 0. 1 18 GeV, to be compared 
to ruff = 80.468 ±0.117 GeV obtained with detector performance found on W events. The result is 
compatible with the input value m' w lie = 80.405 GeV. 



4 Fitting the W mass with templates - muon channel 

This section repeats the discussion of Section 3 for the muons channel. In addition, a template fit of 
the nij distribution is described. 

4.1 Template fits to the muon transverse momentum 

Repeating the analysis from Section 3 for the muon channel yields niyj = 80.538 ±0.106 GeV, ob- 
tained with detector performance found on W events, and = 80.508 ± 0.106 GeV, with detector 
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P T [GeV] P T [GeV] 

Figure 11: (a) Templates obtained for a m = mw/m'™ = 1, with detector performance obtained from 
W events (full line) and from Z events (dashes), (b) Ratio of the previous histograms, fitted with a 
straight line. 



performance found on Z events. The differences with the electron channel are described in the fol- 
lowing. 

The dependence of the template fit on the scale and resolution is the same in the muon channel as in 
the electron channel. The muon momentum resolution is generally slightly worse, and whereas the 
electron resolution improves with pj, the converse is true for the muon resolution. Figure 12 shows 
four examples of the momentum ratio distributions py c /p'™ e for muons - two from W events and 
two from Z events. As can be seen, the shapes can be modeled well with a core gaussian distribution 
describing the general muon bias and resolution complemented by an outlier gaussian distribution 
accounting for the muons, which are encountering parts of the detector with poor muon spectrometer 
coverage and increased material, resulting in a slightly degraded resolution. 

To check the portability, the fitted constants are again compared between the W and the Z events, as 
can be seen in Figure 13. The correspondance between the fitted parameters is satisfactory. According 
to Figures 13(b) and 13(d), the resolution parameters are systematically smaller in W events. The dif- 
ference between the resolutions in W and Z events averaged over T7 is 3.6%, which combined to Eq. 3 
leads to a bias of 3 MeV. As will be seen in Section 6, this bias can be neglected given the precision 
of the in situ scale and resolution determination. 

Unlike the electron case, the muon reconstruction efficiency does not vary significanly over the mo- 
mentum range of interest. As can be seen from Figure 14, the efficiency is quite constant above 
10 GeV, varying only slightly between barrel (e = 95.8%) and endcap (e = 94.3%) region, due to 
increase in material. As the reconstructed muons are required to have a momentum above 20 GeV, 
the efficiency is essentially constant. 

The flatness of the efficiency is not expected to change until at several hundred GeV, where radiative 
losses grow larger than those due to ionization [1]. Using the "tag-and-probe" method, the hypothesis 
of a flat muon efficiency will be tested. The uncertainty in the linear fit translates for 15 pb~ ! into a 
systematic error of 16 MeV, which is much less than in the electron channel, as expected. 
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Figure 12: Distributions of transverse momentum ratios p r j Q / p l ™ e for muons from W ((a) and (c)) and 
Z ((b) and (d)) decays at |rj | < 0.36 ((a) and (b)) and 1 .79 < | T] | < 2. 14 ((c) and (d)) in the momentum 
range 35-40 GeV. 
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Figure 13: Comparison of fitted mean (a), core resolution (b), outlier resolution (c), and out- 
lier fraction (d) between W (circles) and Z (squares) events for muons in the momentum range 
30 < pr < 35 GeV in seven bins of |t]|. The markers have been artificially shifted, left (W) and 
right (Z), in |t] I to increase readability. 
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Figure 14: (a) Muon efficiency as a function of pj for W (circles) and Z (squares) events. Insert shows 
a zoomed view on the range of interest along with a constant fit to each of the graphs. The hypothesis 
of a flat probability in the range 10-70 GeV has been tested to be valid, (b) Muon efficiency ratio 
between W and Z events. No pj dependence is seen, and a linear fit yields a slope of (6.3 ± 6.5) x 
10~ 5 , consistent with zero. 
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4.2 Fitting the transverse W mass 

Having tested the template fitting of the p ( T distribution, we now move to the rrij distribution. In ad- 
dition to the lepton residuals, this fit requires residuals for the missing momentum. Since the missing 
momentum is a transverse quantity, it does not depend on the r\ of the lepton(s) in the event. However, 
the detector response depends on the total transverse hadronic activity LEj and the recoil momentum 
perpendicular to the direction of the leptons (cf. Figure 1). 

To describe the response, ££"7- is divided in 10 bins in the range [0,200] GeV and recoil momen- 
tum into 10 bins in the range [0,40] GeV, each with an additional overflow bin. In each bin, the 
distributions are well described by two gaussian distributions with a common mean, as the missing 
momentum residuals are not expected to have any asymmetric tails. 

To study the response, it is useful to project the Jf.j momentum components onto two axes, defined 
event by event in the transverse plane. The parallel axis is defined such that the azimuthal angles 
between the leptons (I and v) and this axis are minimized and equal, and the perpendicular axis is de- 
fined as perpendicular to the parallel axis. This reference system adapts better to the individual event 
topology than fixed axes. Examples of the residual distributions projected on these axes are shown in 
Figure 15. 




Missing momentum residual [GeV] Missing momentum residual [GeV] 

Figure 15: Distribution of Jf.j residuals ^ r rec — $ T tme for W decays. The $j momentum compo- 
nents along the parallel (a) and perpendicular (b) axes are shown, for a recoil mometum in the range 
[8,12] GeV, and LEj in the range [20,30] GeV. The fitting function describes the distributions well. 

Using the above modelling of the missing momentum response, we produce rrij templates and test 
if the fit is unbiased or not. Unlike the lepton case, no additional efficiency curve has to be in- 
cluded, as the missing momentum is calculated for every event. As can be seen from Figure 16, the 
rrij templates match the reconstructed distribution, and the fit is unbiased, giving a fitted value of 
80.421 ± 0.059 GeV compared to an input value of 80.405 GeV. 
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Transverse Mass [GeV] Scale 



Figure 16: (a) Reconstructed mj' distribution (middle curve) along with templates produced with 
the W mass hypothesis 78.792 GeV (left curve) and 82.008 GeV (right curve), before any kinematic 
selections, (b) % 2 value of fitting templates to the reconstructed distribution as a function of the 
template's (fraction of) W mass hypothesis (compared to the nominal mass). The fit yields 80.421 ± 
0.059 GeV in agreement with the input value of 80.405 GeV. 

4.3 Fitting the transverse W mass using the Z events for calibration 

The dependence of the m$ on the relative recoil scale and resolution uncertainty was determined to 
be [15]: 

dm w /da recoU = -200 MeV/%, dm w /da recoU = -25 MeV/%. (6) 

These parameters can again be measured on Z events. To test the portability from the W to the Z of 
the rrij fit, we model the missing momentum using the Z events and compare this to the one obtained 
from the W events. However, unlike the lepton case, the detector response is not exactly the same for 
W and Z events. The difference is caused by the #t reconstruction algorithm, which in its current 
state does not correctly subtract lepton calorimetric signals from the hadronic recoil. This results in a 
difference between W events where only one lepton is present, and Z events containing two leptons. 

To illustrate this point, consider again the parallel and perpendicular axes defined in the previous sec- 
tion. The residuals of the recoil momentum components are projected on both axes, and the response 
in W and Z events is compared, cf. Figure 17. Along the parallel axis, the average difference be- 
tween the residuals is Aw-z = 17 ± 35 MeV. Along the perpendicular axis, the difference is larger, 
Aw-z = 1964 ± 35 MeV. Using a Z-based calibration in the W mass fit is thus expected to be biased; 
performing this exercise indeed yields m^f = 79.752 ±0.062 GeV compared to the input value of 
80.405 GeV. The Z based recoil calibration is thus not exploitable at present. 

Instead, we assume that the needed improvements to the tfj reconstruction algorithm will be done 
in time for the measurement, providing equal response between W and Z events. The statistical 
sensitivity based on the Z-based calibration is about 50 MeV for 15 pb _1 , serving as a lower bound. 
Given the present uncertainties and the performance of similar analysis [16], we assume that the in situ 
calibration can be performed with a precision of 1 % with an associated uncertainty Smw = 200 MeV, 
according to Equation 6. The effect of pile-up on the missing momentum has not been studied. 
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Missing energy residual [GeV] Missing energy residual [GeV] 



Figure 17: Distribution of tfj residuals $r rec — $T tme parallel (a) and perpendicular (b) to the lepton 
axis for W and Z decays. 

5 Statistical uncertainty as a function of fitting range 

As previously stated, the sensitivity to the W mass comes from the Jacobian edge in the fitting distri- 
bution. Generally the Jacobian edge is slightly sharper for the mj' distribution (see Figure 3), yielding 
a smaller statistical uncertainty. To test the influence of the fitting range, three different fitting ranges 
has been tested for the p c T and distributions for the W — > jltv sample. Since the typical values 
are twice as large as the p e T values, the range size has been chosen accordingly. The result can be seen 
in Table 4. 



Transverse lepton momentum, pj Transverse W mass, ntj 

Fitting range [GeV] a stat [MeV] Fitting range [GeV] a stat [MeV] 
KT80 87 2TM60 54 

20-70 93 40-140 55 

30-60 106 60-120 57 



Table 4: Statistical uncertainty as a function of fitting range for p e T and fits. The uncertainties in 
the pj fit are larger, because the Jacobian edge is less sharp than in the rrij fit (see Figure 3). 



As can be seen from Table 4, the p L T statistical uncertainty changes of about 30% with fitting range, 
while the irij statistical uncertainty is essentially insensitive to the range, and generally somewhat 
lower as expected. Considering that most systematic effects (such as backgrounds, electron calibration 
and efficiency, etc.) are largest at low momenta, a loss in pj statistical uncertainty will be countered 
by a gain in systematic uncertainty. For the fit a narrow range is surely preferable. However, it is 
not possible to quantify this gain until all systematic uncertainties have been calculated. 

6 Lepton performance determination in situ 

In this section we review algorithms to calibrate the lepton response using Z events, and feed back the 
results to the mw fit. 
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m ee [GeV] 

Figure 18: Fully simulated Z pseudo-data (dots with error bars), compared to an example Z resonance 
template with Cfe = 1, Oe = 0.02, and to the best fit. 



6.1 Average scale and resolution 

We first perform a global scale analysis, to verify whether neglecting possible non linearities in the 
response can be expected to induce a significant bias. We restrain ourselves to the electron channel. 
Fixing the non gaussian tail parameters to n = 0.8 and j8 = 5, as expected from the studies performed 
in Section 3.3, we produce templates of the Z resonance by varying the electron scale and resolution. 
These response parameters are applied to generator level electrons as before. 

The templates are then fitted to the fully simulated Z peak. A very good fit is obtained, as shown in 
Figure 18. An average scaling factor of Cfe = 0.9958 ±0.0003, and an average relative resolution of 
Oe = 0.0207 ± 0.0003 provide a statisfactory description of the resonance. The precision of the fit 
corresponds to the complete Z event sample, i.e. j£? = 200 pb~ ! . Scaling to our default luminosity 
JSf = 15 pb -1 , the precision becomes 8a E = 0.0013 and 8o E = 0.0013. 



6.2 Differential scale and resolution 

The calibration uses a (large) sample of reconstructed Z — > tl events and a corresponding simulated 
sample (representing our knowledge of the Z lineshape). For each event the two leptons are assigned 
to bins i and j (choosing i > j) according to energy/momentum, Tj , and/or (j). Based on the lepton 
bins, events are divided into categories as shown in Figure 19. 

For each category the reconstructed sample is compared to the known Z lineshape (obtained 

from the corresponding simulated sample), and a Z mass resolution function Rjj is obtained from 
requiring that its convolution with the theoretical lineshape matches the reconstructed distribution 
(see Equation 7). Each of these Z mass resolutions is the direct result of combining two lepton 
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Mapping of leptons to bins: 
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Figure 19: Illustration of scheme to categorize Z — > it events. Each lepton is assigned a bin according 
to its pr and |tj |, here eight pj bins and two \r\ \ bins (barrel (B) and endcap (E)) thus 8x2=16 bins 
total, as demonstrated (left). Z —> il events from data/simulation (middle box) are then divided into 
categories (squares right) according to the reconstructed/truth pj and | Vj | bin of both leptons. 





Total: /Vx(7V+l)/2 



Total: N x(N+l)/2 



Total: N 



Figure 20: For each category a Z mass resolution function (middle) is determined from folding it 
with the simulated distribution to match the reconstructed one (left). The lepton bias and resolution 
parameters are determined for each of the 16 lepton bins, by globally fitting thel6xl7/2 = 136Z 
mass resolutions, which each is a result of two individual lepton resolutions (right). 



momentum resolutions Rj and Rj\ 

f(mz)fr = f{m z )]^®R ih Rij = R { ®Rj (7) 

The complicated lepton scale and resolution calibration can thus be split into two parts, which both 
saves computing time and allows for intermediate checks and changes. 

Given TV lepton bins and thus lepton resolution functions to determine, there are N x (N + l)/2 Z mass 
resolution functions, and thus the overconstrained system can be solved by a global x 2 fit. This cali- 
bration procedure is illustrated in Figure 20, and allows for a determination of the detector response 
for all combinations of pj and r\ . 
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Figure 21: Lepton scale constants for electrons (a) and muons (b) as obtained from simplified cali- 
bration to the Z peak (squares) and from truth (circles) using full Z samples (see Table 1). The first 
eight bins are scale constants for leptons of increasing pj reconstructed in the barrel, while the last 
eight are for those in the endcap (see text). The result is in good agreement with average scale of 
0.9958 ±0.0003 (indicated by the line in plot (a)) found in Section 6.1. 



A simplified version of the above analysis has been performed, with the aim of obtaining only the 
scales. In this simplified procedure, the resolution functions R in Equation 7 reduces to calibration 
constants. The result of the calibration using the full Z samples (see Table 1) is shown for both elec- 
trons and muons in Figure 21, along with the scales obtained from the truth information. 

As can be seen from the figure, the simplified calibration yields the correct behaviour of the scales in 
general. Some fluctuations around the expected values are observed induced by the above simplifica- 
tions. Scaled to 15 pb _1 , the uncertainties are to be increased by a factor 3.8 and 2.4 for electrons and 
muons, respectively. 



6.3 Lepton reconstruction efficiency 

The pj and rrij distributions of W events are also influenced by any pj and Tj dependence of the 
lepton reconstruction efficiency. Any difference between the data and the simulation used to produce 
the templates will induce a difference in the distribution and cause a bias in the W mass fit. 

Though the method to obtain the (differential) efficiency is conceptually the same for electrons and 
muons, the two cases are different in that electron reconstruction is generally more dependent on pj 
and X] than the muon reconstruction. For this reason, we have chosen to consider the electron recon- 
struction efficiency in the following. The electron reconstruction efficiency can be determined from 
the data with Z events, using the so-called "tag and probe" method [10], which we briefly summarize 
here. Events are selected with one well-identified electron, and an additional high pj object. The 
invariant mass of these two objects is required to be within 10 GeV of the nominal Z boson mass. 
Assuming that this selects Z events with enough purity, the identification efficiency is then simply ob- 
tained by computing the fraction of events where the second object is indeed identified as an electron. 
The efficiency of the isolation criterion is obtained in a similar way. 
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The studies reported in [17] indicate that for an integrated luminosity of 100 pb _1 , the electron effi- 
ciency can be reconstructed with a precision of 1.5% in the range 20 < p e T < 70 GeV. The uncertainty 
is statistically dominated. Scaling this number down to Jzf = 15 pb -1 , we anticipate a precision of 
3.9%. 

7 Background uncertainties 

The leptonic W channel does not suffer from large backgrounds, due to the high cross section and 
the cleanness of the signal. The backgrounds are mostly from similar vector boson decays, such 
as W —> x(—> £w)v, Z — ► ££ (missing one lepton), and Z — > x(— ► £vv)x. With the good particle 
identification capabilities of the ATLAS detector [13], jet events will despite their large cross section 
not be dominant. The backgrounds from tt and W + VK~ events are negligible. The backgrounds and 
the uncertainties in their sizes are estimated below. 

W — > t(— > £vv)v events: A large background (largest in the electron channel) is from W — ► TV 
events, where the x decays into a lepton. This background is irreducible, as the final state is identical 
to the signal; however, its p e T and distributions are generally below the fitting range, as both the 
lepton and missing momentum are much reduced, leaving only a tail into the fitting range. Though a 
quite significant background, its uncertainty is small, as only the x — ► £X branching ratio (1.0%) and 
the acceptance relative to the signal (2.5%) enter. 

Z — > ££ events: Another large background (largest in the muon channel) comes from Z —>■ 11 events, 
where one lepton is either undetected or not identified. A loose lepton identification for the second 
lepton can reduce this background, and possibly further reduction can be obtained with a Z veto, 
should the associated uncertainty still be significant. We do not apply this veto in the present analysis. 
The Z — ► ££ background extends significantly into the p L T and ntj distributions, except in the ntj 
electron channel, where the missed electron cluster still reduces the missing momentum, effectively 
lowering the apparent mf. The size of this background has uncertainties from the W to Z cross section 
ratio R W z (1.8%), Z veto efficiency (2.0%), and the acceptance (2.5%). 

Z — > t(— > £vv)x events: A small background origins from Z — > xx events, where one x decays 
leptonically, while the other is not identified. While the cross section for such a process is small, it 
can fake missing momentum. The largest uncertainty in the size of this background comes from the x 
detector response (5.0%), along with cross section ratio Rwz (1.8%), and acceptance (2.5%). 

Jet events: This background is not studied here, because it can not be evaluated reliably using 
simulation only. Relying on Tevatron experience [16], we will assume it is small and it will not be 
discussed in this note. 



Impact of backgrounds: If the size and shape of the backgrounds were perfectly known, then they 
would not affect the W mass measurement, as they could be included in the templates. It is thus the 
uncertainty on the size and shape of the backgrounds, in the fitting range of the p l T and rrij spectra, 
which gives rise to systematic errors. The uncertainties on the size of backgrounds arise from uncer- 
tainties relative to those of the signal events in cross sections, branching ratios and acceptances. These 
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Electron channel 




Muon channel 


Process 


Evts/ 15pb 1 


Fraction [%] 


Process 


Evts / 1 5ptT 1 Fraction [%] 


W^ev 


45468 


97.8 


W ^ixv 


83263 93.9 


W ^ZV 


666 


1.4 


W -^ZV 


1238 1.4 


Z — ► ee 


305 


0.7 


Z^^At 


3483 3.9 


Z — > TT 


30 


0.1 


Z — > zz 


153 0.2 



Table 5: Signal and expected backgrounds in 15 pb 1 after the event selection described in Section 2.2 
and in the p f T range [30,60] GeV. 



are obtained from PDG [1] and efficiency studies (see Section 6.3). 

The background shapes are determined from simulation. They are essentially unaffected by variations 
in the production, decay and resolution model. For jet events background, which was ignored in the 
present study, both normalisation and shape will have to be measured directly from the data. 

We first assess the overall impact of the backgrounds. The backgrounds remaining after the selection 
described in Section 2.2 are given in Table 5, and the pj spectrum of each process is shown in Fig- 
ure 22. 




Figure 22: pj distribution for signal and backgrounds for electrons (a) and muons (b) after the selec- 
tions described in Section 2.2. The pj range used in the mass fit is 30 < pj < 60 GeV. 



Ignoring the background altogether in the templates leads to a bias Smw = — 10 MeV. This is however 
the result of a conspiracy : the W — > TV background alone gives a bias of —80 MeV, while the Z—*££ 
background gives a bias of +70 MeV; both sources of background can vary independently within the 
uncertainties given above. The other backgrounds have negligible impact. We thus estimate the bias 
per percent relative error on the background normalization (checked to scale linearly with the size of 
the background) to be: 

dm w /dN rv - bkg = -0.8MeV/%, (8) 
dm w /dN U - bH = 0.7MeV/%. (9) 
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We remind that the above does not include the jet events study. 

8 Summary of uncertainties for Jzf = 15 pb~ 1 

The response parameters determined in situ using Z events (cf. Section 6.1) are used to produce tem- 
plates of the p ( T spectrum in W events, as shown in Figure 23, for the electron channel. The resulting 
fit yields mw = 80.466 ±0.1 10 GeV, with no bias with respect to the true value. This results shows 
that for Jz? = 15 pb -1 , propagating a global scale determined on Z events does not induce a significant 
bias in the analysis. Given that the global scale calibration had a precision of 0.13% and Equation 3, 
the scale-induced systematic uncertainty is 8iiiw{(Xe) = 110 MeV. Likewise, the resolution uncer- 
tainty contributes 5mw(<JE) = 5 MeV. 




P T [GeV] 



Figure 23: (a) /^spectrum from fully simulated W decays (dots with error bars), and /^template 
obtained assuming the Z based scale and resolution and the true value of my/, (b) Template fit to my/. 

We did not attempt to control the non gaussian tails in situ. We assume that this contribution can be 
determined to 5% with 15 pb~', yielding a contribution of 28 MeV to the systematic uncertainty. As 
can be seen by comparing Figure 4 and Figure 12, the effect is smaller in the muon channel. 

The expected precision of the in situ efficiency measurement (Section 6.3) and Equation 5 imply a 
systematic uncertainty on the fit result of about Smy/ = 14 MeV. This result holds for the electron 
channel. Given the flatness of the muon reconstruction efficiency, the systematic uncertainty in the 
muon channel is expected to be much smaller. 

It was not attempted to determine the recoil calibration in situ. We assume that this calibration can be 
performed to 1%, yielding a systematic contribution of 200 MeV from Equation 6. This contribution 
affects the transverse mass analysis only. 

The background uncertainty is discussed in the electron channel, and assumed identical in the muon 
channel. The Z — > TT background is negligible. Given the current knowledge of the W and T branch- 
ing ratios [1], a realistic estimate of the % background uncertainty is 2.5%. Injecting the nominal 
background in the templates, then varying them within this range, we find a systematic uncertainty 
8mw (bk») = 2 MeV. Similarly, the contribution from the Z — > ee background is found to be 2 MeV. 
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The total background contribution to the systematic uncertainty is thus expected to be 3 MeV. 

A detailed discussion of theoretical uncertainties is provided in [15]. At the start-up of LHC, the 
dominant theoretical contribution comes from the proton parton density functions, which contribute 
25 MeV. 

In summary, for an integrated luminosity of 15 pb _1 , we found that the analysis of the p e T spectrum 
gives a statistical sensitivity of about 8myy = 110 MeV per channel, while the transverse mass pro- 
vides Smw = 60 MeV. These numbers have experimental systematic uncertainties of 1 14 MeV in the 
Pj analysis, and 230 MeV in the transverse mass analysis. The uncertainty on the lepton and recoil 
scales dominate these numbers. Finally, the uncertainty from PDFs is about 25 MeV and compara- 
tively small. Our numbers are summarized in Table 6. 



Method 


PT (e) [MeV] 


PT (H) [MeV] 


M T (e) [MeV] 


M-rOi) [MeV] 


8mw (stat) 


120 


106 


61 


57 


8m w (a E ) 


110 


110 


110 


110 


Sm w (a E ) 


5 


5 


5 


5 


8mw (tails) 


28 


<28 


28 


<28 


8m w (£) 


14 




14 




8m\y (recoil) 






200 


200 


8m w (bkg) 


3 


3 


3 


3 


8m w (exp) 


114 


114 


230 


230 


8m w (PDF) 


25 


25 


25 


25 


Total 


167 


158 


239 


238 



Table 6: Summary of contributions to 8mw, for the different fitting methods described in the text. 
From top to bottom: statistical uncertainty; systematic uncertainties related to the absolute scale, 
resolution, non gaussian tails, reconstruction efficiency, recoil calibration, and backgrounds; total 
experimental systematic uncertainty; uncertainty from PDF; total systematic uncertainty. 



9 Conclusions 

This note presents a first attempt towards niw fits, confronting distributions obtained from fully simu- 
lated W — ► ev,jXV events to templates produced using a simplified detector model. 

The simplified detector model used here relies on empirical functions to describe the energy and mo- 
mentum scale, resolution, and non-gaussian tails, as well as the lepton selection efficiency. These 
functions are first determined, as a function of pj and 7], using simulated W events, i.e. the signal 
itself; the resulting fit, unbiased, provides a technical validation of the method. 

In a second step, the model parameters are used as extracted in situ, from the analysis of Z events. 
The Z peak provides the detector scale and resolution; the tag-and-probe method [10], applied to its 
decay leptons, is used to estimate the efficiency. A fit of the Z-based templates to the W data again 
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shows no significant bias. 

Our current approach can thus be considered valid, within a statistical sensitivity of about 60 MeV, 
for the transverse mass-based fit in the muon channel, and of 120 MeV for the transverse momentum- 
based fit in the electron channel. 

This note has focused on experimental issues. Phenomenological uncertainties, related to QED, QCD 
and partem density functions are discussed in detail in [15]. 
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Forward-Backward Asymmetry in pp —> Z/fX —> e + e X 
Events 

Abstract 

This paper describes a study on the measurement of the forward-backward 
asymmetry in pp —>■ Z/y*X — »■ e + e~X events with the ATLAS detector for 
an integrated luminosity of 100 fb _1 . Such a measurement can be used to 
determine the effective weak mixing angle, sin 2 d'fj^ . We will demonstrate 
that a very high accuracy on the weak mixing angle, 8 sin 2 Q^'f = (1.5 (star) ± 
0.3(exp) ±2.4 (PDF)) x 10~ 4 , can be reached. This is possible due to the large 
cross-section for the production of Z bosons at the LHC and by using electron 
reconstruction in the forward calorimeters of ATLAS (2.5 < |tj| < 4.9). 

1 Introduction 

The forward-backward asymmetry, Afb, measurement is one of the important precision measurements 
that can be done at the Large Hadron Collider (LHC). It will improve the knowledge of Standard Model 
parameters and test the existence of physics beyond the Standard Model. 

The Z boson events in pp collisions originate from the annihilation of valence quarks with sea anti- 
quarks or from the annihilation of sea quarks with sea antiquarks. Since the valence quarks carry on 
average a larger momentum fraction than the sea quarks, the boost direction of the dilepton system 
can indicate the quark direction. However, dilepton events which originate from the annihilation of sea 
quarks with sea antiquarks do not contribute to the observed asymmetry. 

Afb measurements with quarks and leptons at the Z peak provide a precise determination of the weak 
mixing angle sin 2 6 e J'j-. The weak mixing angle is an important parameter in the electroweak theory that 
describes the mixing between weak and electromagnetic interactions. In the global fit of the Standard 
Model, the weak mixing angle has an impact on indirect constraints on Higgs mass. 

With the experimental capabilities of the ATLAS experiment at the LHC and an expected integrated 
luminosity of up to 100 fb _1 it becomes interesting to perform a study on the measurement of the asym- 
metry, Afb- In order to improve the measurement precision, it will be necessary to detect leptons in the 
very forward pseudo-rapidity region, which favors electrons over muons at ATLAS. In 100 fb -1 data 
around 1.5 xlO 8 Z events will be produced, of which ~5xl0 6 decay to an electron-positron pair, pro- 
viding the measurement of Afb (and sin 2 O^f) with a competitive precision to the current world average 
value [1], 8sm 2 d' e J f =0.00016. 

This paper is organized as follows. In section 2 we present a short introduction of the theoretical 
aspects of the measurement. An overview of the ATLAS detector is given in section 3. In section 
4 the electron measurement in ATLAS is presented, both in the central and forward regions the latter 
being necessary for a precise measurement of the forward-backward asymmetry. We then present the 
simulated data samples used in section 5 and the definition of the electron polar angle in section 6. The 
event selection cuts are discussed in section 7. The charge misidentification is given in section 8. The 
pile-up effect is discussed in section 9. The systematic uncertainties are summarized in section 10. The 
expected precision on the asymmetry around the Z mass is shown in section 11. 
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2 Forward-backward asymmetry 

In proton-proton collisions, e + e~ pairs are predominantly produced via the annihilation of a quark q and 
an antiquark q. In the Standard Model, quark-antiquark annihilations proceed via an intermediate [2, 3] 
7* at low invariant mass M( e + e -) or via a 7* /Z interference at M( e + e -) around the Z mass. The elec- 
troweak neutral current in the Standard Model lagrangian violates parity (due to the presence of vector 
and axial-vector couplings of the quarks and leptons to the Z-boson) and leads to an asymmetry in the 
polar emission angle of the electron in the rest frame of the electron-positron pair. The asymmetry can 
only be extracted with respect to the boost direction of the di-electron system since the quark direction 
is not known (see Sec. 6). 

The differential cross-section for the production of an electron-positron pair in qq annihilation via 
the s-channel can be written, in the electron-positron rest frame [4, 5], as: 

= N c [(\+cos 2 e)F 0 (s)+2cos6F 1 (s)] (1) 

where 6 is the emission angle of the e~ relative to the momentum vector of the quark in the e + e~ rest 
frame and s is the center-of-mass energy squared. F 0 (s), F\ (s) are form factors and N c (1/3) is the colour 
factor: 



with 



F 0 (s) = — (q 2 q qj+2Rex(s)q q q,C v C v + | Z (,)|2( (C ^) 2 + (d) 2 )((4) 2 + (C^) 2 )) (2) 
Fx Cs) = 7 ^ 2 (2/?^(,)^C^ + | Z (,)| 2 2C^42C|ci), (3) 

%{S) = s-Ml + isT z /M z (4) 
where q q j is the electric charge of the quark or lepton and Cy, Ca are the vector and axial- vector coupling 
to the Z. 

The angular dependence of the various terms is either cos0 or (l+cos 2 0). Only the (1 +cos 2 0) 
term contributes to the total cross-section, as the cos 8 term integrates to zero. However, it induces a 
forward-backward asymmetry. Thus, the differential cross-section can be written in a simple expression: 

where o = §Fo(j) x N c and 

A FB (s) = -[a(cos0 >O)-a(cos0 < 0)] (6) 

= ^ (7) 

a < 3 2C v C q A 2C l v C l A 

A FB {s = M z ) ~ i(c , )2 + (cI)2(4)2 + (c / )2 W 

The forward-backward asymmetry is commonly defined using the number of forward produced 
(cos 6 > 0) events (Np) and backward-produced (cos 6 < 0) events (Nb) 

ApB = AT ,AT (9) 

N F + N B 
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60 80 100 120 140 



Figure 1: SM prediction of the forward-backward charge asymmetry Afb in the electron pair channel 
versus the di-electron invariant mass M e + e - for \y e + e -\ >1 and for at least one electron in the central 
region (| tj | < 2.5). 



, f l da f° do 

' = / j ^dcosd, B= / -da 

Jo a cost^ J -i d cos 8 



(10) 



2. 1 Dependence of A FB on M e + e - 

In Figure 1 we show the Standard Model prediction, using MRST PDF [6], of the forward-backward 
charge asymmetry as function of the di-electron invariant mass. At the Z-pole we see a small asymmetry 
(as expected) which is dominated by a small vector coupling in Z — ► e + e~ with the dominant axial 
coupling. Around the Z mass the asymmetry is linear with the weak mixing angle sin 2 Q 1 /// ■ It was 
estimated that (in a good approximation) the weak mixing angle can be determined from the measurement 
of the forward-backward asymmetry (this is the raw forward backward asymmetry measured at detector 
level) when averaged over the rapidity of the electron pair as follow [7, 8, 9]: 

A FB = b(a- sin 2 d[f f ) (11) 

the parameters a and b depend on the parton distribution functions (PDFs). 

At large invariant mass, A FB is dominated by the properties of the interference between the propaga- 
tors of the y* and the Z and is almost constant at a large positive value, close to 0.6, independent of the 
invariant mass. 



3 Detector overview 

The ATLAS detector has been described in detail in [10]. It consists of an inner tracking system, with 
pseudo-rapidity coverage of \r\ \ < 2.5, inside a 2 T solenoidal magnetic field, followed by the calorime- 
ters, and an outer muon spectrometer, with pseudo-rapidity coverage of \r} \ < 2.7, installed in a large 
toroidal magnet system. We briefly describe the parts of the detector relevant to this analysis. 

Figure 2 shows a schematic transversal view of the ATLAS calorimeter system. It has been designed 
to be hermetic to | r\ \ < 4.9 with a fine lateral and longitudinal segmentation. A liquid argon (LAr) sam- 
pling calorimeter in a barrel-endcap geometry provides the electromagnetic and hadronic calorimetry. At 
1 77 1 < 1 .2 the hadronic calorimetry is completed by a tile (Iron/scintillator) hadronic calorimeter. 
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Figure 2: Schematic transversal view (r-z view) of the calorimeters in the ATLAS detector. The cylin- 
drical coordinate system is used with the z axis along the proton-beam direction and r is the transverse 
coordinate. 



The electromagnetic barrel calorimeter, covering a pseudo-rapidity range of \r}\ < 1.475, shares its 
cryostat with the superconducting solenoid, the calorimeter being behind the solenoid. Each electromag- 
netic calorimeter end-cap (EMEC) covers the pseudo-rapidity range 1.4-3.2. To correct for the energy 
lost in the material in front of the calorimeters, both end-caps and barrel are preceded with pre-sampler 
detectors in the region \r] \ < 1.8. The performance expected gives for the energy resolution a sampling 
term of 10%/ v / £(GeV) and a constant term better than 0.7%. The hadronic end-cap calorimeter (HEC) 
and the forward calorimeter (FCal) share the same cryostat with the EMEC and cover the pseudo-rapidity 
range 1.5-3.2 and 3. 1-4.9 respectively. The design of the FCal was constrained by the high radiation level 
in the very forward region. It consists of three consecutive modules along the beam line: one electro- 
magnetic module (FCall) and two hadronic modules (FCal2 and FCal3). To optimize the resolution, 
copper was chosen as the absorber for FCall, while tungsten was used in FCal2 and FCal3, to provide 
containment and minimize the lateral spread of hadronic showers. 

The electron energy can be measured either in the central electromagnetic calorimeter (\r]\ <2.5) 
and/or in the forward calorimeters (2.5 < | r\ \ < 4.9). The |fj | < 2.5 region corresponds to the EM barrel 
and the EMEC outer wheel. 

4 Electron identification 

4.1 Central electrons 

An electron candidate is reconstructed in the central calorimeters if there is a cluster with energy E and 
a charged track matched to the cluster with the condition of E/p < 1 and |At]| < 0.05 and |A0| < 0.1. 
Atj (A0) is the difference between the tj (0) position of the track and the position of the cluster. The 
track is then extrapolated to the cluster. The energy of the electron is determined by the total energy 
it deposits in the EM calorimeter. The transverse momentum (pj) of the electron is determined by the 
energy measured in the calorimeter and by the track angle at the nominal interaction point. The charge 
of the electron (q e ) is determined from the sign of the curvature of the track. 

The development of electromagnetic and hadronic showers is quite different so that shower shape 
information can be used to differentiate between electrons and hadrons. Electrons deposit almost all 
their energy in the electromagnetic section of the calorimeter, while hadrons are typically much more 
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Figure 3: Central Electron ID efficiency vs 
electron p T for events with 1 7] | < 1 .3 and 1 7] | > 
1.6. 



Figure 4: Central Electron ID efficiency vs 
electron Tl. The efficiency is integrated over 
p T > 20 GeV. 



penetrating. To obtain the best discrimination against hadrons, we use both longitudinal and transverse 
shower shapes. For example, for electrons passing tight identification criteria, the hadronic leakage and 
the lateral shower shape in the first and second sampling of the calorimeter are used. After the selections 
based on calorimeter information, we also take into account the inner detector information. We ask for 
a good track pointing to the calorimeter and fulfilling the consistency of the spatial and energy matching 
between the calorimeter and the inner detector. 



Figure 3 and Figure 4 show the electron identification efficiency, 62.0±1.0 % on average, as a func- 
tion of the electron transverse momentum and pseudo-rapidity tj using electrons (and positrons) from 
Z — > e + e~ events (the samples used are described in section 5). Tight cuts have been applied to iden- 
tify the central electrons. We observe an expected increase in the efficiency as function of pj. This 
dependence (in particular at low p T ) is due to the lower performance for low p T electrons. The drop in 
the efficiency versus r\ happens at the transition region between barrel and end-cap calorimeters, 1.37 
< |t]| <1.52. A more detailed description of the electron identification in the central calorimeters is 
given elsewhere [11]. 



4.2 Forward electrons 

In contrast to the central region, forward electron reconstruction can use only calorimeter information 
as the tracking system is limited to the central region (|tj| < 2.5). In this case we can not distinguish 
between an electron, positron or photon. The electron candidate in the forward calorimeter 1 is recon- 
structed if there is a cluster with Ej > 20 GeV. The direction of the electron is defined by the barycenter 
of the cells belonging to the cluster in the calorimeter. 

To discriminate between electron and hadron a multivariate analysis [12] is used. Variables are 
defined using cluster moments or a combination of them. 

The cluster moment of degree n for a variable x is defined as: 

(x") = ^— x££ ; *?, ( 12 ) 

where i^orm =L/^' Ei is th e cei l energy and i is the cell index of the cluster. 

'the EM forward calorimeters are the EMEC inner wheel (2.5 < |tj| < 3.2) and the front compartment of the FCal. These 
EM calorimeters are completed by the HEC and the last 2 compartments of the FCal (3.2 <\rj\< 4.9). 
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We use the following five discriminants in the electron ID: 

• n = \ (xi — c) xu\: 

Electromagnetic and hadronic showers have different shapes in both the transverse and the lon- 
gitudinal directions. To evaluate the shower shapes in the transverse direction we use the second 
moment of the distance r,- of each cell i (x,) to the shower axis (c). u is the shower axis. 

• lat 2 /{lat 2 + lat max ): 

A modified lateral moment is derived as above. It takes into account the two most energetic cells 
(shower core). So, lat 2 is the second moment of the variable n for which we impose that the 
distance r = 0 for the two most energetic cells. For lat max we impose that r = 4 cm for the two 
most energetic cells and r = 0 for the remaining cells. 

• long 2 /{long 2 + long max ): 

To evaluate the shower shapes in the longitudinal direction we use an equivalent variable to the 
previous one where the distance of the each cell to the shower centre is used. 

where Vi is the volume of the cell /. The electromagnetic shower is narrow and deposits energy 
more locally than a hadronic shower. 

• fnu* 

Fraction of the energy in the most energetic cell of the cluster. By measuring the energy fractions 
deposited in the cells of a segmented calorimeter it is usually possible to distinguish incident 
hadrons from electrons and photons. 

In order to combine the electron identification information from the various discriminants into a 
single quantity that provides optimal discrimination power, we calculate the likelihood for each discrim- 
inant based on probability density functions. For each discriminant, the signal likelihood (p s ) and the 
background likelihood (ps) are separately calculated and combined using the likelihood ratio: 

ft- „ (13, 

where j runs over each discriminant. Figure 5 shows the distribution of Rl using all the discriminants 
described above. Both electrons (signal) and background are shown. As can be seen, good separation is 
achieved with this variable. The electron identification efficiency determined using Z — > e + e~ events as 
function of the electron p T and r\ as shown in Figures 6 and 7. The drop in the efficiency plot versus f] 
happens at the transition region between inner wheel of the electromagnetic end-cap and the FCal. 
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Figure 5: The likelihood ratio Rl for the signal (yellow) and background (solid line). Due to the fact that 
the likelihood ratio is strongly peaked at 0 and 1 , in the plot, a transformation is applied that zooms into 
the peaks. 



Figure 6: Forward Electron ID efficiency \ 
electron p T for events with 2.5 < |tj| < 4.9. 



Figure 7: Forward Electron ID efficiency \ 
electron Vj for events with pj > 20 GeV. 



5 Monte- Carlo samples 

The Monte Carlo events are generated at a centre of mass of yfs = 14 TeV using PYTHIA [13] (version 
6.3) for the signal pp — >■ Z/y* — > e + e~ and the background pp — > jj with the CTEQ6LL [14] parton 
distribution functions. Table 1 shows the summary of the cross-sections, number of events and the 
equivalent luminosity for the signal and background simulated events. 

5.1 Signal 

The pp — > Z/y*X — > e + e~X events generated are filtered requiring at least one electron with |tj| < 2.7 
and with transverse momentum pj > 10 GeV and a dilepton mass, m, greater than 60 GeV. Figures 8 and 
9 show the pj distribution of electrons and the invariant mass of the electron pairs in the signal events. 

5.2 Background 

The dominant sources of background to the signal process pp — ► Z/y*X — > e + e~X are: 
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electron P T (GeV) 

Figure 8: The electron Ej distribution normal- 
ized to unit for events with |tj| < 1.3 and \r[ \ > 
1.6. 



Figure 9: Di-electron invariant mass distribu- 
tion for events with |tj| < 1.3 and |t]| > 1.6. 



• dijet production: this is the largest background when two jets fake an electron. The cross-section 
of this process is greater by several orders of magnitude than the signal one, and dominates at low 
transverse momentum. 

• pp—* ttX — > e + e~X: The top quark decays into the W boson and a b quark, followed by the W 
decay into electron and neutrino (t —> Wb, W — »• ev). It has the same signature as the signal one as 
the two electrons of the final state can simulate the two electrons from Z. 



' PP- 



* W + X — > ev e +X: where X is a photon or a jet misidentified as an electron 



These events are passed through a detector simulation program to model detector response. Two 
detector simulation tools are used: a full detector simulation tool, based on GEANT4 [15], and a fast 
simulation program, ATLFAST [16] which simulates the detector response using efficiencies and smear- 
ing parameters measured from detailed simulation data. ATLFAST was used due to the high di-jet 
cross-section, the high rejection power of the selections and the available Monte Carlo statistics of fully 
simulated events which is not sufficient to evaluate the background. The Monte Carlo samples processed 
with the fast detector simulation were used only for estimation of the background contributions. 



Physics process 


Cross section (nb) 


Sfilter 


Number of events 


Equivalent luminosity (pb 1 ) 


Z -» e + e- 


2.015 


0.126 


5.10 5 


248.14 


Inclusive jets 


21. 10 4 


0.09 


3.10 7 


0.15 


tt 


0.833 




6.10 5 


720.3 


W + X 


38.2 




1.10 6 


26.2 



Table 1 : List of signal and background samples. 



Ref. [17] discusses expectation for electron triggers at different luminosities. The Z/7* events used 
in this measurement are selected by high transverse momentum triggers on one or two electrons. This 
selection requires at least one isolated electron with pj > 20 GeV or at least two isolated electrons with 
Pt > 12 GeV. At Level 1, electrons are selected by the presence of an energy deposition in an EM 
calorimeter tower. The Level 2 decision, which is based on the result of the Level 1 trigger, can take into 
account the information from all ATLAS subdetector systems in the regions of interest (ROIs) and the 
event filter (EF) performs its task only after the complete event has been assembled in the event builder 
(EB). 
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6 Angular distribution 

The Z production can be either from the annihilation of valence quarks with sea antiquarks or from the 
annihilation of sea quarks with sea antiquarks. Since the original quark direction is unknown in proton- 
proton collisions (it can originate with equal probability from either proton), the sign of cos# is not 
directly measurable. 

As the valence quarks have, on average, a much larger momentum than the sea quarks, in valence-sea 
quark collisions, the longitudinal motion of the dilepton system approximates the quark direction and the 
angle between the lepton and the quark in the e + e~ rest frame can be extracted with respect to the beam 
axis. A lepton asymmetry can thus be expected with respect to the boost direction. For sea-sea quark 
collisions no such asymmetry is expected, diluting the overall effect. 

To minimize the effect of the unknown transverse momenta of the incoming quarks in the measure- 
ment of the forward and backward cross-sections, we use the Collins-Soper reference frame [18]. This 
reference frame reduces the uncertainty in electron polar angle due to the finite transverse momentum 
of the incoming quarks. The particle four-vectors are transformed to the e + e~ rest frame and the polar 
angle 6* is measured with respect to the axis, which bisects the two quark momentum vectors. 

cose* = 2 ^[p + (e-)p-(e + )-p + (e + )p-(e-)} (14) 

m(e+e-)^/m^e+e-)+p 2 T (e + e-) 

where p ± = ^(E ± p z ), E is the energy and p z is the longitudinal component of the momentum. To 
take into account our supposition that the Z boost and the quark direction are the same we add the sign 
of the Z boost to the definition: 



COS0* = 




Figure 10 shows the resulting distributions of events for which th e + e~ invariant mass is close to the 
Z mass. We should note that the detector itself doesn't introduce false asymmetries with the geometry 
used in the MC samples. 



aulas' 




Figure 10: Distribution in cos d* for reconstructed events in the Z pole region. 
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7 Analysis method 

7.1 Event selection 

In this analysis we search for events with two electrons. An electron must fulfill certain quality and 
kinematic requirements. It must first satisfy the tight electron criteria outlined in Section 4.1. In addition 
the following cuts were applied: 

• CI 

We require an electron transverse momentum higher than 20 GeV (jpj > 20 GeV) to emulate the 
energy threshold of the electron trigger 

• C2 

We require the e + e~ invariant mass, M^ e+e -y to be within a window of 12 GeV around the Z mass, 
85.2 GeV < M (e+e -) < 97.2 GeV (Z pole) 

• C3 \y e+e - \ > 1 

In contrast to Tevatron pp collisions, sea quark effects dominate at the LHC. At central rapidity, 
y e + e - , the probability that the valence quark direction and the dielectron boost coincide is lower 
due the smallness of the valence quark distribution. This reduces the forward backward asymme- 
try. Since the valence quark dominates at high values of x, the events where one parton carries 
a large fraction of the proton momentum are more sensitive and they give a large rapidity to the 
dilepton system. In this region the most significant measurements can be performed, as shown in 
Figure 11. A purer, though smaller, signal sample can thus be obtained by introducing a rapidity 
cut. For the following studies we will impose a \y e + e - \ > 1 cut. 

• C4 Ef ss < 20 GeV 

We require the missing transverse momentum to be less than 20 GeV. This cut rejects efficiently 
the background coming from pp —>■ ttX channel where both top quarks decay semileptonically. 

7.2 Analysis cases 

A study using fast simulated events [19] showed that the use of forward electrons improves the precision 
of the forward-backward asymmetry measurement (and the weak mixing angle). As can be seen in Figure 
12, a very high electron identification performance in the forward calorimeters is not needed, as already 
with a rejection of 100 against jets the influence of the remaining QCD background is negligible. Figure 
12 shows only the statistical uncertainty on Ap b versus the jet rejection in the forward calorimeters (with 
a fixed electron efficiency of 50%). Thus, it is required that one of the two electrons lies in the central 
region (|tj| < 2.5), while the other electron may be either in the central region or in the forward region 
up to \r}\ =4.9: 

• | tj | < 2.5 for both electrons (C-C), or 

• | tj | < 2.5 for one of the two electrons and | tj | < 4.9 for the other (C-F). 
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Figure 1 1 : Di-electron rapidity distribution for 
all events (upper line) and for the events with 
correct quark direction (lower line). 



Figure 12: Forward-Backward asymmetry sta- 
tistical accuracy versus the forward jet rejec- 
tion in the events where at least one electron is 
in the central region and keeping the efficiency 
of the electron in the forward region at 50%. 



In the region 2.5 < |t] | < 3.2 the calorimeters used are the EMEC and the HEC and for 1 17 1 > 3.2 the 
forward calorimeter (FCal) is used. Note that we can not reconstruct the electron track in the forward 
region (2.5 < |t]| < 4.9) as the tracking system of ATLAS is limited to the region |tj| < 2.5. In addition, 
the forward calorimeters have a coarser granularity than the central ones (a factor 2 in both eta and phi 
directions). Figure 13 shows the the Ey distribution of electron for events with at least one electron in 
the central region. 




electron E T (GeV) 



Figure 13: The electron E7- distribution for the 
central-forward events, normalized to unity. 




5 % e . e ,<G 6 e 0 V) 



Figure 14: Di-electron pj distribution for the 
central-central events (C-C) and the central- 
forward events (C-F). 



Using the forward electrons we gain about 30% in the statistics as shown in figure 14, where we 
compare the di-electron pj distribution in the two analysis cases, C-C and C-F. Furthermore the value of 
the asymmetry is higher in the C-F events. The expected numbers of the signal and background events 
of each process are shown in table 2 in the two analysis regions, C-C and C-F. 

Event statistics corresponding to a 100 fb _I would be required for this analysis to reach a very 
high precision on the determination of the weak mixing angle from the forward-backward asymmetry 
measurement. Nevertheless the measurement can be also made at low luminosity in order to test the 
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consistency with the Standard Model. This can be done as function of M e + e - to enhance sensitivity to 
possible deviations at high mass scales. 



7.3 A FB calculation 

The forward-backward asymmetry is calculated according to Eqs 9 and 10. 

If we assume that the distribution of Nf and Nb follows a binomial distribution, then the uncertainty 
on the two quantities can be written as follows (which is valid only for small asymmetries): On f = <Jn b = 

yjN F N B / \/N F +N B , and the A FB uncertainty is o Afb = \j l ~^ FB • 



Selection cut Signal dijet W+X ft 


C-C 


CI 


2.39802x10' 


4741 


73312 


315178 


C2 


1. 87808x10' 


404 


6048 


28204 


C3 


7.31282xl0 6 


144 


2212 


9937 


C4 


7.30496 xlO 6 


142 


205 


741 


C-F 


CI 


3.09742 xlO 7 


447408 


2764790 


336714 


C2 


2.41535 xlO 7 


35920 


172321 


29824 


C3 


1.26855 xlO 7 


35340 


168486 


11556 


C4 


1.25967 xlO 7 


35128 


15868 


864 



Table 2: Summary of expected number of C-C and C-F signal and background events for 100 fb 1 of 
integrated luminosity. Results are given after the application of cuts C1-C4. 



8 Electron charge identification 

Any misidentification of the electron charge dilutes the asymmetry. Where possible, the charge of both 
electrons is measured and it is required that the signs differ. When both electrons are in the central re- 
gion both charges can be measured and the condition that they have opposite charges removes the events 
where the charge of one of the two electrons is misidentified. In this case the forward-backward asym- 
metry is not significantly affected by charge misidentification. 

The situation is different when one electron is forward and the other central. Only the charge of the 
central electron can be measured. A misidentification of the charge of the electron changes the sign of 
cos 6* and a forward event may be taken as a backward one. This effect can be corrected for if the charge 
misidentification fraction is known. 

To measure the charge misidentification we use two methods: 
• MC method 

We use the two electrons coming from Z and we count the number of events where the generated 
charge (q gen ) and reconstructed charge (q reco ) of the individual electrons differ. We define the 
charge misidentification fraction as follow: 

r = N(q reco / q gen ) 
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• Tag and probe method 

This method is used with the tag electron e\ having the tight electron requirements, py >20 GeV 
and 1 77 1 < 1 .5. The tag and probe electron pair must have a dielectron mass within ±6 GeV of M z . 
The rate at which the second electron <?2 has the same charge gives an estimation of the charge 
misidentification fraction. 

The true asymmetry can be deduced from the raw asymmetry using the relation: 

A FB ,., = A f'-'+ + '- . (17) 
1 — r + — r_ 

where r_ is the fraction of true e~ misidentified as e + and r + is the fraction of true e + misidentified as 

Figure 15 shows the misidentification fraction as function of the electron pseudo-rapidity obtained 
with these two methods. The misidentification fraction is small. The difference between the misidentifi- 
cation fraction for e + and e~ is shown in figure 16. 




Figure 15: Electron charge misidentification 
fraction versus the electron pseudo-rapidity: 
Data method of tag-and-probe measuring the 
charge misidentification fraction (open points), 
and Monte Carlo driven charge misidentifica- 
tion fraction (full points). 



Figure 16: The difference in charge misidenti- 
fication fraction of positrons and electrons ver- 
sus the electron pseudo-rapidity. 



9 Effect of pile-up 

Pile-up originates from the fact that several pp collisions can occur during the same bunch crossing. This 
causes extra activity in the detector and therefore influences the event selection. The effect of pile-up is 
assessed by superimposing additional events on top of the signal when events are simulated; hits from 
several bunch crossings are overlaid. In this study, pile-up events corresponding to 10 33 cmT 2 s _1 were 
investigated. 

The pile-up affects the electron selection efficiency due to the presence of additional activity in the 
events. Figure 17 displays the electron identification efficiency with and without pile-up. We can see 
also in Table 3 the summary of the results for the C-C and C-F events, after all selection cuts. The results 
show a loss of about 3% of signal events. 
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C-C 


C-F 


No pile-up 


7,304,960 


1. 25967 xlO 7 


With pile-up 


7,026,653 


1.217696xl0 7 



Table 3: Number of expected events, at lOOfb -1 , after all cuts, for the signal, and for C-C and C-F events. 
The numbers are shown for the two cases: with and without pile-up. The reference pile-up luminosity 
used is 10 33 cm" 2 s _1 . 



s 1 E ~^77 

1 0.9 ;-%ir 




Figure 17: Effect of pile-up on the electron identification efficiency. The reference pile-up luminosity 
used is 10 33 cm~ 2 s _1 . 



10 Systematic Uncertainties 

We have considered several sources of systematic uncertainties. There are PDF uncertainties and experi- 
mental ones including the energy scale, energy resolution, reconstruction efficiency and the background 
estimation. 

10.1 Parton Distribution Functions 

Since the vector and axial vector couplings of the u and d quarks to the Z boson are different, the forward- 
backward asymmetry is expected to depend on the ratio of the u and d quark parton distribution function. 
Thus, the choice of the parton distribution functions (PDFs) will affect the measured lepton forward- 
backward asymmetry. 

The MRST PDF parametrization [6] is used to assess the uncertainty arising from the PDFs. Thirty 
eigenvectors are used in this parametrization to indicate the effect of ±la variations. Figure 18 shows 
the e + e~ asymmetry for each eigenvector. The deviation of Aps from the central value is of the same 
order of magnitude as the statistical accuracy. 

To study the effect of the PDFs on the asymmetry, a PDF reweighting technique is used to reduce the 
need for Monte Carlo generations and simulation. Thus, we generate the events with the best fit central 
value (PDFq) and then we reweight to PDFi of the set i by weighting each event via: 

fpp Fl {xi , Q 2 ,flavi).fpDF,(x2, Q 2 ,flav 2 ) (18) 
fpDFi (x\,Q 2 ,flavi) -fpDF\ (x2,Q 2 ,flav 2 ) 

where xfpjjf(x,Q 2 ,flav) is the parton momentum distribution for flavour, flav, at scale, Q 2 , and mo- 
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mentum fraction, x. 



ATLAS 




Figure 18: The forward-backward asymmetry for each MRST eigenvector; the red star is the central 
value. 



10.2 Background subtraction 

The determination of Apg (with real data) requires knowledge of the number of background events and 
the forward-backward charge asymmetry of the background events. Thus the central value of Apg we 
showed above can be obtained from the data events by subtracting the background events in the forward 
and backward regions separately from the raw forward-backward asymmetry. The number of back- 
ground events estimated thus gives rise to a source of systematic uncertainties on Aps- 

The systematic uncertainty is evaluated by varying the estimated numbers of background events by 
30%. The uncertainty value is taken as the shift in Af B . The largest shift is less than 0.01%. 

10.3 Detector performance 

Various effects due to uncertainties on the knowledge of the detector performance have to be taken into 
account: 

Energy scale The electron energy scale uncertainty, which arises from calorimeter calibration uncer- 
tainties, affects the forward-backward asymmetry by causing a shift in the e + e~ invariant mass over 
which we integrate Afb- The effect is significant in the Z-pole region as can be seen in Figure 1. To take 
these effects into account, the central calorimeter scale is varied by 0. 1 % and the forward calorimeter 
scale is varied by 0.5% to estimate the systematic uncertainties. The positive and negative variations are 
considered separately and the largest shift is taken as the uncertainty. 

Reconstruction efficiency The impact of the uncertainties on the electron reconstruction in the Afb 
measurement can be estimated by removing 0.2% and 0.5% fraction of the reconstructed electrons in the 
central region and forward region respectively. These values used here are based on our study on the 
forward calorimeters reconstruction. 
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Energy resolution The forward backward asymmetry is sensitive to the variations in the energy res- 
olution. These variations are mainly due to the dead material distribution, known with an insufficient 
accuracy. To evaluate the impact of this contribution the central electron energy is degraded by an extra- 
term 0.01 E T and the forward calorimeter energy by 0.05 E T . 

Charge identification A systematic variation in charge misidentification fraction would directly trans- 
late into uncertainties of the forward-backward asymmetry. The systematic uncertainty is estimated by 
comparing the measurement as performed as in the data (tag and probe method), to that using the true 
Monte Carlo. As it is shown in figure 15 a difference of about 0.1% is observed. 

Table 4 reports all the systematic uncertainties at 100 fb _1 . As expected, the dominant contribution 
to the overall uncertainty is from the PDF uncertainties. 



Source 


8A FB (abs) 


8 sin 2 Ogff (abs) 


Energy scale 


2.7 xlO" 5 


1.5xl0- 5 


Reco. Eff. 


3.4xl0- 5 


1.9xl0- 5 


Energy resol. 


1.9xl0" 6 


l.lxlO- 6 


Charge ID 


2.6 xlO" 5 


1.4xl0" 5 


Background subtraction. 


< 10~ 3 


< 10~ 5 


PDFs 




+ 1.3x10-4 


a and b parameters 




3 x 10" 5 


Statistical error 


2.7xl0- 4 


1.5 xlO" 4 



Table 4: Summary of the systematic and statistical uncertainties on Ap b and sin 2 d'^'j- for events with at 
least one electron in the central region (C-F). The uncertainty on sin 2 6'jJ'jr is determined using Eqs. 11 
and a and b parameters from figure 21. 



11 Results 

Figure 19 shows the dielectron invariant mass spectrum expected in terms of the number of events per 
GeV for an integrated luminosity of 100 fb -1 after the application of cuts CI, C3 and C4. The yellow 
histogram corresponds to the signal contribution. The light blue displays the contributions from dijet 
events. The green and violet histograms display the contribution of W + X and tt backgrounds respec- 
tively. The background contributions were obtained with MC samples with a fast detector simulation. 
The signal contribution was obtained with a full detector simulation. 

Table 2 shows the expected number of background and signal events for 100 fb -1 of integrated 
luminosity after the application of cuts C1-C4. It indicates that the contribution from background events 
is at the level of 0.2%. 

In Figure 20 we display the variation of the charge asymmetry versus the rapidity of the two elec- 
trons. It is observed that the asymmetry increases when allowing the second electron to be up to \r]\ = 
4.9. Using \y e + e - | >1 and going from C-C to C-F events, the integrated asymmetry increases from 1.3% 
to 2.7%. The precision on the forward-backward asymmetry improves from 3.7 x 10~ 4 to 2.7 x 10~ 4 . 

Using the parameters a = 0.23 ±0.03 and b = 1.832 ±0.255, of Eqs. 11, derived from the linear fit 
of the figure 21, we can estimate the error expected for sin 2 6 e J'f from a measurement of the forward- 
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Figure 19: Dielectron invariant mass distribution for the signal and background events normalized to 100 
fb -1 , with the final selection except that the cut C2 is removed. 




0.5 1 1.5 2 2.5 3 3.5 

y„ 



Figure 20: Forward-backward asymmetry versus dielectron rapidity in the C-C events (open points) and 
in the C-F events (full points). 



backward asymmetry at the Z pole, for an integrated luminosity of 100 fb 1 : 

Ssm 2 d' e J f = (1.5(stat) ±0.3(exp) ±2.4(PDF)) x 10~ 4 (19) 



12 Conclusion 

We report on a detailed study of the forward-backward charge asymmetry Apb, at LHC with the AT- 
LAS detector, of electron pairs resulting from the process pp —>■ Z/y*X —> e + e~X. This measure- 
ment provides a test of the Standard Model. In the vicinity of the Z-pole this measurement can be 
used to determine the effective weak mixing angle sin 2 6^. The precision on sin 2 6^ obtained is 
8sm 2 d' e J f = (1.5(stat) ±0.3(exp) ±2.4(PDF)) x 10~ 4 for a 12 GeV mass window around the Z mass, 
and 100 fb -1 of integrated luminosity. Electron identification in the forward region (2.5 < |t]| < 4.9) of 
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Figure 21 : Forward backward asymmetry Afb versus the weak mixing angle sin 2 offf at the Z pole. The 
straight line is a x 2 fit to the points shown. Fast simulated events are used. 

the ATLAS detector is very important for the measurement. In this region an electron ID efficiency of 
80% is achieved with less than 3% QCD background. 

The main systematic effects relevant for forward-backward asymmetry measurements with 100 fb _1 
of data are addressed, including systematic uncertainties on detector effects and MRST PDF uncertainty. 
An advanced technique was developed and used to estimate the error due to the PDF uncertainties. This 
study showed that the uncertainty in the weak mixing angle determination due to PDF's is of the same 
order as the measurement statistical error, which means that the precision on the weak mixing angle 
at LHC can be competitive to the current world average. In addition we expect that in the future the 
knowledge of the PDF's will improve from the constraints imposed by Tevatron, HERA and first LHC 
measurements (e.g. using W asymmetry), and the systematic uncertainty due the uncertainty in the 
PDF's should decrease by the time ATLAS high luminosity data is available. If this is not the case, the 
asymmetry measurements can be used, conversely, to constrain the parton distribution functions. 
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Abstract 

This note presents studies of the sensitivity of the ATLAS experiment to Stan- 
dard Model diboson (W + W~, W ± Z, ZZ, W ± y, and Zy) production in pp 
collisions at = 14 TeV, using final states containing electrons, muons and 
photons. The studies use ATLAS simulated data, which include trigger in- 
formation and detector calibration and alignment corrections. The influence of 
backgrounds on diboson detection is assessed using large samples of fully sim- 
ulated background events. The cross-section measurement uncertainties (both 
statistical and systematic) are estimated as a function of integrated luminos- 
ity (from 0.1 to 30 ftr 1 ). The studies show that the Standard Model W + W~, 
W ± Z, W ± y, and Zy signals can be established with significance better than 
5a for the first 0.1 ftr 1 of integrated luminosity, and the ZZ signal can be 
established with 1 ftr 1 of integrated luminosity. The ATLAS experiment's 
sensitivity to anomalous triple gauge boson couplings is also estimated. The 
anomalous triple gauge boson coupling sensitivities can be significantly im- 
proved, even with 0.1 ftr 1 of data, over the results from the Tevatron that use 
1 ftr 1 of data. 

1 Introduction 

This paper presents studies of the ATLAS experiment's sensitivity to diboson (W + W~, W ± Z, ZZ, W ± y, 
Zy) production using lepton and photon final states, and the corresponding ability to set limits on anoma- 
lous triple gauge boson couplings (TGC). The analysis of diboson production at the LHC provides an 
important test of the high energy behavior of electroweak interactions. Vector boson self-couplings 
are fundamental predictions of the Standard Model [1], resulting from the non-Abelian nature of the 
SU(2)l xU(1)y gauge symmetry theory, which was demonstrated by precision measurements of 
and ZZ pair production at LEP II [2]. 

Any theory predicting physics beyond the Standard Model while maintaining the Standard Model 
as a low-energy limit may introduce deviations in the gauge couplings at some high energy scale. Pre- 
cise measurements of the couplings will not only provide stringent tests of the Standard Model, but will 
also probe for new physics in the bosonic sector. These tests will provide complementary information 
to other direct searches for new physics at the LHC. Many models predict deviations of vector boson 
self-couplings from the Standard Model at the 10~ 3 — 10~ 4 level [3]. Experiments that can reach this 
sensitivity could provide powerful constraints on these models. The signature for such anomalous cou- 
plings is enhanced diboson production cross-sections, particularly at high transverse momentum (p T ) of 
the bosons. Experimental limits on non-Standard Model TGC's can be obtained by comparing the shape 
of the measured pj or mass distributions (or transverse mass, Mj, for final states involving W) with 
predictions, provided that the signal is not overwhelmed by background. 

The analysis uses over 30 million fully simulated and reconstructed events, with a detector layout 
and trigger system that reflects the ATLAS experiment as it will operate at LHC turn-on at 14 TeV 
center of mass energy, thus providing a realistic understanding of the detection of these diboson final 
states. A Boosted Decision Tree [4] technique is applied to selected channels, significantly enhancing 
measurement sensitivities. These are among the ways this study improves on our understanding and the 
results of the previous ATLAS diboson studies [5]- [10]. 
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1.1 Diboson production cross-sections 

Tree-level Feynman diagrams for electroweak diboson production at hadron colliders are shown in Fig- 
ure 1. The s-channel diagram contains the vector-boson self-interaction vertices of interest here. The 
cross-sections are calculated to next-to-leading-order (NLO) in [11]- [13]. The Standard Model diboson 
production cross-sections are listed in Table 1. 

q v 2 

>~^^ILtGC vertex 

q Vi 
s-channel 

Figure 1: The generic Standard Model tree-level Feynman diagrams for diboson production at hadron 
colliders; V,Vi,V2 = {W,Z,y}. The ^-channel diagram contains the trilinear gauge boson vertex. In the 
Standard Model, only WWy and WWZ vertices are allowed. 



Table 1: The Standard Model diboson production total cross-sections, calculated to the NLO, at the 
Tevatron (y/s = 1.96 TeV) and the LHC (y/s = 14 TeV). The references in the first column indicate the 
MC generators used for the calculations, with parton density function (PDF) CTEQ6M and the elec- 
troweak parameters [14]. The theoretical uncertainty from the PDF and the QCD scale factor is typically 
5%. 



Diboson mode 


Conditions 


yfs= 1.96 TeV 


y/s=U TeV 






a[pb] 


a[ P b] 


W + W- [15] 


IV-boson width included 


12.4 


111.6 


W^Z [15] 


Z and W on mass shell 


3.7 


47.8 


ZZ [15] 


Z's on mass shell 


1.43 


14.8 


W ± y [16] 


E y T > 7 GeV, AR(£, y) > 0.7 


19.3 


451 


Zy [17] 


E y T >l GeV, AR(£, y) > 0.7 


4.74 


219 



The LHC diboson production rates will exceed those of the Tevatron by at least a factor 100 (10 
times higher in cross-sections and at least 10 times higher in luminosity). Furthermore, because the 
energy reach at the LHC will be 7 times higher than at the Tevatron, the LHC sensitivity to anomalous 
TGC's is expected to be improved by orders of magnitude over that which can be reached at the Tevatron 
or LEP. 

1.2 Effective Lagrangian for charged TGC's 

The most general effective Lagrangian, that conserves C and P separately, for charged triple gauge boson 
interactions is [19]: 
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where V refers to the neutral vector-bosons, Z or y, X^ v = d^X v — d v X^ and the overall coupling constants 
gwwv are given by gwwy = —e, gwwz = — e cotdw, with e the positive electron charge and dw the weak 
mixing angle. The Standard Model triple gauge boson vertices are recovered by letting g\ = K v = 1 
and X v = 0. Experimentally, deviations from the Standard Model couplings is searched for; thus the 
anomalous coupling parameters are defined as 

Agf = gf - 1 , Ak 7 = K y - 1 , Ak z = k z — 1 , A r , and X z . 

Note that electromagnetic gauge invariance requires g\ = 1 or Agf = 0. 

Studies of three different diboson final states, W ± Z and W ± y will provide complementary 

sensitivities to the charged anomalous TGC's [17]. For example, the Axy terms in production are 

proportional to s, defined as the square of invariant mass of the vector-boson pair, whereas these terms 
are only proportional to y/i in W ± Z and W ± 7 production. production is thus expected to be more 

sensitive to Akv than W ± Z and W ± 7 production. Conversely, W ± Z production is expected to be more 
sensitive to Agf than production because terms in Agf are proportional to s in W ± Z production. 

The A -type anomalous couplings have an s dependence in all three cases, thus the sensitivities will be 
enhanced at the high center-of-mass energy of the LHC. 

With non-Standard Model coupling parameters, the amplitudes for gauge boson pair production grow 
with energy, eventually violating tree-level unitarity. The unitarity violation is avoided by introducing an 
effective cutoff scale, A [18]. The anomalous couplings take a form, for example, 

Ap(s) = (iTW 

where Afc is the coupling value in the low energy limit. The scale A is physically interpreted as the mass 
scale where the new phenomenon, which is responsible for the anomalous couplings, would be directly 
observable. The value of n = 2 is used for charged anomalous TGC, and n = 3 for neutral anomalous 
TGC. 



1.3 Effective Lagrangian for neutral TGC's 

In the Standard Model, neutral boson pairs, ZZ and Zy, are produced via the ^-channel diagrams shown 
in Figure 1. While the Standard Model ZZZ and ZZy triple gauge boson couplings are zero at tree level, 
anomalous couplings may contribute. This study considers the effect of anomalous couplings on the pro- 
duction of pairs of on-shell Z bosons only. In this case, the most general form of the Z a (q 1 )Z^ (#2 ) V^ 1 (P) 
(V = Z, 7) vertex function which respects Lorentz invariance and electromagnetic gauge invariance may 
be written as [20] 

8zzv r a zz y = e^j^i ift{F»4* +P p 8 n + ifIe^( qi - q2 ) P ] 

where M z is the Z-boson mass and e is the positive electron charge; qi,q 2 and P are the 4-momenta of the 
two on-shell Z bosons and the ^-channel propagator, respectively. The effective Lagrangian generating 
the g zzv vertex function is 

L=-^[fnd,V^)Z a (d a Z p )+g(d-V a ,)Z^Z p ], 

where V^ v = d^V v — d v V^ and Z^ = ^e^ vpa Z pa . The couplings fj (i = 4, 5) are dimensionless com- 
plex functions of qf, q\ and P 2 and are zero at tree level. All couplings are C odd; CP invariance forbids 
fj, while parity conservation requires that / 5 y vanishes. Because ff and fj are CP-odd, contributions 
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to the helicity amplitudes proportional to these couplings will not interfere with the Standard Model 
terms, and hence ZZ production is not sensitive to the sign of these couplings. The CP conserving cou- 
plings / 5 y contribute to the Standard Model cross-section at the one-loop level, but this contribution is 

^(i<r 4 ) [20]. 

1.4 Current Tevatron results on diboson physics 

Diboson production measurements and studies of anomalous TGC's have been performed at the Tevatron 
with the CDF and DO experiments, using up to 2 fb -1 of integrated pp luminosity. Diboson cross-section 
measurements using el\i decay modes, and their event statistics, measurement precision, and background 
events are summarized in Table 2. The measurments from Tevatron experiments are consistent with the 
Standard Model predictions based upon NLO matrix element calculations. 



Table 2: Summary of Tevatron pp — > diboson cross-sections. For the W + W , W ± Z, and ZZ channels 
total production cross-sections are quoted. 



Process 


Source 


L 


observed 


background 


cr(data) [pb] 


a(theory) 






fb- 1 


events 


events 


± (stat)±(sys)±(lum) 


[pb] 


w+w~ 


CDF [21] 


0.83 


95 


38±5 


13.6±2.3±1.6± 1.2 


12.4±0.8 


(ee, Hfi, efl) 


DO [22] 


0.25 


25 


8.1±.5 


13.8±4.1±1.1 ±0.9 




W±Z 


CDF [23] 


1.1 


16 


2.7±0.4 


5.0^}j±0.4 


3.7±0.3 


(£ ± ve + £-) 


DO [24] 


1.0 


13 


4.5±0.6 


2.7 +1.7-1.3 (total) 




Zy 


CDF [25] 


0.2 


72 


4.9±1.1 


4.6 ±0.6 (sta+sys) ± 0.3 


4.5±0.3 


(£ + £-y) 


DO [26] 


1.0 


968 


117±12 


4.96 ±0.3 (sta+sys) ± 0.3 


4.7±0.2 


W ± y 


CDF [25] 


0.2 


323 


114±21 


18.1 ±3.1 (sta+sys) ± 1.2 


19.3±1.4 


(£±vy) 


DO [27] 


0.16 


273 


132±7 


14.8 ±1.9 (sta+sys) ± 1.0 


16.0±0.4 


ZZ 


CDF [28] 


1.9 


2 


0.014 


1.4±^±0.6 


1.5±0.2 


(£+£-£+£-) 


DO [29] 


1.0 


1 


0.13 


<4.4 





The Tevatron's pp collisions produce the charged states of W ± Z and W ± y. These states can be used 
to study the W + W~Z and the W+W - 7 couplings independently, which is in contrast with the anomalous 
TGC measurements made at LEP [32] from the final state, where certain assumptions relating the 

W + W~Z and the /couplings were made. The Tevatron limits for the WWy and WWZ anomalous 

TGC's are summarized in Table 3. These limits will improve significantly by combining the constraints 
from the W ± y, W ± Z and channels, and by increasing the datasets using the expected integrated 

luminosity of 6 fb -1 at the end of the Tevatron running. 

2 Signal and background modeling and simulated data samples 
2.1 MC generators used to produce fully simulated events 

The diboson physics analyses focus on leptonic decay channels of the boson pairs (W + W~, W ± Z, ZZ, 
W ± y, and Zy) produced in the proton-proton collisions at the LHC. 

Diboson productions of the W ± Z, and ZZ final states, as well as the subsequent pure lep- 

tonic decays, are modeled by the MC@NLO [15] Monte Carlo generator. The generator incorporates the 
next-to-leading-order (NLO) QCD matrix elements into the partem shower by interfacing to the Her- 
WIG/Jimmy [33] programs. A branching ratio of 0.0336 for Z £ + £~ and 0.108 for W + -> l+v is used 
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Table 3: Anomalous gauge coupling limits (95% C.L.) for WWy and WWZ from the Tevatron experi- 
ments, with A = 2 TeV. 



Coupling 


Source 


Mfb- 1 ) 




Akz 


Ak 7 




WWy from W ± y 


DO [27] 


0.16 






[-0.88, 0.96] 


[-0.2, 0.2] 


WWZ from W ± Z 
WWZ from W±Z 


DO [24] 
CDF 


1.0 
1.9 


[-0.17,0.21] 
[-0.13,0.14] 


[-0.12, 0.29] 
[-0.82, 1.27] 






WWZ = WWy 
from W+W- 


DO [30] 


0.25 


[-0.31,0.33] 


[-0.36, 0.33] 






from W + W-,W ± Z 


CDF [31] 


0.35 


[-0.18, 0.17] 


[-0.46, 0.39] 







for each lepton flavor (e, n, t). The gauge-boson decays into tau leptons are included in the MC event 
generator and these tau leptons decay to all the possible final states. Hard emission is treated as in NLO 
computations and soft/collinear emission is treated as in a regular parton shower MC. The matching be- 
tween these two regions is smooth (no double-counting). W -boson width and spin-spin correlations are 
included in the generator. However, 'zero-width' approximations are used in W ± Z and ZZ calculations, 
and no Z/y* interference terms are included. MC@NLO does not include anomalous triple gauge boson 
couplings. The process of production via gluon-gluon fusion and the leptonic decays of the W, 

gg -> ->• £v£'v, is modeled by the MC generator gg2ww [34]. The W ± y and the Zy produc- 

tion processes and subsequent leptonic decays of the W ± and the Z are modeled by the Pythia MC 
generator [35], which only incorporates the leading-order (LO) QCD matrix elements into the parton 
shower. To include the off-shell Z and y* into the ZZ analysis, we have also used PYTHIA to generate 
the Z/y* +Z/y* -> £ + £~ events. The Z/y* mass threshold is set to 12 GeV in Pythia. Table 4 
list all the diboson signal samples used in this paper. 

Major physics backgrounds for diboson signal detection come from top pairs and hadronic jets as- 
sociated with W or Z gauge bosons. We have used MC@NLO to model it —>■ £ + X production (700k 
events). The inclusive W + X and Z + X (X = jets, or y) processes are modeled by the Pythia (30M 
events) and Alpgen [36] (1.1M events). 

Whenever LO event generators are used, the cross-sections are corrected to NLO by using k-factors 
from NLO matrix element calculations to normalize the expected signal and background events. 

2.2 MC generators for TGC studies 

Monte Carlo generators BosoMC [16] and BHO [17] are used for anomalous TGC studies. These MC 
programs are numerical parton level generators. They are used to calculate both LO and NLO cross- 
sections for all five diboson final states (W + W~ , W ± Z, ZZ, W ± y, Zy) with anomalous coupling pa- 
rameters. However, they do not include parton showers automatically. We use the BHO MC to model 
the ZZ, and Zy production cross-sections and kinematics with Standard Model and anomalous 

couplings. BosoMC is used for W ± Z and W ± y diboson final-state TGC studies. The calculated dibo- 
son production rates from these generators are accurate to NLO. The W ± Z and ZZ production 
calculations with the Standard Model couplings are compared with the MC@NLO calculations using 
PDF CTEQ6M. We found that both cross-sections and kinematic distributions are in good agreement, as 
shown in Figure 2. The left plot shows the W ± Z production differential cross-section as a function of 
the transverse mass of the W ± Z system. The right plot shows the production differential cross- 
section as a function of the W transverse momentum. The discrepancies between MC@NLO and the 
BHO MC are within 2%. 
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Table 4: Diboson signal production processes, cross-sections including branching ratios of W/Z leptonic 
decays and fully simulated number of MC events. The MC simulation 'filter' is the event selection at 
the generator level. The corresponding filter efficiencies are given in the table. We also indicate the MC 
generators used to produce the MC events and to calculate the cross-sections given in this table. 



Process 

qtf-tw+w- ->e+ve-v 
gg->w + w- -4fvrv 

(£ = e, n, t) 

qq' ^W + Z^£+v£ + r 

qq' -4rz^rvf + r 

(£ = e, n; Z on mass shell) 

q q> -^zz^ i+re+r 
qq' ^zz^e+e-vv 

(£ = e, n; Z on mass shell) 

qq' -^ZZ^ £+£-£+£- 

(£ = e, n, r,M z /r > 12 GeV ) 

(4 leptons (e, n), p ( T > 5 GeV, |tj £ | < 2.7) 

qq' -^W + y -> £ + vy 
qq' — ► W~y — > £~vy 
(£ = e, At; E y T > 10 GeV) 

qq' ^Zy^£ + £~y 

(£ = e, n; ^>10GeV) 



5-section (fb) £ filter N M c 



11718 
540.0 



1.0 180,000 MC@NLO 
0.96 180,000 gg2ww 



441.7 1.0 50,000 MC@NLO 

276.4 1.0 50,000 MC@NLO 



66.8 1.0 49,250 MC@NLO 

397 1.0 118,000 MC@NLO 



159 0.219 43,000 Pythia 



10220 1.0 38,400 Pythia 

6820 1.0 25,600 Pythia 



1.0 66,000 Pythia 



A somewhat different procedure is used in the estimation of neutral triple gauge couplings from 
ZZ events. In this case, the signal expectation with anomalous couplings was determined from the 
leading order Monte Carlo calculation of BHO, corrected using a p T dependent k-factor derived from 
MC@NLO. 



3 Diboson event selection 

This section discusses features of five diboson signals, (W ± Z, W ± y, W + W~, Zy, ZZ), the major back- 
grounds and the analysis cuts required to discriminate between them. The varied event topolology of 
each diboson signal precludes a common set of universal cuts. Two analysis approaches are employed. 
The first is based on a sequence of straight cuts on kinematic quantities. The second is a refined, multi- 
variate analysis based on Boosted Decision Tree (BDT) which is briefly described in Section 3.2 . Some 
of the diboson analyses employ both techniques. The major results of these analyses are included here. 

3.1 Physics objects 

The ATLAS detector and its performance is described in detail elsewhere [37]. A brief description of the 
physics objects used in diboson analysis is given below. 

The major physics objects used in diboson physics analysis are electrons, photons, muons, missing 
Ej ($t), and hadronic jets. Electrons are identified by their distinctive pattern of energy deposition in 
the calorimeter and by the presence of a track in the inner tracker that can be extrapolated from the 
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M^WZ)(GeV) VT PJGeV) 



Figure 2: Comparison of MC@NLO MC to BHO MC for W ± Z and W + W~ production. The histograms 
are normalized to production cross-sections. Left plots: the W ± Z Mr distribution from W ± Z production. 
Right plots: the W + pj distribution from production. 

interaction vertex to a cluster of energy in the calorimeter. To ensure high trigger efficiency, for events 
with a single electron, the transverse energy of an single electron must satisfy Ej > 25 GeV. For events 
with dielectrons, both electrons are required to have Ej > 10 GeV. The electrons must be isolated from 
other energy clusters. An electron is required to pass a set of cuts on shower shape, track quality and track 
to calorimeter cluster matching. Photon identification is similar to an electron in the EM calorimeter, 
but no charged tracks in the inner tracker should match the EM energy cluster. The average electron 
identification efficiency in the barrel is about 75% and in the endcaps about 60%. 

Muons are reconstructed using information from the outer muon spectrometer (MDT chambers and 
trigger chambers), the inner tracking detectors and the calorimeters. Muons are identified with a tracking 
algorithm that associates a track found in the muon spectrometer with the corresponding inner detector 
track, after the former is corrected for energy loss in the calorimeter. The combined muon detection 
rapidity coverage is |tj| < 2.5. The minimum pj of reconstructed muon track is 5 GeV. The candidate 
muons are required to be isolated in the calorimeter and inner tracker to minimize the contributions of 
muons originating from hadronic jets. The average muon identification efficiency is about 95%. 

The hadronic jets are reconstructed using the fixed-cone jet algorithm. The cone size used in this 
analysis is 0.7. The jet seed threshold on the transverse energy in a tower is set to E s = 1 GeV, and the 
final energy cut on a jet is Ej > 7 GeV. With this cut the minimum measurable jet Ej should be 20 GeV. 

Missing transverse energy, fj, is calculated from the energy deposited in all calorimeter cells and 
from muons. A collection is applied for the energy lost in the cryostat. For diboson events with neutrinos 
in final states, the $t resolution we found to be 6.5 GeV based on our studies using Z —>■ MC events. 

The ATLAS trigger consists of three levels of event selection: Level- 1 (LI), Level-2 (L2) and the 
event filter (EF). The L2 and EF together form the High-Level Trigger (HLT). According to the present 
physics trigger menu for initial running, diboson candidate events with multi-lepton final states will be 
recorded with single muon, single electron, dielectron and dimuon triggers. The trigger efficiencies 
for diboson events, defined as the fraction of events accepted by the analysis cuts that have satisfied 
the trigger requirements, are expected to be in a range of 95% - 100%, with the exception of a lower 
efficiency (~ 80%) for Wy events. The events that are accepted by the trigger are used for the analysis 
results. 
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3.2 Boosted Decision Trees 

A rather new multivariate analysis technique, Boosted Decision Tree, has been used in our analysis to 
improve the detection sensitivity for diboson signals. BDTs have been used in HEP data analysis in 
recent years [38], and details can be found in the Ref. [4]. A further development, allowing for weighted 
events, is used in these diboson physics studies [39]. 

The BDT technique involves a fast 'training' procedure for event pattern recognition. It works with 
a set of data including both signal and background. Data are represented by a set of physics variable 
distributions. A decision-tree splits data recursively based on cuts on the input variables until a stopping 
criterion is reached. Every event ends up in a signal (score=l) or a background (score=-l) leaf of the 
decision-tree. Misclassified events will be given larger weights in the next tree (boosting). This procedure 
is repeated several hundreds to thousands of times until the performance is optimal. The discriminator 
from the BDT training is the sum of the weighted scores from all the decision-trees. If the total score for 
a given event is relatively high this event is most likely a signal event, and if the score is low it is likely a 
background event. 

3.3 W ± Z -> i+vi+r selection 

The W ± Z production at hadron colliders uniquely probes the WWZ trilinear gauge boson coupling as 
shown in Figure 1, the Standard Model tree-level Feynman diagrams. 

W ± Z candidate events have three charged lepton final states, referred to as trileptons, produced when 
Z — ► £ + £~ and W ± — ► t°V, where £^ are e ± or }X ± . Standard Model backgrounds can be highly sup- 
pressed by requiring three isolated high pj leptons and large missing transverse energies (Jf.j). However, 
the pure leptonic decay mode of the W ± Z events only has a 1.5% branching ratio [2]. The total cross- 
sections times the branching ratio are 442 fb and 276 fb for W + Z and W Z, respectively. Event selection 
with high efficiency is important for early observations of this channel. 

Major backgrounds to the W ± Z trilepton final states come from ZZ —> £ + £~£ + £~ with one lepton 
undetected; Z + X —>■ £ + £~ +X (X=jets, or photon) with a jet or a photon faking a lepton; and tt — ► 

w+w-bb->£+e+e+x. 

After a trigger, W ± Z events are selected in two stages: (1) pre-selection with relatively loose cuts, 
and (2) final selection with tightened cuts or with BDT multivariate discriminator. The overall trigger 
efficiency for W ± Z events with trileptons in the final state is (98.9± 1.0)% using a combination of single 
lepton and dilepton triggers. 

The pre-selection of the W ± Z events is done by identifying three leptons (at least one lepton with 
Pt > 25 GeV) and requiring $j > 15 GeV in an event with characteristics consistent with Z dilepton 
decays (M u = (91.18 ±20) GeV) and W leptonic decays (10 GeV < M T {£,# T ) < 400 GeV). The overall 
pre-selection efficiency for W + Z events is 26%, and for W Z events is 29%. The acceptance difference 
is due to the differing tj distributions of the leptons decaying from W + and W~ . After the preselection, 
the known background is about 70 times larger than the signal. 

To bring the background level below the signal, a rejection power better than 100 in the second stage 
of event selections is required. To achieve this, events with Jf.j < 25 GeV and with significant hadronic 
jet activities are rejected. Events must contain no more than one hadronic jet with E J T et > 30 GeV and 
\r\i et \ < 3.0. The transverse recoil of the W ± Z system, calculated using the vector sum of the pj of 
the charged leptons and fa, is required to be less than 120 GeV and the sum of the hadronic transverse 
energy must be less than 200 GeV. These requirements effectively reject the tt and hadronic jet events. 
To reject the Z + X background, the third lepton (not from the Z boson decay) pj is required to be 
greater than 20 GeV and 25 GeV for muons and electrons, respectively. Any pair of leptons must satisfy 
AR = a/ (Atj 2 +A0 2 ) > 0.2. Leptons must be isolated from other energy clusters based on calorimeter 
and inner tracker measurements. All three leptons must be associated with isolated tracks that originate 
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Table 5: Number of expected W ± Z signal (Nwz) and background (Nb) for 1 fb 1 data with cut-based 
analysis. 





WZ 


ZZ 


ft 


Z+jet 


Z + 7 


Drell-Yan 


Total bkg 


Nwz/Nb 


N events 


53 


2.7 


.023 


1.9 


0.18 


2.5 


7.3 


7.3 


% of background 




37 


.32 


26 


2.5 


35 







from the same collision point. The dilepton invariant mass best matching the mass of Z must be within 
the Z-mass window of \Mz — M^ Lfl \ < 12 GeV or \Mz — M ee \ < 9 GeV. These mass windows are set by 
the mass resolutions. The Mj determined by the third lepton and the Tf.j must be within the transverse 
W-mass window: 40 GeV < M T < 120 GeV. 

The total and selected number of the signal and the background events for each trilepton final state, 
and for 1 fb -1 of integrated luminosity, are listed in Table 5. The overall signal efficiency is 8.7% and 
7.1% for W Z and W + Z, respectively. For 1 fb -1 of data, 53 W ± Z signal events and 7 background events 
are expected. The dominant background contributions are from ZZ, Z+jet and Drell-Yan processes, while 
Zy and it contribute a small fraction of the total background events. 

Table 6: Number of expected W ± Z signal (Nwz), and background (Nb) for 1 fb -1 of data with BDT 
analysis using the cut BDT > 200. 





WZ 


ZZ 


it 


Z+jet 


Z + y 


Other 


Total bkg 


N W z/Nb 


N events 


128 


7.7 


2.8 


2.5 


2.0 


1.1 


16 


7.9 


% of background 




48 


17 


16 


12 


7.0 







For 0.1 fb -1 of integrated luminosity, only 5 signal events (Ns) with 0.7 background event (Nb) 
contamination are expected. The W ± Z detection significance, defined as the probability from Poisson 
distribution with mean Nb to observe equal or greater than Ns + Nb events, converted in equivalent 
number of sigmas (standard deviations) of a Gaussian distribution, will be 3.6a only. To improve the 
detection sensitivity, the BDT analysis technique is employed. This BDT analysis is conducted with a 
total of 1000 trees with 20 tree-split nodes. Based on the variables used in the cut-based analysis, and the 
BDT training Gini-index (a measure of a variable effectiveness in separating signal from background), 
a total of 22 kinematic and topology variables are selected for the BDT training. About 12000 pre- 
selected signal events and 18000 pre-selected background events are used in this BDT-based analysis, 
where 50% of the signal and background events are used for the training, and another 50% of statistically 
independent events allocated to the BDT test sample sets. The BDT output discriminator from the testing 
sample, used to separate the signals from the background, is shown in Figure 8 (right), see Section 4.1. 
This spectrum will be used to determine (fit) the W ± Z production cross-section. 

Results of the W ± Z event selection with the cut BDT > 200 are shown in Table 6. Expected numbers 
of signal and background events are given for 1 fb -1 of integrated luminosity. The estimated background 
uncertainties are 15-20% due to the limited number of simulated events available. The overall W ± Z 
event selection efficiency is about 13.7% for signal, resulting in 12.8 signal and 1.6 background events 
for 0.1 fb -1 . The W ± Z detection significance is expected to be 5.9a (including both statistical and 20% 
systematic uncertainties) for 0.1 fb -1 of integrated luminosity. 
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3.4 W ± y — > £ ± vy selection 

The W ± y signal events are modeled with Pythia, which includes tree-level diagrams as shown in Figure 
1 for W production with initial state radiation (ISR) (the t— and u— channels) and the s— channel pro- 
duction depending on the triple-gauge-coupling WWy vertex. The WWy vertex introduces a destructive 
interference of zero amplitude at cos Qq Y = ±1/3 for W ± production, where Q^y is the photon scattering 
angle to the incoming anti-quarks. 

The W ± y diboson events are selected from pp collisions using pure W leptonic decays. The experi- 
mental signature is a final states with one high pj lepton (electron or muon), one high pj photon and 
large Major backgrounds are from the processes: 

• Inclusive W +X production with W +X —>■ £yv +X , where the y is from final state radiation (FSR) 
from the lepton. 

• Inclusive W +X — >• £v +X productions, with X = jets and the jets faking a photon. 

• Inclusive Z + X — >• tt + X productions, with one lepton escaping detection, and with X = yor jets 
faking a photon. 

A photon isolation cut is effective in suppressing these backgrounds. 

To study the W ± y detection sensitivity about 1.3 million inclusive W events and about 100,000 in- 
clusive Z events were generated with the Pythia MC generator. In these datasets, the photon transverse 
energy threshold is 10 GeV and the lepton and photon separation, AR(£, y) was required to be greater 
than 0.7. 

The W ± y candidates are inclusive ^7 or jx ± y events having one electron or muon observed and the 
absence of the oppositely charged lepton of the same flavor. The photon selected is the most energetic 
one in the events. Efficiencies for three trigger types are investigated: isolated muon with pj > 20 GeV, 
isolated electron with p T > 22 GeV and photon with E T {y) > 55 GeV. The overall W^y trigger efficiency 
is about 80% based on our study. 

Background to the W ± y signal is dominated by inclusive events with radiated jets faking a 
photon. Contamination from inclusive Z events with an undetected lepton is also significant. Figure 3 
shows the W ± y signal (first column) scatter plots compared to those for major backgrounds: the inclusive 
W events with final state radiation (FSR) (the 2nd column), and with fake photon (the third column), and 
the Z events with one lepton escaping detection and a photon of any type reconstructed (the 4th column). 

The BDT method is used to select the W ± y signal events. Three trainings are done to separate: 1) 
The £yv events with FSR photons from other sources, 2) The W ± y signal photons from fake photons, 
and 3) The signal photons from the contamination of Z inclusive events. Nineteen variables are used in 
the BDT analysis. Cuts are applied to the BDT output spectra for the three trainings. The cuts chosen 
for signal selection correspond to a W ± y selection efficiency of 65% with a signal to background ratio of 
0.95 (0.98) for electron (muon) final states, respectively. The numbers of events selected with the BDT 
cuts are listed in Table 7. The selected events are further required to pass the trigger requirements as the 
final accepted events, which are listed in the Table as well. 

The BDT discriminates signal photons effectively from FSR and fake photons in the high Ej{y) re- 
gion. This preserves a high detection efficiency in the region that is sensitive to discovery of phenomenon 
beyond Standard Model predictions. 

3.5 W+W -> £+v£-v selection 

The production of W pairs has been investigated extensively at LEP and at the Tevatron. The experi- 
mental signature is two high transverse momentum leptons with opposite charge associated with 
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Figure 3: Distributions of the W (ju v)y event variables, with a photon from the W y signal (ISR), 
and from backgrounds, FSR, fake y, and inclusive Z contamination. 



large transverse missing energy. production involves both WWZ and WWy triple gauge boson 

couplings and is most sensitive to the AKy anomalous coupling parameters. Furthermore, the W pair pro- 
duction provides an important background to Higgs boson searches in the pp — > H — > WW^ — ► £v£v 
channel at the LHC. In dileptonic IV-decays no Higgs mass peak can be reconstructed, so this back- 
ground cannot be estimated from the measured data via sideband interpolation. An understanding of the 
irreducible VK-pair continuum background is therefore crucial. 

At the LHC the major background processes (tt, inclusive W and Z, and Drell-Yan) have much higher 
cross-sections than W + VK~. It will be necessary to achieve very high background rejection power to 
suppress the events with mis-identified leptons (from jet, photon and instrumentation effects) and leptonic 
decays from heavy-flavor quark jets. In the Drell-Yan process the mis-measured fa also contributes non- 
negligible background. 

Event selection consists of the trigger, pre-selection (two high pj leptons plus fa), and final selection 
with a set of conventional cuts or with BDT selection cuts. The events are required to pass 

one of two high-level trigger paths: a single, isolated electron with p t > 25 GeV or a single muon, 
with p t > 20 GeV. The trigger efficiencies for WW — > ee, WW —> ixpi and WW —> e\l events with two 
opposite sign isolated leptons (pj > 20 GeV, | r\ |< 2.5) are estimated as follows: 98.2 %, 95.9%; and 
97.4% respectively. 

The cut-based analysis rejects background events with tight lepton identification criteria and iso- 
lation requirements as well as with event topology variables which clearly distinguish the signal and 
significantly suppress the main background processes tt and Z + X. The W + VK~ leptonic decay events 
are selected by requiring two well identified isolated leptons with opposite charge, and with p e T > 20 
GeV and | tj |< 2.5. A jet veto requirement rejects events with any jet (p } j > 20 GeV) in the rapid- 
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Table 7: The number of W ± y signal and background events after pre-selection, BDT selection and 
trigger requirement, for an integrated luminosity of 1 fb~ 1 . The signal and total background are then 
scaled to NLO cross-sections with the k-factors indicated. For the signal, the k-factor is obtained using 
BosoMC. For background, the k-factors are obtained by comparing the cross-sections calculated with 
MC@NLO and Pythia generators. 





Signal 




Background 






W ± y 


W+FSR_y 


W+fake_7 


Z(£/t)y 


Total 


£ = e Pre-selected 


1710 


11440 


7890 


32480 




BDT selection 


1145 


242 


791 


101 




Triggered 


966 


188 


628 


93 




NLO scaled 


1604 (k= 1.66) 








1183 (k=l. 3) 


£ = \x Pre-selected 


2680 


28410 


10250 


3950 




BDT selection 


1793 


413 


961 


409 




Triggered 


1305 


177 


595 


260 




NLO scaled 


2166 (k= 1.66) 








1342 (k=l. 3) 



ity region | r\ |< 3. This cut is efficient in tt suppression since tt contains one or two energetic b jets 
in addition to the signature. An event with #t < 50 GeV is rejected to reduce the background 

arising from the event pileup and from Z/y* events in the Drell-Yan process. To reject the dilepton 
events from inclusive Z production, a Z mass veto is applied. Finally, angular variable cuts are imposed. 
For cross-section measurements, events must pass a cut: <j>u < 2 rad, where <j>u is the angle between 
the transverse momentum of the two leptons. For anomalous TGC studies, this cut is replaced with: 
^(Pt ,£ ,Pt' ss ) > 175 deg, where <I> is the angle between the transverse momenta of the lepton pair, and 
the missing transverse momentum. The first angular cut (selection-A) results in high signal detection 
efficiency, but it is not optimized for high pr(£) and prill) event detection and thus could decrease the 
sensitivity to anomalous TGC's. The second angular cut (selection-B) results in lower signal detection 
efficiencies, but has higher efficiency for high pr{£) and pr{££) events. The yields of the se- 
lection with the cut-based analysis are summarized in Table 8. Both signal and background events are 
normalized to 1 fb _1 of data. Figure 4 shows the transverse momentum distributions of leptons (left) 
and lepton pairs (right) after applying the kinematic cuts of Selection-B. The distributions are shown for 
sum of signal and various backgrounds, and for individual backgrounds, for an integrated luminosity of 
1 ftr 1 of data. 

The overall signal detection efficiency is 1.4-3%, depending on the selection cuts and the final 

lepton states. The signal detection significance for 0.1 fb _1 of data is expected to be 4.7a (for Selection 
A), after taking into account a 20% background systematic uncertainty. The detection efficiencies can 
be improved by using the multivariate BDT technique. In this analysis one thousand decision 

trees, wherein one tree has 20 splitting-nodes, are used to separate signal from background based on the 
input variables. The input data for the BDT analysis must first pass the pre-selection cuts (two leptons 
with pj > 10 GeV and 1f.j > 15 GeV). The pre-selected simulated samples are divided into two equal 
parts: one sample is used for BDT training and the other to test event selection performance. The BDT 
output spectra for both signal and background are shown in Figure 8 (left). By varying the location of 
the cut along the BDT scores (x-axis), the signal to background ratio can be optimized. Table 9 presents 
the detection sensitivities with total integrated luminosity of 1 fb _1 : the selected number of signal events 
{Nww), the corresponding signal efficiency {£ww), the number of background {Nbkg) events, and signal 
to background ratio (N W w/Nbk g ) are shown. The breakdown of background contributions are also given 
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Table 8: Yield and total event detection efficiency of the WW selection for 1 fb 1 of data. The errors 
shown are statistical only. 



efficiency N ww ^background ^sig./^bkg. 
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Figure 4: Transverse momentum distributions of leptons (left) and lepton pairs (right) after applying 
kinematic cuts from Selection-B. The distributions are shown for sum of signal and various backgrounds, 
and for separated backgrounds for L=l fb -1 . 



in this table. For initial measurements using early LHC data based on 0.1 fb -1 of integrated luminosity, 
the application of BDT is compelling. As inferred from Table 8, the initial data is expected to yield a total 
for all decay channels of ~ 10 signal events using conventional cuts, whereas the BDT-based analysis, 
which gives a similar signal to background ratio as the conventional cuts is expect to yield total 47 signal 
events. With an estimated background contribution of 9.2 events the W + VK~ detection significance is 
about 10a (including 20% background systematic uncertainties). 

3.6 Zy -^£ + £~y selection 

Zy signals are produced through initial state radiation (ISR) of the photon from the quarks as illustrated 
by the t- and w-channel diagrams shown in Figure 1. The s-channel Zy production contains the ZyV (V = 
Z, y) vertex, which is forbidden at tree-level in the Standard Model. The cross-section measurement of the 
Zy production would provide a sensitive probe to anomalous ZyV couplings, which can be investigated 
through this channel by measuring the Ej(y) distribution, expecially at large values. 

The cleanest Zy experimental signature is two high pj leptons from the decay of the Z boson and an 
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Table 9: WW — > leptons detection sensitivities of accepted signal and background events for 1 fb _1 of 
integrated luminosity. Results from the BDT analysis are shown with cuts that give similar signal to 
background ratio as the cut-based analysis. The quoted efficiencies in the table are the BDT selection 
efficiencies including the trigger requirements based on pre-selected events. 











Background fraction 




Modes 


£ww(%) 


N W w 


Nbkg 


tt W ± Z z+x 


Nww/Nbkg 


£>V;UV 


32.7 


347±3 


64±5 


47.7% 27.8% 21.8% 


5.4 




12.1 


70±2 


17±2 


54.1% 34.6% 11.3% 


4.1 


evev 


13.7 


52± 1 


11± 1 


81.4% 7.2% 11.4% 


4.7 



isolated high pj photon (from ISR). The backgrounds to this Zy signal are: 1) the Z boson production 
with a FSR photon from the leptons decay from the Z, 2) the Z boson production with a fake photon from 
jets, and 3) a small contamination from W +X production reconstructed as a £ + ^~y final state. Some of 
the event variables for different photon sources are shown in Figure 5. 

Events are pre-selected with Ej{y) > 10 GeV, a value chosen to be above the Pythia generator 
threshold and as low as is reasonably achievable by detector reconstruction. The FSR event rate is almost 
an order of magnitude higher than the ISR rate in the inclusive Z production process. Backgrounds with 
a Z boson and a fake photon are comparable to the Zy signal photon rate. 

The event selection is conducted with BDTs trained to separate Zy events of different photon types. 
The training is in two stages: first to identify the FSR photon background events and then to distinguish 
the signal (ISR) photon from Z events with fake photons. Separate BDT training is done for the electron 
and muon Z decay channels. 

The BDTs are trained with 19 variables in total. As one example, FSR photons can be identified 
from the opening angle from the nearest lepton. Fake photons originating from high pj neutral mesons 
decaying to two photons can not be directly identified within the limits of the spatial resolution provided 
by the ECAL segmentation. However, they are often accompanied by jet secondaries or underlying 
remnant particles. By counting the charged tracks in a neighborhood (a cone of 0.45 rad in this case), or 
summing their energies, parameters useful for differentiating background from isolated ISR photons can 
be formed. 

With the chosen BDT cuts the signal selection efficiency is 67%, and the signal to background ratio 
is 2.0 and 1.8 for the electron and muon Z decay channels, respectively. The estimated numbers of 
reconstructed Zy candidates for an integrated luminosity of 1 fb _1 are listed in Table 10. 

The FSR and fake photons contribute approximately equally to the total background but have impor- 
tant distinctions. The fake photons populate the low E T {y) region, and are differentiated from the signal 
photon (ISR) in the Ej(y) > 20 GeV region. The shape of the Ej distribution in the low energy region 
is important for calibration and measurement of the production cross-section with ISR photons. On the 
other hand, the FSR photons have an Ej{y) distribution similar to the ISR photons which carry signa- 
tures of the coupling to the colliding quarks. Event rates in the high Ej{y) region, where the background 
is primarily FSR, is an important probe of new physics phenomena. 

3.7 ZZ -> £+£-£+£- selection 

The cleanest experimental signature for ZZ detection is through the four lepton decay channels: 

pp^ZZ^e + e~e + e~, }X + }X~ }X + }X~ , e + e~ jJ, + /J,~ . 
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Figure 5: Distributions of Z{ee)y event variables for ISR (left column), FSR (middle column.) and fake 
photons (right column). 

The ZZ — > At' (£' = e, ju, t) signal is modeled at LO by the Pythia event generator using the CTEQ6L 
PDF. The Z/y* interference terms are included in the generator. With the 12 GeV mass cut on the 
dileptons decay from Z/y*, the cross-section times the dilepton decay branching ratio, a x BR is 159 fb. 
A filter is applied to the simulated data to pre-select four lepton (e and fx only) events by requiring that 
the lepton transverse momenta must be greater than 5 GeV, and the lepton rapidity, r\£, must be in a range 
\eta\< 2.7. The overall filter efficiency is 0.219. The % lepton contribution to the four lepton channels 
(Ae, A^i, lelfl) is less than 4% in the event sample after the filter. The fraction of the on-shell ZZ events 
(both lepton pair masses are between 70 GeV to 1 10 GeV) in the sample is about 73%. Next-to-leading- 
order calculations give higher production cross-sections. The k-factor is about 1.35 when both Z's are on 
mass shell. However, when the Z/y* are off the Z mass shell, the k-factor varies from 1.15 to 1.52forthe 
(Z/y*) (Z/y*) mass range from 115 GeV to 405 GeV, which is determined by using the MCFM Monte 
Carlo calculations [1 1]. The k-factor is set constant at 1.35 to normalize the ZZ — ► Ai signal events. The 
four-lepton events have high trigger efficiencies close to 100%. 

Major background processes for four lepton final states are tt — > WWbb -^A£+X and Zbb -^Al+X. 
The tt background events, generated using MC@NLO, has a total production cross-section of 833 pb. 
The Zbb background events are generated using AcerMC, with cross-section scaled to NLO by a k- 
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Table 10: The number of Zy signal and background events after pre-selection and BDT selection is 
listed, for an integrated luminosity of 1 fb _1 . The signal and total background are then scaled to NLO 
cross-sections with the k-factors indicated. For the signal, the k-factor is obtained using BHO. 





Signal 




Back£ 


round 






Zy 


Z+FSR y 


Z+fake_y 


W{lv)y 


Total 


£ = e Pre-selected 


430 


2760 


490 


44 




BDT selection 


288 


70 


74 


0 




Triggered 


282 


65 


79 


0 




NLO scaled 


367 (k=1.3) 








187 (k=1.3) 


i = \l Pre-selected 


950 


7500 
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0 




Triggered 


578 


164 


165 


0 




NLO scaled 


751 (k=1.3) 








429(k=1.3) 




Figure 6: The four-lepton invariant mass distributions of ZZ signal, Zbb and tt background events with 
tight (left) and loose (right) Z mass cut on the lepton pairs. The number of events correspond to an 
integrated luminosity of 10 fb _1 . 

factor. Leptons from b quark decays in these processes are produced in association with hadrons. Their 
contributions can be highly suppressed by lepton isolation requirements. For muons, the ratio between 
the transverse energy deposited in a cone around the muon track of radius AR = 0.4 and the transverse 
energy of the muon Ej must be below 0.2. A similar isolation cut is applied to the electron selections. 
To reject background with leptons not originating from the Z decays, the two opposite sign lepton pairs 
must have at least one lepton with pj greater than 20 GeV, and at least one lepton pair must have the 
invariant mass between 70 GeV- 110 GeV. This is referred to as the loose Z mass cut. A tight Z mass cut 
requires the second lepton pair also to have invariant mass between 70 GeV- 1 10 GeV. The separation 
between the two leptons must be AR(£ + £~) > 0.2. Table 1 1 lists the signal selection cut efficiencies for 
all the four lepton final states. Each cut efficiency value is relative to the previous selection. The quoted 
uncertainties of the selected numbers of events are statistical only. 

The total selection efficiencies for the Zbb background using these same tight cuts are 0.13% ± 
0.06%, 0.61% ±0.14%, and 0.51% ±0.13% for the 4ju, 4e, and 2/j,2e channels, respectively. For tt 
these efficiencies are 0.07% ±0.07% for all three channels. The expected number of signal and back- 
ground events for L = 1 fb~ ! requiring tight Z mass cut are given in Table 12. The expected signal and 
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Table 11: ZZ — >• £ + £~£ + £~ signal selection cut efficiencies 



Lepton Preselection 70.7 62.3 65.4 

Pair formation, dR 99.3 88.0 93.4 

Isolation, pf" 81.1 58.6 59.1 

ZMass tight loose tight loose tight loose 

72.7 92.0 76.1 93.5 77.8 95.2 

Total 41. 4 ±0.6 52.4 ±0.7 24.4 ±0.5 30.0 ±0.6 28.1 ±0.4 34.3 ±0.4 



Table 12: Expected number of ZZ — >• £ + £ £ + £ signal and background events at L = 1 fb 1 using the 
tight Z mass cut. 





4jj, events 


4e events 


2\i2e events 


Total 


Signal 


4.5±0.05 


2.6±0.04 


6.2±0.06 


13.3 ±0.09 


Zbb 


0.01±0.003 


0.04±0.01 


0.04±0.01 


0.08±0.01 


it 


0.04±0.04 


0.04±0.04 


0.04±0.04 


0.12±0.07 


Total background 


0.05±0.04 


0.08±0.04 


0.08±0.04 


0.20±0.07 



background events with only one on-shell Z (Loose Z mass cut) are in Table 13. Uncertainties quoted 
in these tables are statistical only. Figure 6 shows the four-lepton invariant mass distributions for the ZZ 
signal, Zbb and it background with tight and loose Z mass cut on the lepton pairs. Based on these results 
the ATLAS experiment will establish the ZZ — »■ 4£ signal with a significance of 6.8a (after taking into 
account 20% background systematic errors) with the first 1 fb -1 of integrated luminosity. 

3.8 ZZ^£ + e-vv selection 

The ZZ — ► £ + f vv signature is two high-pr charged leptons with a large missing transverse energy (tfj) 
due to the neutrino pair leaving the detector. The main backgrounds will either come from channels with 
large cross sections, such as it and Z — >• £ + £~, or those with a similar signature to the signal, such as the 
W*Z diboson channel. Both signal (ZZ £ + £~vv) and background (ft, W ± Z, W + W~, and Drell-Yan 
dileptons) are modeled by the generator MC@NLO, except for high pj Z (pr(Z) > 100 GeV) events 
which are modeled by Pythia. 

To reduce the backgrounds, a set of simple cuts on discriminating parameters is invoked. In general, 
each cut is used to suppress a particular background channel, as described below. 

First, two oppositely charged good quality leptons with pj > 20 GeV are selected. This reduces much 



Table 13: Expected ZZ — >• £ + £ £ + £ signal and background events at L = 1 fb 1 using the loose Z mass 
cut. 





4/x events 


4e events 


2\l2e events 


Total 


Signal 


5.7±0.06 


3.2±0.04 


7.6±0.07 


16.5±0.1 


Zbb 


0.1±0.01 


0.5±0.02 


0.3±0.02 


0.9±0.1 


it 


0.1 ±0.06 


0.5±0.14 


0.4±0.13 


1.0±0.2 


Total background 


0.2±0.06 


1.0±0.14 


0.7±0.13 


1.9±0.2 
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of the tt background which contains softer leptons than the signal. This cut also reduces the background 
from Z —> T + T~ — > £ + £^V£V^v x v T as the electrons and muons are produced with reduced pj. The leptons 
must also lie within the inner detector pseudorapidity range, \r\ \ < 2.5. 

The charged lepton pairs are required to have an invariant mass close to the Z mass, specifically 
\Mu — 91.2 GeV| < 10 GeV. This is equivalent to ~ 5a of the signal width, and helps to reduce back- 
ground combinatorics where the lepton pair does not come directly from a Z decay. A lepton veto is 
imposed by combining the good quality and loose lepton selection to remove any events with more than 
two leptons in total. This reduces background from the W ± Z channel, whose Z has an almost identical 
signature to the signal, and the neutrino from W decay also appears as fa. The third-lepton veto sup- 
presses the W ± Z background by ~ 30%. If the lepton from the W is not reconstructed, however, this 
background channel becomes almost indistinguishable from the signal. 

A main characteristic of the signal decay is a large missing transverse energy (fa) from the Z — > vv 
decay. An important background, due to its large cross section, comes from the Z — > £ + £~ Drell-Yan 
process, where jets are produced in addition to the leptons. If these jets are aligned with cracks in 
the detector, then they will fake fa as they will not be fully accounted for in the calorimeters. This 
background can be significantly reduced by applying a 50 GeV fa cut. The background from ZZ —* At 
is also reduced, but this is less significant as it has a much smaller cross-section. The W ± Z channel is 
suppressed by this cut as only one neutrino is produced, and hence the fa distribution is slightly softer. 




Figure 7: The fa — pr{Z) magnitude (left) and angle matching (right) distributions before cuts for 
ZZ — > 4£ (solid), Z —> £ + £~ (dash) and W ± Z (dot). The plots are normalised to unit area for comparison 
of distribution shapes. 

The signal is expected to have missing pj equal and opposite to that of the reconstructed Z, when 
the ZZ pair is produced with no initial pj and they decay back-to-back. Figure 7 (left and right) shows 
a clear peak in the signal for both magnitude and angle matches. The W ± Z background shows a worse 
magnitude match as some of the W momentum is lost to either an electron or muon on decay. This 
means that the missing pj will not quite match up with that of the recoiling Z. The angular distribution 
shows a peak in both the W ± Z and Z — > // channels. In the case of W ± Z, this is because the W and Z 
are produced in approximately opposite directions. When the W decays, the neutrino will be deflected 
and so the peak has a wider distribution. In a similar way, in Z — > 11, the Z is likely to be produced with 
some quarks recoiling against it. These will manifest themselves as jets which can fake fa. Cuts at 
{\fa - p T {Z)\) / p T {Z) <0.35 and 145° < 0^ - (j) z < 215°, reduce the W±Z background. 

A jet veto reduces backgrounds with large hadronic activity. For example, the predominant decay 
channel for the top quark in tt is the t — > Wb final state, resulting in several high pj jets. Its contribution 
can be reduced by applying a veto on events containing any jet with pj 1 > 30 GeV and \r\i et \ < 3.0. 

The final cut to be applied is on the pr of the reconstructed Z boson. This reduces the background 
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Table 14: Cut flow table for ZZ — > £ + £~ vv signal and background after cuts for an integrated luminosity 
of 1 fb _1 . The values in brackets indicate the percentage of events passing each cut relative to the 
previous cut. Note that this Z — > // MC sample already requires pr(Z) > 100 GeV. 



Cut 


ZZ -► Uvv 


ZZ^4Z 


Z-*ll 


tt 






Z — > TT 


Leptons 


130.1 


54.3 


13100 


4530 


271.2 


491.1 


2170 


Third-lepton veto 


101.9 


3.1 


1900 


428.9 


52.9 


375.6 


1690 




(78.3%) 


(5.7%) 


(14.5%) 


(9.5%) 


(19.5%) 


(76.5%) 


(77.9%) 


Dilepton mass 


100.2 


2.7 


1740 


110.2 


45.3 


83.8 


40.1 




(98.3%) 


(87.1%) 


(91.6%) 


(25.7%) 


(85.6%) 


(22.3%) 


(3.4%) 


Missing Ej 


38.0 


0.34 


3.8 


17.9 


9.4 


18.3 


0 




(39.9%) 


(12.6%) 


(0.2%) 


(16.2%) 


(20.8%) 


(21.8%) 


(0.0%) 


Jet veto 


34.4 


0.30 


0.44 


6.0 


7.6 


16.7 


0 




(90.5%) 


(88.2%) 


(11.6%) 


(33.5%) 


(80.9%) 


(91.3%) 


(0.0%) 


Pr 


10.2 


0.08 


0.4 


3.0 


1.7 


0.02 


0 




(29.7%) 


(26.7%) 


(90.9%) 


(50.0%) 


(22.4%) 


(0.1%) 


(0.0%) 


Stat. Error [90%CL] 


0.2 


0.01 


0.2 


2.1 


0.1 


0.22 


[1.6] 



Table 15: The expected ZZ — > £ + £ vv signal yields and total signal selection efficiency for 1 fb 1 of 
integrated luminosity. The errors shown are statistical only. 



^signal 


Signal efficiency 


^background 


N S /N B 


10.2±0.2 


2.6% 


5.2±2.6 


2.0±0.8 



from the single Z channel, whose pr(Z) distribution drops much faster than the signal. A cut of pr(Z) > 
100 GeV significantly reduces this background, and has a negligible effect on the sensitivity to anomalous 
couplings, which predominantly manifest at high pj. 

Using the single isolated electron trigger (effective Ej > 22 GeV) and the single isolated muon trigger 
(effective pj > 20 GeV), the trigger efficiency for selected ZZ — ► £ + £~vv events is expected to be 97%. 

Table 14 gives a summary of the cuts applied and presents the expected number of events passing the 
cuts. The final row in each column gives the statistical error. If no events pass cuts, the figure given is 
the number of expected events at the 90% confidence level. The dominant background is tt . Table 15 
summarizes the expected yield and sensitivity of the ZZ — > £ + £~vv channel. 

4 Total cross section measurements 

A binned likelihood method is used to determine the most likely cross-sections. This likelihood method 
is also used to extract the sensitivities to the anomalous TGCs which will be described in Section 5. The 
likelihood is based on Poisson statistics convoluted with Gaussian probabilities to model the signal and 
background uncertainties. What follows is a more detailed description of the binned likelihood method 
followed by a description of the statistical and systematic uncertainties for the diboson cross-section 
measurements using the first 1.0 fb -1 of data. 
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4.1 Binned likelihood 

In the binned likehood method, expected events are determined from high statistics Monte Carlo sim- 
ulation, and observed events are also determined from Monte Carlo simulation for this work, but with 
appropriate statistical fluctuation according to the luminosity. The events are binned by one or more 
observables. As an example, in the case of the cross-section measurement the BDT output spectrum, an 
example of which is shown in Figure 8, could be used. In the TGC analysis described in Section 5 the 
M T (VV) and pr(V) spectra are choosen. 

For each bin expected signal and background are compared to the observed number of events (n 
events in the bin) with a likelihood, which is based on Poisson statistics. We assume the systematic un- 
certainties of the signal and background are Gaussian and uncorrected for each bin. Thus, two Gaussian 
distributions are convolved with the Poisson distribution to form the likelihood 

r l+3a b r i+3a s ( f v + f, Vh )n e -(f s v s +f b v b ) (1-0 /2af 

L= / gs gb Us " Ih h \ dfsdf b with gi = (i = s,b), 

Jl-3a b Jl-3a s n\ f 0 °° e (l-//)72ff, 

here the total systematic uncertainty of signal and background appear as o s and Gj,, respectively. 

From these likelihoods a total log-likelihood is formed from all the bin likelihoods. Some pro- 
cesses may also be separated into multiple channels (such as the three decay combinations of WW — > 
ee,efl,Hfl). Also, a factor of -2 is included to make this test statistic comparable to a chi-squared distri- 
bution. Thus, the negative log-likelihood is 

-2lnL=-2 £ £ log(Lf). 
k= channels ;=bins 

In cross-section measurements, the likelihood is determined as a function of cross-section in each bin of 
a measured spectrum for each channel (e.g. the BDT output spectrum for the — > evjiv channel as 

shown on the left in Figure 8). The log-likelihoods are then combined and the minimum of the negative 
log-likelihood determines the most likely cross-section (or anomalous TGC). The 68% C.L. limits (±la) 
are taken from the minimum of the negative log-likelihood plus 1.0. To set the 95% confidence-level 
interval of the anomalous TGC limits, likelihood minimum+1.92 is taken when fitting one parameter, 
and the minimum+2.99 for a fit of two parameters (e.g. two independent anomalous couplings). 



4.2 Statistical uncertainties 

Based on the diboson event selections described in Section 3, the expected number of signal and back- 
ground events for 1 fb -1 , and the expected detection significance of observing the Standard Model sig- 
nals, are summarized in Table 16, after taking into account the known background contributions and 20% 
systematic uncertainties of the background estimate. The expected signal statistical uncertainties are also 
given in the 5 th column of the table. For 1 fb -1 they range from 2.1% to 31% depending on the channel. 

For early LHC data with 0.1 fb -1 integrated luminosity, the statistical uncertainties are large. How- 
ever, based on BDT analysis, with an assuption of a 20% systematic uncertainty, the signal detection 
significances could reach 9.9a and 5.9a for and W ± Z, respectively. The overall detection sig- 

nificance is expected to be greater than 10a for both W ± y and Zy signals with 0. 1 fb -1 of data. 



4.3 Systematic uncertainties 

The major theoretical uncertainties on the production cross-sections come from the PDF uncertainties 
and the QCD factorization scaling uncertainties (for NLO calculations). By varying the PDF's and scale 
values for W ± Z, and ZZ cross-section calculations, the differences of the calculated cross- 

sections are found to range from 3.4% to 6.2%. 
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Figure 8: BDT-output spectra from a Monte Carlo experiment for (left) and W ± Z (right) detec- 

tion with 1 fb -1 . The dots in the plots are Monte Carlo 'mock data'. The dashed histograms represent 
the signal and the dotted, background. 



The major experimental systematic effects in the cross-section measurements arise from the un- 
certainties of the luminosity determination, the lepton identification efficiencies and energy/momentum 
resolutions, the jet energy scale and resolutions, and background model and estimate. 

A promising possibility for the precise determination of the luminosity is to use the W and Z pro- 
duction and leptonic decays. The estimates show that in this way the luminosity uncertainties could be 
controlled to ~5% [40]. It should be noted that a 6.5% luminosity uncertainty was quoted in Tevatron 
Run II physics papers. 

The lepton acceptance uncertainty is about 2-3% mainly due to the isolation requirement which 
involves the hadronic jet energy uncertainties. The lepton trigger efficiency uncertainties also contribute. 
With large Z samples, this uncertainty could be minimized. Z decays can typically be triggered and 
identified using only one of the two decay leptons. This leaves the second lepton unbiased from the point 
of view of trigger and offline identification. The rate at which the unbiased lepton passes the trigger and 
ID requirements provides a measurement of the respective efficiencies. 

In the studies using the ATLAS simulated events the background estimate dominates the systematics 
with uncertainties of 15-25% for all the diboson channels except for the ZZ — > £ + l~£ + £~ channel, where 
the background uncertainty should be less than 2%. Even though more than 30 million fully simulated 
events are used to estimate the background, the analyses are still largely limited by W + jets event 
sample statistics in the diboson background estimate. Tevatron experiments have used data to estimate 
the background, and typical uncertainties for diboson physics analyses are around 10% for 1 fb -1 of data. 
With early LHC data (0.1 fb -1 ), the background estimate uncertainty would be comparable to current 
Tevatron diboson background uncertainties, and with more data the uncertainties of the background 
estimate should decrease. 

The lepton and jet energy resolution uncertainties will contribute to additional background estimate 
uncertainties which will further propagate to the cross-section measurement uncertainties. A study has 
been performed in W ± Z analysis to estimate the size of such uncertainties. In this study, the W ± Z BDTs 
are first trained with Monte Carlo signal and background events simulated with the 'standard' detector 
energy resolutions and calibrated energy scale. For independent test samples, 10% and 3% are added 
to the jet and lepton energy resolutions, respectively, and the reconstructed energy related quantities are 
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Table 16: Summary of signal and background of all diboson final states (£ denotes e and jl) for 1 fb _1 
of integrated luminosity. The 4th column indicates the overall signal selection efficiency and the type 
of analysis, The 5th column gives the signal statistical uncertainty. The last two columns indicate the 
/rvalue and the significance (in Gaussian standard deviations) where /rvalue is the probability of the 
background fluctating to the expected total observation assuming 20% systematic uncertainties. 



Diboson mode 


Signal 


Background 


Signal eff. 


^signal 


p- value Sig. 


W + W~ -► e ± y^y 


347±3 


64±5 


12.6% (BDT) 


5.4% 


3.6 xl0~ 166 27.4 


W+W- -► ii + vii-v 


70±1 


17±2 


5.2% (BDT) 


12.0% 


8.8 xlO" 30 11.3 


W+W- -^e + ve-v 


52±1 


11±2 


4.9% (BDT) 


13.9% 


1.9 xlO" 24 10.1 


W + W~ — > £ + v£~v 


103 ±3 


17±2 


2.0% (cuts) 


9.9% 


1.4xl0" 54 15.5 


W ± Z^£ ± v£+T 


128±2 


16±3 


15.2% (BDT) 


8.8% 


3.0 xl0~ 76 18.4 




53±2 


8±1 


6.3% (cuts) 


13.7% 


3.1 xl0~ 30 11.4 




17±0.5 


2±0.2 


7.7% (cuts) 


24.6% 


6.0 xlO- 12 6.8 


ZZ^£+£-vv 


10±0.2 


5±2 


2.6% (cuts) 


31.3% 


7.7 x 10~ 4 3.2 


Wy^evy 


1604 ±65 


1180±120 


5.7% (BDT) 


2.5% 


significance > 30 


Wy^ juvy 


2166 ±88 


1340 ±130 


7.6% (BDT) 


2.1% 


significance > 30 


Zy^e + e~y 


367 ±12 


187 ±19 


5.4% (BDT) 


5.2% 


1.2xl0" 91 20.3 


Zy^\x + ]x-y 


751 ±23 


429 ±43 


11% (BDT) 


3.6% 


5.9 xlO" 171 27.8 



Table 17: Change of background acceptance in a test of BDT (W ± Z vs. ZZ) performed by smearing jet 
energy, Ej et , and missing Ej, Jfij, by an additional 10%; and the lepton energy Ej by an additional 3%. 



Signal Efficiency 


Background Eff. 


Background Eff. 


Background Eff. 




No additional smearing 


10% for E je t &$T 


10% for E jet & $ T , 3% for E ( T 


40% 


4.0% 


4.2% (+5.7%) 


4.2%(+6.7%) 


50% 


8.6% 


8.9% (+3.7%) 


9.0%(+4.8%) 


60% 


14.6% 


14.9% (+2.2%) 


15.1%(+3.7%) 


70% 


22.3% 


22.7% (+2.0%) 


23.0%(+3.4%) 
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'smeared' to reflect the uncertainties of the diboson detection sensitivity (signal to background ratio). In 
this study the signal efficiencies are fixed and changes to the background acceptance are gauged. The 
results are summarized in Table 17. As an example, for a BDT signal selection efficiency of 60%, the 
change of the signal to background ratio is 3.4%. For the W ± Z cross-section measurement with 1 fb _1 
of integrated luminosity, the 3.4-6.7% background contribution uncertainty would result in additional 
cross-section measurement uncertainties of about 2-3%. 

4.4 Measurement uncertainties vs. selection cuts and luminosities 

The cross-section measurement uncertainties are estimated for various event selection cuts and integrated 
luminosities in the VK + W~ and W ± Z BDT based analysis. The BDT output spectra are used to build the 
log- likelihood by using 'mock data', which is a sample of simulated events with appropriate statistics 
according to the luminosity and the Standard Model. For example, the BDT-output spectra for a Monte 
Carlo experiment with 1 fb _1 of data are shown in Figure 8 for — ► e ± v^ T v detection (left) 

and for W ± Z — > t^ sl vt r t~ detection (right). The Standard Model 'mock data' (points) are compared to 
expected signal (dashed histogram) and background (dotted histogram). 




-200 -100 0 100 200 300 400 -200 -100 0 100 200 300 400 

BDT Cut BDT Cut 

Figure 9: The total relative uncertainties for (left) and for W ± Z (right) cross section measure- 

ments as the BDT cut is varied for different luminosities. The optimal BDT cut is between 200 and 300. 
An overall 9.2% systematic uncertainty was included in the fitting process. 



To understand the optimal cut on the BDT spectra for cross-section measurements, the cuts on the 
BDT spectra are varied and the cross-section measurements are repeated. A total 9.2% systematic un- 
certainty is included in the fitting process. Figure 9 shows the cross-section measurement uncertainty as 
a function of the BDT cut for different integrated luminosities from the W + VK~ and the W ± Z analysis. 

Figure 10 shows the relative cross-section uncertainties as a function of integrated luminosity (with 
BDT spectrum cut at 200) for (left) and W ± Z (right) cross-section measurements. From these 

plots it should be noted that the systematic uncertainty starts to dominate after 5 fb~ ! of integrated 
luminosity for cross-section measurements, and after 10 fb -1 for W ± Z. 

5 Sensitivity to anomalous couplings 

The signature of anomalous couplings in diboson production is an increase in the cross-section at high 
values of gauge boson transverse momentum (pr) and diboson transverse mass (Mj). The ATLAS 
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10 1 1 10 j 10 1 1 10 j 

Integrated Luminosity (fb" ) Integrated Luminosity (fb" ) 



Figure 10: The (left) and the W ± Z (right) cross-section measurement uncertainties as a function 

of integrated luminosity (with BDT spectrum cut at 200). An overall 9.2% systematical uncertainty was 
included in the fitting process. 



sensitivity to anomalous TGC's is investigated by comparing the 'measured' diboson production cross- 
sections and the vector boson pj or diboson Mj distributions to models with anomalous TGC's. A 
binned likelihood fitting procedure using the Mj or pj spectrum for each channel is followed to extract 
the 95% C.L. intervals of anomalous coupling parameters. The most dramatic effect is an increase 
in the high Mj or high pj cross-sections, so it is important for the binned likelihood calculation to 
include events up to the highest values of the observables. Details of the binned likelihood method are 
described in Section 4.1. One- and two-dimensional limits are set on the charged CP-conserving coupling 
parameters from the W ± Z, and W ± y final states. The ZZ final state is used to probe the neutral 

anomalous TGC sensitivity. 

The values of the form factor scale A are chosen such that the extracted experimental anomalous 
coupling limit from data for a certain diboson production process is less than the unitarity limit [41]. 
For this study, with 0.1-1.0 fb -1 of integrated luminosity at early LHC running, A values of 2-3 TeV are 
used. The same A values of 2-3 TeV are also used to estimate the anomalous coupling sensitivities for 
higher luminosities for simplicity. It should be noted that as the luminosity increases, the A value should 
have increased accordingly. 

5.1 Re- weighting the fully simulated events 

To avoid producing an impractically large number of fully simulated events in non-Standard Model 
anomalous coupling parameter space, a re-weighting method was invoked to study the ATLAS de- 
tector sensitivities to anomalous coupling parameters. The BHO and the BosoMC calculations are 
used with different anomalous coupling parameters to re-weight the fully simulated events generated 
by MC@NLO. As an example, Figure 11 shows the production differential cross-section distri- 

butions for Standard Model and some anomalous coupling parameters (left plot) and the corresponding 
differential cross-section ratio, ^^"sM^m^ ( r ^ nt Pl ot )- These ratios have been used as weights to 
re-weight the fully simulated events to probe the anomalous TGC sensitivities. The weights are generated 
in one-dimensional and two-dimensional anomalous coupling space according to parton level kinematics 
using the BHO and the BosoMC programs. To produce the weights the step size in coupling parameter 
space ranges from 0.1 x 10~ 3 to 1.0 x 10~ 3 . For each point in the coupling parameter space 5 million 
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Figure 12: The expected signal+background of the Standard Model, superimposed with 'mock data' 
(points with error bars showing statistical uncertainty), and the non-Standard Model (anomalous cou- 
plings) predicted signal+background histograms (dashed and dotted histograms). The left plot is for 
1 ftr 1 of data and the right plot is for 30 ftr 1 of data. 



events were generated to obtain the theoretical 'reference' distributions. The fully simulated events with 
the Standard Model couplings are required to pass the event selection cuts, and then reweighted accord- 
ing to the parton level kinematics. The weighted events are equivalent to fully simulated events with the 
corresponding anomalous couplings. The distributions of variables, sensitive to the anomalous coupling, 
such as lepton pj, of the data events, can be compared to those of simulated events with anomalous cou- 
plings included, to extract the limits on the anomalous couplings. In this study the Standard Model 'mock 
data' are used to probe the ATLAS detector sensitivities to anomalous triple gauge boson couplings. 

5.2 WWZ anomalous TGC sensitivity in W ± Z analysis 

The W ± Z diboson production involves exclusively the WWZ coupling, in contrast to the diboson 
final state which contains both WWZ and WWy couplings. To extract the 95% C.L. sensitivity intervals of 
the anomalous parameters, AK"z,Agf, and Az, from the W ± Z diboson final state, both the transverse mass 
of W ± Z (Mt(W ± Z)) and the transverse momentum of Z {pr(Z)) spectra are used to fit the anomalous 
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Table 18: Summary of WWZ one-dimensional anomalous coupling parameter 95% CL sensitivities using 
the M T (W ± Z) fitting for A = 2 TeV and A = 3 TeV for integrated luminosities of 0. 1 , 1 , 1 0 and 30 fbr 1 . 



Int. Lumi Cutoff A Ak z A z Agf 

(fb- 1 ) (TeV) 



0.1 


2.0 


[-0.440, 


0.609] 


[-0.062, 


0.056] 


[-0.063, 


0.119] 


1.0 


2.0 


[-0.203, 


0.339] 


[-0.028, 


0.024] 


[-0.021, 


0.054] 


10.0 


2.0 


[-0.095, 


0.222] 


[-0.015, 


0.013] 


[-0.011, 


0.034] 


30.0 


2.0 


[-0.080, 


0.169] 


[-0.012, 


0.008] 


[-0.005, 


0.023] 


0.1 


3.0 


[-0.399, 


0.547] 


[-0.050, 


0.046] 


[-0.054, 


0.094] 


1.0 


3.0 


[-0.178, 


0.281] 


[-0.020, 


0.018] 


[-0.017, 


0.038] 


10.0 


3.0 


[-0.135, 


0.201] 


[-0.015, 


0.013] 


[-0.013, 


0.018] 


30.0 


3.0 


[-0.069, 


0.131] 


[-0.008, 


0.005] 


[-0.003, 


0.016] 



couplings. 

Monte Carlo experiments are performed with 0.1, 1, 10, and 30 fb -1 of integrated luminosities to 
study the anomalous coupling sensitivities. Figure 12 shows the expected signal+background of the 
Standard Model, superimposed with the 'mock data' (points with error bars), and the non-Standard 
Model (anomalous couplings) predicted signal+background distributions. Table 18 shows the summary 
of 1-dimensional 95% CL. anomalous coupling parameter intervals based on the Mt(W ± Z) spectra 
fitting. Results corresponding to 0.1, 1, 10 and 30 fb -1 of integrated luminosities for cutoff, A = 2 TeV 
and A = 3 TeV are listed. It should be noted that even for 0.1 fb -1 of integrated luminosity, the ATLAS 
sensitivity to WWZ anomalous couplings could be much better than the Tevatron limits based on 1 fb -1 
of pp collision data. 

To understand the systematic uncertainty effects on the TGC sensitivity, three different systematic 
uncertainty assumptions are considered: (1) ideal case with no systematic uncertainties: Os = 0, and 
o B = 0; (2) expected uncertainty of 7.2% for signal, and 12% for background, based on estimate from 
various contributions; and (3) worse than expected systematic uncertainty of 9.2% for signal, and 18.3% 
for background. Unless otherwise stated, (3) was used to evaluate coupling limits. 

The 95% CL. 1-dimensional limits for the WWZ anomalous couplings, obtained from the fits to the 
Pt{Z) assuming A = 2 TeV are shown in Table 19, for different scenarios of systematic uncertainties. 
From this table it is seen that only when reaching 30 fb -1 of integrated luminosity do the systematic 
uncertainties become significant enough to affect the TGC sensitivities. 

The studies on the WWZ anomalous couplings in two-dimensional space are also based on the pj{Z) 
fits for different integrated luminosities (0.1, 1, 10 and 30 fb -1 ) and for two cutoff values, A = 2 TeV 
and 3 TeV. The anomalous coupling limit contours are not very sensitive to these cutoff values. The 
effects of different systematic uncertainties on the 2-dimensional TGC sensitivity contour are shown in 
Figure 13. The left plot shows the 95% CL. anomalous TGC limit contour of Az vs. Akz = Agf without 
systematic uncertainties, and the right plot shows the 95% CL. TGC limit contour with the systematic 
uncertainties (<7s = 9.2%, <7g = 18.3%) included. Again, the systematic uncertainties become significant 
when the integrated luminosity reaches 30 ftr 1 . 

5.3 WWy anomalous TGC sensitivity in W ± y analysis 

The W ± y diboson production involves exclusively the WWy triple gauge coupling. To extract the 95% 
CL. sensitivity intervals of the anomalous parameters, A(C r , and A r , from the W ± y diboson final state, 
the photon transverse energy Ej{y) distribution is used to fit the anomalous couplings, with A = 2 TeV. 
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Figure 13: The left plot: 95% C.L. WWZ TGC limit contour of X z vs. Ak z = Agf; without the sys- 
tematic uncertainties. The right plot: the 95% C.L. WWZ TGC limit contour of Az vs. Ak z = Agf, with 
systematic uncertainties (as = 9.2%, Ob = 18.3%) included. The anomalous coupling limit contours 
from outer to inner corresponding integrated luminosities of 0.1, 1, 10 and 30 fb -1 , respectively. The 
systematic uncertainties become significant when the integrated luminosity reaches 30 fb -1 . 



Table 19: Comparison of WWZ one-dimensional anomalous coupling parameter 95% C.L. sensitivities 
for different systematic uncertainties. Results obtained in this table are using the pj (Z) fit for A = 2 TeV 
for integrated luminosities of 0.1, 1, 10 and 30 fb -1 . 



Systematic 
uncertainties 


Int. Lumi 

(fb- 1 ) 


Ak z 


K 




As 


z 


05 


= 0 


0.1 


[-0.942, 


1.130] 


[-0.203, 


0.193] 


[-0.227, 


0.324] 


Ob 


= 0 


1.0 


[-0.561, 


0.664] 


[-0.093, 


0.082] 


[-0.106, 


0.154] 






10.0 


[-0.233, 


0.231] 


[-0.033, 


0.024] 


[-0.025, 


0.061] 






30.0 


[-0.128, 


0.136] 


[-0.024, 


0.013] 


[-0.009, 


0.047] 


o s 


= 7.2% 


0.1 


[-0.950, 


1.140] 


[-0.204, 


0.194] 


[-0.228, 


0.325] 


Ob 


= 12.0% 


1.0 


[-0.574, 


0.692] 


[-0.093, 


0.083] 


[-0.106, 


0.158] 






10.0 


[-0.228, 


0.302] 


[-0.033, 


0.027] 


[-0.022, 


0.070] 






30.0 


[-0.164, 


0.212] 


[-0.026, 


0.018] 


[-0.009, 


0.055] 


o s 


= 9.2% 


0.1 


[-0.956, 


1.150] 


[-0.204, 


0.194] 


[-0.229, 


0.326] 




= 18.3% 


1.0 


[-0.583, 


0.706] 


[-0.094, 


0.084] 


[-0.106, 


0.159] 






10.0 


[-0.241, 


0.316] 


[-0.033, 


0.028] 


[-0.024, 


0.071] 






30.0 


[-0.184, 


0.228] 


[-0.028, 


0.020] 


[-0.011, 


0.056] 
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Table 20: 95% C.L. intervals for the anomalous WWy coupling parameters obtained from fitting the 
Et(y) distribution to the NLO expectations using the combined sample of W(ev)y and W(/xv)y events, 
with A = 2 TeV. 



W(£v)y 




1 fb 1 


10 fb" 1 


30 ft- 1 


A r 


[-0.09, 0.04] 


[-0.05, 0.02] 


[-0.02,0.01] 


Ak 7 


[-0.43, 0.20] 


[-0.26, 0.07] 


[-0.11,0.05] 



The intervals are calculated for W ± y events by combining the electron and the muon decay channels. 

Figure 14 shows an Ej{y) distribution from W ± (£ ± v)y normalized to 1 ft -1 of data. The signal 
expectations at LO and NLO are shown by the dashed and dotted lines on the left in Figure 14. On the 
right in Figure 14 is shown the 95% confidence contour in the A 7 -Afc r parameter space for 1 ft- 1 of data. 
The 1 -dimensional 95% C.L. intervals of A r and Aff r are listed in Table 20. 



~ ATLAS 


o Mock data 




Background 




LO Expected ; 




NLO Expected- 
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Figure 14: Left: the E T (y) distributions of W(£v)y (£ = e, pi) events for 1 ftr 1 of data. Right: the 
95 % confidence contour in the A r -Aff r parameter space (A = 2 TeV) for 1 ft- 1 of W ± y data (with 
W ->ev, juv). 



5.4 WWZ and WWy anomalous TGC sensitivity in W + W analysis 

The Mj spectrum of W+W - pair is fitted to obtain the WWZ and WWy anomalous TGC sensitivity 
intervals at 95% confidence level. A comparison of the Mt(WW) distribution of the 'mock data' to that 
of models with anomalous coupling is shown in Figure 15. Five anomalous coupling parameters, (AfCz, 
Az, Agf , Ak 7 , Ay), have been studied with only one parameter varied at the time; the remaining pa- 
rameters are fixed to Standard Model values. One dimensional anomalous coupling sensitivity intervals 
at 95% C.L. for different integrated luminosities are given in Table 21. The cutoff A = 2 TeV is used 
in these calculations. The two-dimensional anomalous coupling limits from production with 

different scenarios relating the anomalous coupling parameters have also been investigated in this study. 
The two-dimensional contours of the TGC limits at 95% confidence level for 0.1, 1, 10 and 30 ft- 1 
integrated luminosities are shown in Figure 16. The left contours are the limits calculated with the HISZ 
assumption [43]. The right contours are calculated by assuming A z = A r and Ak z = Ak 7 . 
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Figure 15: W + W transverse mass distributions for 1 (left) and 30 (right) fb 1 of integrated luminosities. 
The last bins in the plots are 'overflow'-bins. 



Table 21: One-dimensional 95% C.L. interval of the WWZ and WWy anomalous coupling sensitivities 
from the WW final state analysis for 0.1, 1, 10 and 30 fb -1 integrated luminosities, with A = 2 TeV. 

Int. Lumi (fb -1 ) Akz Az Agf Ak 7 \ y 

0.1 [-0.242,0.356] [-0.206,0.225] [-0.741,1.177] [-0.476,0.512] [-0.564,0.775] 

1.0 [-0.117,0.187] [-0.108,0.111] [-0.355,0.616] [-0.240,0.251] [-0.259,0.421] 

10.0 [-0.035,0.072] [-0.040,0.038] [-0.149,0.309] [-0.088,0.089] [-0.074,0.165] 

30.0 [-0.026,0.048] [-0.028,0.027] [-0.149,0.251] [-0.056,0.054] [-0.052,0.100] 
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Figure 16: The two-dimensional anomalous TGC limits at 95% C.L. for 0.1, 1, 10 and 30 ftr 1 integrated 
luminosities. 




0 100 200 300 400 500 600 700 800 100 200 300 400 500 600 700 800 

p T (Z) [GeV] p T (Z) [GeV] 



Figure 17: Example of a fit to one 'mock data' sample in each channel. The points show the total number 
of data events in each bin (not number per unit pj ). The histograms show the Standard Model prediction 
(solid), the best fit (dashed) and the 95% C.L. limit on \ ff | (dotted). 

5.5 ZZZ and ZZy anomalous TGC sensitivity in ZZ analysis 

Measurements of the pp — > ZZ differential cross-section can be used to measure, or set limits on, ZZZ 
and ZZy couplings. These couplings are zero at tree level in the Standard Model. Measurements of 
the couplings provide a sensitive test of the Standard Model, and non-zero values would indicate the 
presence of new physics beyond the Standard Model. 

In order to estimate limits on anomalous couplings which may be obtained from measurements of 
ZZ production in early ATLAS data, the pj distribution of the Z boson is considered. In the ZZ — ► ££vv 
channel the visible Z boson reconstructed from the charged leptons is used. In the ZZ — >• ££££ channel one 
of the two reconstructed Z bosons is chosen in each event at random. Simulated 'mock data' distributions 
are fitted with the sum of expected signal and background distributions, where the signal distribution 
depends on the anomalous couplings. A binned maximum likelihood fit is employed, with systematic 
uncertainties included by convolution with the predictions. Fits are performed to each channel separately, 
and a combined fit is performed by multiplying together the likelihoods from the two channels assuming 
no correlated uncertainties. 

An example fit for each channel is shown in Figure 17. The results presented here use four pj bins 
for the ££w channel and six pj bins for the 4-lepton channel, as shown in Figure 17. Reasonable 
modifications to the number or position of pj bins change the expected limits by up to 15% (12%) in 
the ££w {££££) channel. Removing the first two pj bins for the ££££ channel, and fitting only the region 
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Table 22: Expected 95% C.L. intervals on anomalous couplings from fits to the ZZ —> ££££ channel, 
the ZZ —> ££w channel and both channels together for various values of integrated luminosity, with 
A = 2 TeV. In each case, other anomalous couplings are assumed to be zero. 





Int. Lumi / fb 1 


fl 


Jf 


fl 


n 


zz -► tm 


1 


[-0.023, 0.023] 


[-0.024, 0.024] 


[-0.028, 0.028] 


[-0.029, 0.028] 




10 


[-0.010, 0.010] 


[-0.010, 0.010] 


[-0.012, 0.012] 


[-0.013, 0.012] 




30 


[-0.008, 0.008] 


[-0.008, 0.008] 


[-0.009, 0.009] 


[-0.009, 0.009] 


ZZ -► ££vv 


1 


[-0.024, 0.024] 


[-0.024, 0.025] 


[-0.029, 0.029] 


[-0.030, 0.029] 




10 


[-0.012, 0.012] 


[-0.012, 0.012] 


[-0.014, 0.014] 


[-0.015, 0.014] 




30 


[-0.009, 0.009] 


[-0.009, 0.009] 


[-0.011,0.011] 


[-0.011,0.011] 


Combined 


1 


[-0.018,0.018] 


[-0.018,0.019] 


[-0.022, 0.022] 


[-0.022, 0.022] 




10 


[-0.009, 0.009] 


[-0.009, 0.009] 


[-0.010, 0.010] 


[-0.011,0.010] 




30 


[-0.006, 0.006] 


[-0.006, 0.007] 


[-0.008, 0.008] 


[-0.008, 0.008] 



Pt > 100 GeV has a negligible effect on the limits. 

Table 22 shows the mean expected limits from each channel separately, and from combining the 
channels, for various values of integrated luminosity. With an integrated luminosity of 1 fb -1 the sensi- 
tivities of the two channels are very similar. At higher luminosities, the £111 channel becomes somewhat 
more sensitive, because it has lower background and hence a lower associated systematic uncertainty. 
With as little as 1 fb -1 of data it should be possible to improve the LEP limits [32] on ff, ff and /J 
by an order of magnitude using a single channel, while a similar improvement on fj will require both 
channels. 

At an integrated luminosity of 10 fb -1 , the expected limits have only a low sensitivity to the back- 
ground level and to the systematic uncertainties. With the same signal efficiency but no background, 
the limits from the ££vv channel improve by 10%, while those from the ££££ channel change by only 
~0.2%; in the latter case, doubling the background has an effect of only ~0.4%. Reducing all systematic 
uncertainties to zero improves the limits by 7% (6%) in the ££vv (££££) channel. Thus, the background 
level and systematic uncertainties are unlikely to be important factors in obtaining limits from early data. 

As discussed above, the expected limits are affected by the choice of pj bins. The number of bins is 
currently limited by the statistics of the fully simulated Monte Carlo events. Future studies would benefit 
from increased signal Monte Carlo statistics, particularly in the high p T region. In addition, samples 
of fully simulated events with anomalous couplings should be used to investigate the dependence of the 
efficiency at a particular pj value on the production diagram. 

6 Summary 

This note presents studies of the production of W ± Z, ZZ,W ± y and Zy dibosons from pp col- 

lisions at the LHC, using leptonic decays of and Z bosons. The simulated measurements are done 
using the ATLAS detector with full detector simulation and event reconstruction, and the statistics ex- 
pected in the initial data taking periods. It focuses on the sensitivities that ATLAS can achieve in the 
early running of LHC, rather than the ultimate sensitivites that ATLAS might reach after running at the 
design luminosity. The advanced analysis technique BDT is used in analysis of most of the final states, 
which improves the sensitivities significantly. Table 16 lists the expected numbers of signal and back- 
ground events using 1 fb -1 of data, and the significance of the Standard Model signals after taking into 
account the known background contributions with 20% systematic uncertainties. It concludes that with 
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Table 23: 95% C.L. interval of the anomalous coupling sensitivities from W + W~, W ± Z, W ± y final 
states with 10.0 ft -1 of integrated luminosity and the cutoff A = 2TeV. The table also indicates the 
variables used in the fit to set the AC sensitivity interval. For reference, some recently published limits 
from Tevatron and LEP are also listed. These limits caculation assumptions are given in the table as well. 



Diboson, 




Ak z Agf 


AfCy 




(fit spectra) 










WZ, (M T ) 


[-0.015, 0.013] 


[-0.095, 0.222] [-0.011,0.034] 






Wy, (p 7 T ) 






[-0.26, 0.07] 


[-0.05, 0.02] 


WW, (M T ) 


[-0.040, 0.038] 


[-0.035,0.073] [-0.149,0.309] 


[-0.088, 0.089] 


[-0.074,0.165] 


WZ, (DO) 










(1.0 ft" 1 ) 


[-0.17,0.21] 


[-0.12, 0.29] (Agf = Ak z ) 






W ± y (DO), 










(0.16 ft- 1 ) 






[-0.88,0.96] 


[-0.2,0.2] 


WW, (LEP) 




[-0.051,0.034] 


[-0.105,0.069] 


[-0.059,0.026] 


(A r = A z ,Ak- z 


= Agf - AfCytan 2 0w) 







Table 24: Expected 95% C.L. intervals on anomalous couplings from fits to the ZZ — > ££££ channel, the 
ZZ — > ££w channel and both channels together for 10 ft -1 of integrated luminosity, with A = 2 TeV. In 
each case, other anomalous couplings are assumed to be zero. The 95% C.L. limits on neutral TGC from 
LEP ZZ detection are also listed. 





fl 


Jf 


fl 


fl 


ZZ -► ££££ 
ZZ -► ££vv 
Combined 


[-0.010, 0.010] 
[-0.012, 0.012] 
[-0.009, 0.009] 


[-0.010, 0.010] 
[-0.012, 0.012] 
[-0.009, 0.009] 


[-0.012, 0.012] 
[-0.014, 0.014] 
[-0.010, 0.010] 


[-0.013, 0.012] 
[-0.015, 0.014] 
[-0.011,0.010] 


LEP Limit 


[-0.30, 0.30] 


[-0.34, 0.38] 


[-0.17,0.19] 


[-0.32, 0.36] 



0.1 ft -1 of integrated luminosity the Standard Model signals of W ± Z, W ± y and Zy can be 

established with significance better than 5a assuming 20% systematic uncertainties. ZZ production can 
be established with 1 ft -1 of data using the four-lepton decay channels. 

Any significant deviation from the Standard Model prediction for these final states can lead to in- 
dications of new physics phenomena. In Section 5, the sensitivities to anomalous TGC are presented. 
The sensitivities are expressed in terms of constraints on the anomalous triple gauge boson couplings in 
the effective Lagrangian. Table 23 compares the 95% confidence level sensitivity interval for charged 
anomalous TGC's using observables from different diboson final states with 10 ft -1 of integrated lumi- 
nosity. 

The neutral anomalous TGC's can be explored with the Zy and ZZ final states. In this note, only 
ZZ pairs are used for constraining the anomalous coupling, with the study using Zy still in progress. 
Both the ZZ — > £ + £~£ + £~ and ZZ — > £ + £~vv final states are used to constrain the neutral anomalous 
TGC parameters ( f%, f%, fl, /J ). The 95% C.L. intervals on the anomalous couplings for 10 ft -1 of 
integrated luminosity are listed in Table 24. 

The current status of the Monte Carlo generators for diboson is less than satisfactory. MC@NLO 
is integrated with a parton shower (Herwig), but it does not have matrix elements for the effective 
Lagrangian beyond the Standard Model with anomalous couplings. The BHO program can generate 
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parton-level LO and NLO diboson events with anomalous couplings, but can not be correctly integrated 
with the parton shower programs. In the current analysis, MC@NLO is used to simulate the Standard 
Model events. The BHO MC with anomalous TGCs is then used to re-weight the events so that the fully 
simulated events can effectively have the anomalous TGCs, and be used directly to compare with the 
simulated 'mock data'. 

Because of the higher center of mass energy at the LHC, the cross-sections for diboson production 
are an order of magnitude higher than at the Tevatron. This will allow ATLAS to improve the Tevatron 
measurements in the early running of the LHC. The large signal statistical significances and signal to 
background ratios determined from these studies suggest that early observations of these channels will 
take place at the LHC start up with 0.1 to 1 fb _1 of data. Systematic uncertainties will dominate the cross- 
section measurement errors starting from 5-30 fb _1 of data. With increasing luminosity, the constraints 
on the anomalous couplings will provide important probes of physics beyond the Standard Model. 
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Abstract 

In the early days of data taking at the LHC, top quark physics will have a role of 
primary importance for several reasons. At start-up, already with the first few 
fb~ 1 of integrated luminosity, a top quark signal can be clearly separated from 
the background even with an imperfectly calibrated detector and the top quark 
pair production cross-section can be extracted at better than 20% accuracy and 
with negligible statistical error. The first measurement of the top quark mass 
will provide feedback on the detector performance and top quark events can be 
used to understand and calibrate the light jet energy scale and the b-tagging. 
Additionally in scenarios beyond the Standard Model, new particles may decay 
into top quarks, therefore a detailed study of the top quark properties may 
provide a hint of new physics. A good understanding of top quark physics 
is also essential as top quark events are a background for many new physics 
searches. 

1 Introduction 

The top quark, discovered at Fermilab in 1995 [1], completed the three generation structure of the Stan- 
dard Model and opened up the new field of top quark physics. Produced predominantly, in hadron-hadron 
collisions, through strong interactions, the top quark decays rapidly without forming hadrons, and almost 
exclusively through the single mode t — > Wb. The W-boson can then decay leptonically or hadronically. 
The relevant CKM coupling is already determined by the (three-generation) unitarity of the CKM ma- 
trix. Yet the top quark is distinguished by its large mass, about 35 times larger than the mass of the 
next heavy quark, and close to the electroweak symmetry breaking scale. This unique property raises a 
number of interesting questions. For example if the top quark mass is generated by the Higgs mechanism 
as the Standard Model predicts and if its mass is related to the top-Higgs-Yukawa coupling, or if it does 
play an even more fundamental role in the electroweak symmetry breaking mechanism. Non Standard 
Model physics could first manifest itself in non-standard couplings of the top quark which show up as 
anomalies in top quark production and decays. By studying the top quark, some of these questions may 
be answered. Further insight in top quark properties will come from measurements done with the high 
statistics sample of tt pairs such as top quark and W polarization studies sensitive to anomalous Wtb cou- 
plings, searches for rare top quark decays indicating the presence of new physics, or for new resonances 
decaying to tt pairs. 

The LHC will be a top quark factory, producing millions of tt pairs in a sample of 10 fb _1 , which is 
expected to be collected during the first years of LHC operation. 

The understanding of the experimental signatures for top quark events involves most parts of the ATLAS 
detector and is essential for claiming potential discoveries of new physics. 

Since numerous single (anti-)top quark events are produced via electroweak interactions, the top quark 
properties, such as the Wtb coupling, can be examined with high precision at the LHC during its first 
years of running. 

2 Top quark pair production 

In proton-proton collisions top quark pairs are produced through both gluon-gluon and quark-antiquark 
scattering (Figure 1). The relative importance of both amplitudes depends on the center of mass energy 
of the collision and nature of the beams: at the LHC the gluon scattering process dominates (~ 90% of 
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the cases) while at the Tevatron, production of top quark pairs is kinematically restricted to the quark 
dominated region. This difference appears also in the values of the cross-section: that is about 100 times 
larger at the LHC than at the Tevatron. The large top quark mass (m t ) ensures that top quark production 
is a short-distance process, and that the perturbative expansion, given by a series in powers of the small 
parameter a s {m t ), converges rapidly. The cross-section used throughout this note for production at the 
LHC has been calculated up to NLO order including NLL soft gluon resummation, and results in about 
833±100 pb, where the uncertainty reflects the theoretical error obtained from varying the renormalisa- 
tion scale by a factor of two [2]. The effect of PDF errors accounts for a few percent uncertainty, while 
varying the top mass by a factor of two, an uncertainty of about 6% on the cross-section is obtained. A 
calculation that includes NNLL soft-gluon corrections results in a central value for the tf cross-section of 
872.8 pb [3]. This translates to about 83,000 top quark pairs in a sample of 100 pb -1 and of the order of 
10 7 top quark pairs produced per year before any selection or detection criteria are applied. 




Figure 1: Top production processes at lowest order: gluon-gluon scattering diagrams (a) 
and b)) and quark-quark scattering diagram c). 



2.1 Observables and phenomenology 

In the Standard Model, the decay of top quarks takes place almost exclusively through the t^Wb decay 
mode. A W-boson decays in about 1/3 of the cases into a charged lepton and a neutrino. All three lepton 
flavors are produced at approximately equal rate. In the remaining 2/3 of the cases, the W-boson decays 
into a quark-antiquark pair, and the abundance of a given pair is instead determined by the magnitude 
of the relevant CKM matrix elements. Specifically, the CKM mechanism suppresses the production of 
b-quarks as \V c b\ 2 ~ 1.7 x 10~ 3 . Thus, the quarks from W-boson decay can be considered as a clean 
source of light quarks. 

From an experimental point of view, one can characterise the top quark decay by the number of W- 
bosons that decay leptonically. A value of 10.8% and 67.6% has been used for the leptonic and hadronic 
branching ratio (BR) of the W-boson, respectively [4]. The following signatures can be identified: 

• Fully leptonic: represents about 1/9 of the tf events. Both W-bosons decay into a lepton-neutrino 
pair, resulting in an event with two charged leptons, two neutrinos and two b-jets. This mode is 
identified by requiring two high p T leptons and the presence of missing transverse energy (|? T ), 
and allows a clean sample of top quark events to be obtained. However, this sample has limited 
use in probing the top quark reconstruction capability of the ATLAS experiment, due to the two 
neutrinos escaping detection. 

• Fully hadronic: represents about 4/9 of the tt decays. Both W-bosons decay hadronically, which 
gives at least six jets in the event: two b-jets from the top quark decay and four light jets from the 
W-boson decay. In this case, there is no high pj lepton to trigger on, and the signal is not easily 
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distinguishable from the abundant Standard Model QCD multi-jets production, which is expected 
to be orders of magnitude bigger than the signal. Another challenging point of this signature is the 
presence of a high combinatorial background when reconstructing the top quark mass. 

• Semi-leptonic: represents about 4/9 of the tt decays. The presence of a single high pj lepton 
allows to suppress the Standard Model W+jets and QCD background. The pj of the neutrino can 
be reconstructed as it is the only source of #j for signal events. 

In this document, top pair production is studied in the semi-leptonic and fully leptonic decay modes. 

3 Single Top Quark Production 

In the Standard Model single-top quark production is due to three different mechanisms: (a) W-boson and 
gluon fusion mode, which includes the t-channel contribution and is referred to as t-channel or Wg as a 
whole (b) associated production of a top quark and a W-boson, denoted Wt, and (c) s-channel production. 
The corresponding diagrams are shown in Fig. 2. We note however that these definitions are valid only 
at leading order (LO): next to leading order (NLO) calculations may introduce diagrams which cannot 
be categorised so unambiguously. The total NLO cross-section amounts to about 320 pb at the LHC. 
Among those channels, the dominant contribution comes from the t-channel processes, which account 
for about 250 pb; the Wt contribution amounts to about 60 pb while the s-channel mode is expected with 
a cross-section of about 10 pb [5] [6]. 




Figure 2: Main graphs corresponding to the three production mechanisms of single-top 
quark events: (a) t-channel (b) Wt associated production (c) s-channel. 

In the following notes, when discussing the analysis strategy in the s- and t-channels, we will use only 
the leptonic decay of the W-bosons (lvbb and lvb(b)q final states, respectively) 1 . For the associated Wt 
production, we will consider events where one of the W-bosons ( either the one produced together with 
the top quark or the one appearing in the top quark decay) decays leptonically and the other hadronically. 
The t decay modes were included in all relevant simulated event samples, though signal selection is 
aimed at electron and muon signatures. 

We note that in pp collisions, the cross-section for single-top quark is not charge symmetric. The 
s-channel tb final state cross-section is predicted to be a factor 1.6 higher than the one corresponding 
to the tb final state. This ratio is 1.7 if only the t-channel processes are included. This feature is of 
special interest since it generates a charge asymmetry in the leptonic final state that can be exploited in 
the analysis to reduce the contamination from the top quark pair production, which constitutes the main 

'The hadronic decay modes have obvious disadvantages for triggering and the lack of a lepton signature increases the 
background significantly 
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background to our signal. On the other hand, the rates for the charge-conjugate processes W + 1 and 
W + + 1 are identical. 

Significant sources of uncertainties affect the theoretical predictions of the production cross-sections. 
The s-channel is known with a precision of 9.1% at NLO [5,7], while the t-channel has an uncertainty of 
4.8% [5,7]. An uncertainty of 3% is quoted for the Wt channel [6]. Those uncertainties come from three 
main sources. The uncertainty in the parton luminosity, depending upon the choice of the partem density 
functions, is particularly important (2 to 4% for the s- and t-channels), since the b parton or the gluons are 
involved in the hard processes. The choice for the renormalization and factorization scales accounts for 
about 2 and 3% uncertainty in s- and t-channel calculations respectively. Finally, a few GeV uncertainty 
on the top quark mass m t results in percent level variation of the cross-sections. The uncertainty in a s 
enters marginally in the total error at a value below the 1 % level. 

4 Monte Carlo samples 

The Monte Carlo samples which have been used for the top quark analyses reported are described in this 
section. The calculation of many processes benefit from methods such as resummation of next-to-leading 
log terms and some are calculated at the full NLO accuracy. All the samples used have been normalized 
using "K-factors", to the NLO theoretical cross-section calculations whenever available [8]. The value 
of m t = 175 GeV has been used for the generation of all samples and all cross-sections correspond 
to this value. Most samples were processed with the full GEANT4 ATLAS detector simulation and 
reconstruction code. In some cases, the fast simulation package ATLFAST has been used. 
The effect of pile-up in the cavern corresponding to a luminosity of 10 33 cm _2 s _1 has been simulated 
both for tt and single top events. 

4.1 Simulation of tt signal events 

Top quark pair production has been simulated using the Monte Carlo generator MC@NLO [9] version 
3.1. The hard process of tt production is calculated at NLO, so that diagrams that produce one additional 
parton in the final state are included at matrix element level. The parton density function CTEQ6M [10] 
is used. Fragmentation and hadronisation is simulated using HERWIG [11] and the underlying event 
by Jimmy [12]. The tt main samples for analysis are a sample of single and double leptonic events and 
one of fully hadronic events. There are no cuts applied at generation level other than the lepton flavor 
separation according to W-boson decay type that allowed subdividing the generated events into these 
two samples. A number of other samples mainly aimed at systematic studies have been produced and 
are listed in Table 1. Among these AcerMC [13] samples interfaced with PYTHIA for the hadronisation 
and fragmentation and simulation of the underlying event, aimed at initial and final state radiation (ISR 
and FSR) studies, that will be discussed later in the text. For the tt rare decays samples were produced 
with TopRex [14] interfaced with PYTHIA for the hadronisation and fragmentation and simulation of 
the underlying event. 

4.2 Simulation of single top quark events 

For the single top quark signal production, the AcerMC matrix element generator was used in conjunction 
with PYTHIA, that was used for hadronisation, fragmentation and simulation of the underlying event. 
The parton density functions CTEQ6M have been used. Compared to TopRex which was previously 
used in ATLAS, its t-channel generation method is based on a more physically motivated method [15] 
for combining LO and tree level NLO diagrams. The contribution from NLO diagrams is rather important 
for the t-channel as the gluon splitting to bb tends to be underestimated with the parton shower method. 
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Table 1 : tt and single top quark simulated samples used throughout the notes. (m t =175 GeV) 



is used as default in the generation. Given are a short description of the simulated physics 
process, the generator used, the production cross-section (a), and the K-factor that should 
be applied to the quoted cross-section. 


MC@NLO + HERWIG tt - fully simulated events - K-factor = 1.0 


a(pb) xBR 


Fully leptonic and semi-leptonic tt 


450 


Fully hadronic tt (ra t =175 GeV) 


380 


Fully leptonic and semi-leptonic tt (m t =160 GeV) 


450 


Fully leptonic and semi-leptonic tt (m t =170 GeV) 


450 


Fully leptonic and semi-leptonic tt (m t =180 GeV) 


450 


Fully leptonic and semi-leptonic tt (m t =190 GeV) 


450 


Fully leptonic and semi-leptonic tt no UE 


450 


Inclusive tt (pj (t) > 200 GeV) 


100 


AcerMC+PYTHIA - tt - fully simulated events - K-factor = 1.0 


a(pb) 


Fully leptonic and semi-leptonic tt 


450 


Fully leptonic and semi-leptonic tt, different ISR/FSR (low top mass) 


450 


Fully leptonic and semi-leptonic tt, different ISR/FSR (high top mass) 


450 


AcerMC+PYTHIA single top quark Wt-channel - full sim. events K-factor = 1.14 


a( P b) 


Single top quark associated Wt production, semi-leptonic decay 


25.5 


AcerMC+PYTHIA single top quark s-channel - full sim. events K-factor = 1.5 




Single top quark s-channel leptonic decay 


2.3 


AcerMC+PYTHIA single top quark t-channel - full sim. events K-factor = 0.98 




Single top quark t-channel leptonic decay 


81.3 


TopRex+PYTHIA - tt rare decays - fully simulated events 




tt^bW(^v)+q7 




tt^bW(£v)+qZ(££); i = e, \i 




tt^bW(£v)+qg 





The s-channel and Wt-channel are generated at LO accuracy only. All three channels were generated 
with W-bosons forced to decay leptonically (e or }X or t). In the case of Wt, either the associated W- 
bosons or the W-bosons from top quark decay is forced to decay leptonically and no dileptonic events 
are included. The Monte Carlo samples that have been used in the analyses and the corresponding 
normalisation cross-sections can be found in Table 1. 

4.3 Simulation of background W + jet events 

For the W + jets production, the ALPGEN [16] generator with HERWIG [11] clustering has been used. 
HERWIG has been used for the simulation of the fragmentation and the hadronisation and Jimmy for the 
underlying event. The MLM [17] algorithm has been used to match the parton shower and the matrix 
element calculations. The matching parameters are the minimum pj of the partons and the minimum AR 
among two partons, defined as the separation of two objects in the tj-0 space, (AR) 2 = (At]) 2 + (A0) 2 . 
Here, tt. is the pseudorapidity of an object, defined as 77 = — ln(tan(0/2)) and 0 and 6 are the azimuthal 
and polar angles, respectively. All cones are defined in tt -0 space. The values of the matching parameters 
that are used in this note are p T = 20 GeV and AR = 0.3. 

A fraction of this background contains heavy quarks. This background is treated separately in ALPGEN 
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by producing W + bb and W + cc (plus light jets) samples. The W-boson background samples used 
throughout the notes are described in Table 2 2 . 

4.4 Simulation of background Z — ► U+ jets events 

The Z —>■ il + n jets background events, where the lepton flavor can be any charged lepton, has been 
generated with ALPGEN, while HERWIG has been used for the simulation of the fragmentation and 
the hadronisation and Jimmy for the underlying event. The MLM algorithm has been used to match the 
parton shower and the matrix element calculations. The matching parameters values are pj = 20 GeV 
and AR = 0.3. 

The contribution of those backgrounds to our analyses is non-negligible only when there is at least one 
jet in addition to the Z. The samples have been generated with up to 5 additional jets. Samples generated 
with PYTHIA have also been used: the complete list of the samples can be found in Table 3 3 . 

4.5 Simulation of di-boson background 

Di-boson events produced with light jets can be a background for the tt and single top quark signal. WW, 
WZ and ZZ processes with all decay modes have been generated with HERWIG: a filter was applied to 
select those events with an electron or a muon with pj > 10 GeV. WW events with the W-boson decaying 
leptonically into final states with two electrons, an electron (muon) and a T-lepton and two T-leptons have 
been generated with MC@NLO interfaced with HERWIG for the hadronisation and fragmentation ( see 
Table 4). 

4.6 QCD background 

QCD multi-jet events are a background for tt and single top quark analyses if at least one of the jets 
in the event is misidentified as an isolated lepton. The level of QCD multi-jet background has large 
uncertainties with the currently available generation tools, which are based on a leading order description: 
ALPGEN has been used to generate these events with the same matching parameters as discussed in 
section 4.3. Given the large cross-section for this process only fast simulated events (ATLFAST) have 
been produced and used for the present studies. Events with 2 to 5 light jets and bb + 0, 1 , 2, 3 light jets 
have been generated requiring at least 3 jets in the final state with pj > 30 GeV. Di-jet fully simulated 
PYTHIA events are available and have been used for comparison with the ATLFAST results (see Table 5). 
In practice, the level of background will be derived directly from the data and will strongly depend on the 
lepton fake rate, and can depend on the topology of the event. Different cuts can be applied to strongly 
reduce this background. In fully hadronic tt decays, QCD multi-jet events are the main background and 
a different strategy has to be developed. These decays are not studied in details in this document. 

5 Reconstruction of physics objects 

We use definitions of high level reconstructed objects (electrons, muons, jets, etc.) that are standard in 
ATLAS. They are described in the following sections. In the definitions, we often use the distance AR 
between objects. 

2 The difference in the cross section for the different leptons in the first set of events in Table 2 (fully simulated W boson 
events with ALPGEN) is due to the truth jet filter. Electrons and T leptons can also be reconstructed as a jet: it is more probable 
for electrons than for T leptons that are reconstructed via their visible decay products. 

3 In the truth jet filter applied to the fully simulated Z boson events with ALPGEN in Table 3 jets made from leptons are 
removed at the filter level. 
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Table 2: W-boson background samples used throughout the notes. Given are a short descrip- 
tion of the simulated physics process, the generator used, the production cross-section (a) 
including filter, matching efficiency for ALPGEN and selection efficiency, and the K- factor 
that should be applied to the quoted cross-section. 



ALPGEN + Jimmy - W-boson - fully simulated events - K-factor = 1.15 


a( P b) 


ev + 2 partons; truth filter 3jets with > 30 GeV 


214 


ev + 3 partons 


124 


ev + 4 partons 


54 


ev + 5 partons 


22 


iiv + 2 partons; truth filter 3jets with pij. > 30 GeV 


16 


jUV + 3 partons 


65 


/xv + 4 partons 


36 


jiiv + 5 partons 


20 


TV + 2 partons; truth filter 3jets with p* T > 30 GeV 


88 


W— > xv + 3 partons 


87 


W— > xv + 4 partons 


46 


TV+ > 5 partons 


21 


/ v i . i v 1 1 . . > -i- jimmy — V t uosoii — /\uiasi sample — jv-i<tciur — 1.13 


a(pb) 


> _i_ q parton 


13400 




2610 


> gy _(_ 2 partons 


826 




239 


ev + 4 partons 


67.4 


W — ► ev-\- > 5 partons 


24.0 


/xv + 0 parton 


13400 


/xv + 1 parton 


2590 


/xv + 2 partons 


826 


/xv + 3 partons 


236 


/xv + 4 partons 


68.3 


/XV+ > 5 partons 


24.3 


W— *■ TV + 0 parton 


13400 


tv + 1 parton 


2620 


TV + 2 partons 


828 


TV + 3 partons 


239 


TV + 4 partons 


67.7 


TV+ > 5 partons 


24.4 


ALPGEN + Jimmy - W + bb - fully simulated events - K-factor = 2.57 


a(pb) 


W + bb + 0 parton; no filter 


6.26 


W + bb+ 1 parton 


6.97 


W + bb + 2 partons 


3.92 


W + bb + 3 partons 


2.77 


ALPGEN + Jimmy - W + cc - fully simulated events - 


a( P b) 


W + cc + 0 parton; no filter 


6.72 


W + cc + 1 parton 


7.49 


W + cc + 2 partons 


4.36 


W + cc + 3 partons 


2.45 
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Table 3: Z boson background samples used throughout the note. Given are a short descrip- 
tion of the simulated physics process, the generator used, the production cross-section (a) 
including filter, selection efficiency and matching efficiency for ALPGEN, and the K-factor 
that should be applied to the quoted cross-section. 



ALPGEN + Jimmy — Z boson — fully simulated events — K-f<ictor — 1.24 


a(pb) 


Z — > e + e~+ 1 partem; p\ > 10 GeV, one jet with > 20 GeV 


138 


Z —>■ e + e~+ 2 partons 


50.5 


Z — > e + e~+ 3 partons 


16.2 




4.6 


Z — > e e~ -\- ^ 5 partons 


1.7 


a — > jU jU + iparion, /?j ^ iu ucv , one jei wnn pj ^ zu ucv 




Z — > 2 partons 


51.7 


Z — > jU. + /x^-i- 3 partons 


16.3 


Z — > 4 partons 


4.6 


Z -> > 5 partons 


1.7 


Z -> t+t~+ 1 parton; p{ > 10 GeV, one jet with p\ > 20 GeV 


57 


Z^T + T~+ 2 partons 


21.3 


Z — > T + T~+ 3 partons 


7.0 


Z^T + T~+ 4 partons 


2.2 


Z -> T+T~ + > 5 partons 


0.8 


PYTHIA - Z boson - fully simulated events - K-factor = 1.22 


a(pb) 


Z -► e + <?~; > 10 GeV, m u > 20 GeV 


1432 


Z p£ ^ 10 GeV ' m « ^ 20 GeV 


1497 


Z -► t+t-; p£ > 5 GeV, m u > 20 GeV 


77 



5.1 Electron definition 

Electron candidates are reconstructed and identified by the calorimeters and inner tracker of ATLAS and 
are reconstructed in the pseudorapidity range \r} \ < 2.5. An electron candidate is defined as a medium 
electron identified by the isEM algorithm [18]. 

If an electron is found in the calorimeter crack region 1.37 < 1 7] | < 1.52, it is vetoed. The electron 
has to be isolated based on calorimeter energy: the additional transverse energy Ej in a cone with radius 
AR = 0.2 around the electron axis is required to be less than 6 GeV. The p T and |t]| cuts depend on the 
event signature and are detailed in the various relevant sections. 

For electrons above 20 GeV and |t]| < 2.5 and outside the crack region, the average identification 
efficiency in tt events is about 67%, with a purity of about 97%. 

5.2 Muon definition 

Muons are reconstructed by the muon spectrometer and the inner detector. The muon reconstruction is 
performed using the Staco algorithm [19], and muons are defined from the best match combination of 
the muon chambers and the tracker information. Muons are reconstructed in the pseudorapidity range 
|tj| < 2.5 and have to be isolated based on calorimeter energy: the additional transverse energy E T in 
a cone with radius AR = 0.2 around the muon is required to be less than 6 GeV. The pj and |t]| cuts 
applied to the muons are given in the selection cuts described in the various sections. For muons above 
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Table 4: Diboson background samples used throughout the notes. Given are a short descrip- 
tion of the simulated physics process, the generator used, the production cross-section (a) 
including filter, matching efficiency and selection efficiency for ALPGEN, and the K-factor 
that should be applied to the quoted cross-section. 



HERWIG + Jimmy — WW — fully simulated events — K-factor = 1.57 


a (pb) 


1 e or n p^ > 10 GeV 


24.5 


HERWIG + Jimmy - ZZ - fully simulated events - K-factor = 1.29 




1 e or pi p<f > 10 GeV 


2.1 


HERWIG + Jimmy - WZ - fully simulated events - K-factor = 1.89 




1 e or n p* > 10 GeV 


7.8 


MC@NLO + HERWIG - WW - fully simulated events - K-factor = 1.0 


a(pb) 


W+W — > e+v e~v; no filter 




W+W" -► e+vrv 




W+W" -► T+ve-v 




W+W -► n + v TV 




W+W -► T+V \l~V 




W+W -► t+v TV 





20 GeV, the average reconstruction efficiency in tt events is 88%. The fake rate, defined as the rate at 
which an object that is not associated with a true muon is mis-identified as a muon, is 0.1 ±0.01 %. 

5.3 Jet definition 

Jets are reconstructed with the standard ATLAS cone algorithm in tj — <j) space, for \r] | < 2.5 and a cone 
radius of 0.4, operating on energy depositions in calorimeter towers [20]. In tt events, jets coinciding 
within AR < 0.2 with electrons (as defined in section 5.1) are removed. 

A jet is identified as originating from a b-quark by determining the probability that it contains a secondary 
vertex. The three-dimensional impact parameter (IP3D) and the secondary vertex (SV1) [21] algorithms 
are used and we require that the resulting output weight of the event is larger than 7.05. The requirement 
on the weight is chosen such that an efficiency of about 60% for jets with p T > 30 GeV in tt semi-leptonic 
events is reached, with a mistag rate of 100. 

5.4 Missing transverse energy definition 

For the missing transverse energy ($t)> we determine the sum of five components. 

1. the contribution of cells in identified electron or photon clusters; 

2. the contribution of cells inside jets; 

3. the contribution of cells in topological clusters outside identified objects; 

4. the contribution from muons; 

5. the cryostat correction. 

For the calculation of fa we use the standard ATLAS variable [22]. The sum of the tranverse energy in 
semi-leptonic top events is about 500 GeV, which gives a typical $t resolution of the order of 10 GeV. 
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Table 5: QCD background samples used throughout the notes. Given are a short descrip- 
tion of the simulated physics process, the generator used, the production cross-section (a) 
including filter, matching efficiency and selection efficiency for ALPGEN. 



PYTHIA - QCD Dijets - fully simulated events - 


a(pb) 


Dijets; e/y filter p T > 15 GeV 


1.91 - 10 8 


Alpgen + HERWIG - jets - Atlfast - 




2 partons; one jet with p^ > 30 GeV 


1.13 -10 6 


3 partons 


2.03 • 10 6 


4 partons 


1.10- 10 6 


> 5 partons 


0.32 -10 6 


Alpgen + HERWIG - bb + jets - Atlfast - 


a(pb) 


bb + 0 parton; one jet with p^ > 30 GeV 


5.3 -10 3 


bb + 1 parton 


33.6 -10 3 


bb + 2 partons 


29.5 -10 3 


bb+ > 3 partons 


18.9 -10 3 



5.5 Overlap removal 

In some cases when there is ambiguity in the definition of an object, the overlaps are removed: if an 
object is identified as an electron it is not counted in the category of photons or taus or jets. 

6 Systematics 

In this section, we list the sources of systematics common to all analysis and describe how they have 
been consistently treated. 

6.1 Estimate of the luminosity and its uncertainty 

At the LHC start-up only a rough measurement of the machine parameters will be available. The expected 
uncertainty on the luminosity during this phase will be of the order of 20-30%. A better determination 
of the beam profiles using special runs of the machine will lead ultimately to a systematic uncertainty of 
the order of 5%. The proposed ALFA detector will measure elastic scattering in the Coulomb-nuclear 
interference region using special runs and beam optics, determining the absolute luminosity with an 
expected uncertainty of the order of 3%. The optical theorem, in conjunction with a precise external 
measurement of the total cross-section, can achieve a similar 3% precision [23]. 

6.2 Lepton identification efficiency 

For the first 100 pb _1 of integrated luminosity, the lepton identification efficiency error is expected to be 
of the order of 1 % for electrons and muons, while the error on the fake rate is expected to be 50% and 
20%, respectively [18,19]. 

6.3 Lepton trigger efficiency 

The lepton trigger efficiency is measured from data using Z events. We expect the uncertainty to be of 
the order of 1% for an integrated luminosity of 100 pb _1 [24]. 
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6.4 Jet energy scale 

In the difficult hadron-hadron collision environment, the determination of the jet energy scale is rather 
challenging [25]. While several methods are proposed such as using y+jet events to propagate the 
electromagnetic scale to the hadronic scale, the jet energy scale depends on a variety of detector and 
physics effects. This includes non-linearities in the calorimeter response due, for example, to energy 
losses in "dead" material, and additional energy due to the underlying event. Energy lost outside the jet 
cone can also affect the measured jet energy. Effects due to the initial and final state radiation (ISR/FSR) 
modelling could also affect the jet energy scale but they are evaluated separately. The ultimate goal in 
ATLAS is to arrive at a 1 % uncertainty on jet energy scale though such performance is only reachable 
after several years of study. The jet energy resolution used throughout the note is 60% / \[E ©5%. 

To estimate the sensitivity of the analyses to the uncertainty on the jet energy scale in early data we 
have repeated them while artificially rescaling the energies of the jets by ±5%. The resulting variation in 
the measurement from the analyses (e.g. cross-section, mass etc.) gives a good measure of the systematic 
uncertainty due to the jet energy scale. 

Since several analyses are using the missing transverse energy in the event to reduce the backgrounds, 
the effect of the variation of the jet energy scale on the missing energy has been taken into account by 
calculating the contribution to the transverse missing energy coming from unsealed jets and subtracting 
it from the overall missing transverse energy in the event. Finally the contribution from the rescaled jets 
is calculated and added to the missing transverse energy of the event. 

6.5 b-tagging uncertainties 

The use of b-tagging in tf and single top quark events is essential in order to reduce the backgrounds, 
in particular that from W+jets, and the combinatorial background when reconstructing the top quark. 
At the beginning of data taking the b-tagging performance will need to be understood and tt events will 
be used as a calibration tool for the determination of the b-tagging efficiency. To avoid having a large 
dependence on the b-tagging efficiency in the early days of data taking we have studied methods to 
extract the tt cross-section and the top quark mass without applying b-tagging. The uncertainty on the 
b-jet efficiency is currently estimated to be of the order of ±5% and the uncertainty on the mistag rate is 
50% [21]. 

6.6 ISR and FSR systematics 

More initial and final state QCD radiation (ISR and FSR) increases the number of jets and affects the 
transverse momentum of particles in the event. Selection cuts for top quark events include these quan- 
tities, therefore ISR and FSR will have some effect on the selection efficiency. In order to evaluate the 
effect of the ISR and FSR systematics, several studies have been performed using the AcerMC [13] 
generator interfaced with the PYTHIA parton showering. 

Samples of tt and single top quark events with separate variations of the PYTHIA ISR and FSR 
parameters have been generated. The study was limited to parameters which have been shown to have 
the biggest impact on the reconstructed top mass at the generator level. The choices of the parameters 
depend on the analysis and include ISR Aqcd, the ISR cutoff, FSR Aqcd, and the FSR cutoff. The effect 
of the selection efficiency can be as large as 10%, depending on the analysis cuts. 

6.7 Parton density uncertainties 

The systematic error due to the parton density functions (PDF) uncertainties is evaluated on tt signal 
samples. Both the PDF error sets CTEQ6M and MRST2002 [26] at NLO are used. Both sets have pos- 
itive and negative error PDFs. In order to evaluate the systematic effect on an observable, the following 
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formula [27] could be adopted: 



AX = - 



where i varies over the set of PDF errors and X + and X is the observable evaluated with the positive 
and negative error PDFs respectively. 

The error PDFs do not guarantee that X^ > Xq and X^ < Xq Vi (here X® is the value of the observable 
as computed with the default PDF set). An alternative approach has thus been proposed in reference [28], 
which entails the use of the following asymmetric formulas, that are used here: 



In order to avoid generating a sample for each set of PDF errors, a re-weighting method has been 
used. Once an event has been generated with a certain PDF, PDF1, it can be re- weighted as if it would 
have been generated with a different PDF, PDF2, by using the following re-weight factor: 



The variable flav; is the flavor of parton i= 1,2, which initiates the hard scattering, x,- its momentum 
fraction, and Q is the mass scale used by the Monte Carlo in the computation of the PDFs. 

Eq. 1 is an approximation and should be checked by comparing samples obtained with the re-weighting 
technique and generated ones. Using the re-weighting technique is not the same as generating the sam- 
ples with the different sets of PDFs, the reason being that in the generator the Sudakov factors of the 
initial state radiation depend on the PDFs and the dependence does not factorize, i.e. it is not a multi- 
plicative factor. The effect is believed to be generally small, but has been checked for each process by 
comparing ATLFAST subsamples of our signal events generated with the PDFs and re-weighted sam- 
ples: the selection efficiencies for those samples have been compared for the tt and single top quark 
analyses and the results agree within the statistical errors. 

6.8 W+jets normalization 

For final states with at least one top decaying leptonically, the ones studied in this document, one of the 
most important backgrounds is W-boson production in association with jets where the W-boson decays 
leptonically. The uncertainty on the normalization of this process is particularly relevant in analyses 
where the background contribution is estimated on the basis of the Monte Carlo expectations rather than 
fitted from the data. 

To evaluate this background we used Monte Carlo samples of W+0,1,2,3,4 jets produced with the 
ALPGEN Monte Carlo, and we evaluated the selection efficiencies for this background. For the nor- 
malisation of the exclusive samples we used the leading order cross-section prediction of the ALPGEN 
Monte Carlo, with standard settings for the renormalisation and factorisation scales. The inclusive cross- 
section is then normalised to the NNLO value [8]. However, the normalisation of the exclusive samples 
has a large theoretical uncertainty due to the matching: we therefore envisage to determine it from data 
itself. 

The exclusive cross-section for W-boson produced in association with jets (a(W + nj)) can be ex- 
tracted from the data by using the following relation: 





event {reweights) = 



fpDF2(xi,flav\ , Q) fpDFijxiJlavi, Q) 
fpDFi (xijlavi , Q) fpDFi(x2,flav 2} Q) 



(1) 



g(W ind ) = a(Z incl ) 
a(W + nj) a(Z + nj) 



(2) 
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where the inclusive W-boson and Z-boson cross-sections ((7(Wi nc i) and a(Zi nc i)) and the exclusive cross- 
section for Z produced in association with n jets (a(Z + nj),n =0 — 4) are extracted from the data. 
Equation (2) is demonstrated to be valid to a few percent level in [29], depending on the selection cuts 
applied (e.g. p T cut on jets). 

When the data will be available the normalisation of the W+j background will be extracted from 
relation (2) using Z+nj samples selected in Z — > ee events. 

Obtaining the normalisation for this background from the data has shown to be essential, since the 
uncertainty on the exclusive cross-section of the W+nj background obtained from ALPGEN can be as 
large as 50% [30]. The largest uncertainties affect the topologies with higher number of jets that are 
typical of tt events. We have verified that such large variations of the exclusive cross-sections are found 
when selecting events with at least 4 jets with pj > 40 GeV in ALPGEN W+nj fast simulated samples 
generated with different sets of matching parameters. While the normalisation has large uncertainties, the 
shapes of the distributions are found to be basically independent of the choice of the matching parameters. 
With data driven methods and about 1 fb _1 of luminosity, a 20% uncertainty on the W+nj normalisation 
should be reachable. 
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Triggering Top Quark Events 

Abstract 

Collisions at the LHC occur at a rate of up to 40 MHz, much larger than the 
200 Hz storage capacity of the ATLAS experiment. The ATLAS trigger sys- 
tem has the challenging task of rejecting 99.9995 % of the events produced in 
collisions, while keeping those needed to achieve the physics goals of the ex- 
periment. This note evaluates the expected performance of the trigger system 
in top quark events by investigating the response of the trigger system to single 
objects such as a muon, an electron or a jet originating from top quark decays. 
In addition, the methodology needed to efficiently select top quark events in 
the online trigger system is discussed including methods to determine trigger 
efficiencies from data. 

1 Introduction 

Triggering at the Large Hadron Collider (LHC) is a challenging task. The selection system is required 
to reduce the initial bunch-crossing rate of 40 MHz down to a manageable 100-200 Hz while retaining 
and recording the five in one million most interesting physics events. ATLAS has designed a three-level 
trigger which aims to select these events, with the highest possible efficiency and lowest possible bias. 

The ATLAS trigger is largely based on signatures of high transverse-momentum particles and large 
missing transverse energy. The first trigger level (LI) is implemented using custom electronics and 
is based on coarse -resolution data from the calorimeters and dedicated fast muon detectors [1]. The 
level-2 trigger (L2) and the Event Filter (EF), referred to together as the High Level Trigger (HLT), are 
implemented in software and run on a commodity computing cluster [2]. The L2 design is based on the 
concept of Regions of Interest (Rols). Algorithms request full-resolution data only from the region of the 
detector corresponding to the LI candidate object and perform a more refined analysis of its features. The 
EF works also in a seeded mode although it has access to the full-resolution data of the entire detector. 
It runs more sophisticated offline reconstruction and selection algorithms. 

At the LHC top quarks are mainly produced either in pairs or singly. A large set of different and 
complex event signatures is expected. For tt events, approximately 44% of the decays will be fully 
hadronic, resulting in a final state with six jets 1 : four light jets from the W-boson decays and two b-jets. 
Another 44% of the decays will be semi-leptonic, with a final state containing one lepton, one neutrino, 
two b-jets and two light jets. Roughly 11% of the tt decays will be purely leptonic, with two leptons, 
two neutrinos, and two b-jets in the final state. In single-top events, the W-boson decays hadronically 
two-thirds of the time and leptonically one-third. More details about the properties and decay modes of 
top events are discussed in the introduction [3]. 

The trigger efficiencies for top events of the most relevant single object triggers are studied in Sec- 
tion 2 of this note. The rich topologies of top events results in large overlap between trigger signatures. 
This feature can be exploited to monitor trigger efficiencies as discussed in Section 3. Finally, in Sec- 
tion 4, the electron trigger efficiency is extracted from Z^ee events and is compared to the true efficiency 
in tt events. 

2 Single object trigger performance in top quark events 

Results are presented for the basic trigger signatures using electron, muon and jet trigger objects, as well 
as missing transverse energy {E™ ss ). The main trigger criteria from the ATLAS trigger tables for the 

1 Neglecting for now initial or final state radiation. 
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initial data taking phase are evaluated using simulated top quark signal samples. Two types of tt signal 
samples are used in this note. The first is the fully hadronic sample with only jets in the final state and 
the second is a leptonic sample, consisting of a combination of semi-leptonic and di-leptonic events, and 
will be referred to as leptonic sample in the following. For top quarks produced singly, the s, t, and Wt 
channels are simulated and only those events which result in an electron or a muon from the W-boson 
decay are considered here. More details about the data samples are given in [3]. 

Trigger items are described by a combination of letters and numbers. The trigger object is represented 
by an abbreviation consisting of one or two letters. It is preceded by a number representing the object 
multiplicity and followed by a number signifying the transverse momentum (pj) of the trigger threshold. 
If an isolation requirement is applied, it is indicated by the letter T after the pj threshold. For example, 
the item name '2EM18F represents a trigger on two electromagnetic objects, with a threshold of 18 GeV 
each, including isolation requirements. The L2 and EF item naming conventions are the same as LI 
except that lower-case letters are used. A trigger chain consists of the LI, L2, and EF trigger items an 
object must satisfy and is referred to using the HLT notation. A trigger menu is a list of triggers enabled 
during a data taking run. For a more complete description, see [4]. 

The dependence of a trigger on the actual pj cut will be investigated here by means of turn-on 
curves, which shows the fraction of objects passing a certain trigger as a function of the reconstructed or 
true simulated p T of that object. Trigger candidate objects are matched to reconstructed or Monte Carlo 
simulated objects by cutting on AR = ^At] 2 + A0 2 between the two objects. 

In the following sections, single object triggers are evaluated in detail first in tt events, and then in 
single-top events. 

2.1 Electron triggers in tt events 

2.1.1 Introduction 

Leptonic tf decays will form a statistically large sample of events early on at the LHC. Figure 1 (a) shows 
the simulated pj spectrum of the electron from a W-boson decay in leptonic tt events. The plot gives an 
idea of the expected fraction of events the trigger system is confronted with above certain pj values. The 
single-electron trigger threshold is expected to be around 20 GeV, indicating that complex multi-object 
triggers (with correspondingly lowered thresholds) may be unnecessary. 

2.1.2 Single electron triggers 

The LI electron triggers operate on reduced granularity (0.1 x 0.1 in At] xA0) calorimeter trigger towers 
which cover the range \t]\ < 2.5. A central cluster of four towers is formed in the electromagnetic and 
hadronic calorimeters, along with a ring of 12 towers around this central cluster. The ring is used to 
select candidates using isolation criteria by cutting on the amount of energy deposited around the central 
cluster. At L2, electromagnetic clusters are formed, tracking is then performed for the first time, and, 
finally, the reconstructed cluster is matched to a track. In the final stage, the EF, tracking and cluster 
determination is performed with more accurate algorithms, further refining the trigger decision. More 
details can be found in [4]. 

2.1.3 Single electron trigger efficiencies 

The lowest unprescaled trigger threshold depends on the luminosity. For a luminosity of 10 33 cm~ 2 s -1 , 
the e22i trigger is the unprescaled trigger with the lowest pj threshold. For the start-up luminosity of 
10 31 cm~ 2 s -1 , an el2i trigger is available. The el2i chain has a lower threshold and hence a higher 
efficiency in leptonic tt events. 
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Figure 1: (a): pj spectrum of electrons from the decay W^ev in leptonic tt events. The number 
of electrons is scaled to an integrated luminosity of 1 pb -1 , and is shown as a function of the 
true simulated electron pj. (b): Turn-on curves for the e22i trigger determined with leptonic tt 
events. The efficiencies are shown with respect to an offline selection (excluding the cut on pj) as 
a function of the offline reconstructed pj of the electron that fired the trigger. 



Table 1: Efficiency of the e22i and el2i trigger for leptonic tf events with a W^ev decay. The binomial 
errors A on the efficiencies for the different trigger levels are quoted for an integrated luminosity of 
100 pb -1 , and are calculated as A 2 = £(1 — e)/N, where £ is the efficiency. Note that no matching 
constraint is imposed between the trigger and the reconstructed or true simulated electrons. Moreover, 
the efficiencies are determined for \r] | < 2.5 by cutting on the r\ of the trigger as well as the reconstructed 
or simulated e lectrons. 



Trigger 


Compared to Monte Carlo 
Eff. [%] 


Compared to offline selection 
Eff. [%] 


e22i: 






LI EM18I 


74.7 ± 0.5 


96.0 ± 0.6 


L2 e22i 


59.6 ± 0.6 


92.7 ± 0.9 


EF e22i 


52.9 ± 0.6 


89.8 ± 1.0 


e!2i: 






LI EM7I 


83.6 ± 0.4 


98.6 ± 0.3 


L2 el2i 


66.7 ± 0.5 


92.6 ± 0.8 


EFel2i 


63.5 ± 0.5 


91.8 ±0.8 



Table 1 shows the fraction of events passing each trigger level in the e22i chain per 100 pb -1 . The 
e22i trigger chain consists of the LI trigger L1_EM18I, and the e22i trigger at L2 and EF. The efficiency 
has been calculated for each trigger level with respect to the total number of leptonic ft events with a 
W^ev decay and with respect to the number of events reconstructed and selected in the commissioning 
analysis [5], requiring \ r\ \ < 2.5 for the trigger and reconstructed or simulated electrons. The cuts are: 

• At least one reconstructed isolated electron with p T > 20 GeV; 

• Ef ss > 20 GeV; 

• At least 3 reconstructed jets with p T > 40 GeV and at least 4 reconstructed jets with p T > 20 GeV. 
Also shown in Table 1 are the percentage of events passing each trigger level in the el2i chain. 
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Compared to e22i, a larger fraction of events passes this looser chain at each level, as expected. Note 
that the large value of the trigger efficiency with respect to the selection, exceeding 90%, is partly due to 
jets fulfilling the electron trigger. Eventually this effect has to be accounted for when determining trigger 
efficiency corrections to be used in an analysis. 

Figure 1 (b) shows the turn-on curves for each trigger level in the e22i chain, determined from 
simulated leptonic tt events. The turn-on behaviour of the trigger is sharp at all levels. From the LI to the 
HLT the pi dependence is very similar, the curves reveal a flat plateau once beyond the turn-on region. 
Note that the slight decrease in efficiencies at large p T values are due to the isolation requirements of the 
e22i trigger. This small loss can be recovered using a high-threshold non-isolated trigger, although it has 
not been considered in this study. 

In summary, the single electron trigger efficiency in tt events is high for tt events with a W^ev 
decay in the current physics trigger menus. The electron trigger chain will be, together with the muon 
chain discussed in the next section, the main trigger for selecting tt events in the golden leptonic decay 
channel. Simple and efficient high- pi isolated single electron triggers exist to select tt events, avoiding 
considerable complications in the determination of trigger efficiencies for complex combined trigger 
signatures. 

2.2 Muon triggers in tt events 
2.2.1 Introduction 




Figure 2: (a): pi spectrum of muons from jj,v decays in leptonic tt events. The muon pi is 
the true simulated value. The distribution is scaled to correspond to an integrated luminosity of 
1 pb _1 . (b): Trigger efficiencies for the mu20 with respect to an offline selection (excluding the 
cut on pi), given as a function of reconstructed muon pi. 

The pi spectrum of truth muons from the jiv decay in tt events is shown in Fig. 2 (a). As for 
electrons, rather high-p T single muon triggers can be used to select tt events with at least one W-boson 
decaying to a jj, and without much loss of efficiency. 

2.2.2 Single muon triggers in ATLAS 

The LI muon trigger consists of fast electronics establishing coincidences between hits of different detec- 
tor layers of the muon system inside programmed geometrical windows. The size of the window defines 
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the transverse momentum interval corresponding to the deflection of the muon in the toroidal magnetic 
field. One of six programmable pj thresholds is assigned to the candidate. The L2 processing consists 
of three reconstruction steps applied to full granularity data of the region defined by LI. First, the muon 
candidate is reconstructed in the muon spectrometer. Then inner detector tracks are reconstructed around 
the muon candidate. Both are combined to form the L2 muons upon which the trigger decision is based. 
Currently only the pj of the muon candidate is checked. Isolation requirements or constraints on the 
X 2 of the combined inner detector and muon spectrometer track are not imposed here (but might be in 
future). The muon reconstruction in the Event Filter is done using offline algorithms. The EF muon 
trigger decision at the moment is also based solely on the pj of the reconstructed muon candidate. A 
more complete description of the muon trigger can be found in [4]. 

2.2.3 Single muon trigger efficiencies 

Using the standard LI muon thresholds, six different trigger items have been defined in the simulated 
samples under study; L1^1U06, L1^1U08, L1JMU10, Ll^lUll, L1^4U20 and L1.MU40. The effi- 
ciencies for each trigger item are shown in Fig. 3 (a). Each efficiency is calculated with respect to the 
number of simulated leptonic tt events with a jj,v decay. It can be seen that the high-px LI single 
muon triggers provide a high efficiency for selecting the events. When running at an increased luminosity 
of 10 34 cm~ 2 s -1 , the threshold could be raised to 40 GeV without an unacceptable loss in efficiency. 




Figure 3: Efficiencies of the single muon triggers for LI (a), L2 (b), and EF (c), determined using 
leptonic tt events with a jj,v decay. For the LI items, the efficiencies are calculated for the 
six available thresholds. The HLT efficiencies are plotted as a function of the p T cut applied. In 
Fig. (b) the L2 chain is started by LI Rols passing the MU06 threshold and in Fig. (c) the EF 
chain is started by muon candidates passing the mu06 signature at L2. Note that the efficiencies 
are calculated with respect to the number of simulated muons within \t] \ < 2.4. 

Figure 3 (b) shows the absolute efficiency of the L2 single muon trigger in the case of the HLT 
chain being started by a MU06 Rol. The pj cut applied by the L2 algorithm is shown on the horizontal 
axis. Similar efficiencies are shown in Fig. 3 (c) for the EF when the EF chain is started by a L2 muon 
candidate fulfilling the mu06 signature. The absolute trigger efficiency for the HLT cannot in principle 
be inferred directly from these plots, as for instance the L2 mu20 processing will only be initiated by LI 
Rols which fulfill the L1JVIU20 requirements. On the other hand, the efficiencies for each trigger level, 
normalized to the number of events with a true simulated muon from a W-boson decay within | r\ | < 2.4, 
have been calculated for a few trigger signatures and are shown in Table 2. Comparing Table 2 with 
Fig. 3, one can see that the differences are small. Hence these figures can be used to indicate the effects 
of a change in trigger threshold cuts. 

The efficiencies of the single muon trigger signatures have also been evaluated for the events selected 
by the commissioning analysis [5]. The same cuts as described in the electron section 2.1.3 are applied. 
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Table 2: Efficiencies of the mu06, and mu20 trigger chains for leptonic tt events with at least one 
jj,v decay. The numbers are absolute trigger efficiencies, the reference offline selection is 
given in [5]. The binomial errors A are calculated as A 2 = £(1 — e) /N, where e is the efficiency, 
for an integrated luminosity of 946pb _1 , which corresponds to the available Monte Carlo statistics. 
Note that the efficiencies are determined for tj < |2.4| by cutting on the tj of the trigger as well as 
the reconstructed or true simulated muons. 



Trigger 


Compared to Monte Carlo 
Eff. [%] 


Compared to offline selection 
Eff. [%] 


mu06: 






LI 


83.8 ±0.3 


91.9 ±0.4 


L2 


80.2 ± 0.3 


88.7 ± 0.4 


EF 


73.1 ±0.2 


83.1 ±0.4 


mu20: 






LI 


74.6 ± 0.2 


86.4 ± 0.4 


L2 


66.3 ± 0.2 


82.3 ± 0.4 


EF 


58.8 ±0.2 


76.6 ± 0.4 



The results are given in Table 2 for the mu20 trigger chain, which consists of the LI trigger L1JVIU20, 
and the mu20 trigger at L2 and EF. The table shows that the mu20 signature is effective in selecting 
events for the commissioning analysis. 

One can also determine the turn-on curve of the trigger efficiency for single muons in tt events. 
For that purpose a LI muon is matched to a reconstructed muon by requiring AR < 0.15 while L2 and 
EF trigger muons are matched by requiring AR < 0.1 2 . Figure 2 (b) shows the mu20 trigger chain 
efficiencies for reconstructed muons as a function of their reconstructed pj. The muons are required to 
have < 2.4, as this is the reach of the muon trigger chambers. The LI turn-on is less steep than 
those for the higher trigger levels, due to the coarser pi threshold assignment at LI than at the HLT. 
The thresholds at the different levels are set such as to reach the plateau at the same value of the offline 
reconstructed pj. At all trigger levels, once the plateau is reached, the efficiencies remain flat versus pj. 

In summary, as for the electrons, the single muon trigger efficiency in tt events is reasonably large. 
The mu20 is the most relevant muon trigger for selecting tt events with at least one W-boson decaying to 
a muon. 

2.3 Jet triggers in tt events 

Jet triggers are dominated by QCD multi-jet events, which have production cross-sections orders of 
magnitudes larger than the top signal processes. Therefore it is not obvious that multi-jet triggers giving 
acceptable rates will correspond to p T thresholds efficient for selecting top events. In order to character- 
ize jet distributions in tt events, Fig. 4 (a) shows the pj distribution of the six highest-px jets (ordered 
in pi and defined using a cone jet algorithm with radius 0.4) for hadronic tt events as obtained after a 
full event reconstruction. The characteristics of top events are a large number of high-px jets. This is 
expected to provide some discriminatory power against the background, which has a steeply falling pj 
spectrum. 

In Fig. 4 (b), turn-on curves are given for different pj thresholds at LI, for leptonic tt events. The 
plot is obtained by matching a jet triggered at LI with a corresponding fully reconstructed jet within 

2 The AR cuts were chosen after studying the Aft distributions for the different trigger levels. 
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Figure 4: (a): Reconstructed pj distribution of the six leading jets for hadronic tt events, (b): 
Turn-on curves relative to the reconstructed jet pj for jets triggered at LI at various threshold 
values for leptonic tt events, (c): LI trigger efficiency versus trigger threshold for multi-jet triggers 
for hadronic tt events. The efficiency is calculated with respect to all events in the sample. 



AR < 0.2. The curves reveal, compared to the lepton triggers, a rather slow turn-on of the LI jet triggers, 
resulting from coarse resolution at LI (cf. [6]). 



Table 3: Jet trigger efficiencies for tt signal and trigger rates, the latter also for the most important 
background events (the background samples are described in [3]). The efficiencies are for the 
whole trigger decision (after the EF), and are calculated with respect to the total number of events. 
The order of magnitude of the trigger rates are given for a luminosity of 10 31 cm -2 s _1 . 



Process 


j20 

Eff. [%] Rate 


jl60 

Eff. [%] Rate 


4j50 

Eff. [%] Rate 


Leptonic tt 
Hadronic tt 

QCD 
W-boson+Jet 


99.8 O(10- 3 Hz) 
100.0 O(10 3 Hz) 
O(10 3 Hz) 
O(10 3 Hz) 


11.2 O(10~ 4 Hz) 
12.9 O(10~ 4 Hz) 
O(Hz) 
O(10 4 Hz) 


9.9 0(10 4 Hz) 
21.0 0(10 4 Hz) 
0(10 1 Hz) 
0(10 4 Hz) 



Figure 4 (c) shows the LI trigger efficiency for different jet multiplicities as a function of the trigger 
pj threshold for hadronic tt events. Single jet triggers will be more than 90% efficient for thresholds 
up to about 60 GeV, and roughly 45% efficient at 100 GeV. Table 3 summarises trigger efficiencies for 
signal and background rates for a selection of single and multi-jet triggers. Depending on how high the 
jet trigger thresholds are set, the signal efficiency varies between 10 and 100%. Due to large QCD rates, 
only high single-jet thresholds or carefully optimised multi-jet triggers are affordable. For example, for 
the j 160, a rate of O(Hz) is to be expected from QCD multi-jets already for an instantaneous luminosity 
of lO^cnrV 1 . 

2.3.1 Multi-jet triggers for the fully-hadronic tt channel 

The multi-jet trigger item 4J50 listed in the previous section is intended to be general purpose, appli- 
cable to a wide range of physics channels. In order to optimize the signal efficiency of the jet triggers, 
especially for the fully-hadronic tt decays not accessible with lepton triggers, other trigger combina- 
tions for multi-jet triggers are studied. The aim is to identify optimum trigger combinations that reduce 
QCD background rates, while keeping a sizable signal acceptance in the fully-hadronic tt decay channel. 
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Table 4: Set of optimum multi-jet trigger combinations for fully-hadronic tt events. The QCD 
multi-jet background rates are given relative to the background rate of the 4j50 trigger (cf. Table 3). 
Efficiencies and rates at the EF are calculated with respect to the total number of events. Note that 
the names of the triggers are inclusive, "4j60_2j 100_j 170" means 4 jets have to pass the 60 GeV 
trigger, 2 jets the 100 GeV, and another 1 jet has to pass the 170 GeV trigger. 



Trigger 


Signal Efficiency \ ' < | 


Relative Background Rate 


S/B 


4j60_2jl00_jl70 


6 


0.13 


2.8- 10~ 3 


5j45_2j60_jl00 


16 


0.34 


3.0- 10" 3 


6j35_5j45_4j50_3j60 


10 


0.18 


3.7 -10- 3 



Therefore, trigger combinations requiring at least 4, 5 or 6 jets for a given set of jet thresholds, were 
tested. 

Table 4 summarizes signal efficiencies and increased background suppression for a set of trigger 
combinations. As figure of merit, the ratio of the trigger efficiency for the signal and QCD events is 
used. A strong suppression of QCD is observed when tightening the cuts on the jet energies, as expected. 
The signal-to-background ratio of the 4J50 trigger (cf. Table 3) is improved by a factor of two in the 
5J45_2J60J100 trigger (cf. Table 4) for roughly the same signal efficiency. Overall, the three trigger 
combinations given in Table 4 are optimal in terms of the figure of merit, and the 5 jet trigger combination 
leads to the best signal efficiency with the lowest background rate for fully-hadronic tt events. It is worth 
pointing out, that the reliable determination of trigger efficiencies from data is another challenging aspect 
of multi-jet triggers. 

Large uncertainties are inherent in predictions for LHC energies, hence the QCD background trigger 
rates given here and the jet trigger definitions are preliminary and subject to tuning as soon as data 
taking starts. This is especially true for the effect of pile-up, which also impacts the signal trigger 
efficiencies. It is shown nevertheless that it should be possible to trigger on fully-hadronic tt decays 
with reasonable efficiency by optimising the choice of thresholds and multiplicity. Further HLT studies 
are underway to fully exploit also advanced methods for background suppression, such as multi-variate 
analysis techniques. 

2.4 Missing E T triggers in tt events 

The LI Ej lss trigger performance in tt events is presented in this section. The HLT performance is not 
discussed due to ongoing development effort at the time of this writing. 

The LI energy triggers calculate missing transverse energy (Ef lss , trigger item name: XE) based on 
reduced granularity calorimeter data (the trigger towers) without taking muons into account, across an T7 
range of \r] \ < 5.0 [4]. 

Figure 5 shows the Ef lss spectra at LI for tt signal and background events normalized to an integrated 
luminosity of 1 pb~ 1 . Shown is the QCD background for which Ej iss is mainly faked by jet response 
fluctuations and losses in non-instrumented regions of the calorimeters. In the W-boson+jets background, 
a neutrino from the W-boson decay is present and produces real E™' ss . Single top quark events are also 
shown, with a combined rate of about one third of tt. As can be seen from the figure, high QCD rates 
will not permit inclusive Ej lss triggers with low thresholds, hence LI trigger efficiencies for tt events 
will be small at larger luminosities. While for example L1_XE30 has an efficiency of 81% for leptonic 
events (applying no selection cuts in the efficiency calculation), the efficiency decreases to 19% for 
L1_XE100. For comparison, the lowest un-prescaled Ef lss threshold for the early running at a luminosity 
of 10 31 cm~ 2 s" 1 is expected to be L1JXE70 [4]. 
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Figure 5: E™ lss spectra as measured by the LI trigger for leptonic and fully -hadronic tt signals and 
the most significant backgrounds to tt. The samples are normalized to an integrated luminosity of 
1 pb _1 . Note that the W + jets samples used here include W-boson decays to electrons, muons, 
and taus, as described in Table 2 of [3]. 



2.5 Single-top trigger 

The final state of leptonic single-top events in the s, t, and Wt channels are characterized by one high-px 
muon or electron, Ef lss , and by one to three jets. Since these leptons are among the high-px products 
of a W-boson decay, a relatively large lepton pj threshold can be used to select the single-top events at 
the trigger level. Therefore, the most important triggers for selecting single-top events are the high-px 
muon and electron triggers. Multi-jet and Ef lss triggers can be used in combination with lower threshold 
electron and muon triggers in order to enhance acceptance. 

The turn-on curves for the mu20 and e22i trigger are shown in Fig. 6. The trigger efficiencies are 
shown with respect to the single-top selection (the cut-based one listed in detail in [7]), and are very 
similar to the ones found in tt events shown in the previous sections. Both triggers exhibit a sharp turn- 
on behaviour and a flat plateau up to 100 GeV. 

The single-top event selection will include the trigger combination e22i OR e55 OR mu20. The 
efficiency for this combination, as well as the total trigger efficiencies for the individual items in the 
muon and electron channels, are shown in Table 5. The efficiencies are found to be high, above 80% in 
the muon channel, and up to 90% in the electron channel. 

3 Trigger redundancy in top quark events 

Due to the rich event topologies, top quark events satisfy many trigger items simultaneously. This redun- 
dancy may be exploited to enhance signal over background rates by forming combined trigger items and 
to monitor the trigger by providing information on trigger efficiencies from data. 

Combining suitable trigger items may reduce significantly the background rate. For example, in 
a combined jet and Ej lss trigger, the Ej lss requirement will suppress the QCD background. For the 
same allocated bandwidth, the jet pj threshold of the combined trigger can be lowered compared to the 
inclusive single jet trigger. Overall this will increase the number of recorded leptonic tt events which 
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Figure 6: Turn-on curves are shown for the mu20 (a) and the e22i (b) trigger. In both plots, 
the circles represent Wt-channel single-top, the squares represent s-channel single-top, and the 
triangles represent t-channel single-top events. 



have good acceptance for a moderate threshold on E™ ss . 

On the other hand, if there is sufficient overlap between two triggers, the efficiency for one of the 
triggers can be measured by selecting offline a clean sample of tf events which satisfied the other trigger. 
For example, the lepton triggers used to select leptonic tt events could be monitored in events that have 
been triggered by the combined jet and Ef ss trigger. Possible biases due to correlation should be checked 
and eventually corrected for. 

3.1 Trigger item overlap in tt events 

The correlation among different trigger items is given in Fig. 7, with plot (a) showing overlaps for fully - 
hadronic tt events and plot (b) showing overlaps for leptonic tt events. The plots show the percentage of 
events triggered by the item on the x-axis that were also triggered by the item on the j-axis. The trigger 
items on the axes were representative of the ATLAS trigger menu at the time of this writing. 

As expected, there is considerable overlap for the leptonic tt events, and the event topology of these 
events is clearly illustrated. The leptonic triggers have very low acceptance in the fully-hadronic tt 
events. Only the very low-pj muon triggers, which will be prescaled [4], have considerable efficiency 
due to muons from b-jets. Any combination of trigger items used to enhance the signal-to-background 
ratio in the fully-hadronic sample will have to rely on the jet triggers (cf. Section 2.3.1) only. For the 
leptonic channel, trigger combinations involving Ef iss are potentially useful. Studies done at LI have 
shown that Ef' ss trigger items indeed overlap considerably with the leptonic and jet triggers in leptonic 
tt events. 



4 Determination of trigger efficiencies from data and application to top 
quark events 

Trigger efficiencies should be determined from data in order to reduce systematic uncertainties, as the 
shape as a function of pj and the absolute value of trigger efficiencies are hard to describe precisely in 
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Table 5: Total trigger efficiencies for the three single-top channels. Efficiencies for the muon 
(W— >• jJ,v) and electron (W^ev) channels are shown separately. The errors are calculated using 
the full available Monte Carlo statistics. Note that the reference event selection [7] selects one and 
only one isolated lepton for the electron and muon channel. Hence the trigger effciency in the Wt 
channel does not increase when taking the OR of electron and muon triggers as compared to the 
single electron or muon triggers. 



Sample 


Muon Channel 


Electron Channel 




Trigger 


Efficiency (%) 


Trigger 


Efficiency (%) 




mu06 


88.4 + 0.6 


e22i 


87.1 + 0.7 


Wt 


mu20 


82.5 ± 0.7 


e22i OR e55 


90.6 + 0.6 




e22i OR e55 OR mu20 


82.5 ± 0.7 


e22i OR e55 OR mu20 


90.8 + 0.6 




mu06 


88.0 + 0.6 


e22i 


89.2 + 0.7 


s 


mu20 


82.6 + 0.7 


e22i OR e55 


90.7 + 0.6 




e22i OR e55 OR mu20 


82.6 + 0.7 


e22i OR e55 OR mu20 


91.0 + 0.6 




mu06 


86.0 + 0.7 


e22i 


89.5 + 0.7 


t 


mu20 


79.6 + 0.8 


e22i OR e55 


90.6 + 0.7 




e22i OR e55 OR mu20 


79.6 + 0.8 


e22i OR e55 OR mu20 


90.9 + 0.7 




Figure 7: Trigger item overlap at the HLT for fully-hadronic (a) and leptonic (b) tt events. The 
numbers in the plots indicate the percentage of events triggered by the item on the x-axis also 
triggered by the item on the y-axis. Note that, by definition, the values on the diagonal have full 
acceptance. The bottom row in each plot gives the total efficiency of the corresponding item on 
the x-axis. 



simulations. 

A methodology is tested to determine the electron trigger efficiency for top events in data. Events 
with Z-boson decays are used to parameterize the electron trigger efficiency as a function of pj and tj by 
means of the tag and probe method [8]. These parametrized efficiencies yield correction factors which 
can then be applied to top events. 
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Electron ti Electron <|> 

Figure 8: Plots of the reconstructed top mass, reconstructed electron pj, r\ and (j). The darker 
histogram is the reconstructed value, the black points include the correction from Z^ee. The 
lighter histogram is from simulation where no trigger selection is applied. 



The Standard Model Z-boson decaying to two leptons is relatively free of backgrounds at the LHC, 
and therefore provides a clean source of events for the tag and probe method. This method relies on the 
ability to obtain a clean sample of Z-boson events using selection cuts after obtaining the initial sample 
using a single electron trigger. From the di-electron final state one can tag the electron that fired the 
trigger, and probe the other in order to measure the trigger efficiency. Once this efficiency is extracted 
it can be applied to other processes which have been obtained using the same trigger, in this case top 
events. 

Having selected top events using the commissioning selection [5] , a correction for the trigger effi- 
ciency is applied to recover the number of events that would be measured by an ideal detector. For each 
event, a weight is applied according to the p T and the position in r\ of the electron in the event. The 
weight is taken as the inverse efficiency extracted from look-up tables, which have been obtained with 
the tag-and-probe method. 

Figure 8 shows the pj r\ , and 0 distributions of the electron in the tt event. The dark histogram rep- 
resents events that have passed the trigger selection, the black points show the events after the correction 
is applied. The lighter histogram is from a sample of events run with the trigger disabled. Also shown in 
the figure is the top quark mass as measured using the above selection, after a correction for the trigger 
efficiency. It can be seen from the figure that the efficiency correction works well, the effect of the trigger 
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can be recovered to reproduce the simulated distributions. The effect of additional jet activity in tf events 
as compared to Z^ee events is thus indicated not to have a big effect at the current level of precision. 

The final value of the trigger efficiency, used as a correction to the cross section measurement of the 
leptonic tt decay channel with one electron in the final state, is a value e tr ig ger of 92.6 ± 0.4 % for an 
el2i trigger chain and 91.5 ± 0.4 % for an e22i trigger chain for 100pb _1 . These values are obtained 
by taking the ratio of the number of events before and after the trigger correction, and the uncertainty is 
statistical only. Note that to obtain these numbers, a matching between the trigger and the reconstruction 
electron candidates has to be done (AR < 0.2), in contrast to the numbers in Table 1. 

In conclusion, it is shown that the tag-and-probe efficiency can be successfully applied to leptonic 
top decays. The method appears robust, however, the issue of background events passing the Z^ee 
selection and the possible bias introduced thereby needs to be investigated. 

5 Conclusions and outlook 

The first studies of the expected ATLAS trigger performance for top events have been shown. All levels 
of the electron and muon triggers are found to be highly efficient for the golden leptonic tt decays. The 
jet triggers, especially important for hadronically decaying tops, are much more challenging to use due 
to the high rate of QCD jets. High jet multiplicities of 5 or 6 jets with optimised pj thresholds may 
provide the best acceptance. 

Top quark events at the LHC will have especially rich event topologies and will result in many trigger 
items being fulfilled simultaneously. The overlap of trigger items was investigated to identify combina- 
tions of trigger items with improved signal efficiency and trigger rate, as well as potential monitoring 
triggers. The use of the E™ ss trigger should be further investigated. 

The trigger efficiency should be determined from data with the least possible bias. This must be val- 
idated with Monte-Carlo simulations, comparing the efficiencies derived with the data driven methods 
to the true efficiencies. Such a study, albeit not yet complete, was shown for the electron trigger effi- 
ciency using simulated Z —>■ ee samples. By using the trigger efficiency correction to the reconstructed 
kinematical distributions, it was shown that the simulated top quark distributions are very well recovered. 
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Abstract 

This note describes the properties of jets from light quarks (u, d, s and c) in 
tt events, with the goal of obtaining a good reconstruction of the hadronically 
decaying W-boson, which will be used for an in-situ jet energy scale mea- 
surement and will enter, together with jets from b quarks, into the top quark 
reconstruction. 

1 Introduction 

The measurement of the top quark mass with a precision of 1 GeV is one of the goals of the LHC 
experiments. In order to achieve such a precision, several complementary methods can be used [1]. The 
most straightforward method, namely the measurement of the tri-jet invariant mass from the hadronically 
decaying top quark in the tt — > Ivb qqb channel, can in principle reach such a precision, provided that 
the three jets are well reconstructed and calibrated. 

This note describes some studies of the reconstruction and calibration of jets from light quarks (u, d, s 
and c) in tt events, with the goal of obtaining the best possible reconstruction of the hadronically decaying 
W-boson. After a comparison of various jet algorithms, two methods of in-situ jet calibration using the 
W-boson mass distribution will be presented. Finally, the stability of the W-boson reconstruction with 
different gluon radiation settings in the Monte Carlo event generator will be shown. 

Unless otherwise specified, the studies presented here have been performed using tt events generated 
with MC@NLO + Herwig, with the top quark mass equal to 175 GeV, at least one of the W-bosons 
decaying leptonically, and a full GEANT simulation of the ATLAS detector. The simulation statistics 
corresponds to a luminosity of about 0.9 fb _1 . By default, no pile-up is included in the simulation, except 
in a few dedicated studies. Triggering was not considered, as tt^lvb qqb events will be triggered mainly 
by the leptonic decay of the second top quark, without biasing the jets on the hadronic side. 

2 A comparison of jet reconstruction algorithms 

The current default jet algorithm used for tt event reconstruction in ATLAS is the cone algorithm with a 
size AR = 0.4. A small cone size was indeed found in previous studies to give the best mass resolution and 
signal over background ratio. This section presents new studies of the optimal jet algorithm, performed 
by looking at the jet energy, angular resolutions and at the W-boson mass reconstruction. For these 
studies, the cone and kx algorithms, with various parameters, were tried. Details on jet algorithms can 
be found in Ref. [2]. 

Cone algorithms define jets as the combination of input objects from generated stable particles or 
calorimeter energy deposits within a cone of radius R around the jet directions tj and 0. In an iterative 
procedure, the jets are repeatedly reconstructed until a stable configuration is found. Two different 
implementations exist: the first one, referred to as seeded cone jet finder, used in this paper, uses high 
Ej objects in the event as a starting point, whereas the second, seed-less implementation, is much slower 
but theoretically more accurate. In both scenarios, the jets obtained undergo a split-merge procedure, to 
define non-overlapping exclusive jets. The resulting jets then need to be calibrated as explained below. 

The second class of jet algorithms consists of the kj algorithm, which reconstructs jets via a cluster- 
ing procedure. Such jets do not necessarily have a cone-shape with a fixed radius. Instead, the algorithm 
clusters "nearest" protojets together, depending on their relative transverse momentum. Thus, it also 
ensures a unique association of input clusters to jets, without the need for split/merge procedures as in 
cone algorithms. 
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The kj algorithm introduces a distance measurement AR and a recombination scheme [3]. The 
distance measurement decides which pair of protojets should be merged and when to stop merging a 
protojet any further. Deciding whether to merge the two closest jets or flagging one of the protojets as 
a final jet is executed recursively in the algorithm, until there are no more mergeable protojets left. The 
recombination scheme determines how to combine two protojets into a new protojets. Here we use the E 
recombination scheme. 

There are two modes of operation for the kx algorithm, the inclusive and exclusive mode. The 
inclusive kx algorithm is intended for defining inclusive jet cross sections, as they are typically used 
in hadron-hadron interactions. It has a parameter R, which controls the decision to merge protojets or 
declare them as final jets, and plays a role comparable to the cone size in cone jet algorithms. 

The kx algorithm in its exclusive mode must account for the proton remnants and the underlying 
event in the p-p interaction for the calculation of exclusive cross sections. Its parameter D cut controls the 
scale which separates "beam jets" from the "hard jets" during the clustering. Moreover, the value of D cut 
can also be determined dynamically if one fixes the desired number N of final jets. 

The basic input objects for both cone and kx jet algorithms are either calorimeter towers, defined as a 
group of cells in a fixed (Atj, A<j>) grid, or topological clusters, defined as a group of cells formed around 
a seed cell [4]. These calorimeter towers or topological clusters are given at the electromagnetic scale. 
The jets obtained are then calibrated to the hadronic scale using different weighting schemes such as the 
Hl-style [5], whose major drawback is that the set of corrections is specific for each type of jet finder 
and each value of its parameters. 

The comparisons of jet algorithms were performed on tt — > Ivbjjb events. In order to consider only 
jets relevant for the top quark mass measurement, an event selection was performed, requiring: 

• a missing transverse energy > 20GeV to account for the unmeasured neutrino. 

• one isolated electron or muon, defined as described in [6], with p T > 20GeV and |tj| <2.5. 

• at least 3 jets with p T > 40 GeV, a 4 th jet with p T > 20 GeV, with \r} | < 2.5. 

2.1 Jet energy resolution and linearity 

This section presents the energy and position measurement performance for various jet algorithms, cal- 
ibrated with the HI weighting scheme. Although we investigated, in this section, all the combinations 
of cone / inclusive kx algorithm, 2 jets sizes, made from towers / topological clusters, with / without 
pile-up, for the sake of clarity not all results are shown in the figures. 

In the events passing the event selection cuts, the jets considered for the resolution studies are the 
two jets closest to the two quarks from the W-boson decaying hadronically, when their distance AR to 
the quark is less than 0.3. As the final goal is to improve the two jet invariant mass reconstruction, the 
jet energies (and directions) are hereafter compared directly to the ones of the associated quarks. 

Figure 1 shows, for four jet algorithms, the distributions of the energy difference between the matched 
quark and jet, divided by the quark energy, for two different jet energy ranges. Larger jets lead, especially 
at low energies, to a worse resolution and to larger tails because the jet energies are overestimated when 
they overlap with other particles in the underlying event. 

These energy distributions were fitted with a Gaussian distribution 1 . The width of the distribution, 
as a function of the energy, is shown on figure 2 and in Table 1. The resolutions obtained here are 
significantly worse than the detector resolution itself, as shown in Ref. [5], mainly because they include 
the fluctuations of the energy lost outside the jets, as relevant to determine the best algorithm for the 

make the fit stable, a first Gaussian was fitted between the mean of the histogram ± 2 times the RMS. A second fit was 
then performed in the same way using the mean and width of the first fit. 
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Figure 1: Distributions of {E quar k - Ej et )IE quar k for the jets from the W-boson decay, for two 
different quark energy ranges (left figure: quark energy between 15 and 50 GeV, right figure: 
quark energy above 350 GeV). 



W-boson (and the top quark) mass reconstruction. To quantify this effect, Figure 3 compares these 
resolutions to the ones obtained by using the jets made from the Monte Carlo hadrons ("Truthjets")- The 
latter ones are typically 20% better, and comparable to the resolutions shown in [5]. 

Jet energy resolutions were also studied with events including the pile-up expected at a luminosity of 
10 33 cm-V 1 (figure 4 and Table 1), showing that: 

• When jets are made from towers, small sizes are preferable, especially for the cone algorithm. For 
example, Figure 4 shows that the energy resolutions of cone 0.4 jets made from towers is degraded 
by 30% to 50% in presence of pile-up. 

• The algorithms using topological clusters seem to behave very well, even in the presence of pile- 
up, as can also be seen on this figure. 

For cone 0.4 algorithms, the reconstructed jet energy is on average 4% lower than the quark energy, 
for quarks with transverse momenta above 40 GeV. This 4% miscalibration can a priori come from the 
detector calibration itself, from the energy lost outside the jet, or from the underlying event. Figure 5 
shows that, in the current simulation, the detector is very well calibrated, to a precision better than 3% 
for all algorithms, and to better than 1% for the cone 0.4 algorithm. 

On the other hand, Figure 6 shows that the out-of-cone energy is important, and accounts for the 4% 
energy shift observed with cone 0.4. 

The difference between the quark and the reconstructed jet energies, for quark energies between 15 
and 50 GeV, is shown in Figure 7 with and without pileup, for the cone 0.4 from topological clusters 
algorithm. For such a small jet size and a topological cluster noise suppression, the average energy is 
shifted by 2.5 %. 

2.2 Jet angular resolution 

Figure 8 shows, for four jet algorithms, the distributions of the distance AR between the quark and 
the matched jet, for the lowest and the highest energy bins. It can be seen that smaller jets have a 
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Table 1: Energy resolution (in %) as a function of the quark energy, for various jet algorithms, with and 
without pile-up. 





Energy range [GeV] 


algorithm 


15 - 50 


50 - 100 


100 - 150 


150 - 250 


250 - 350 


> 350 


Cone AR = 0.4, Tower 


18.3 


14.7 


13.0 


11.2 


9.4 


8.2 


Cone AR = 0.4, Topo 


18.5 


14.8 


12.6 


11.0 


9.5 


8.3 


Cone AR = 0.7, Tower 


25.2 


16.3 


13.5 


11.6 


10.3 


8.7 


Cone AR = 0.7, Topo 


20.4 


14.3 


12.5 


11.2 


9.3 


8.6 


kj R = 0.4, Tower 


21.4 


16.7 


14.4 


11.7 


9.6 


8.0 


k T R = 0.4, Topo 


20.4 


16.2 


13.1 


10.9 


9.5 


8.2 


k T R = 0.6, Tower 


22.1 


16.1 


13.4 


11.2 


10.3 


8.9 


k T R = 0.6, Topo 


19.3 


14.8 


12.6 


10.6 


9.8 


8.2 


Cone AR = 0.4, Tower, pile-up 


27.8 


20.0 


16.1 


13.6 


11.1 


8.8 


Cone AR = 0.4, Topo, pile-up 


20.9 


15.8 


13.1 


12.2 


10.6 


8.5 


Cone AR = 0.7, Tower, pile-up 


41.7 


26.8 


20.4 


19.1 


15.8 


13.4 


Cone AR = 0.7, Topo, pile-up 


34.2 


19.4 


14.6 


13.2 


11.8 


9.6 


k T R = 0.4, Tower, pile-up 


30.4 


23.7 


18.3 


15.5 


12.8 


10.3 


kj R = 0.4, Topo, pile-up 


24.4 


17.3 


14.6 


12.9 


10.9 


9.3 


kj R = 0.6, Tower, pile-up 


33.3 


24.0 


19.6 


17.0 


14.8 


11.6 


k T R = 0.6, Topo, pile-up 


26.1 


18.3 


14.4 


12.8 


11.6 


9.3 



more precise direction determination. This can also be seen from Figure 9 where the full width at half 
maximum (FWHM) of these distributions are plotted. It was checked that, as for the energy resolution, 
the same conclusion remains when pile-up is added, and that jets made from topological clusters are less 
sensitive to pile-up. 

It is observed that, for large cone sizes and large transverse momenta, the distributions of the distance 
between the quark and the matched jet have larger tails (larger fraction of jets with a distance to the 
associated quark between 0.1 and 0.3), because of a higher overlap probability. 

2.3 W-boson mass reconstruction 

This section compares the W-boson mass reconstruction performances for the cone algorithms with sizes 
0.4 and 0.7 and for the k T jet algorithms with different choices of parameters, both using topological 
clusters. 

The choice of the jet algorithm parameters is crucial for the performance of the W-boson reconstruc- 
tion. Indeed, the probability, averaged over the W-bosons selected in this section, for the two quarks 
from the W-boson decay to be at a distance AR smaller than 1.4 is 25%, leading to non-negligible over- 
lap effects for jet sizes of the order of 0.7, as already seen in section 2.1, and eventually to the merging 
of the two jets coming from the light quarks into a single one. On the contrary, the probability for the 2 
quarks from the W-boson decay to be at a distance smaller than 0.8 is only 4%. 

The same event selection as the one described in section 2.1 is performed, apart for the requirement 
that the fourth jet satisfies pj > 40 GeV, which is needed to improve the W-boson purity. On the selected 
events, the three jets whose four-momentum sum has the highest pr are chosen to belong to the hadronic 
top quark. Among these three jets, the di-jet combination whose mass is closest to the known value of the 
W-boson mass (80.4 GeV) is taken to represent the W-boson. The remaining third jet is assumed to be 
the b quark; no further b-tagging information is used in this study. The distribution for the reconstructed 
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Figure 2: Energy resolution for jets from light quarks (integrated over all Tj values), as a function 
of the quark energy, for various jet algorithms (events simulated without pileup). 



W-boson mass is shown in Figure 10. 

To obtain mass values from the invariant mass spectra of Figure 10, the sum of a Gaussian and a 
4 th degree Chebychev polynomial is fitted to the distribution. The polynomial describes the background 
from wrong jet combinations and from background events, whereas the mean value of the Gaussian and 
its error are interpreted as the mass (mw) and its statistical error, o stat . The width of the Gaussian is 
referred to as Ocauss- Table 2 shows the obtained W-boson mass with the E recombination scheme for 
various choices of the kx parameters. 

The error resulting from 1% variation on the jet energy scale (JES) is obtained by varying the recon- 
structed jet energies by ±1% followed by a linear fit across the three W-boson masses (-1%, nominal 
value, +1%). 

With increasing R and D cut parameter values, the reconstructed W-boson (and also top quark) mass 
rise mono tonic ally. Higher parameter values lead to fewer but more energetic jets and thus their com- 
bination has a higher invariant mass. Part of this effect could in principle be absorbed by the in-situ 
calibration, but on the other hand, the event is more likely to be misreconstructed due to unwanted jet 
merging, which makes the choice of the three jets maximizing the py-sum unpredictable. 

The efficiencies quoted in Table 2, defined as the number of events in the gaussian part of the mass 
distribution divided by the initial number of events, which can be as high as 5.8%, drop to less than 4% 
when the jets become too big. 

Table 2 also shows the purity of the W-boson reconstruction, defined as the fraction of events in the 
gaussian part of the distribution, in the range [— l<7 gaus , l(T gaus ] from the mean value of the gaussian fit. 
A purity around 64% can be obtained with some choices of the algorithms / parameters, for example 
cone 0.4 or kp with R = 0.4, whereas other choices may lead to purities about 15% smaller. 

This study indicates that, for the W-boson mass reconstruction in tt events, the cone 0.4 algorithm 
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Figure 3: Comparison of the jet energy resolutions with respect to the Monte Carlo hadrons 
("Monte Carlo jets") and with respect to the initial quarks from the W-boson decay, for events 
simulated without pileup. 
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Figure 4: Comparison of the jet energy resolutions for events simulated with and without pile-up. 
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Figure 5: Relative difference between the Monte Carlo jet energy and the reconstructed jet energy, 
as a function of the Monte Carlo jet energy, for events simulated without pileup. 
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Figure 6: Relative difference between the quark energy and the Monte Carlo jet energy, as a 
function of the quark energy, for events simulated without pileup. 
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Figure 7: Relative difference between the quark and the jet energy, for quark energies between 15 
and 50 GeV, with and without pileup, for the cone 0.4 algorithm from topological clusters. 
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Figure 8: Distance between the jet and the quark, for various jet algorithms / parameters and for 
two different quark energy ranges (left figure: quark energy between 15 and 50 GeV, right figure: 
quark energy above 350 GeV) 
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Figure 9: Resolution (defined as the full width at half maximum) on the distance between the jet 
and the quark, as a function of the quark energy, for various jet algorithms / parameters. 



Table 2: Reconstructed hadronic W-boson mass for different kr-parameters with statistical uncer- 
tainty, efficiency, purity and the systematic uncertainty from jet energy scale. (*) The cone jet 
algorithms use uncalibrated topological clusters with Hl-style cell weights. 



W mass 


kp parameter 


% 


OGauss 


Ostat 


efficiency 


purity 


1% AJES 




[GeV] 


[GeV] 


[GeV] 


[%] 


[%] 


[GeV] 


R = 0.4 


78.30 


7.37 


0.18 


4.4 


63 


±0.26 


R = 0.6 


79.75 


7.09 


0.18 


4.4 


63 


±0.28 


R = 0.8 


80.47 


6.95 


0.23 


3.1 


58 


±0.20 


D cut = (10GeV) 2 


81.25 


7.03 


0.17 


5.4 


54 


±0.21 


D cut = (20GeV) 2 


81.69 


7.40 


0.18 


5.8 


55 


±0.27 


D cut = (30GeV) 2 


81.84 


7.44 


0.20 


5.3 


57 


±0.33 


D cut = (40GeV) 2 


82.00 


7.12 


0.22 


4.2 


57 


±0.39 


N = 5 


82.02 


7.35 


0.21 


4.5 


54 


±0.34 


* Cone R = 0.4 


81.00 


6.38 


0.14 


5.6 


64 


±0.36 


* Cone R = 0.7 


85.15 


8.03 


0.26 


3.9 


53 


±0.55 



leads to the best performance both in terms of resolution and in terms of efficiency / purity. 

Finally, in addition to this study which uses jets calibrated with Hl-style weights, a study of the W- 
boson reconstruction using local hadron calibration, described in detail in [5] and in [7], has been started. 
Although the local hadron calibration algorithm is still being improved, the W-boson mass reconstruction 
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Reconstructed W mass [GeV] 



Figure 10: Invariant di-jet mass as an estimate of the hadronically decaying W-boson mass in the 
selected events. The luminosity used corresponds to 1 fb _1 of signal and background events. The 
inclusive kj algorithm in the E scheme with R=0.4 is used for jet reconstruction. The data points 
for signal and background events are shown in black. The hatched histogram shows the distribution 
for background events (fully hadronic tt events, single top events and W + jets events). 



capabilities are similar to the ones obtained with Hl-weighted jets, both in terms of mass resolution and 
purity. 

2.4 Conclusion 

The studies shown here indicate that a small jet size should be preferred for a proper reconstruction of 
the jets in tt events, to avoid overlap effects spoiling the energy and angular resolutions and, eventually, 
leading to the merging of jets. Thus, the cone 0.4 jet algorithm from towers used in the following sections 
and in most top physics studies is a good choice for looking for top events in the first LHC data, although 
it could be complemented by an algorithm using topological clusters, which seem to be robust against 
pileup. The effect of the pile-up on the W-boson mass reconstruction needs however to be assessed. 

3 Measuring the light jet energy scale 
3.1 Introduction 

The miscalibration of jet energies is one of the main sources of systematic error in the measurement 
of the top quark mass in the three-jet invariant mass method [1]. Indeed, a 1% error on the jet energy 
translates into a 0.9 GeV error on the top quark mass, 0.2 GeV coming from the light jet energy scale 
and 0.7 GeV from the b-jet energy scale. The goal of measuring the top quark mass with a precision of 
1 GeV thus puts the limit on the miscalibration at 1 % or better. 



907 



Top - Jets from Light Quarks in tt Events 



Several physics processes will be used in ATLAS to achieve such a precision [8]: di-jet events will 
help check the uniformity of the response as a function of rapidity; Z+jet and y+jet events will be used 
to measure the energy scale as a function of energy. However, these processes cannot be applied directly 
to the jets in tt events, for several reasons: 

1. These processes contain a mixture of jets from light quarks, jets from b quarks and jets from 
gluons, with calibration factors probably different by a few % due to their different fragmentation 
and neutrino content 

2. The underlying event may be different in tt events 

3. Jet selection performed to purify the tt sample could lead to a different average jet energy scale in 
this sample 

The use of W — > jj from the tt events themselves (in-situ calibration) will thus be very important. 
After having shown how to select a clean W — > jj signal from tt events, two possible methods for 
extracting the light jet energy scale will be presented. All the studies presented in this section (apart 
from the check of the stability with different top quark masses shown in section 3.6.3) use the default 
MC@NLO + HERWIG simulated events (with a top quark mass of 175 GeV), without pileup. The 
cone 0.4 jet algorithm from towers with the Hl-style calibration is used for all studies in this section. 

3.2 Event and jj pair selection 

In order to measure the jet energy scale with a precision of about 1%, a clean W — > jj sample must be 
selected. In this section the following selection cuts, which are stricter than the ones used previously, are 
thus used: 

• One and only one isolated lepton (electron or muon) with Pj > 20 GeV 

• Missing E T > 20 GeV 

• At least 4 jets with pj > pf 

• Among them, 2 and only 2 jets must be tagged as &-jets. The other jets are called "light jets" in 
the following. 

In order to increase the purity, the W-boson candidate jj pairs are selected among all light jet pairs 
which lead to a reconstructed top quark mass between 150 and 200 GeV. In addition, one may want to 
consider only events with only two light jets. 

The estimated number of jj pairs selected for 1 fb -1 and the purity of the selection, defined as the 
fraction of jj pairs with both jets within AR = 0.25 of the two W-boson quarks, are shown in table 3 as a 
function of pf", and whether or not one uses only events with two light jets. 

Requiring two light jets only reduces the number of jj pairs by about a factor of almost two, but is 
necessary if one aims to measure the jet energy scale down to 20 GeV, in order to minimize the bias due 
to the combinatorial background. 

Figure 11 shows the jj mass distribution for pf 1 = 40 GeV and exactly two light jets. The com- 
binatorial background (shown in grey) is almost flat. The systematic error due to the knowledge of the 
combinatorial background will be discussed later. Because of the requirement for 2 b-tagged jets, the 
background from other processes is of the order of a few percent and is neglected here. 
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Table 3: Number of selected W —> jj pairs (for 1 fb ') and purity of the selected pairs (the statistical 
uncertainty on the purity is about 1 %) 





2 jets only 


2 jets 


or more 


pf[GeV] 


jj pairs 


purity [%] 


jj pairs 


purity [%] 


20 


3100 


76.9 


7100 


64.3 


30 


2300 


77.7 


4100 


71.5 


40 


1200 


81.0 


1900 


79.1 




rrijj [GeV] 



Figure 1 1 : Invariant mass mjj of the selected jet-jet pairs when the two jets are at AR < 0.25 from 
the two quarks from the W-boson decay (white histogram) and when at least one of the jets is not 
from the W-boson decay (grey histogram). 



3.3 The P T cut effect in the Jet Energy Scale measurement 

When measuring the jet energy scale, one must be aware of a shift introduced by any cut on the transverse 
momentum of the jets needed for the event selection. Indeed, when one selects jets with a reconstructed 
Pt greater than p c j lt , jets with a true pj lower than the cut but a reconstructed one above are selected, 
whereas jets with a true pj above the cut but a reconstructed one below are lost. These two effects lead 
to a ratio (reconstructed energy) / (true energy) higher than one near the cut, even if the calibration is 
perfect. 

This can easily be seen, and the size of the effect measured, with a simple Monte Carlo simulation 
where the momenta of the quarks from the W boson decays in tt are smeared according to the expected 
jet resolution 2 , and a pj cut on the smeared quark momenta is applied. 

Figure 12 shows the apparent calibration as a function of the jet pj cut, using this simple simulation. 

2 The value o(E) = 3.8 GeV + 0.063 x E q was used. 
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This apparent calibration is greater for jets whose energy is closer to the value of the pj cut. The size 
of the effect is fully determined once the p, pr spectra and the detector resolution are known. With the 
expected resolution of the ATLAS calorimeter, the global apparent calibration is 2% for a p T cut = 40 
GeV. 




40 50 

p= ut [GeV] 



Figure 12: The apparent jet energy scale due to a cut on the transverse momenta of the recon- 
structed jets. The error bars indicate how the shift varies when changing the jet resolution by ± 
20%. This figure was obtained using the simple simulation described in the text. 



The shift in the energy scale also leads to a shift on the reconstructed W boson and top quark masses, 
which are shown in Figures 13 and 14. With a pf l at 40 GeV, the bias on the W boson mass is about 
1.5 ± 0.6 GeV, whereas the bias on the top quark mass is about 2.0 ± 0.4 GeV. 

The two jet energy scale determination methods described below handle this effect in a different way: 

• The iterative method measures the effective jet energy scale above the pf', which will not reflect 
the bare jet energy scale, but will lead to W boson and top quark masses with the correct values. 

• The template method aims at finding the bare jet energy scale (which was by definition 1 in the 
simple Monte Carlo simulation). The reconstructed W boson and top quark masses will then be 
shifted, and must be corrected. 

3.4 Expected Jet Energy Scale 

The energy scales fitted by the two methods described below will be compared with the true jet energy 
scale, obtained by comparing the reconstructed jet energy and the quark energy. 

In the simulation version used for this note, the mean value of the ratio of jet to quark energies, 
when the two jets chosen to form the hadronic W-boson are at AR < 0.25 from the two quarks from the 
W-boson, for quarks with Pr > 40 GeV is 0.961 ± 0.003, as can be seen in figure 15. 

As shown in section 2.1, this 4% miscalibration is mainly due to the out-of-cone energy. 
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Figure 13: Shift of the reconstructed W boson mass, < nijj >, due to the pf f on the jet energies. 
The error bars indicate how the bias varies when changing the jet resolution by ± 20%. This figure 
was obtained using the simple simulation described in the text. 




40 50 
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Figure 14: Shift of the reconstructed top mass, < rrijjb >, due to the pf f on the jet energies. The 
error bars indicate how the bias varies when changing the jet resolution by ± 20%. This figure 
was obtained using the simple simulation described in the text. 
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Figure 15: Ej et /E quar k for jets originating from a W-boson and with pr(quark) > 40 GeV. 



3.5 The iterative rescaling method 
3.5.1 Method 

This method uses the precisely known W-boson mass as a reference to extract the light jet energy scale. 
The invariant mass of the two jets with energies E\ and E 2 and an opening angle 6jj, originating from 
the W-boson can be written as : 



M, 



<„ = ^2E l E 2 (l-cos(d jj )) 



Figure 16 shows that the angle between the two jets is measured without any significant bias for most 
jet-jet pairs. Any deviation of Mjj therefore comes mainly from the energy miscalibration. 

The peak value of this invariant mass matches the PDG W-boson mass value if the appropriate jet 
energy scale factors K(E\) and K{E 2 ) are used: 



Mi 



'w G = ^2(K{E 1 )E 1 ){K{E 2 )E 2 ) x (1 -cos(0 ;7 )) = ^K(E l )K(E 2 )M jj 

If the jet calibration is independent of the energy (K = K(E\) = K(E 2 )), the jet energy scale factor^ 
is simply K = M^ DG /Mjj. The simple rescaling gives the effective jet energy scale, taking into account 
the pr cut effect from the jet selection. 

With 1 fb _1 of luminosity, around 4000 jets originating from the W in the tt sample are available 
for calibration. Normalizing the sample to 1 fb _1 , the truth calibration, for jets with Pj > 40 GeV, 
is Ktrutt, = {Ejet/EpartonY 1 = 1.014 ±0.002. The value obtained by the simple rescaling of jet pairs 
originating from a W boson (no background contribution) is K = M^ DG /Mjj = 1.014 ±0.003 . The 1% 
goal on the precision on the global jet energy scale is therefore obtainable with 1 fb _1 , provided the 
luminosity dependence is small or can be corrected for. 
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Figure 16: (1 — cos(0 ;; mc))/(1 —cos(djj)) as function of the reconstructed cos(djj), where djj 
is the reconstructed angle between the two jets and Ojjmc the angle between the two associated 
quarks. 
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Since the calibration is not constant over the energy and pseudorapidity, and since the two jets from 
the W-boson decay have in general different energies, the rescaling method has to be adapted to get the 
calibration K(var) (var=jet energy, jet pseudorapidity or any other jet variable). Instead of building one 
jj mass distribution, the first step is to split the studied jet variable in N bins, and to build all associated jj 
invariant mass distributions. In such a way, the distributions become correlated to the studied variable. 

It can be shown that, if the rescaling method is iterated M times, extracting at each iteration j the 
correction factor for the bin i (K-) and recomputing the invariant mass by adding the new K- for the next 
iteration, the exact correction K t per bin i can be obtained (thus the function K(var)): 

Ki = U K I (D 

This method converges in 3-4 iterations, and is stopped when the maximum difference between K-> 
and K j ~ l is less than 1%. The number of bins N depends on the available jet statistics. 

3.5.2 Results 

The simple rescaling method allows to determine a global jet energy scale factor. A W-boson mass 
value Mw = 79. 12 ± 0.25 GeV is obtained from a fit with a polynomial+gaussian function to the jet-jet 
invariant mass spectrum. The resulting jet energy scale factor is thus K = M™ G /M ;; - = 1.016 ±0.003 3 . 
This value is very close to the one obtained previously by a fit on the same sample without combinatorial 
background included. 

Figures 17 and 18 show the effective calibration to be applied on the selected jets after the tt selection 
for 10 fb _1 of integrated luminosity 4 . The observed discrepancy between the fit result and the truth K 
factors is due to the (T7, E) correlations existing on the jet energy scale factors. This is particularly true 
for high jet r\ and high jet energy. The discrepancy at low energy is a threshold effect which could be 
removed by lowering the p T cut on the jets to 30 GeV when measuring the jet energy scale above 40 
GeV. Nevertheless the jet energy scale as a function of Y\ and E can be extracted with a precision of the 
order of 1%, without any a priori knowledge of the function shape before the fit. The treatment of the 
correlation could be resolved by a fit on W mass distributions defined as a function of both jet r\ and E. 

For 1 fb _1 of integrated luminosity, the number of available jets (« 4000) limits the number of bins 
and therefore the accuracy of the extracted jet energy scale functions {K(r\) and K(E) ). The resulting 
plots for 1 fb _1 are shown in Figures 19 and 20. A precision better than 2% is obtained in each energy 
or r\ bin. 

3.5.3 Systematic uncertainties 

The systematic uncertainties associated to each point are of three types: 

• the first is related to the method itself. Performing the fit on a large sample shows up to 1% 
deviation, due to the (T7,E) correlations on the jet energy scale which are not taken into account 
yet. 

• the second is related to the impact of the background on the mass peak measurements. It is rather 
low since the W-boson sample has a purity above 80%. Further purification cuts can be applied 
to increase the purity. The associated error depends on the available statistics and is higher for 

3 The W boson mass is set to M w = 80.4 GeV in the simulation. 

4 The calibration factors are large in the barrel-endcap transition region. In principle this dependence is corrected at the level 
of jet reconstruction algorithm using a pj balance method in di-jet events. This correction has not been applied on the used 
sample. 
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Figure 17: Result of the iterative rescaling fit as a function of Ej et , with 10 fb _1 : the expected 
effective calibration factors are shown as square points, in addition to the fitted calibration factors 
with circles. The calibration factor obtained from Monte Carlo truth information K trut h(E) = 
(Eparton/Ejet) are shown as square points while the calibration factor K(Ef) obtained in the iterative 
rescaling procedure as described in section 3.5.1 are described with circles. 



low jet energy where the background contribution is higher. It should be noticed that the jet 
pairs contributing to the background are mainly composed of one jet originating from the W- 
boson, the others being jets coming from the remaining part of the event or gluon radiation. An 
event mixing technique, described in [1] paragraph 4.2, will be useful to assess the shape of the 
background contribution on real data. To evaluate the systematics induced by the background 
contribution, the background contribution was changed by +20%, separately for the FSR and non- 
FSR contributions 5 . The fitted M w values are all within 0.2 GeV, compatible with the statistical 
uncertainty leading to an uncertainty of 0.003 ±0.003 for the global jet energy scale factor. 

• the last comes from the correction factor to be applied on the fitted value to take into account the 
Pt bias discussed in section 3.3. This correction of the effect is fully determined once the jet p, pj 
spectra and energy resolution are known. The size of this correction is —1% above 100 GeV and 
up to 2% at 40 GeV. In order to assess the error on these values, the resolution has been rescaled 
by ±10%, leading to variation less than one percent of the correction factors. 

5 The shapes of the background when a jet coming from the W-boson decay is associated with a FSR or a non FSR jet are 
different. They are taken from a fast simulation Monte Carlo. 
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Figure 18: Result of the iterative rescaling fit as a function of r\j et , with 10 fb _1 : the expected 
effective calibration factors are shown as square points, in addition to the fitted calibration factors 
with circles. 



3.6 The template method 
3.6.1 Method 

The template method for the light jet energy scale determination is similar to the method described in [9] 
for the electromagnetic energy scale determination using Z° — > e + e~ events. It uses template histograms 
with various energy scales a and relative (to the default jet energy resolution) energy resolutions /3 . The 
X 2 between each template histogram and the "data" is then computed. The minimum of the x 2 is found 
in the (a, /3) plane. With 1 fb -1 , this method fits both the average jet energy scale and a relative jet 
resolution. With enough data, it can be extended to measure these quantities as a function of the jet 
energy or rapidity. 

The template histograms were generated from W — > qq decays in 1 .2 million PYTHIA tt events by 
smearing the quark energies by a gaussian number with width equal to 3.8 GeV + 0.063 x E q , determined 
on reconstructed jets in an older simulation of tt events. As will be shown later, the exact choice of the 
default energy resolution in the templates is not very important, as it will lead at the end only to a change 
in the fitted /3 and not in the fitted a. However, in order to obtain a j3 value close to 1, the jet angles 
are also smeared, according to the expected resolution, in the templates and the observed correlation 
between the two jet energies is added by using correlated gaussian numbers for the energy smearing. 

After smearing the quark energies, the same pf* as used in the data is applied. This ensures that the 
fitted energy scale is not affected by the bias described in section 3.3. Figure 21 shows the reconstructed 
jet-jet mass (for events with two jets or more, with the combinatorial background included), the template 
histogram for a = 1, j3 = 1 and the best fit histogram. 
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Figure 19: Result of the iterative rescaling fit as a function of Ej et , with 1 fb : the expected 
effective calibration factors are shown as square points, in addition to the fitted calibration factors 
with circles. 



3.6.2 Results 

Table 4 summarizes the results obtained for jets with Pj > 40 GeV. The combinatorial background, 
which is flat as shown in Figure 11, doesn't affect the fitted jet energy scale (less than 0.6%), but degrades 
the fitted relative resolution by 30 to 40%. The choice to use or not events with more than two jets is also 
not changing the fitted jet energy scale. 

Finally, these fitted energy scales are in good agreement with the expected value from the Monte- 
Carlo (0.961 ±0.003), as obtained in section 3.4. 



Table 4: Fitted jet energy scale and relative jet energy resolution using the template method, for jets 
with transverse momentum greater than 40 GeV. Good combinations refers to the cases where the two 
jets come from the W-boson decay, while the results with all combinations include the effect of the 
combinatorial background. 





a 


P 


2 jets, good combinations 

2 jets, all combinations 
> 2 jets, good combinations 
> 2 jets, all combinations 


0.9693 ± 0.0045 
0.9638 ± 0.0049 
0.9696 ±0.0033 
0.9660 ±0.0036 


1.145 ±0.054 
1.434 ±0.064 
1.089 ±0.035 
1.308 ±0.041 
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Figure 20: Result of the iterative rescaling fit as a function of rfj et , with 1 fb _1 : the expected 
effective calibration factors are shown as square points, in addition to the fitted calibration factors 
with circles. 



3.6.3 Systematic uncertainties 

Several checks of the stability of the method have been performed: 

• The influence of the combinatorial background, and the choice of using or not events with more 
than two jets was already described in the previous section. All fitted energy scales are compatible 
within ± 0.3 %. 

• The stability with respect to the ingredients used in the templates was checked by generating 
template histograms without the smearing of the angles, or without the correlation between the jet 
energies, or with a default jet resolution degraded by 20%. The fitted jet energy scales were found 
to be compatible within ± 0.3%, and only the fitted relative resolutions were changing. This shows 
that the method is not very sensitive to the simulation used for the templates. 

• The jet energy scales were fitted on tt events simulated with various top masses but the same 
version of the simulation and reconstruction codes. The results are shown in Figure 22. All results 
are compatible within ± 0.5%, even when including top masses in a very wide range. 



3.6.4 Stability of the template method with smaller integrated luminosities 

In order to check the stability of the template method at lower luminosities, the 770 pb _1 full dataset was 
split into sixteen parts, each part thus corresponding to 48 pb _1 . The fitted jet energy scales in the sixteen 
pseudo-experiments are shown in Figure 23. The average of the sixteen measurements is 0.9534, in good 
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Figure 21: Jet-jet invariant mass in fully simulated events (dots) superimposed on the template 
histogram with a = 1, j3 = 1 and the best fit histogram. 
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Figure 22: Fitted jet energy scale as a function of the top mass. 
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agreement with the fit on the full sample (0.9562 ±0.0039). The RMS of the sixteen measurements is 
1.9%, only slightly worse than the value expected by a scaling with the square root of the luminosity 
(1.6%). In addition, the average of the errors given by the fit is 1.5%, showing that this error is correctly 
estimated by the fit even at low luminosities. 



ATLAS I 




0 2 4 6 8 10 12 14 16 

Measurement number 



Figure 23: Fitted jet energy scale for sixteen 48 pb 1 pseudo-experiments. The horizontal lines 
show the result of the fit on the full sample with its error bar. 



3.7 Conclusion on light jet energy scale 

Two complementary methods for measuring the light jet energy scale from the W —> jj mass distribution 
in tt events have been studied. The template method is well suited to measure the bare jet energy scale 
(integrated over some transverse momenta), with a precision which could be around 2% with 50 pb -1 . 
Within the statistical precision of the available datasets, no systematic error larger than 0.5% has been 
identified. The required precision of 1% on the light jet energy scale should thus be achievable. The 
iterative method, on the other hand, is well suited to measure the jet energy scale of the selected jet 
sample, possibly as a function of the energy or as a function of rapidity. 

In this study, the invariant mass of selected jet pairs present a clear W-boson signal on top of a 
flat background dominated by combinatorial from top events (Figure 11). This is possible because b 
jets are tagged with a high efficiency and purity. In the early data-taking phase, b-tagging may be less 
performant and the level of background will be much higher with contribution from QCD or W+jet 
events. The performance of the methods for calibration of the light jet energy scale in these conditions 
need to be evaluated. 

The dependence of the jet energy scale with the level of pile-up has to still to be investigated, in order 
to decide whether a specific correction is needed. 
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4 W-boson mass reconstruction and gluon radiation 

The stability of the W-boson reconstruction with different QCD-related jet activity, commonly referred 
to as initial- and final-state radiation (ISR and FSR) has been evaluated by using the standard HERWIG + 
MC@nlo simulation and the "low mass" and "high mass" ACERMC + PYTHIA event samples described 
in [6]. 

4.1 Reconstructed W-boson masses 

The same event selection (described in the previous section) and mass reconstructions is applied on the 
3 samples, with jets being reconstructed with the AT? = 0.4 cone algorithm from towers. 

Figure 24 shows that the W-boson mass distributions for the 3 samples, normalized to the same 
number of events after selection, are clearly not compatible before calibration. The results of the mass 
peak fits are shown in Table 5. The difference between the high mass dataset and the low mass dataset is 
1.7±0.2GeV. 



> 300r 




mjj [GeV] 



Figure 24: Reconstructed W-boson mass distribution (before calibration) in datasets with different 
gluon radiation settings. The distributions are normalized to the number of events after selection 
in the standard simulation and fitted with a gaussian function. 



Table 5 : Fitted top and W masses for the three datasets with different gluon radiation settings. 





maximum mass 


minimum mass 


MC@NLO + Herwig 


W mass (GeV) 
fitted energy scale 
W mass after calibration (GeV) 


81.29±0.12 
0.9779 ±0.0024 
82.46 ±0.12 


79.59±0.15 
0.9489 ± 0.0026 
82.38 ±0.14 


80.44 ±0.14 
0.9636 ±0.0025 
82.63 ±0.14 
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The jet energy scales obtained with the template method described in section 3.6 are also shown in the 
table, together with the W-boson masses obtained after recalibrating the jet four- momenta. One observes 
that the calibrated W-boson masses are above the true W-boson mass because of the bias explained in 
section 3.3, but are all compatible within errors, showing that the calibration method is indeed working 
well. 



4.2 Light jet properties 

Although the in-situ calibration leads to W-boson mass distributions which are independent of the gluon 
radiation level, we looked for additional measurable quantities which could help to understand these 
effects and tune the Monte Carlo simulations. 

Figure 25 show the pj distributions for the light jets coming from the W-boson decay 6 . Only a very 
small difference, at low energies, is visible, which would probably be very difficult to measure in the 
data. 




Figure 25: p T distribution of the jets from the hadronic W-boson decay for datasets with different 
gluon radiation settings. The distributions are normalized to the number of events after selection 
in the standard simulation. In order to look for a possible difference at low values, the cut on the 
p T value of the jets has been reduced to 20 GeV for this figure. 



On the contrary, a very large difference is visible in the number of jets, as can be seen for example 
in the total number of jets with pr > 10 GeV reconstructed in the events, shown in Figure 26. If 
the jet reconstruction efficiency is well understood, this distribution in the data could help to tune the 
simulations, although the number of jets may also depend on the underlying event and may need to be 
corrected for luminosity. 

Finally, no large difference has been seen in the pj distributions for the jets not assigned to top quark 
decays, as shown in Figure 27. 

6 A cut pj > 20 GeV is applied in these figures 
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Figure 26: Total number of reconstructed jets with pj > 10 GeV for datasets with different gluon 
radiation settings. The distributions are normalized to the number of events after selection in the 
standard simulation. 



> 




0 20 40 60 80 100 

P T [GeV] 



Figure 27: pj of the additional jets for datasets with different gluon radiation settings. The distri- 
butions are normalized to the number of events after selection in the standard simulation. 
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5 Conclusions 

This note has presented several aspects of the W-boson reconstruction in tt events. 

First, it was shown in section 2 that small jet sizes should be prefered to avoid overlap effects. The 
current default (cone algorithm with AR = 0.4) is well suited for the study of the first data, but could 
be complemented by a reconstruction using topological clusters, which seems robust in the presence of 
pileup. 

Two methods for the in-situ calibration of the light jet energy scale have been studied (section 3). 
The two methods should be able to provide a light jet energy scale to the 1% level with 1 fb _1 , and 
could perhaps be used to follow the jet energy scale with time and/or luminosity. The two methods use 
complementary approaches and should be both applied in real data, up to the level of a top quark mass 
measurement, to allow for cross-checks. 

A study of simulations performed with different gluon radiation settings (section 4) has demonstrated 
that, although the reconstructed W-boson mass can change by a large amount with different settings, the 
in-situ calibration method is able to correct for it. 

More generally, the clean and large W-boson sample which should be available in tt events will be a 
good laboratory to study the performance of jet reconstruction and calibration in real data, and compare 
them with the performance in Monte Carlo simulated events. 

Most studies presented in this note were performed without including pile-up effects. The depen- 
dence of the results with luminosity should thus be checked. In addition, a scenario of not yet optimal 
detector performance, which would lead to higher backgrounds, should also be studied in order to un- 
derstand the potential of the method for the early phase of data-taking. 

Finally, in order to reconstruct properly the top quarks, the knowledge of the b jets will be of partic- 
ular importance. For instance, the & jet energy scale is, in our simulation, about 5% lower than the light 
jet energy scale. This difference should be checked in the data to reach the ultimate precision for the top 
quark mass measurement. 
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Abstract 

An accurate determination of the top quark pair production cross-section at the 
LHC provides a valuable check of the Standard Model. Given the high statis- 
tics which will be available, corresponding to about one top-quark pair per 
second, at a luminosity of 10 33 cm~ 2 s _1 , the cross-section measurement can 
be performed relatively fast after the turnon of the LHC. Prospects for measur- 
ing the total top quark pair production cross-section with the ATLAS detector 
during the initial period of LHC are presented in this note. The cross-section 
is determined in the semi-leptonic channel, and in the dilepton channel. For 
the semi-leptonic channel we perform the measurement both with and without 
relying on the tagging of b-quark initiated jets. 

1 Introduction 

The determination of the top quark pair production cross section is one of the measurements that will 
be carried out once the first data samples are available at the ATLAS experiment. It casts light on the 
intrinsic properties of the top quark and its electroweak interactions. Cross section measurements are also 
an important test of possible new production mechanism, as non Standard Model top quark production 
can lead to a significant increase of the cross section. New physics may also modify the cross section 
times branching ratio differently in various decay channels, as for example predicted by Supersymmetric 
models [1] with charged Higgs particles, t — ► H b, or with super-partners of the top quark, t — ► t%o- The 
selection of top quark events is based on the identification of a jet from a b-quark, assuming a branching 
ratio BR(t — »■ Wb) = 1. The consistency of this assumption, performed with kinematic methods, is 
another important check of the Standard Model prediction but falls outside the scope of this paper. 

Last but not least, the top pair production process will be valuable for the in-situ calibration of 
the ATLAS detector during the commissioning phase. The large cross section and the large signal to 
background ratio for the semileptonic channel, allows to identify high purity samples with large statistics 
in a short period of time. Understanding the experimental signatures of top events involves most parts of 
the ATLAS detector and is essential for claiming discoveries of new physics. 

The cross-section values and the Monte Carlo samples which have been used throughout this note, 
are described in [9]. 

1.0.1 Cross section measurements at Tevatron 

During Tevatron Run I (1992-1996) an integrated luminosity of about 100 pb _1 at a centre of mass energy 
of yfs = 1.8 TeV allowed to measure top pair production cross sections of 6.5+}'4 pb and 5.7 ± 1.6 pb 
by the CDF and D0 collaborations respectively [2]. The Tevatron Run II started in 2001 and until 
spring 2006 about 1 fb -1 of pp collisions at y/s = 1.96 TeV have been collected and analysed. At this 
higher centre of mass energy an increase of about 30% in the cross section is expected. The most recent 
calculations predict a cross section of 6.7^9 pb [3] at NLO+NLL or 6.8 ±0.6 pb [4] at NLO+ threshold 
resummation for a top mass, m t = 175 GeV. CDF and D0 measured a combined channels cross-section 
equal to 7.3±0.5(stat) ±0.6(sys)±0.4(lum) [5] and 7.4±0.5(stat)±0.6(sys)±0.4(lum) [6] respectively. 
All the measurements are in good agreement with the predictions. From a combination of all results, an 
experimental error of the order of the theoretical error is expected. 
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2 Single lepton channel 

In this section the strategy for the determination of the tt cross-section in the semi-leptonic decay mode 
is described. This channel, which has a branching fraction of approximately 45%, has a clear signature, 
is experimentally easily accessible and is expected not to suffer from large backgrounds. 

A robust analysis is presented of the first 100 pb _1 ATLAS data, which are expected to be collected 
during the first few months of the LHC data taking period. In particular it is studied whether a pure tt 
sample can be identified without utilizing the full ATLAS b-tagging capabilities. This is brought about 
by the fact that efficient tagging of jets originated from the hadronization of b-quarks, called b-tagging 
from now on, is non trivial and implies a precise alignment of the inner detector, which will probably 
require several months of data taking. This analysis solely relies on the measurement of jets, leptons and 
£'™ 1SS (transverse missing energy), and requires a functioning lepton triggering system. Thanks to the 
over-constrained kinematics of the tt system, with the selected events it will be possible to measure the 
b-tagging performance and the £"™ ss , as well as to calibrate the light jet energy scale. 

2.1 Event selection 

The identification of semi-leptonic tt events starts by requiring a highest level (event filter) lepton trigger 
to have fired. In this study we assume that either the single isolated electron trigger e22 or the muon 
trigger mu20 (for the definition see [7]) has fired. A correct description of the trigger efficiencies is vital 
for the cross-section determination. The strategy for determining the trigger efficiencies from the Monte 
Carlo, as well as from the data without relying on Monte Carlo, is not pursued in this note, but included 
in [7]. 

Further, we define a candidate tt event as having one reconstructed high-/? r isolated lepton (electron 
or muon), a minimal amount of missing energy and at least four reconstructed jets. The definition of 
electrons, muons and jets in our analysis has been discussed before. 

For our default off-line selection the events are required to fulfil the following: 

• One lepton (electron or muon) with p T > 20 GeV. 

• Ef ss > 20 GeV. 

• At least four jets with p T > 20 GeV. 

• Of which at least three jets with p T > 40 GeV. 

The fraction of events passing the individual selection requirements and the overall selection effi- 
ciency are shown in Table 1 for semi-leptonic events. In this table we split the efficiencies for semi- 
leptonic tt events according to the W decay in the Monte Carlo generator: tt (electron) where it decayed 
to an electron and a neutrino and tt (muon) where it decayed to a muon and a neutrino. 

We observe a combined efficiency for these requirements which is somewhat larger for the tt (muon) 
events compared to the tt (electron) events. 

2.1.1 Reconstructing tt events 

Before discussing additional requirements to improve the purity of the tt event selection, we present 
the second step in the event reconstruction. In this step we test the events for compatibility with a tt 
hypothesis. In the tt candidates, three of the reconstructed jets are expected to form the hadronic top- 
quark. In the absence of b-tagging there is an additional ambiguity in choosing the correct three -jet 
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Table 1: Fraction of events passing the various selection criteria and the combined 'default' selection 
efficiency for semi-leptonic (electron and muon) analyses respectively. The statistical uncertainties on 
these numbers are negligible. 





Trigger 


Lepton 




Jet req. (I) 


Jet req. (II) 


Combined 




eff (%) 


eff (%) 


eff (%) 


eff (%) 


eff (%) 


eff (%) 


tt (electron) 


52.9 


52.0 


91.0 


70.7 


61.9 


18.2 


tt (muon) 


59.9 


68.7 


91.6 


65.5 


57.3 


23.6 



Hi, 



Figure 1: (a): Three-jet invariant mass distribution for the electron analysis default selection, normalised 
to 100 pb _1 . The statistical errors in each bin are indicated, (b): The same distribution after the additional 
W-boson mass constraint.. 



combination among the reconstructed jets. We define our top-quark decay candidate as the three -jet 
combination of all jets that has the highest transverse momentum sum. 

Fig. 1 (a) shows the reconstructed top mass for this selection (from now on referred as default selec- 
tion) for the tt sample. The top mass peak is clearly visible, and the tails of the distributions correspond 
to the combinatorial background. 

2.1.2 Selection variations: I 

Apart from the default event selection as described above, a number of additional criteria are defined to 
further increase the purity of the top sample. Here we improve on the simple tt analysis by exploiting 
additional information: every three-jet combination that originates from a top decay also contains a two- 
jet combination that originates from a W-boson decay. To illustrate the presence of the W-boson we take 
the three jets that constitute the top quark, and select from the three combinations of di-jets the one that 
results in the highest value of the sum of the pj of the two jets. The IV-boson mass is then the invariant 
mass of the two jet system. In Fig. 2 (a) this mass distribution is shown for the electron analysis, and the 
VK-boson mass peak around 80 GeV is clearly visible. 

However we prefer an unbiased W -boson mass distribution in the analysis, for which we choose not 
to pick/define one particular W -boson di-jet pair out of the three combinations, but rather require that at 
least one of the three di-jet invariant masses is within 10 GeV of the reconstructed mass of the W-boson 
(taken as the peak value of the mass distribution of the W-boson candidates). This selection will be 
referred to as the W-boson mass constraint selection. The distribution of all three di-jet combinations in 
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Ay 



Figure 2: (a): The di-jet combination with highest pj (left) for the electron analysis, (b): The three di-jet 
combinations invariant masses among the top-quark candidates in a 100 pb _1 event sample for the muon 
analysis. 



the top candidate is shown in Fig. 2 (b). Note that each event enters three times in this distribution. In 
this figure the background, as discussed in the next section, is already included. 

The distribution of the three-jet invariant mass after the additional requirement that at least two jets 
are compatible with the mass of the W -boson is shown in Fig. 1 (b). This requirement shows a substantial 
reduction in the tt combinatorial background compared to Fig. 1 (a). Notice that, compared to the default 
selection, the top mass peak becomes narrower and the tail of the distribution is reduced. However, the 
W -boson mass constraint also introduces a visible shoulder in the distribution which makes fitting to the 
data more subtle. 

In Table 2 we show the fraction of tt events that pass these various selection requirements. 

Table 2: Efficiencies at different stages of the electron and muon analyses for several event types: after 
trigger and event selection (left column), after a cut on the di-jet masses (see text for details) in the top 
candidate (middle column) and events with, in addition to the di-jet mass cut, a hadronic top mass 141 < 
m, < 189 GeV (right column). The first three rows correspond to the single-lepton final states, the fourth 
row to the di-lepton final state and the last row to the hadronic final state. 





Electron analysis 






Muon analysis 


Event type 


Trigger+Selection (%) 


Trigger+Selection (%) 






W const. m t 






+ W const. m t win 


tt (elec) 


18.2 


9.2 


4.5 


0.1 


0.0 0.0 


tt (muon) 


0.0 


0.0 


0.0 


23.6 


12.0 5.8 


tt (tau) 


1.4 


0.7 


0.3 


2.0 


1.0 0.5 


tt (di-lepton) 


2.2 


1.0 


0.2 


3.0 


1.3 0.4 


tt (hadron) 


0.0 


0.0 


0.0 


0.1 


0.1 0.0 



2.1.3 Background evaluation 

We consider a number of background processes. The dominant expected background is IV-boson+jets, 
but also single top production, Z-boson+jets and Wbb are sizeable. Tables 3 and 4 summarise the ex- 
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Table 3: Number of events which pass the various electron selection criteria for the tt signal and for the 
most relevant backgrounds normalised to 100 pb -1 . 



Electron analysis 


S ample 


default 


W const. 


m, win 


W const. 


W const. 


W const. 










-4- In 1 <r 1 








2555 


1262 


561 


303 


329 


208 


hadronic tt 


11 


4 


0.0 


0.8 


0.6 


0.0 


W+jets 


761 


241 


60 


38 


7 


1 


single top 


183 


67 


23 


12 


18 


7 


11 +jets 


115 


35 


8 


5 


2 


0.4 


Wbb 


44 


15 


3 


5 


5 


0.7 


Wcc 


19 


6 


1 


1 


0.4 


0.0 


WW 


7 


4 


0.4 


0.0 


0.0 


0.0 


wz 


4 


1 


0.4 


0.2 


0.0 


0.0 


zz 


0.5 


0.2 


0.1 


0.0 


0.0 


0.0 


Signal 


2555 


1262 


561 


303 


329 


208 


Background 


1144 


374 


96 


63 


33 


10 


S/B 


2.2 


3.4 


5.8 


4.8 


10.0 


20.8 



pected numbers of signal and background events for the electron and muon analysis respectively. The 
first column of the two tables shows the event numbers obtained by applying the default selection, whilst 
the second column gives the corresponding numbers with the W -boson mass constraint. All numbers 
are normalised to 100 pb _1 . The evaluation of the QCD fake rate deserves a separate discussion. The 
QCD production of pp — »■ bb is characterised by a cross-section of about 100 fib, and can therefore be 
an important background for our signal. Requiring the presence of a high pj lepton and missing energy 
can reduce its contribution, but since the cross-section enhancement relative to the signal is so large, 
there might be QCD events with a fake lepton and/or poor missing energy reconstruction that pass these 
requirements as well. 

The rate for extra (medium [8]) electrons is studied and found to be roughly 1.0 x 10~ 3 per jet. This 
number is divided between semi-leptonic B(D) decays and true fakes, i.e. hadronic objects identified as 
electrons. The origin of extra isolated muons is dominated by semi-leptonic B decays, i.e. by the presence 
of hard b-quarks. The isolated muon rate per b-parton reaches a few times 10~ 3 for b-parton momenta 
around 40 GeV, while the fake rate is only a few times 10~ 5 . By studying their origin and dependence on 
jet/parton kinematics like the p T , tj , jet multiplicity and quark content of the jet, we can get an estimate 
of the fraction of multi-jet events that will pass the lepton requirement in the event selection. The validity 
of this approach has been checked using a large sample of di-jet events at various transverse momenta. 
As a result, the QCD background has been evaluated to be smaller than the W-boson+jets background 
and will not be discussed further. 

The distribution of the invariant mass of the three -jet combination that forms the hadronic top-quark 
candidate with the default selection and with the backgrounds added together, is shown in Fig. 3 (a). The 
events where the correct jets were selected to reconstruct the hadronically decaying top quark candidate 
are clearly visible as the mass peak (open histogram) on top of a smooth background distribution. This 
background is partially composed of events from non-top processes (light shaded histogram), but is 
dominated by the (combinatorial) background from semi-leptonic tt events (dark shaded histogram). The 
combinatorial background was determined using the matching of the top candidate with the generated 
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Table 4: Number of events which survive the various muon analysis requirements for the tt signal and 
for the most relevant backgrounds normalised to 100 pb -1 . 



Muon analysis 


S ample 


default 


W const. 


m t win 


W const. 


W const. 


W const. 










+ In 1 <? 1 








3274 


1606 


755 


386 


403 


280 


hadronic tt 


35 


17 


7 


6 


5 


2 


W+jets 


1052 


319 


98 


47 


11 


0.0 


single top 


227 


99 


25 


19 


19 


10 


Z^ 11 +jets 


84 


23 


3 


2 


0.5 


0.0 


Wbb 


64 


19 


4 


4 


5 


2 


Wcc 


26 


9 


3 


0.7 


0.1 


0.0 


W W 


7 


3 


0.7 


0.7 


0.0 


0.0 


W Z 


7 


3 


0.8 


0.5 


0.0 


0.0 


zz 


0.7 


0.3 


0.1 


0.0 


0.0 


0.0 


Signal 


3274 


1606 


755 


386 


403 


280 


Background 


1497 


495 


143 


84 


42 


14 


S/B 


2.2 


3.2 


5.3 


4.6 


9.6 


20.1 



top-quark in a cone of size AR < 0.2. 

In Fig. 3 (b) the reconstructed three-jet mass after the W-boson mass constraint is presented. The 
background is also shown. 

Table 3 and 4 show the number of signal and background events in a 100 pb _1 data sample. To give 
an indication of the signal purity in the top mass peak region, in the third column of Tables 3 and 4 we 
give the number of events in a hadronic top mass region: 141 < m t < 1 89 GeV. Although not all signal 
events are correctly reconstructed, in both the electron and muon analyses the purity of the signal in the 
top mass window is close to 80%. 




Figure 3: (a): Expected distribution of the three -jet invariant mass after the standard selection. The white 
area represents the tt signal in the muon channel. The dark shaded area is the combinatorial background 
and the light shaded area represents the background contribution, (b): The same after the W-boson mass 
constraint in a 100 pb~ ! event sample. Both plots are for the muon analysis. 
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2.1.4 Selection variations: II 

Additional ways to kinematically select top events other than the W-boson mass constraint, or to improve 
the signal purity after having applied the W -boson mass cut itself, were explored. In the commissioning 
phase, it can happen that the barrel calorimetry will be better calibrated than the forward one. Therefore, 
it can be useful to apply the additional request that the three highest p j jets are all at \r\\ < 1. The 
reconstructed top mass in this case is shown in Fig. 4. 




Figure 4: (a): Reconstructed top mass after W -boson mass constraint for the electron analysis. The 
white area represents the tt signal in the electron channel, while the three shaded areas corresponds, 
going from the lighter to the darker, to the background from W-boson+jets, single top and all the other 
background sources considered in the analysis. The distribution is normalised to 100 pb _1 . (b): The 
same distribution, but in addition requiring that the three highest pj jets are at 1 77 1 < 1 . 



The centrality requirement applied after the default selection allows to reach the same signal-over- 
background that one obtains after applying the W-boson constraint. Tables 3 and 4 show the signal-over- 
background and signal efficiencies for the electron and muon analyses if the centrality requirement is 
applied in addition to the W-boson constraint (fifth column). 

Other variables were exploited as well, like the cos0* 1 and the total invariant mass of the event. In 
the following no cuts on these variables are used in the analysis. 

2.2 Determination of the cross-section 

In this section two complementary methods to determine the it cross-section of the commissioning 
analysis are presented. The first method estimates the tt signal by performing a maximum likelihood 
fit on the three-jet invariant mass distribution. The second is based solely on counting the number of 
top candidate events that pass the selection, and subtracting all backgrounds in order to get the yield of 
tt events in the sample. The two methods are affected by different systematics. Whereas this counting 
method needs all backgrounds to be properly addressed and normalised, it does not rely on a correct 
reconstruction of the top quark. The peak-fit method is rather insensitive to background normalisation 
and efficiencies, but requires a fairly well understood top mass peak. 

'it is the angle that one jet forms with the direction of the incoming proton in the centre of mass of the event (it is expected 
that the top decay products are emitted more centrally than the W-boson+jets and jets from QCD) 
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2.2.1 Likelihood fit method 

To extract the number of completely reconstructed it events (after having applied the default + W-boson 
mass constraint selection) a maximum likelihood fit is performed to the three-jet mass distribution with a 
Gaussian signal on top of the background described by a Chebychev polynomial, see Fig. 5. It has been 
verified that the background model correctly describes the combined tt combinatorial and background 
distribution in the signal region by comparing the fitted background to the subset of events that are not 
fully reconstructed signal as determined from truth matching information. 

Using 10000 pseudo-experiments, based on the input from the full simulation Monte Carlo events, one 
can extract the average fraction of signal events that pass all selection requirements and enter in the 
peak (i.e., the fraction correctly reconstructed). The average number of events in the peak in the muon 
analysis, i.e. correctly reconstructed semi-leptonic tt (muon) events, in 100 pb _1 is 508 events as shown 
in Fig. 5 (a). This corresponds to an efficiency of (4.23 ± 0.57)%. For the electron analysis this efficiency 
is (2.73 ± 0.47)%. 

The signal significance is defined using the likelihood ratio from two hypotheses: the presence of a 
signal (a peak) and its absence (only the Chebychev polynomial). The amount of data needed to make 
a statistically significant observation of the tt signal depends on the amount of background. For low 
luminosities, for example for 25 pb~\ the sampling fluctuations are too large and there is no typical 
plot like the one of Fig. 3. To quantify the relation between signal significance, luminosity and the 
amount of background, 10000 pseudo-experiments based on the full simulation distribution of the three- 
jet mass as a function of the integrated luminosity, have been modelled and fitted. The expected statistical 
significance is shown in Fig. 5 (b), where the yellow band is obtained by assuming the nominal level of 
QCD W-boson+jets background, while the red one refers to the case when this background is multiplied 
by two. . 




Figure 5: (a): Fit to the top signal. The Chebychev polynomial fit to the background is indicated by the 
dotted line and the Gaussian fit of the signal events is indicated by the full line, (b): Distribution of the 
expected statistical significance of the top signal in the peak as a function of the integrated luminosity 
for two background scenarios. The yellow band is obtained by assuming the nominal level of QCD 
W-boson+jets background, while the red one by assuming that this background is doubled. 



To go from a fitted number of properly reconstructed hadronic top quarks to a cross-section, one 
needs to correct for the event selection efficiency and the hadronic top reconstruction efficiency. The 
statistical error is estimated from having simulated 100000 pseudo-experiments, applying the fluctuations 
which are expected in 100 pb _1 to both signal and background and fitting the peak in both the electron 
and muon channels. 

One of the biggest uncertainties is the correct modelling of the jet multiplicity distribution as it affects 
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the fraction of tt signal events that are correctly reconstructed using the algorithm described earlier. An 
overview of the systematic uncertainties is given in [9]. 

2.2.2 Counting method 

The tt cross-section can be obtained by performing a counting experiment: 

_ Nsig _ Nobs - Nbkg 

./Vbkg, the number of background events estimated from Monte Carlo simulations and/or data samples, 
is subtracted from N 0 b S , the number of observed events meeting the selection criteria of a top-event 
signature. This difference is divided by the integrated luminosity Jz? and the total efficiency e. The 
latter includes the geometrical acceptance, the trigger efficiency and the event selection efficiency, and is 
slightly dependent on m t . The advantage of using event counts in the commissioning phase is that, early 
on, the Monte Carlo simulations may not predict the shapes of distributions very well. 

In order to perform the counting experiment the Monte Carlo samples were divided into two, statisti- 
cally independent: one which represents real data, used to obtain N a b s and the other one used as a Monte 
Carlo to obtain both e and Nbkg- 

2.2.3 Systematic uncertainties 

The main sources of systematic uncertainties are described in [9]. Some relevant points for the analyses 
presented here are discussed for the case of the default selection plus the W-boson mass constraint. The 
systematic uncertainty on the cross-section due to the luminosity determination, is factorised and men- 
tioned as a separate uncertainty on the overall results. The event selection efficiencies have a nearly linear 
dependency for jet energy scale variations, which affects the counting method directly. The hadronic top 
reconstruction efficiency has an inverse dependence on the jet energy scale, caused by the algorithm that 
picks the three -jet combination that are considered to be the hadronic top. If the jet energy scale is low- 
ered, the jet multiplicity (and therefore the number of three-jet combinations) increases. The probability 
that the algorithm picks the right combination generally decreases with the number of combinations to 
choose from. 

The effect of ISR and FSR parameter variations (chosen in such a way to maximise the effect on 
the cross-section measurement) has been evaluated. For the uncertainties related to the PDFs, both the 
uncertainty coming from CTEQ and MRST, have been considered and the largest one (coming from 
CTEQ) has been used for the final systematics evaluation. 

The main systematics uncertainties for the two analysis are listed in Table 5. For the likelihood 
method a 5% change in jet energy scale causes a 2.3% (0.9%) change in the combined reconstruction 
efficiency for the electron (muon) channel. The different jet multiplicity distributions affect not only the 
event selection efficiencies (jet requirements), but also the overall efficiency of the hadronic top recon- 
struction algorithm. A comparison between the two generators MC @ NLO and ALPGEN has been made to 
study these systematic effects. ALPGEN predicts 7% and 4% larger selection efficiencies in the electron 
and muon channels respectively. For the overall efficiencies the values are 10.5% and 4.7% larger in the 
electron and muon channel. However, these numbers were not added in the final result since there is 
overlap with the ISR/FSR systematics. Systematics effects on the shape of the fit as well as the normali- 
sation of the peak fit w.r.t. background is estimated with toy Monte Carlo's. Deviations of 14.0 (10.4)% 
for the electron (muon) channel are found, while the effect of changing the fit-ranges is negligible. 
For the counting analysis the uncertainty arising from the Monte Carlo used to generate the signal pro- 
cess has been taken into account as well. This has been done by comparing the cross-section obtained 
applying the same analysis to tt events generated with MC@NLO and with the ACERMC Monte Carlo. 
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Table 5: Systematic uncertainties on the commissioning likelihood and counting method cross-section 
measurement, in percent. 





Likelihood fit 


Counting method (elec) 


Source 


Electron 


Muon 


Default 


W const. 




(%) 


(%) 


(%) 


(%) 


Statistical 


10.5 


8.0 


2.7 


3.5 


Lepton ID efficiency 


1.0 


1.0 


1.0 


1.0 


Lepton trigger efficiency 


1.0 


1.0 


1.0 


1.0 


50% more W+jets 


1.0 


0.6 


14.7 


9.5 


20% more W+jets 


0.3 


0.3 


5.9 


3.8 


Jet Energy Scale (5%) 


2.3 


0.9 


13.3 


9.7 


PDFs 


2.5 


2.2 


2.3 


2.5 


ISR/FSR 


8.9 


8.9 


10.6 


8.9 


Shape of fit function 


14.0 


10.4 







The W-boson+jets normalization uncertainty has been evaluated using the Z-boson+jets sample as dis- 
cussed in the general introduction of systematic uncertainties, but also varying the level of the expected 
W-boson+jets level by 20%, 50% and even by a factor of two. For the final selection (including the 
W -boson mass constraint) this corresponds to an uncertainty of 4%, 10% and 19% respectively. As 
reference value to calculate the overall systematic error, the 50% case will be used. 

2.2.4 Contributions of new physics 

Many models of physics beyond the Standard Model contain new particles which couple to top-quarks. 
For example, in a supersymmetric estension of the Standard Model (SUSY) these new particles are top 
squarks [10] and in warped extra dimensions they are Kaluza- Klein resonances [11]. Since the new 
particles are expected around the TeV scale, the typical cross-sections are of order a few pico barns and 
hence one can expect a few hundreds new physics events in the first 100 pb -1 of data. In principle, these 
new physics events could represent a significant background to the cross-section measurement. For new 
physics models with much lower masses, e.g. low mass supersymmetry models, there will be many more 
events: this case will be considered later on. Here the existence of a new particle V which decays only 
into tt pairs, V —>■ tt it is assumed. It is further assumed that this particle has a production cross-section 
of 5 pb, so that 25/9 pb is the cross-section for the non-fully hadronic decays of V. As a model for V a 1 
TeV Z' is used. The efficiency for these events with respect to the default selection + the W -boson mass 
constraint is roughly twice the one obtained for Standard Model tt events, and the number of events 
passing the selection will be of the order of 1% or less of the tt events. Hence the new particle V will 
not affect the cross-section determination significantly. 

The predictions at several mSUGRA benchmark points [12] have been studied as well. The results 
are shown in Table 6 and demonstrate that the expected signals are small. At specific parameter points 
however, like SU4 [12], the cross-section is sizeable and the event topology is similar to that from top 
quark pairs which results in additional backgrounds as large as the total Standard Model background. In 
Fig. 6 one can see that the shape of the SU4 supersymmetry signal in the top quark candidate three -jet 
invariant mass distribution is very similar to that from the Standard Model background. 

The separation of tt events and these new physics signals is addressed in more detail in [12], dedi- 
cated to searches for supersymmetry. 
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Table 6: Expected number of events in a 100 pb _1 data sample at different stages of the analysis for 
several event types: after trigger and event selection (left column), after a cut on the di-jet masses in the 
top-quark candidate (middle column) and events with in addition to the di-jet mass cut a hadronic top 
mass cut 141 <m t < 189 GeV(right column). 





Electron analysis 






Vluon analysis 


Event type 


Trigge 


r+S election 




Trigger+Selection 




W const. m t win 




W const. m t win 


SU1 


53 


9 


1 


64 


12 2 


SU2 


10 


2 


0.5 


13 


3 0.7 


SU3 


108 


22 


4 


124 


26 4 


SU4 


1677 


541 


155 


2141 


700 199 


SU6 


29 


5 


0.6 


35 


6 0.6 


SU8 


27 


5 


0.6 


33 


6 0.8 




M [GcV] M | [GeV] 

Figure 6: (a): Expected distribution of the three-jet invariant masses among the top-quark candidates after 
the requirement on the mass of the di-jet system in the electron channel in a 100 pb _1 event sample, (b): 
Same distribution for the muon channel. The light histogram represents the Standard Model background, 
i.e. the background from mSUGRA point SU4 is shown separately. 



2.3 Implementation of b-tagging 

The possibility to identify b-flavoured jets (b-tagging) will improve the signal to background ratio of the 
selection. The b-tagging requirements are described in [9]. The number of "tagged" b-jets in the tt , 
single top and W-boson+jet events which pass the default selection is shown in Fig. 7. 

Tables 3 and 4 list the number of tt and background events in the electron and muon channel which 
survive the default selection plus the W -boson mass constraint, and the request of having one and only 
one, or two and only two b-jets (column six and seven). For all these cases, the corresponding signal to 
background ratios are given. Requiring one or two b-tagged jets improves the purity of the sample by 
more than a factor of four, while the signal efficiency is only reduced by a factor of two. 

In Fig. 8 the reconstructed three-jet mass is shown when one or two b-tagged jets are required for the 
default selection (a) and for the default selection + the W-boson mass constraint (b). To reconstruct the 
top mass, we find the three-jet combination with the highest possible pj, obtained by requiring that one 
and only one of the three jets is a b-jet. The W-boson mass constraint can then be applied to the two jets 
which are not b-tagged (among the three). If the three-jet combination chosen above is such that the two 
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Number of b-tagged jets 



Figure 7: Number of jets tagged as coming from a b-quark in it, single top and W-boson+jet events after 
the default electron selection. 




Figure 8: (a): Reconstructed three-jet mass for it , single top and W -boson + jet events for the default 
electron selection, requiring one or two jets tagged as coming from a b-quark. (b): Same distribution for 
the default selection + the W -boson mass constraint and requiring one or two jets tagged as coming from 
a b-quark. 



The statistical error on the cross-section which is obtained by requiring one or two b-tagged jets is 
4.5%. The systematic error due to the jet energy scale is in this case of 4.9%, while a wrong normalization 
of the W-boson+jets background by a factor of 20%, 50% or even a factor two, brings a systematic error 
on the cross-section of 3.4%, 4.7% and 6.9% respectively. A 5% relative error on the b-tagging efficiency 
is expected from present studies for an efficiency of 50-60% and for a luminosity of 100 pb _1 . The 
resulting uncertainty on the cross-section turns out to be negligible. The undertainty on the mistag rate 
is assumed to be of the order of 50%. 

2.4 Results 

With the first 100 pb _1 of data, we can observe a it signal and determine its production cross-section. 
This will be determined with a number of methods and we expect to reach the following accuracies (for 
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the default selection with the W -boson mass constraint, using electron and muons): 

Likelihood method: Aa/a= (7(stat) ± 15(syst) ±3(pdf) ±5(lumi))% (1) 
Counting method: Aa/a= (3(stat) ± 16(syst) ±3(pdf) ±5(lumi))% (2) 

2.5 Differential cross-sections 

We studied several differential distributions for it production. First, we present the momentum and 
rapidity distribution of the hadronically decaying top quarks, after having applied the default selection 
and the W-boson mass constraint. The results are shown in Fig. 9 (a) and (b). 




pt (top) [GeV] ti (top) 



Figure 9: (a): Momentum distribution of the reconstructed three -jet mass for it , single top and W- 
boson+jet events for the electron analysis. Right plot: Rapidity distribution of the hadronically decaying 
top quark. 

A more detailed study has been performed for the differential cross-section as a function of the ti 
system mass, and for several double differential distributions as shown in the following sections. 

The differential cross-section for ti production can be measured as a function of the invariant mass 
of the ti system in the semi-leptonic channel (with no tau leptons in the final state). Such a measure- 
ment provides an important check of the Standard Model and, at the same time, deviations from the ti 
continuum could indicate the presence of new physics, for example new heavy resonances decaying into 
a ti pair [14]. 

The standard commissioning selection is applied and the momenta of the four jets with highest pj , 
the lepton and the best estimate of the £"™ 1SS vector are used as inputs to a least squares fit with the con- 
straints that, in each event, the masses of both the W -boson and top-quark are consistent with 80.4 GeV 
and 175 GeV respectively. The fit procedure is documented in [15]. The goal is to improve the measure- 
ment of the reconstructed final state particles' four vectors by incorporating the precise knowledge of the 
masses of the W-boson boson and the top-quark. No b-tagging is used in this analysis and hence there 
are 12 possible combinations to assign jets to the (anti-) top. All combinations were investigated and and 
one was chosen: the assignment which returned the smallest weighted sum of squared residuals obtained 
from the kinematic fit. A simpler reconstruction scheme obtains the ti mass by deriving the leptonic 
W -boson momentum from the lepton and missing energy momenta with the W -boson mass constraint 
and then combining it with the momenta of the four highest pj jets. 

The reconstructed mass distributions for the two methods are compared to the true di-top mass distri- 
butions in Fig. 10 (a) for the non-hadronic signal. The true di-top mass distribution is in better agreement 
with the result obtained by making use of the full event fitting technique. 
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In this case, the expected mass resolution ranges from 5% to 9% between 200 and 850 GeV. A 
variable bin size of about twice the expected resolution is used to take such variation into account and 
reduce bin-to-bin migrations. The di-top mass spectrum (dN /dm tt ), reconstructed with the full event fit, 
is shown in Fig. 10 (b) for the signal and the backgrounds studied. Backgrounds include: full hadronic 
top, single top, W-boson+jets, Wbb , Wcc , inclusive Z-boson to leptons. The contribution from the 
di-boson (IVW.WZ and ZZ) backgrounds is negligible. 




Figure 10: (a): Normalised di-top mass distribution for the more complex (dashed line) and the simple 
reconstruction (dotted line). The normalised true di-top mass is also shown for reference (solid line), 
(b): Expected reconstructed di-top mass distribution after all cuts for signal and studied backgrounds, 
normalised to 100 pb~ ! . 



2.5.1 Double differential cross-section as function of pr and y 

The double differential cross-section for it production is sensitive to possible new physics beyond the 
Standard Model, e.g. extra dimensions based on studies of the top quark spin correlation [13], which 
depends on the knowledge of the top quark's momentum. A measurement investigates the decay products 
of the top quark in its rest frame and therefore good knowledge of its pj and y as defined in (3), for a top 
quark of energy E and longitudinal momentum p z , is needed. 

Theoretical predictions can be found in [4]. Here we present a feasibility study which, since the 
neutrino momentum cannot be directly measured, concentrates on the reconstruction of the hadronically 
decaying top quark in semileptonic tt events. Since in this case a high purity is needed, the default 
event selection is tightened by requiring exactly two b-tagged jets. The reconstruction of the hadronic 
top quark proceeds as follows: all possible combinations of two non-b-tagged jets with 60 GeV< mjj < 
100 GeV are selected as W -boson candidates. The nearest b-tagged jet for every W -boson candidate is 
found. The combination with the highest transverse vector sum momentum is then taken as the recon- 
structed hadronic top quark. This results in a purity of well reconstructed top quarks of 45%. The main 
background is due to combinatorics. 

Figure 1 1 shows the reconstructed double-differential distribution of the hadronic top scaled to an 
integrated luminosity of 1 fb -1 . In (a) the truth distribution of the tt signal is presented, while in (b) 
the distribution of reconstructed hadronic top-quarks is shown. In this distribution the contribution of 
background (from single top, W -boson + jet, Wbb and Wcc), which is very small after the requirement 
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Figure 11: (a): Distribution of pj and y of hadronically decaying top quarks calculated by MC@NLO. 
(b): Reconstructed distribution of pj and y of hadronically decaying top quarks from fully simulated 
samples scaled to an integrated luminosity of 1 fb -1 . 

of two b-tagged jets, has been added. After such a selection, the number of expected events limits the 
region of interest to \y\ < 2 and 50 GeV< pj< 280 GeV. 1 fb -1 is a reasonable statistic which allows 
to fill a significant number of bins with an appropriate statistical error. A lower integrated luminosity 
of 100 pb _1 would increase the statistical error of the bin contents at the edge of the interesting area 
from 10% to approximately 30%, which would limit the measurement to a smaller phase-space. The 
systematic uncertainties of this feasibility study are expected to be small and under control. 

The potential to determine the double differential cross-section of it events decaying semileptonically 
has been estimated. The phase space can be determined with an average efficiency of 3.98 ± 0.04% and 
a peak efficiency of 8% near the central rapidity region and for pj < 140 GeV. 

The main systematic uncertainties for this study will come from the jet energy scale and from the 
initial and final state radiation: each source contributing with an uncertainty of the order of ± 15% in the 
central region. 

3 Di-Lepton channel 

In this section the determination of the tt cross-section where both W-bosons decay leptonically is pre- 
sented. This measurement depends crucially on the correct identification of leptons. We proceed by 
investigating the channel with two electrons, one electron and a muon, and two muons in the final state. 
Final states with tau leptons are not studied here. The determination of the cross-sections is performed 
with a simple 'cut and count' method, a template method and a likelihood fit. 

3.1 Event selection 

The di-lepton sample is expected to be triggered with high efficiency using a combination of single-lepton 
and di-lepton triggers [7]. The overall trigger efficiency for the electron-muon channel is (97 ± 1)%, for 
the two electron channel it is (98 ± 1)% and for the two muon channel it is (96 ± 1)%. Due to the trigger 
OR condition between the channels and the high statistics available for efficiency measurements, the 
uncertainty is small. 

The offline selection of di-lepton events is based on the identification of leptons as described in the 
introductory section. Two high-propposite signed leptons, i.e. two electrons, two muons, or one electron 
and one muon, are required. These requirements define the preselection sample and additional selection 
criteria are imposed depending upon the tt cross-section extraction method. The expected number of 
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events produced after the preselection cuts of two isolated opposite charged leptons is shown in Table 7. 
None of the selections makes use of b-tagging. 

Table 7: Simulated Monte Carlo samples and expected events produced for 100 pb _1 of integrated lumi- 
nosity. Cross-sections a are at least Next-to-Leading Order total cross-sections, and <7 e ff are the effective 
Monte Carlo cross-sections including generator level filter efficiencies (reported in the "Filter(%)" col- 
umn). The last three columns show the number of preselected events after requiring two opposite signed 
leptons. 



Sample 


a(pb) 


Filter(%) 


OtfKpb) 


e/J, 


ee 


nn 


tt (di-lepton) 


833 


7(2/) 


55 


699 


312 


381 


it (semi-leptonic) 




48(1/) 


397 


31 


20 


8 


Z e+e- 


2015 


86 


1733 


5 


37418 


0 




2015 


89 


1793 


153 


0 


51139 


Z -► T+T- 


2015 


5 


101 


249 


101 


159 


W -^ev 


20510 


63 


12920 


42 


69 


0 


W -»jiv 


20510 


69 


14150 


152 


0 


40 


WW 


117 


35 


41 


76 


32 


44 


wz 


48 


29 


14 


6 


41 


52 


zz 


15 


19 


3 


1 


25 


31 


single top 


324 


31 


99 


5 


3 


2 



3.1.1 Lepton identification and isolation 

The inclusive di-lepton selection requires at least one of the electrons to be identified with the 'tight- 
electron' algorithm 2 to improve the fake lepton rejection. 

One of the major backgrounds in this channel is given by semi-leptonic tt events where the second 
lepton candidate is faked by a jet. A requirement on the isolation of the electrons lowers this background 
substantially. Fig. 12 (a) shows the distribution of the variable £a#<o.2£t, defined to be the energy 
deposited in a hollow cone with radius AR of 0.2 around the electron candidate. The figure shows this 
variable for reconstructed electrons in the semi-leptonic tt background sample which are close to a 
monte carlo truth electron (AR < 0.1 to a truth electron from the W -boson decay) and for reconstructed 
electrons with AR > 0.1. With the requirement £ar<o.2£t< 6 GeV, the signal is reduced by only ~ 4% 
whereas the semi-leptonic tt background is reduced by a factor of two. 

Two cuts are used to select muons from W -boson decays while simultaneously rejecting muons from 
b-jets. The first cut selects the muon candidate with tracks in the muon spectrometer that match best with 
tracks in the inner detector. Additionally it is required that the muon is not closer than AR < 0.2 to a jet, 
otherwise the muon is removed. 

Fig. 12 (b) shows the distance of muons to the closest jet in the event. Muons that are close (AR < 0.2) 
to a monte carlo truth b-quark are separated from muons that are close to a monte carlo truth muon 
(AR < 0.1). A AR cut of 0.2 removes the muons that do not originate from the W -boson decay. This 
requirement reduces the semi-leptonic tt background by ~ 25%, whereas the signal is reduced by only 
0.8%. 

2 The tight-electron algorithm makes full use of the TRT information as described in Ref. [16]. 
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Figure 12: (a) The "Lar^^Et variable for reconstructed electrons of the semi-leptonic tt background, in 
black are electrons matched to truth electrons, in white otherwise, (b) AR from a muon to the closest jet 
in the semi-leptonic tt sample. In white are muons matched to truth muons, in black are muons that are 
close to a b-quark. 

3.2 Backgrounds 

The backgrounds to the tt di-lepton signal can be classified into two main categories: prompt leptons 
originating from an electroweak decay, or non-isolated non-prompt leptons and jets that are falsely iden- 
tified as leptons mainly originating from QCD jets. 

For the estimation of the electroweak backgrounds we use Monte Carlo samples, as shown in Ta- 
ble 7. For the same-flavour Drell-Yan processes we made an exception, as the size of this contamination 
depends on the determination of the £"™ ss which is difficult to model. As a default we use the shape 
of the £"™ ss distribution for events with a di-lepton mass inside a window around the Z-boson mass, to 
correct the shape of the £"™ 1SS distribution in the Monte Carlo. The same correction factor is used for 
Drell-Yan processes with di-lepton invariant masses outside the Z-boson mass window. 

The backgrounds from fake leptons are estimated from the QCD dijet samples. These estimations of 
fake leptons are subsequently applied to all objects that can lead to fake leptons (e.g. jets) in the inclusive 
single electron or muon samples. 

3.3 Cross-section measurement 

As mentioned, three methods to determine the cross-section for the di-lepton channel are presented. The 
di-lepton channel profits from having smaller backgrounds and systematics and the methods are comple- 
mentary to each other. The robust 'cut and count' method can be replaced with the more sophisticated 
template and likelihood method with increased accumulated data. 

3.3.1 'Cut and count' method 

A 'cut and count' analysis is the most straightforward method to determine the cross-section. It can 
be used as the basis and reference for the more elaborate likelihood methods and act as a cross check 
between the different approaches. 

The selection criteria are defined to maximise the efficiency e and the purity p at the same time. The 
figure of merit is the product 




which is referred to as significance. 
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The variables that best characterise the signal events are the transverse momenta of the two leptons 
and the two jets, as well as the missing transverse energy. The jet momenta are high since they originate 
from the b-quarks, not present in the prominent background processes Z — > or dibosons — ► £ + £~ 
(WW, WZ and ZZ). A large amount of missing transverse energy is expected in signal events due to the 
two escaping neutrinos. In addition a veto on events with a di-lepton invariant mass around the Z-boson 
mass is applied. Fig. 13 (a) shows the di-lepton mass distribution in Z —> events. Most of the events 
are found to have an invariant mass between 85 and 95 GeV. In the case of the Z —> t + t~ events the 
peak is shifted and broadened, since the visible leptons do not come directly from the Z-boson. Also 
the neutrinos from the x decay add to the missing transverse energy. It is therefore expected that this 
background will be dominant, although the branching ratio for both t's decaying leptonically is only 
~ 9%. The optimal selection was found from a multidimensional scan of the significance s. Exactly two 
leptons are required and at least two jets. The requirements on the lepton and jet transverse momenta and 
on the Ej lss is then varied from 20 to 60 GeV. Finally, the cuts with the maximum significance are used 
to evaluate the cut and count performance. 

As a result, the multidimensional scan indicates that the values of the preselection requirement of 20 
GeV for the two leptons and for the two jets with the highest transverse momentum already maximise the 
significance. One additional cut is imposed: the £"™ 1SS is required to be at least 30 GeV for a selection of 
two leptons (all sub-channels together), at least 20 GeV for the e\i decay channel and at least 35 GeV for 
the same flavour lepton channels. The distribution of the missing transverse momentum for the signal 
and the background samples is presented in Fig. 13 (b). The efficiencies, signal over background ratios 
and the significance for an integrated luminosity of 100 pb _1 are shown in Table 8. 

The cross-section is derived from: 

_ Nsig _ AUs-Nbkg 
°~^xe~ ifxe 

To evaluate the statistical uncertainty on a, the error on N 0 b S is taken to be Gaussian assuming it will be 
measured in the data. The error on A^kg is calculated here from Monte Carlo and scaled to the desired 
luminosity. The relative error on the efficiency e (the product of geometrical acceptance and selection 
efficiency) here is also calculated from the Monte Carlo. The expected statistical error on the cross- 
section measurements for different integrated luminosities is given in Table 9. In data the efficiencies 
regarding leptons will be estimated from Z data events using tag and probe. Also the rates of backgrounds 
containing misidentified leptons can only be measured in data reliably. 



942 



Top - Determination of the Top Quark Pair Production Cross-Section 



Table 8: Number of events which survive the optimised selection criteria for signal and background 
samples, scaled to a luminosity of 100 pb _1 . 



dataset 


e\l 


ee 


flfX 


all channels 


tt (di-lepton) 


555 


202 


253 


987 


£ [%] 


6.22 


2.26 


2.83 


11.05 


tt (semi-leptonic) 


24 


11 


4 


39 


Z — > e + e~ 


0.0 


9 


0.0 


20 




5 


0 


51 


79 


Z -► T+T~ 


17 


4 


6 


25 


WW 


6 


2 


2 


10 


zz 


0 


0.2 


0.4 


0.9 


wz 


1 


0.6 


1 


3 




7 


7 


0.0 


14 




25 


0.0 


7 


33 


single top 


0.7 


0.5 


0.0 


1 


single top s-chann. 


0.0 


0.0 


0.0 


0.1 


single top t-chann. 


2 


0.8 


1 


4 


Total bkg. 


86 


36 


73 


228 


S/B 


6.3 


5.6 


3.4 


4.3 



Table 9: Expected statistical error on the cross-section determination for the cut and count analysis for 
different luminosities. 



Luminosity [pb' 1 ] 10 100 1000 

Aa/a e\i 14.1% 4.5% 1.5 % 

ee 23.1% 7.6% 2.6% 

HH 22.5% 7.6% 3.6% 

All channels 11.0% 3.6% 1.5% 



3.3.2 Inclusive template method 

The inclusive template method is based on the observation that the three dominant sources of isolated 
leptons which can be selected in the e\i channel are tt, WW and Z — ► TT. However, these three processes 
can be separated looking at the two-dimensional plane spanned by £"™ 1SS and number of jets, as shown in 
Fig. 14. Table 7 shows that there might be instrumental effects that introduce non-prompt non-isolated 
or falsely identified leptons, primarily in single W -boson and Drell-Yan decays to muons. These non- 
prompt non-isolated or falsely identified leptons can weaken the separation of the templates if their 
contribution is too large. To reduce this effect we add a 'tight-electron' requirement on one of the 
electrons. Signal sensitivity and robustness against systematic uncertainties are improved by adding also 
the ee and jljl channels. In the jljl channel we reject events where the £™ ss is aligned along any of the 
reconstructed muons. To further reduce the Drell-Yan background in the ee and jj, jx channels a Ej lss > 35 
GeV cut and a Z-veto are added in those channels. The estimated number of events remaining after these 
additional background rejection cuts are shown in Table 10. 

By constructing normalised 2D templates for each channel we determine the relative size of the tt, 
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Figure 14: Monte Carlo e/x templates spanning the plane £"^ 1SS and number of jets for tt (a), WW (b) 
and Z — > TT (c). 



Table 10: Estimated number of events remaining after the background rejection cuts used in the inclusive 
template analysis. 



Sample 


e\i 


ee 


MM 


tt (di-lepton) 


516 


213 


178 


tt (semi-leptonic) 


12 


11 


2 


Z -► e+e- 


3 


9 


0 




9 


0 


18 


Z - T+T- 


151 


10 


2 




35 


28 


0 


W ^IXV 


11 


0 


11 


WW 


57 


17 


19 


wz 


5 


3 


2 


zz 


1 


1 


1 


single top 


3 


1 


1 



WW and Z — > TT contribution by performing a binned log likelihood fit to pseudo-experiments. To en- 
sure a fit with a pull distribution compatible with a Gaussian with zero mean and unit sigma we relax the 
template likelihood using nuisance parameters constrained to their expected errors. The nuisance param- 
eters are: common acceptance and common background normalization for each final state. Bias from 
potential new physics is avoided by adding one free parameter with an associated orthogonal background 
template. Orthogonal in this context means that an additional background template does not introduce a 
bias when only SM events are present in the pseudo-experiments. The fit has in total ten free parameters 
including the tt, WW and Z — > TT cross-sections. 

Many different configurations of background templates are scanned and the template with the highest 
probability is selected. For robustness, the fit is first performed in the ejj, channel, and then the fitted 
parameters are fed as start values into the full fit which includes all channels: ejj,, ee and The result 
from the full fit applied to pseudo-experiments including all systematics is shown in Fig. 15. 

Systematic uncertainties enter in two ways: in the acceptance of the two leptons, and in the shapes 
of the 2D templates exemplified in Fig. 14. The impact of the systematic uncertainties is estimated using 
pseudo-experiments where all uncertainties and correlations, as discussed in section 3.4, are included, 
except the luminosity. Note that the acceptance is by definition not affected by the jet energy scale, 
but has a 2% uncertainty from QCD showering. The shape has a 2% uncertainty from QCD initial 
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state showering, and 2% from the jet energy scale. QCD final state showering only comes from tt and 
is found not to have any measurable effects on the fit when all channels are included. The projected 
significance versus integrated luminosity including all systematic uncertainties, except luminosity, is 
shown in Fig. 15. 




Number of Jets Integrated luminosity (pb 1 ) 

Figure 15: (a) Total composition of the inclusive di-lepton selection versus number of jets, (b) Pro- 
jected significance from pseudo-experiments for the inclusive template fit versus integrated luminosity 
including all channels and all systematic uncertainties. 



3.3.3 Likelihood method 

The likelihood method uses the following log-likelihood function to extract the parameters A^g and N\± g 
given the fixed total number of events iV tot and the measurements (*,-): 

Mot 

L = - £ln(G[x^ sig ,Ak g ]) + N tot 

!=1 

with 

G(x)=N sig xS(x)+N hkg xB(x) 

The multidimensional function G(x) is the sum of the functions S(x) which describes the signal 
distribution and of B(x) that describes the background distribution. The functions are determined by 
fitting Chebychev polynomials to the signal and background Monte Carlo distributions after the cut and 
count cuts were applied in the variables ^(^(leptono,^™ 88 )! (A<p between the highest pj lepton and the 
missing transverse energy vector) and | A^> | (j eto , £'^ 11SS ) | (A<p between the highest pj jet and the £"™ 1SS 
vector). Fig. 16 shows one of the distributions and the solid line shows the fits to the distribution that are 
used as Six) and B(x). 

The sum of the semi-leptonic tt ,Z—> l + £~ and WW events are considered as background and added 
up according to their cross-section to produce one single background distribution. 

To estimate the error on the cross-section, ensemble tests were performed for different integrated 
luminosities ranging from 10 pb _1 to 1 fb _1 . The relative statistical errors are presented in Table 11. 

3.4 Systematic uncertainties 

The systematic uncertainties have been evaluated according to the standard prescription [9]. In particular, 
the lepton ID efficiency will be measured from data using Z-boson events and the uncertainty is expected 
to be of the order 1%. The lepton trigger efficiency will be measured from data using Z events with an 
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Figure 16: Distribution of A<p between the highest pj lepton and the E™ ss momentum vector for (a) 
signal, (b) background and (c) signal/background composition according to the ratio obtained from the 
cut analysis. 

Table 1 1 : Expected statistical error evaluated from ensemble tests on the cross-section measurement for 
the likelihood method for different luminosities. For 10 pb _1 the fits in the subchannels do not converge 
well enough to give results. 



Luminosity [pb l ] 


10 


100 


1000 


Ao/o e\i 




9.1 % 


2.7 % 


ee 




16.0 % 


4.6 % 


]X]X 




7.8 % 


2.5 % 


All channels 


18.2 % 


5.2% 


1.7 % 



Table 12: Uncertainties on the cross-section measurement for the cut and count and the likelihood meth- 
ods. 





cut and count method 


likelihood method 




Ao/o [%] 


e\l 


ee 




all 


efl 


ee 




all 


CTEQ6.1 set 


2.4 


2.9 


2.0 


2.4 


0.3 


0.4 


0.2 


0.2 


MRST2001E set 


0.9 


1.1 


0.7 


0.9 


0.2 


0.2 


0.1 


0.2 


JES-5% 


-2.0 


0.0 


-3.1 


-2.1 


-5.4 


1.1 


4.9 


8.3 


JES+5% 


2.4 


4.1 


4.7 


4.6 


7.8 


3.9 


-4.6 


-4.4 


FSR 


2.0 


2.0 


4.0 


2.0 


0.2 


0.4 


0.0 


0.3 


ISR 


1.1 


1.1 


1.2 


1.1 


2.5 


1.8 


0.0 


1.7 


parameters- la 










-3.0 


-0.2 


-2.1 


-1.8 


parameters+la 










3.2 


0.8 


2.0 


2.0 



uncertainty which should also be of the order 1%. The lepton fake rate uncertainty is estimated from 
dijet events, and during the initial phase of data taking we expect this uncertainty to be of the order of 
50%. The systematic effect due to the uncertainty of the jet energy scale is investigated by scaling the 
reconstructed jet energies by ±5%. The value of the E™ 1SS is rescaled accordingly. The initial and final 
state radiation was investigated and the results are summarised in Table 12 as well. 

For the likelihood, the fit parameters of the Chebychev polynomials were varied by 1 a at the same 
time in the same direction. The results can be seen in Table 12. 
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The reweighting technique, to assess the systematic uncertainties from parton densities is also de- 
scribed in Section [9]. To cross-check the technique ATLFAST samples with different PDFs were used. 
For the cut and count method the efficiency using MRST PDFs is larger than the efficiency using CTEQ 
PDFs. Table 12 summarises the changes in the selection efficiency. For the cut and count analysis this 
directly translates into an uncertainty on the cross-section determination. For the likelihood method the 
reweighted events were used to fit the N s - lg and A^kg to the templates which were generated with the 
unmodified fully simulated events. 

3.5 Contribution of new physics 

Many models of physics beyond the Standard Model can have a significant branching ratio to the di- 
lepton final state. In particular the 'cut and count' method is sensitive to these potential contributions. In 
some SUSY scenarios the contribution can be as large as one-third of the total tt signal. This indicates 
how crucial it is to include all final states in order to verify the global consistency of the tt cross-section. 
The template/likelihood methods are able to consider more distinctive tt event properties and can be 
made much more robust against non-Standard Model sources. 

Data-driven methods which consider the full kinematics of the di-lepton tt system can further help 
to disentangle new physics, as described in other notes [12] of this volume. 

3.6 Results 

The final results in percent for the combined di-lepton channels for an integrated luminosity of 100 pb _1 
are summarised here 

Cut and Count method: Ao/o= (4(stat)+^(syst) ±2(pdf) ±5(lumi))% (4) 
Template method: Ao/o = (4(stat) ±4(syst) ±2.(pdf) ±5(lumi))% (5) 
Likelihood method method: Ao/o= (5(stat)tf(syst) ±0.2(pdf) ±5(lumi))% (6) 

4 Discussion and outlook 

In this note we have demonstrated that ATLAS will be able to reliably determine the tt production 
cross-section already from the startup period of LHC. We have determined this cross-section for the tt 
system decaying both into a single electron or muon with associated jets, or two electrons or muons with 
jets. For the single-lepton mode, we have investigated robust selection criteria that do not depend on the 
&-quark tagging. For the di-lepton channel various complementary channels have been investigated. 

With only 100 pb" 1 of accumulated data, we have shown that we can observe the top-quark signal 
and measure its production cross-section. Various methods have been presented and the corresponding 
uncertainties studied. Apart from the luminosity uncertainty, the overall uncertainties are of the order of 
(5-10)% and are dominated by systematics. Consistency between the methods constrain contributions of 
new physics as they affect the various methods differently. 
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Abstract 

At the LHC, the single top quarks will be produced at a third of the rate of 
top quark pairs. With more than two million single top quark events produced 
every year during the low luminosity run, a precise determination of all con- 
tributions to the total single top quark cross section seems achievable. Com- 
parison between the measured cross section and the theoretical prediction will 
provide a crucial test of the Standard Model. These measurements will lead to 
direct measurement of V t b at the few percent level of precision, and constitute a 
powerful probe for new physics, via the search for evidence of anomalous cou- 
plings to the top quark or the measurements of additional boson contributions 
to single top quark production. 

The single top quark production mechanism proceeds through three different 
sub-processes resulting in distinct final states, topologies and backgrounds. 
Given the level of backgrounds affecting the individual selections and the im- 
portance of the systematic uncertainties, the use of sophisticated methods ap- 
pears mandatory to unambigiously observe each process and determine pre- 
cisely the corresponding cross sections. This report presents the methods de- 
veloped to optimize the selection of single top quark events in the three chan- 
nels and establishes the ATLAS potential for the cross section measurements 
for the early data period and for a 30 ftr 1 low luminosity run. 

1 Introduction 

The top quark is one of the key particles in the quest for the origin of particle mass. In particular the 
electroweak interaction of the top quark is sensitive to many types of new physics. The electroweak 
production of top quark leads to a final state of a single top quark plus other particles. The production 
cross section is sensitive to contributions from new particles such as new heavy bosons W' or charged 
Higgs bosons H*. Other processes such as flavor-changing neutral currents also result in a single top 
quark final state [1]. Furthermore, single top quark production is an important background to many 
searches for new physics. 

The DO [2] and CDF [3] collaborations at the Fermilab Tevatron reported evidence for single top 
quark production and a first direct measurement of the CKM matrix element V t b- This involved advanced 
analysis methods to extract the small single top quark signal out of the large backgrounds. The Tevatron 
experiments will collect several fb _1 of data, and expect to not only observe single top quark production 
at the 5 sigma level but also separate two different production modes: the s-channel and the t-channel. 
However, the single top quark production cross section is small at Tevatron energies and single top quark 
measurements will be limited by statistics. 

At the LHC, the number of signal events is not a problem anymore. The LHC will not only be a 
strong interaction top quark factory but will also produce several million single top quark events. The 
cross section for all three modes of single top quark production as well as the CKM matrix element 
Vtb can be measured with high precision [4]. Once the single top quark signal has been established, 
detailed measurements of the process will follow, for example of the top quark polarization, ratios of 
cross sections, and charge asymmetries. Searches in each of the single top quark channels are sensitive 
to new physics even before the Standard Model single top quark signal is found. The DO collaboration 
has published limits on W' boson production and flavor changing neutral currents [5,6]. 
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This paper describes the cross section measurements for the three single top quark production modes 
and is organized as follows: the single top quark phenomenology is introduced in Section 2. All studies 
are based on a common event preselection, presented in Section 3. The prospects for the cross section 
measurements for the t-channel, s-channel and Wt-channel are then presented in Sections 4, 5, and 6. 
Finally, we summarize our conclusions in Section 7. 

2 Phenomenology and strategy for single top quark analyses in ATLAS 

In the Standard Model single-top quark production is due to three different mechanisms: (a) W-boson 
and gluon fusion mode, which includes the t-channel contribution and is referred to as t-channel as a 
whole (b) associated production of a top quark and a W-boson, indicated as Wt-channel, and (c) s- 
channel production coming from the quark anti-quark annihilation. Among those channels, the dominant 
contribution comes from the t-channel processes which account for 2461}^ pb [7, 8]. The Wt-channel 
contribution amounts to 66 ± 2 pb [9] while the s-channel mode is expected to have a cross section of 
11 ± 1 pb [7, 8]. The cross sections are calculated at the next-to-leading Order (NLO) with an input 
top quark mass of 175 GeV. A 4.3 GeV uncertainty on the top quark mass is included in t-channel 
and s-channel cross section uncertainties, while the Wt-channel uncertainty includes strong energy scale 
variations only. 

In the Standard Model, the top quark is assumed to decay almost exclusively into a W-boson and a b 
quark. The W-boson can then decay leptonically or hadronically. In the following, when discussing the 
analysis strategy in the s- and t-channels, only the leptonic decays of the W-boson are considered (lvbb 
and lvbbq final states, respectively) 1 . For the associated production (Wt-channel), only the modes where 
a lepton originating either directly from the W-boson produced together with the top quark or from the 
top quark decay are used. The x decay modes are included in all relevant samples and treated as a signal 
for leptonic x decays, since the signal selection is aimed at electron and muon signatures. 

Top quark pair production constitutes a dominant background to single-top quark events. The LHC 
total production cross section computed at NLO with next-to-leading logarithmic resummations is a(tt) = 
833 ± 100 pb [10] [11], about 3 times larger than the single top quark cross section, and more than 80 
times that of the s-channel. Given the fact that single top quark final state topology is characterized 
by one high p T lepton, missing energy and jets, tt production represents a significant background in 
its lepton+jets decay mode, i.e. when one of the Ws decays leptonically and the other hadronically 
(tt — > lvbjjb), with a final state containing two jets from the hadronization of b quarks and two jets from 
light quarks, a high pj lepton and missing energy. The "dilepton" channel (tt — > lvblvb) where a lepton 
is lost in acceptance also constitutes a major background. Finally, top quark pairs with one or both W- 
boson(s) decaying into a x lepton where the x decays into an electron or a muon, may also survive the 
selection. 

W+jets events constitute another major source of background because of a cross section several 
orders of magnitude above the one of the single top quark production. The Leading Order (LO) Alpgen 
[12] generator with the HERWIG [13] parton shower algorithm was used for the generation of W+jets 
and Wbb+jets events in this analysis. The corresponding LO cross-sections have also been used. A 
20% uncertainty on the W+jets and Wbb+jets cross-sections is considered in the following. WW and 
WZ diboson processes were also studied using samples generated with the HERWIG generator. The 
corresponding cross-sections are reported in [10]. 

While QCD dijet and multijet processes do not have features of the signal, they contribute to the 
background due to their overwhelming cross section and small but finite rate of object misidentification. 

^he hadronic decay modes have obvious disadvantages for triggering and the lack of lepton signature increases the back- 
ground significantly. 
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The estimation of this background needs to be done using data-driven methods and has been studied 
using full simulated dijet events generated with PYTHIA [14]. 

3 Single top quark event preselection 

The three production modes of single top quark events are characterized by similar features that motivate 
a common set of preselection criteria. This preselection, which classifies the events according to exclu- 
sive electron and muon selections, is also aimed at reducing the level of three types of processes which 
constitute the main backgrounds to single top quark analyses: the top quark pair production, the W+jets 
and the QCD events. 

3.1 Triggering and event preselection 

The triggers for single top quark events rest upon the use of the inclusive isolated electron and muon 
triggers. Their design and performance for the three levels of trigger are presented extensively else- 
where [15]. Triggering efficiency is 84±1% overall for top pair quark events. Only mariginal differences 
are seen in single top events. Note that for fast simulation samples like W+jets datasets, where no trigger 
information is available, a trigger weight derived from turn on curves established on tt events is applied 
to every event. 

Selected events must have at least one offline high pj isolated lepton in the central region with 
1 7] | < 2.5. The isolation criterion requires that the energy in a cone of AR= 0.2 around the lepton direction 
be less than 6 GeV and is important for the rejection of QCD background in which a jet can fake an 
electron or a muon [10]. Muons and electrons are required to have a transverse momentum greater 
than 30 GeV ensuring that the trigger efficiency is on the plateau and hence less sensitive to trigger 
uncertainties. Finally, the sign of the highest pj lepton gives the flavor of the decaying top quark. 

The event must then pass a second isolated lepton veto cut, applied to any lepton with a pj above 
10 GeV and in the pseudo-rapidity region \r\\ < 2.5 in order to reduce contamination from dilepton 
backgrounds. The rejection of tt events in the dilepton channel is increased by a factor 2.5 as this 
requirement is applied. The impact on the signal efficiency is limited, with a loss of 3.6% in both the s- 
and t-channel and 2.7% in the Wt-channel. 

Events are preselected if at least two jets are reconstructed with a pj above 30 GeV in the pseudo- 
rapidity region \t] \ < 5.0. The tf production being the dominant background to single top quark analyses, 
the use of a jet veto appears mandatory. The event jet multiplicity is thus further required to be lower or 
equal to four, where the extra jets must be reconstructed with p T above 15 GeV. It is important to keep 
this threshold as low as possible, since lowering the veto pj threshold from 30 to 15 GeV results in a 
rejection of tt events increased from 5 to almost 10. Among those events, the 1+jets decay modes are 
the most sensitive to such a requirement with a rejection rate that is doubled compared to the 30 GeV 
threshold. Dilepton events are less affected with a rejection rate going from 5 to 6.6. This requirement 
results in a relative loss of 20 to 25% of the single top quark in the t- and Wt-channel channels and of 
7% in the s-channel. 

Among the highest pj jets, one at least must be b-tagged, have a pj above 30 GeV and be in the 
central pseudo-rapidity region |?]| < 2.5. The b-tagging algorithms are described elsewhere [10]. A cut 
on the b-tag weight corresponding to a 60% efficiency and a mistag rate of light jets of 100 [16] is used. 
The mistag rates being small, we use an additional method to estimate backgrounds which do not contain 
b partons in the final states but are present in the selected sample because of a cross section several orders 
of magnitude above that of the signal. This is the case for the W+jets production. In those events, each 
jet is assigned a tag weight based on the parametrization of the mistag rate as function of the jet pseudo- 
rapidity and transverse momentum. The combination of all jets results in a weight assigned to the event 
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to be considered as a "1 b-tag inclusive", "2 b-tags" etc... event. Known as the "tagging rate function", 
this method, which applies on Monte Carlo events, improves the statistics in the distributions of such 
events, while keeping a global normalization corresponding to the actual mistag rates [17]. 

Finally the transverse missing energy is required to be larger than 20 GeV. This criterion is consistent 
with the leptonic decays of a W-boson while reducing the contamination from QCD events. 



3.2 Preselection efficiency 

Preselection efficiencies as well as event yield for an integrated luminosity of 1 fb -1 are reported in 
Table 1. The single top quark efficiency ranges between 5 and 6% in the electron channel and between 
5.5 and 7.5% in the muon channel respectively. These numbers translate into a total of approximately 
5,000 and 4,200 single top quark events in the muon and electron channel respectively including x decays. 

The production of tt events constitutes the dominant source of background to single top quark analy- 
ses. The tt events in the 1+ jets (1 = e,/i) modes are selected with an efficiency lower than 5%, resulting 
in a total of 22,580 events. The T + l (and XX) modes also contribute significantly with about 6,000 ex- 
pected events. Despite the use of a second lepton veto dilepton events, where a lepton escapes detection, 
also contribute to the preselected sample significantly with about 4,000 events surviving the selection. 
The production of W+jets events is also an important source of background, with about 13,000 events 
in the preselected sample. However these events populate mostly the lower jet multiplicity bin and their 
selection depends crucially on the mistag rates. The selection of Wbb+jets events results in about 1,000 
events. 

QCD events may also contribute to the selected sample of events. However, the requirements of the 
presence of at least one isolated identified lepton and one jet tagged as a b-jet reduce very significantly 
their level as discussed in [11]. Typically QCD background is expected to be smaller than the W+jets 
background. Further rejection may be achieved by applying more restrictive requirements on the lepton 
identification, or on the missing transverse energy and its correlation in 0 with reconstructed objects like 
leptons and jets [18]. The uncertainty on the remaining level of QCD background may be large. The 
QCD background contamination in selected samples should preferably be monitored, for example by the 
reconstruction of the transverse W-boson or top quark reconstructed masses [19]. 



3.3 Cross-section determination and systematic uncertainties 

Specific selections have been defined for the t-, s- and Wt- channels analyses. A general procedure to 
determine the cross-section on the selected samples and assess the errors associated to the measurement 
has been defined and is common to all three single top quark analyses. The following expression is used 
to calculate the cross section, a: 

where N s i g and N tot are the number of signal and all selected events respectively, B is the number of back- 
ground events, a is the signal acceptance and Jz? is the luminosity of the data sample. In the following 
studies, these numbers are estimated using Monte Carlo samples only. Data driven methods are expected 
be used to estimate specific backgrounds in the forthcoming data analyses. 

The propagation of the errors into the cross-section has been done in a consistent way among the 
three channels. They are combined and propagated to the measured cross section using a Monte Carlo 
method, which randomly generates N tot according to a Poisson distribution, and varies randomly B and 
a for every systematic source by an amount chosen around its central value, according to a gaussian 
distribution. This procedure is performed a few thousand times and the RMS of the resulting distribution 
is interpreted as the total uncertainty. 
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Table 1: Preselection efficiency, including trigger, for signal and background events. The 
uncertainties come from Monte Carlo statistics only. The convention 1 = is used. 



Process 


Muon channel 


Electron channel 




e(%) 


N(lft)- 1 ) 


e(%) 


NClfbr 1 ) 


s-channel^ 1 


7.1 ±0.1% 


166±3 


5.8 ±0.% 


136±3 


s-channel^ x 


0.7 ±0.1% 


8±1 


0.5 ±0.1% 


6±1 


t-channel^ 1 


5.9 ±0.2% 


3143±80 


5.2±0.1% 


2787±76 


t-channel^ x 


0.6 ±0.1% 


169±19 


0.3 ±0.05% 


92±14 


Wt-channel^ 1 


6.8±0.1% 


1314±27 


5.6±0.1% 


1091 ±24 


Wt-channel^ x 


1.0±0.1% 


93±7 


0.8 ±0.1% 


77±7 


tt -> 1 + jets 


4.9 ±0.05% 


11846±130 


4.4 ±0.05% 


10734±124 


tt — > x + jets 


0.6 ±0.05% 


757±34 


0.5 ±0.02% 


625±31 


tt->ll 


5.8±0.1% 


2257±58 


4.6±0.1% 


1762±51 


tt -> 1 + X 


7.9 ±0.2% 


3055±66 


6.7 ±0.2% 


2595±61 


tt— ► XX 


1.6 ±0.2% 


158±16 


1.6 ±0.2% 


151±15 


Wbb+jets— > 1 or x 


2.6 ±0.05% 


514±21 


2.1 ±0.05% 


424±19 


W+jets^ 1 or x 


0.014 ±0.001% 


7437±105 


0.011 ±0.002% 


5449±90 


WW 


0.3 ±0.04% 


78±11 


0.3 ±0.04% 


65±10 


wz 


0.6 ±0.04% 


50±3 


0.5 ±0.04% 


39±3 
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The total uncertainty is calculated using a common procedure as described in Section 4.2.3 in the 
three individual analyses. The main sources of experimental systematic errors taken into account are 
the b-tagging efficiency, jet energy scale, luminosity, for which central values are provided in the corre- 
sponding sections. Theoretical uncertainties that have been considered are the errors on the background 
cross-sections, as well as the impact of the uncertainties in the parton distribution functions and the 
b-quark fragmentation in the selection efficiency. Note that we consider the errors as fully correlated 
between signal and background for jet energy scale, b-tagging, and luminosity. 



4 Measurement of the t-channel cross section 

The t-channel is the most promising channel for single top quark observation at the LHC. It has the 
largest theoretical cross section of the three channels and its event features give us reasonable visibility 
of the signal according to the studies done so far [20]. In addition to the cross section measurement, 
the t-channel single top quark is one of very few candidate channels for the V t b and the top quark po- 
larization measurements. It is hoped that these measurements will shed light on our understanding of 
the electroweak symmetry breaking since these quantities have not been studied with high precision in 
past experiments. Compared to tt , the t-channel single top quark analysis suffers from a higher level 
of background due to its lower jet multiplicity, which makes the selection sensitive to QCD and W+jets 
backgrounds whose cross sections are several orders of magnitude higher than that of the signal. Thus, 
while the signal production rate is not statistically limited at the LHC, a good strategy for signal extrac- 
tion is required and a careful estimation of the background rate is necessary. 




Figure 1: Signal and background distributions of the b-tagged jet pj and multiplicity of jets with pj > 
30 GeV after the t-channel selection. 



4.1 Cut based event selection 

After the common preselection, the contribution from the background is still high mainly due to W+jets 
at the lower end of the kinematic distributions (such as in the jet pj spectrum) and tt at the higher end. 
The b-tagged jets coming from tt and the t-channel single top quark tend to have high pi and to be located 
more centrally since they mainly originate from top quark decays. On the other hand, b-tagged jets from 
W-boson events are much softer as they primarily come from mistagged jets originating from extra gluon 
radiation. Furthermore, the recoiling forward quark in the t-channel single top quark produces a high pj 
light jet in the forward direction. This was found to be one of few features useful to reject tt background 
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events because light jets from the tf events are typically initiated from hadronic W-boson decays and are 
therefore more central. A cut on b-tagged jet pj > 50 GeV reduces the W+jets significantly, while a cut 
on the hardest light jet 1 1] | > 2.5 can reject tt . Figure 1 shows the pi distribution of the hardest b-tagged 
jet and the composition of the sample in terms of the jet multiplicity after the additional event selection. 

Table 2 lists the number of preselected events, the number of events selected after the cut on b-tagged 
jet pi and the number after the light jet r\ cut for the signal and the background processes. The signal 
efficiency is 1.81% and the signal to background ratio after all selection is 37%. Note that the single top 
quark x decays are included in the signal count. By far the largest background contribution comes from 
the tt process; the 1+jets top quark pair events contribute the most, although dilepton events have a higher 
survival probability of 1.36% compared to 0.64% of the 1+jets. W+jets is the second largest background 
while the Wbb contribution is relatively small. The s-channel single top quark contribution is almost 
negligible and the diboson background is even smaller and therefore is not included in the table. 

Table 2: Number of events selected after each cut in the t-channel analysis. The last column shows 
the number of remaining events in the cut-based analysis at the integrated luminosity of 1 fb -1 . 
Note that the convention 1 = e, \x is used. 



Process 


Preselected 


b Jet pj 


light jet tj 




e(%) 


NClfb- 1 ) 


e(%) 


NClfb" 1 ) 


e(%) 


NClfb" 1 ) 


t-channel 


7.7% 


6191±112 


5.5% 


4412±95 


1.8% 


1460±56 


jj, channel 


4.1% 


3312±83 


2.9% 


2352±71 


0.9% 


728±40 


e channel 


3.6% 


2879±78 


2.6% 


2060±66 


0.9% 


732±40 


s-channel 


9.0% 


316±5 


7.0% 


245±4 


0.8% 


26±1 


Wt-channel 


8.9% 


2575±37 


6.4% 


1854±32 


0.4% 


122±9 


tt 1+jets 


9.3% 


22580±176 


7.3% 


17775±158 


0.6% 


1556±48 


tt -> l + T/r + r/T+jets 


4.3% 


7342±104 


3.4% 


5776±93 


0.4% 


740±34 


tt 11 


10.4% 


401 8 ±75 


8.1% 


3143±67 


1.3% 


520±28 


W+jets 


0.025% 


12886±138 


0.012% 


6082±95 


0.0017% 


873±36 


Wbb 


4.7% 


939±27 


3.0% 


597±22 


0.4% 


69±8 


S/B 




0.12 




0.12 




0.37 


S/VB(o) 




27.5 




23.4 




23.4 


VsTb/s 




3.9% 




4.5% 




5.0% 



As seen in Table 2, a precision (y/S+B/S) of 5% can be obtained after the final selection. Note, 
however, that the selection cut does not optimize the precision nor the significance of the signal obser- 
vation. From purely statistical arguments, the t-channel cross section can be measured to a few percent 
accuracy with a few fb -1 of data. 

In comparison to the previous study in the ATLAS physics TDR [20], which reported S/B a ratio of 3, 
there is a significant apparent reduction in the event selection performance. The cut-based selection used 
here is similar to that used in the TDR and we would have expected a similar result. Further investigation 
revealed several issues with the TDR analysis. Firstly, Monte Carlo (MC) generators changed drastically 
in recent years. PYTHIA introduced a new parton shower algorithm, which is much more radiative. 
The matrix-element generator for the signal has also changed to AcerMC, which combines the NLO 
diagram contribution to the LO ones as opposed to PYTHIA, which only uses LO. For the W+jets 
background production, Alpgen is used because it is shown that it reproduces better the high pj tails of 
jet distributions at the TeVatron [21], while Herwig was used in the TDR. In addition, the TDR analysis 
did not include some of the crucial background processes, like dilepton tt channels and x decay modes 
which are shown to contribute significantly in the present analysis. Therefore, the result of the TDR can 
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now be seen as too optimistic. 

Note that the presence of pile-up affects the reconstruction of the objects selected in the event. In 
particular, the selection efficiency is directly affected by the presence of extra jets in the events. Specific 
Monte Carlo samples of single top quark and tt events have been produced with or without pile-up events 
superimposed. The use of events with pile-up results in a decrease of about 25% for signal events, and 
34% for tt events. No systematic uncertainties will be associated with this value. The use of data will 
be mandatory for the tuning of the pile-up modeling and the corresponding uncertainty is expected to be 
brought down to a negligible value with respect to the other sources of error. 

4.2 Systematic uncertainties 

The systematic uncertainties have been evaluated following a procedure common to all three analyses. 
This procedure is defined here and the results specific to all analyses are reported in the corresponding 
sections. 

4.2.1 Experimental systematic uncertainties 

Various criteria need to be considered to assess the systematic effects as it depends on a number of 
detector sub-components and theoretical assumptions. A general procedure has been defined to assess 
all systematic errors in a consistent way for the three single top quark analyses. The estimate is shown 
here for the t-channel analysis only. 

The effect of the uncertainties on the measured cross section is summarized in Table 4. While the 
statistical uncertainty is not a constraint for the t-channel cross section analysis even with the early data, 
systematic uncertainties can only be reduced with detailed understanding of the detector performance 
and theoretical uncertainties. The measurement in the t-channel is largely limited by such effects. 

Table 3: Effect of b-tag and jet energy scale systematics. Numbers are 
quoted from relative variation of selected event. 



Process 


b-tag -5% 


b-tag +5% 


JES -5% 


JES +5% 


t-channel 


-4.0% 


+4.8% 


-4.8% 


+5.0% 


s-channel 


-5.2% 


+3.6% 


-6.6% 


+12.7% 


Wt-channel 


-8.9% 


+8.9% 


-2.0% 


+6.9% 


tf 


-6.4% 


+6.8% 


-3.1% 


+5.9% 


Wbb 


-4.0% 


+8.9% 


-12.6% 


+12.7% 


W+jets 


-1.6% 


+2.5% 


-9.9% 


+14.6% 


Total bkgd. 


-4.6% 


+6.1% 


-4.8% 


+8.1% 



The uncertainty in the b-tagging performance and jet energy scale can have crucial effects on the 
measured cross section since we rely on jet kinematics to reduce the background. In particular, the 
large background contribution from tt can cause a large fluctuation on the total event rate with these 
uncertainties, which leads to a large error on the cross section measured. The jet energy scale uncertainty 
affects the pj distribution of jets strongly at the low end of the distribution. Cutting on pj in this region 
can lead to a larger uncertainty on the signal acceptance. The effect of 5% variation of the b-tagging 
performance has been considered together with the corresponding change in the mistag rate of non b- 
tagged jets. The effect of a 5% change of the jet energy scale determination (JES) has also been evaluated 
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on the efficiency. The impact of both effects on the selection is summarized in Table 3. It should be noted 
that the Wbb and W+jets backgrounds are more affected by the jet energy scale due to the fact that most 
of the jets in these sample are in the lovj-pj region, which means that they are more likely to enter or 
leave the acceptance as the scale is changed. 

The uncertainty due to the trigger requirements has been estimated by computing the variation in 
the signal and background rates resulting from a bias of 1 % in the inclusive lepton trigger efficiency. 
This corresponds to the expected level of precision derived from turn on curves determination with the 
early data. Similarly, a 1 % uncertainty in the lepton identification was considered and the corresponding 
impact on the cross section measurement assessed. These two effects are expected to contribute to less 
than 2% of the total uncertainty with lfb x . 

Note that for all those results, the limited statistics of the Monte Carlo samples may result in fluctu- 
ations in the variable distributions used to discriminate signal from background events. The uncertainty 
associated to the low statistics is found to be 6-8% with W+jets being the largest cotrigger requirements. 

4.2.2 Theoretical and Monte Carlo systematic uncertainties 

Since the event selection relies heavily on the jet multiplicity, the number of additional jets from the 
parton shower can affect the selection efficiency. To evaluate the effect of ISR/FSR uncertainty, two sets 
of PYTHIA parameter variations were considered for comparison, which control the free parameters in 
the underlying event, initial- and final-state radiation, showering, and multiple interactions. The varia- 
tions were combined to maximize the variation in the number of selected events to give a conservative 
estimate. The effect on the jet multiplicity is shown in Figure 2. It can be seen that the loose jet (pj 
> 15 GeV) distribution is affected more severely, which affects the selection efficiency due to the jet 
veto cut. The parameter variation set that gave the largest uncertainty leads to an overall uncertainty in 
the t-channel selection efficiency of —11% +7%, and is quoted as a systematic uncertainty. 
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Figure 2: Variation in t-channel multiplicity for the loose (pr > 15 GeV, left) and the tight (jpj > 30 
GeV, right) jets due to ISR/FSR parameter variation. The distributions are shown without applying the 
t-channel selection cuts. The jet veto cut (< 4 loose jets) and the requirement of b-tagged jet were also 
removed for these plots to show the uncertainty over all multiplicities 2 . Black points are the nominal 
entries while the mesh band shows variation from "3/4 jet parameter set" and the filled band shows 
that of "2/3", which are the two sets of parameter variations compared to study different jet multiplicity 
events. 



The uncertainty due to PDF was calculated using a re-weighting method together with error PDF 
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sets provided by the PDF packages. While the event generation was done only for the central value PDF, 
weights were calculated in each event according to: 

, = fa(x h Q;Sf)-fa(x 2 ,Q;Sf) 

' /i(*i,G;So)-/2(* 2 ,G;So) ' 

where, fa and fa are the PDF values for a given hard scattering process (characterized by flavors / 
and momentum fractions x of the initial partons, and by Q, the event energy scale) evaluated for the i th 
error PDF pair Sf. The variation in efficiency was calculated by applying an event selection similar to 
the preselection cuts at the generator level. The variations in efficiency were added using the Hessian 
formalism and the results from CTEQ and MRST PDF error sets [22] [23] were consistent. The effect of 
the PDF uncertainty on the signal was evaluated to be +1.4% -1.1% and it is a minor contribution to the 
final systematics. A larger effect of +6.2% -5.5% was seen on the tt background. The PDF uncertainty 
was estimated for the leading contribution from the tt process only. 

In addition to the above, the t-channel process has a fairly large uncertainty from MC generator pre- 
dictions. Comparing various combinations of ME and PS generators, it was observed that the Pythia and 
Herwig parton shower algorithms give significantly different jet multiplicities. While we assume that this 
difference can be eliminated by tuning the parameters to the observed data 3 , the instability of theoretical 
prediction from matrix element generators is an outstanding issue and a 4.2% variation in signal accep- 
tance was seen by comparing AcerMC+Herwig and MC@NLO+Herwig. We quote this as an estimate 
for systematic uncertainty from the theoretical prediction. In our present analysis, the background is es- 



Table 4: Summary of all uncertainties that affect the measured cross section, shown for the cut- 
based analysis and the BDT analysis. "Data statistics" represents the Poisson error one would 
expect from real data 



Source 


Analysis of 1 fb 1 




Analysis of 10 fb 1 






Variation 


Cut-based 


BDT 


Variation 


Cut-based 


BDT 


Data Statistics 




5.0% 


5.7 % 




1.6% 


1.8% 


MC Statistics 




6.5 % 


7.9% 




2.0% 


2.5% 


Luminosity 


5% 


18.3 % 


8.8% 


3% 


10.9 % 


5.2% 


b-tagging 


5% 


18.1 % 


6.6% 


3% 


10.9% 


3.9% 


JES 


5% 


21.6% 


9.9% 


1% 


4.4% 


2.0% 


Lepton ID 


0.4% 


1.5 % 


0.7% 


0.2% 


0.6% 


0.3% 


Trigger 


1.0% 


1.7% 


1.7% 


1.0% 


3.6% 


1.7% 


Bkg x-section 




22.9% 


8.2% 




6.9% 


2.5% 


ISR/FSR 


+7.2 -10.6% 


9.8% 


9.4% 


+2.2 -3.2% 


2.7% 


2.5% 


PDF 


+1.38 -1.07% 


12.3 % 


3.2% 


+1.38 -1.07% 


12.3 % 


3.2% 


MC Model 


4.2% 


4.2% 


4.2% 


4.2% 


4.2% 


4.2% 


Total 


45% 22% 


22% 10% 



timated from Monte Carlo and its normalization is currently estimated based on theoretical uncertainties. 
When data will be available, the tt and the W+jets backgrounds will be measured from data as well. It 

3 The current tunings of Pythia and Herwig parton shower weres obtained independently based on extrapolation from the 
Tevatron data. [24] 
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is beyond the scope of this paper to discuss methods for the data-driven background estimation and the 
uncertainty from the background is currently estimated based on theoretical uncertainties. 

4.2.3 Summary of uncertainties 

A large part of the final uncertainty is due to the overwhelming amount of tt background. For this 
reason, the event selection optimized for statistical significance or statistical precision does not give the 
smallest total uncertainty when systematics are included. Table 4 reports the uncertainties associated 
the cross section determination. The total uncertainty is dominated by systematic effects related to the 
background normalization. Reducing the background contamination to increase signal to background 
ratio would help to minimize the systematic uncertainty. It is therefore desirable to further optimize the 
selection beyond what can be achieved with simple cuts on the existing variables. 

Note that for an analysis based on 10 fb -1 , the table reports the expected performance assuming a 
better understanding of the experimental aspects of the detection. Assuming a b-tagging efficiency known 
with a precision of 3%, the jet energy scale determined at 1%, a luminosity known at 3% and a better 
understanding of the background to the 3% level and the radiation modeling at 3%, a total systematic 
uncertainty of 10% seems achievable for the BDT analysis. 

4.3 Multivariate event selection 

In the cut-based analysis no variable was found that was effective to reject the tt background. One 
commonly employed separation technique is a Multivariate Analysis (MVA), which effectively factorizes 
the process of cut optimization using a set of rules for separating the signal from the background events. 
In the present analysis, an attempt to eliminate the remaining background contribution from tt makes use 
of the Boosted Decision Tree (BDT) method [25] within the TMVA [26] framework. The sample used is 
selected with the b-tagged jet pj cut selection, without applying the forward jet tj cut. This cut is indeed 
effective to reduce the tt contribution but not very efficient. About 40 object/event level variables were 
studied using a genetic algorithm, which scanned a large number of variable sets. Extensive studies of this 
variable sets revealed that a small subset of the variables can achieve signal to background discrimination 
close to what was achieved using all variables. This set was further refined so that the chosen variables 
are not too sensitive to JES systematics: 

• pt of the leading b-tagged jet and non-b-tagged jet; 

• T7 of the leading non-b-tagged jet and cos 6 of the leading jet; 

• Centralit y( ^±^); 

• Scalar sum of the p T of the two highest energy jets, the transverse missing energy $ T , and the 
lepton p T ; 

• AR between the two jets with the highest transverse mometum 

• AR between the leading jet and the lepton; 

• AR between the leading non-b-tagged jet and the lepton; 

• the lepton, and the W-boson transverse mass; 

• r\ of the jet with largest | r\ | ; 

• the number of jets with pj > 30 GeV. 
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Figure 3 (left) shows the BDT output discriminator constructed from the variables discussed above. 
By cutting on a high value of the discriminator, the tt background can be removed more effectively than 
by cutting on individual input variables. Since the BDT was optimized for the tt separation, as expected 
the output is not effective against W+jets. It can be seen in the right figure that a high level of signal 
purification is achieved using the BDT discriminator. We optimized the cut on the BDT output by min- 




Figure 3: Boosted decision tree output for signal and background after the b-tagged jet pj cut (left) and 
leptonic top quark mass distribution using cut on BDT output at 0.6 (right). 

imizing the final uncertainty on the measured cross section, including the systematic effects. For each 
cut, the systematic uncertainties were calculated assuming that the relative systematic uncertainty stays 
constant for each channel for each source of uncertainty. This is a fairly reasonable assumption consid- 
ering that the relative uncertainty changes slowly with the cuts and the variables were chosen to avoid 
larger systematics. For most cut values, the systematic effects are dominant and the total uncertainty is 
reduced with increasing signal to background ratio. On the other hand, with very tight cuts the statistical 
error becomes larger than systematics. The overall minimum was found to be at 0.6 where the signal to 
background ratio is 1.3 and 542 signal events are left as shown in Table 5. The reconstructed top mass 
distrubution after applying BDT selection is shown in figure 3 (right). The top mass was reconstructed 
from the lepton, $ T and the highest pj b-tagged jet. The W-boson mass was constrained to be 80.4 
GeV to obtain two solutions for neutrino kinematics and the one with the smaller pz was selected. The 
statistical uncertainty is 5.7% while the total uncertainty is 22% as shown in Table 4. 

Table 5: Final event yield after the cut on BDT discriminator at lfb -1 . 



Process 


t-channel 


s-channel Wt-channel 


tt W+jets 


Wbb 


S/B 


N(lfb x ) 


| 542 


3 15 


184 201 


10 


1.31 



4.4 Sensitivity at 1 fb 1 and the measurement of \V tb \ 

Although the estimated systematic uncertainty of the cut-based analysis is rather large (44.7%), it has 
been shown that this can be reduced by rejecting the tt background using boosted decision trees. The 
total uncertainty decreases as the S/B ratio increases and the estimated uncertainty at 1 fb -1 is 

Ao 

— = ±5 .7 % stat ± 22%, y . v = ±23 % . (3) 
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The t-channel cross section is proportional to [/zy tb | 2 , wnere the parameter fi is the weak left-handed 
coupling and ft = 1 in the Standard Model. In the theory predictions, the product [/^ V tb | 2 is always set 
to unity. Thus, if one measures the cross section, and then divides by the theoretical cross section, one 
obtains a measurement of | V tb | 2 , making the Standard Model assumption that f L = 1 [27]. 

The relative uncertainty on V t b is the relative uncertainty on | V th \ 2 divided by two since S \ V tb \ / \ V th \ = 
S\ V th \ 2 /2\ V tb \ 2 . However, there are additional systematic uncertainties in the V t \, measurement due to the 
presence of the theoretical cross section in the denominator. Here, we quote the uncertainty calculated 
in [7], in which a theoretical uncertainty of +3.8 -4.1% is reported including the contributions due 
to the strong scale, PDF and top quark mass uncertainties. We use the average of the positive and the 
negative uncertainties. Therefore, the estimated uncertainty on the measured value of V t b is 

= ±U% stat+sys ±4% theo = ±12%. (4) 

I v tb\ 

4.5 Summary 

The cross section measurement of the single top quark t-channel was studied in this chapter. The charac- 
teristics of the signal and background were investigated in detail and an analysis strategy was developed 
first using simple cuts and then using boosted decision trees. 

While a cut-based event selection can achieve a statistical precision of a few percent at an integrated 
luminosity of 1 fb -1 , the tt background is difficult to reduce. This results in large systematic uncer- 
tainties coming from both experimental and theoretical origins. Uncertainties in the jet energy scale, 
b-tagging and luminosity all affect the measurement considerably. The uncertainty on the background 
cross section is also rather large though we expect that it will be constrained at higher accuracy with 
the data. This is also true for the QCD background, which is not included in our current analysis. The 
results shown are obtained on samples without any pile-up and no systematic uncertainty was associated 
to it. Data-driven background estimation methods should be developed once data is available. Among 
the theoretical issues, the ISR/FSR uncertainty degrades the measurement more significantly than other 
theoretical effects such as the PDF and the Monte Carlo generator model. 

A multivariate background discrimination method is very effective in reducing the background and 
thus reducing the total uncertainty to nearly a half of the cut-based analysis. We conclude that multivari- 
ate analysis tools are highly effective for a t-channel cross section measurement and further studies of 
these techniques will be very beneficial for the improvement of the analysis in the future. However, to 
reach a precision at a few percent level, studies of systematic uncertainties and an excellent understanding 
of the detector response will be necessary. 

5 Measurement of the s-channel cross section 

The measurement of the single top quark s-channel appears the most delicate of the three main single- 
top quark processes. Suffering from a low cross section compared to the main backgrounds, the event 
topology makes this channel very sensitive to the presence of both tt and W+jets events. Because of 
the low jet multiplicity of such events, the analysis is also expected to be sensitive to dijet production, 
despite the tight requirements on the presence of at least two b jets. The s-channel is however one of the 
most interesting because the production of tb final state events is directly sensitive to contributions from 
extra W-bosons or charged Higgs bosons as predicted in two Higgs doublet model (2HDM) [28]. 

The event selection is presented in three steps. A first one makes use of a standard cut-based analy- 
sis, and will serve as a reference with the early data. In a second step, likelihood functions designed to 
improve the discrimination against specific backgrounds are presented together with the sets of discrim- 
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inant variables that enter their definition. Finally, the selection criteria applied on those likelihoods are 
defined so that the total uncertainty affecting the cross section determination is minimized. 

5.1 Sequential cut analysis 

After applying the common preselection described in Section 3, only two b-tagged jet events with pj 
above 30 GeV are selected, and a jet veto is applied on any other jet with a transverse momentum above 
15 GeV. This strong requirement is used to reject tt events which represent the dominant background 
to our signal at this stage. This set of requirements also reduces the W+jets and QCD multijet contam- 
ination, since those events feature much softer b jets or no b jets at all. The selection also requires the 
opening angle AR between the two jets to be between 0.5 and 4.0, the scalar sum of total jet transverse 
momenta H T (jet) to be above 80 and below 220 GeV and finally the sum of the transverse missing energy 
($ T ) and lepton transverse momentum (p T ) to be in the range between 60 and 130 GeV. 

Selected event yields are reported in Table 6 for the three single top quark processes, and all tt 
and W+jets backgrounds. When adding all contributions, the overall signal efficiency is about 1.1% in 
the electron channel and 1.6% in the muon channel. This corresponds to a total of about 25 selected 
candidates for an integrated luminosity of 1 fb _1 . The signal to background ratio is about 10%. The 
dominant background is composed of the tt events which account for about 60% of the total background 
yield. Among those, the tt in the lepton+jets mode, including t decays, contribute about 40%. The 
remaining backgrounds originate from the Wbb+jets production, which constitutes about 14% of the 
background yield, and almost equally from the single top quark t-channel (11%) and W+jets (9%) events. 
As expected, W+jets events are removed due to the requirement of b-tagged jets, with a final yield 
depending upon the mistag rate. Diboson contributions (WW and WZ) are found to be negligible. 

Table 6: Event yield for signal and background for the cut-based analysis in the 2-jet multi- 
plicity bin for 1 fb _1 . Uncertainties come from Monte Carlo statistics only. The convention 
1 = e,;ii is used. 



Events in lfb 1 


e channel 


jU channel 


e + ji combined 


s-channel 


10.3 ± 0.8 


14.5 ± 1 


24.8 ± 1.3 


t-channel 


17.0 ± 5.7 


13.6 ±5.1 


30.6 ± 7.9 


Wt-channel 


6.5 ± 1.9 


2.4 ± 1.2 


8.9 ± 2.3 


tt 1+jets 


18.8 ±4.3 


20.5 ± 5.3 


39.3 ± 5.3 


tf 


29.0 ± 5.5 


15.4 ±5.0 


44.4 ± 7.4 


tt ->1t 


20.5 ± 5.6 


40.9 ± 6.9 


61.4 ±8.9 


Wbb+jets 


18.9 ±2.0 


19.7 ± 2.0 


40.6 ± 2.5 


W+jets 


14.8 ± 1.4 


11.0 ±2.2 


25.8 ±2.3 


Total Bkg 


125.5± 10.3 


123.5 ± 10.6 


251.0 ± 14.2 


S/B 


8.2% 


11.7% 


9.8% 


s/Vb 


0.9 


1.3 


1.6 


VsTb/s 


1.1 


0.8 


0.7 
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5.2 Likelihood selection 

The cut based analysis shows that a simple approach to select single top quark s-channel events is ham- 
pered by a high level of background. The use of a likelihood discriminator is aimed at improving the per- 
formance of the discrimination against backgrounds in order to purify the selected signal samples. This 
approach assumes that the distributions entering the definition of likelihood functions are well known 
and validated on data itself. This result can be achieved by cross-checking at every step the agreement 
between data and Monte Carlo distributions on selected sub-samples where a high level of tt and W+jets 
background is expected. In the following, only pre-selected events with exactly two jets, both of which 
are b tagged are considered. A jet veto on any other jet is applied. 

5.2.1 Definition of the likelihood functions 

The main background processes to our signal show very distinct features in the final state and topology 
that lead us to define several likelihood functions each devoted to the discrimination of a specific process: 
three likelihood functions are devoted to tt events in the dilepton, the 1 + T and the 1+jets decay modes and 
two likelihood functions have been developed to discriminate against W+jets and t-channel events. Due 
to the limited Monte Carlo statistics, these likelihood discriminators have been defined by combining 
both muon and electron channels. 

The list of variables entering a likelihood function is derived from a procedure of optimization that 
selects only the variables that bring a significant discrimination between signal and the considered back- 
ground. The discriminating power of a given variable is computed using the selection efficiencies for 
both signal and background in the plane (£5, £5). When the discriminating power of a variable is low, the 
variation of the background efficiency £5 follows that of the signal £5. On the contrary, a high discrim- 
inating variable results in a larger decrease of Eb compared to the variation seen in £5. This variation, 
computed for each £5 and integrated over the full range of £5 can thus be seen as an estimator of the 
discriminating power of the variable. In this analysis, the variable is selected if the discriminating power 
is about a few percent. The use of higher thresholds results in a degradation of the performance due to 
the loss of discriminating power of the formed likelihoods. The set of discriminant variables is built from 
16 relevant kinematical variables: 

• the opening angles between the lepton and the jets AR(l,b\ ) , AR(l, b-i) 

• the angles cosA<E>(7,£>i) and cosA<J>(/,& 2 ); 

• the opening angle between the two b tagged jets AR(bi ,b 2 ); 

• the pseudo-rapidity of the b tagged jets Tjbi and ^2; 

• the invariant mass formed by the systems of the two b tagged jets Mi nv (bi, b?) 

• the invariant mass formed by the reconstructed W leptonic boson and the b tagged jets M(W lep ,bi) 
and M(W lep ,b 2 ); 

• the transverse momentum of the reconstructed top quark candidates pj {top\) and pi it opt) 

• the sum of the missing transverse energy and lepton transverse momentum $ T + p T (I); 

• the transverse mass of the leptonic W-boson candidate Mj(W\ ep ); 

• the scalar sum of the jet transverse momenta Hjijets); 

• global event shape variables: sphericity, aplanarity and centrality. 
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likelihood 

Figure 4: Distributions of the five likelihood functions for an integrated luminosity of lfb -1 . a) Like- 
lihood against tt in the 1+jets channel; b) Likelihood against tt in the dilepton channel; c) Likelihood 
against tt in the 1 + T channel; d) Likelihood against W+jets events; e) Likelihood against t-channel 
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5.2.2 Optimization of the likelihood selection 

The likelihood function distributions corresponding to an integrated luminosity of 1 fb" 1 are shown in 
Figure 4. Thresholds have been applied on each of the 5 likelihood values. In the present analysis, the 
thresholds on the likelihood values have been set so that the total uncertainty affecting the cross section 
measurement is minimized. The total uncertainty was calculated as described in Section 4.2.3. The main 
sources of systematic errors taken into account are the b-tagging efficiency, jet energy scale, luminosity 
and uncertainties on the background level, for which central values are provided in Section 5.3. Note that 
we consider the errors as fully correlated between signal and background for jet energy scale, b-tagging, 
and luminosity. 

The thresholds set on the five likelihood outputs resulting from the minimization of the total uncer- 
tainty are listed below: 

■^tt/lepton+jets > 0.34, 
■%dilepto„ > 0.56, 
-^tf/T+lepton > 0.80, 
-^W+jets > 0.32, 
-^-channel > 0.46 



Table 7 reports the number of events expected for all signal and backgrounds for an integrated lu- 
minosity of 1 fb -1 . The overall signal to background ratio is improved significantly compared to the 
sequential cuts analysis with a purity increased from 9.8% to 18.7%. This is an expected outcome of the 
optimization procedure since the main source of errors comes from the uncertainties in the background. 

Table 7: Numbers of expected events for an integrated luminosity of 1 fb _1 expected from 
the likelihood analysis in the two jet final state events. The results are shown separately 
for the electron and muon channels. Statistical uncertainties correspond to the Monte Carlo 
statistics only. The convention 1 = e ;i u is used. 



Events in lfb 1 


e channel 


jU channel 


e + jl combined 


s-channel 


6.3 ± 0.7 


9.1 ±0.8 


15.4 ± 1.0 


t-channel 


negl. 


1.7 ± 1.7 


1.7 ± 1.7 


Wt-channel 


1.8 ± 1.0 


negl. 


1.8 ± 1.0 


tt — »• 1+jets 


7.6 ±2.6 


7.7 ±3.5 


15.3 ± 4.4 


tt->ll 


6.0 ±2.6 


6.0 ±2.6 


12.0 ±3.8 


tt->l+T 


6.8 ± 3.4 


14.5 ±4.1 


21.3 ±5.3 


Wbb+jets 


10.0 ± 3.2 


7.0 ±2.6 


17.0 ±4.1 


W+jets 


6.2 ± 1.2 


7.3 ±2.1 


13.5 ± 2.4 


wz+ww 


negl. 


negl. 


negl. 


Total Bkg 


36.8 ± 5.8 


45.9 ± 6.6 


82.7 ± 8.6 


S/B 


17.3% 


19.8% 


18.7% 


s/Vb 


1.0 


1.3 


1.7 


VS + B/S 


1.0 


0.8 


0.6 
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The dominant background comes from tt production, which contributes about 60% of the total event 
yield. Among those events, the main contribution originates from the 1 + T decay modes (l=e, jj,), which 
corresponds to 45% of the tt yield, followed by the dilepton and the 1+jets channels. The production of 
Wbb+jets represents about 20%, while the W+jets events still contribute 16% of the total background 
yield. Finally, the contamination originating from the other single top quark channels is smaller than the 
signal expected event yield with a contribution of 4% from the sum of t- and Wt- channels. The results 
show a similar statistical sensitivity compared to the standard cut-based analysis, with an improved signal 
to background ratio. 

As described in Section 4. 1 the presence of pile-up affects the reconstruction of the objects selected 
in the event. The use of events with pile-up results in a decrease of about 9% for signal events, and 15% 
for tt events. No systematic uncertainty is associated to this difference, as tt events completely dominate 
the selected sample. Dedicated studies with data will be used to tune all Monte Carlo generators. 

5.3 Systematic uncertainties 

The systematic uncertainties have been evaluated on the likelihood analysis. We follow the procedures 
defined in Section 4.2. 

5.3.1 Experimental systematic uncertainties 

The uncertainties on the b-tagging efficiency and the mistag rates have been estimated by varying the b 
weight cut value corresponding to a change of ±5% in the b-tag efficiency with respect to the reference 
value of 60%. Table 8 shows the impact of such effects on signal and background events. As the b- 
tag efficiency is varied by +5% and -5%, the signal selection efficiency is shifted by 7.1% and -7.7% 
respectively. For background events, the impact of such variations result in both cases in an increase 
of the background level which reaches +15% and +5% respectively because of the change affecting the 
mistag rates. Table 9 reports the impact on the total cross-section determination. The understanding of 
the b-tagging performance completely drives the analysis strategy. 

Table 8: Effect of the main systematic effects, b-tag efficiency and mistag rate variation and of the jet 
energy scale variation on the number of expected events for an integrated luminosity of 1 fb _1 expected 
from the likelihood analysis in the two jet final state events. Numbers in parentheses are the relative 
variations. 



Events in lfb 1 


b-tag -5% 


b-tag +5% 


JES -5% 


JES +5% 


s-channel 


14.2 (-7.7%) 


16.5( +7.1%) 


16.9 (+9.7%) 


15.4 (negl.) 


t and Wt-channel 


6.9 (+97%) 


3.5 (negl.) 


5.1 (+45%) 


5.1 (+45%) 


tt combined 


47.8 (-1.6%) 


58.1 (+19.5%) 


49.5 (+1.8%) 


46.9 (-3.5%) 


Wbb+jets 


15.5 (-8.8%) 


16.4 (-3.5%) 


17.0 (negl.) 


17.0 (negl.) 


W+jets 


17.0 (+26%) 


17.0 (+26%) 


15.4 (+14%) 


15.1 (+11.8%) 


Total bkg 


87.2 (+5.4%) 


94.9 (+14.8%) 


87.7 (+6%) 


84.9 (+2.6%) 



A variation of -5% and +5% of the jet energy scale has been propagated to the jet reconstruction 
and the selection efficiencies were re-assessed. Backgrounds change by about 6% and 3% respectively, 
while the signal acceptance is found to vary by about ±10%. Again, in both cases the background is 
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increased with any change of the jet energy scale. Indeed, as the jet scale factor is decreased, top quark 
pair events in the dilepton and 1+jets modes tend to have a lower multiplicity, resulting in an increased 
contamination. As the scale factor goes up, the low multiplicity events are favored while top quark pair 
production is almost not affected. Table 8 shows the effects of the jet energy scale variation on the signal 
and background event yield. 

The impact of the trigger efficiency and lepton identification uncertainties has been estimated as 
explained in the t-channel Section. This reflects as an uncertainty of 6% on the total cross-section mea- 
surement. Table 9 reports the impact on the total cross-section determination. 

Table 9: Summary of all uncertainties that affect the measured cross section. Data statistics 
is the Poisson error one would expect from real data while MC Statistics is the uncertainty 
on the estimated quantities due to MC statistics. 



Source of 


Analysis for 1 fb 1 


Analysis for 10 fb 1 


uncertainty 


Variation 


Aa/a 


Variation 


Aa/a 


Data Statistics 


64% 


20% 


MC Statistics 




29% 






Luminosity 


5% 


31% 


3% 


18% 


b-tagging 


5% 


44% 


3% 


25% 


JES 


5% 


25% 


1% 


5% 


Lepton ID 


1% 


6% 


1% 


6% 


Bkg x- section 


10.3% 


47% 


3% 


16% 


ISR/FSR 


9% 


52% 


3% 


17% 


PDF 


2% 


16% 


2% 


16% 


b-fragmentation 


3.6% 


19% 


3.6% 


19% 


Total Systematics 


95% 


48% 



5.3.2 Theoretical and Monte Carlo systematic uncertainties 

In the present analysis, uncertainties on the background estimates come from the theoretical uncertainties 
associated with their cross section. The main background contributions are tt (60%), Wbb+jets (20%) 
and W+jets (16%), so 10% uncertainties on the tt cross-section prediction and 20% on the W+jets and 
Wbb+jets events lead to a total of 10.3% uncertainty on the total background. The understanding of the 
background level is thus a crucial point for a precise cross-section determination. Note that despite very 
distinct topologies, the selections of the s-, t- and Wt- channels analyses are not orthogonal. However it 
is believed that correlations can be properly addressed with dedicated study and enough statistics. 

The selection of a 2-jet final state is very sensitive to the presence of extra jets originating from gluon 
radiation. Any uncertainty in the ISR/FSR modeling is thus expected to have a significant impact on the 
selection efficiencies, in particular for the s-channel and the tt events. Specific Monte Carlo samples 
have been generated with ISR/FSR settings leading to the largest cross section variations. Variations of 
5% of the signal selection efficiencies are expected, and are due to the change in the jet multiplicity. 
The uncertainties however reach large values for the tt production, with variations of 17.8% between 
the two extreme cases. Note that these variations have been assessed with non-calibrated jets, although 
there is a high expected correlation between jet energy scale and ISR/FSR gluon modeling. An overall 
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9% uncertainty corresponding to half the variation is quoted. Note that the s-channel being produced via 
quark-antiquark annihilation, some constraints coming from the use of W-boson events should allow the 
tuning of the showering interfaces. 

Another uncertainty is related to the choice of the PDFs. The procedures used to assess the impact of 
such choice is presented in Section 4.2.2. The expected bias on the signal is below 3% while this number 
is below 2% for tt events. 

Finally the effect of the b-fragmentation parametrization has also been investigated using fast simu- 
lation. The b quark fragmentation is performed according to the Peterson parametrization, with one free 
parameter e h . Varying the default value from e b = -0.006 by ±0.0025 [29] and taking the difference as 
a systematic uncertainty leads to a change of 3.6% in the tt and signal selection efficiencies. 

Table 9 lists all sources of uncertainties and reports their impact on the cross-section determination. 
Two cases are considered: one defined by the level of uncertainty in the b-tagging, JES, luminosity 
that will presumably characterize the early data taking period and a second one assuming a reasonable 
improvement on those effects with an integrated luminosity of 10 fb _1 . The assumptions made in the 
latter case are the same as the ones listed in Section 4.2.3. 

5.4 Summary 

The determination of the s-channel cross section constitutes a challenging measurement, due to the pres- 
ence of large backgrounds from the tt production and from the W+jets channels. With an expected 
signal to background ratio of 18% the measurement will be hampered not only by a significant statistical 
uncertainty but also by the systematic effects affecting both signal and background. The measurement 
with an integrated luminosity of 1 fb~ 1 is thus both statistically and systematically limited, with about 
60% of statistical and 90% of systematic uncertainties. 

Given the present limitation on the background knowledge, early measurements will have to be 
devoted to the understanding of the background, in terms of shape and absolute normalization. For 
this purpose, specific studies performed on enriched background samples can be used. In this area, the 
knowledge of the effects of the ISR/FSR gluon radiation will need dedicated studies, in particular in 
tt events. The constraint from data itself will thus be very important, and a tuning a la CDF [30] will 
be crucial for the understanding of these radiations. From the detector side, a reliable cross-section 
measurement requires a good knowledge of the b-tagging tools performance, since double-tag events are 
considered. A good determination of the jet energy scale is also mandatory for the selection, at the level 
of better than 5%. PDF and b-fragmentation effects are expected to have a significant impact only at 
higher luminosity. 

In this context, the use of sophisticated statistical methods appears mandatory to discriminate the 
signal from the background and to establish convincing evidence for a signal. Their use however requires 
an a priori good understanding of the background normalization and shapes. With an improved situation 
for the b-tagging and jet energy scale, with a background normalization determined from the data and a 
better ISR/FSR knowledge, evidence at 3 a should be achievable with 30 fb -1 . 

6 Measurement of the Wt-channel cross section 

The Wt-channel is characterized by the associated production of a top quark and a W-boson. At the 
LHC, this single top quark process is the second highest cross-section after t-channel production, with 
an expected cross section of 66 pb. The final state features two W-bosons and a b jet, making this channel 
experimentally very close to top quark pair production, from which it differs by the absence of a second 
b jet in the final state. With a cross section 15 times larger than the Wt-channel processes, top quark pair 
events will thus constitute the dominant source of backgrounds and drive the definition of the selection 
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criteria in higher jet multiplicity events. On the other hand, low jet multiplicity events will suffer from 
W+jets contamination. In this report, we consider only the case where one of the two bosons decays into 
leptons while the other decays into jets. Two approaches have been used to estimate the sensitivity to the 
Wt-channel cross section measurement: a sequential cut-based analysis, which will provide reference 
numbers, and an analysis based on the use of Boosted Decision Trees. 

6.1 Sequential cuts analysis 

The common preselection defined for all three single top quark processes has been extended further to 
account for the specific topology of Wt-channel events. Selected events must have exactly one high pj 
b-tagged jet above 50 GeV. A veto on any other b tagged jet above 35 GeV is applied in order to reject 
tt events. This b-tag veto utilizes a looser b-tag weight cut which has been optimized according to the 
signal over tt background ratio: a ratio of 18% is reached in 2- and 3- jet final states events while 14% 
is found in 4-jet events. The corresponding efficiency is about 30% on signal events and 10% on tt 
events. For events containing more than three high pj jets, the selection requires that the invariant mass 
of the two highest pi non b-tagged jets to be between 50 and 125 GeV. The number of selected events is 
reported in Table 10 for signal and backgrounds. 

Table 10: Number of expected events for an integrated luminosity of 1 fb _1 as function 
of the jet multiplicity and for the electron and muon channel combined in the sequential 
cut-based analysis. Errors shown are statistical only. The convention 1 = is used. 



Events in lfb 1 


2 jets (lblj) 


3 jets (lb2j) 


4 jets (lb3j) 


Wt-channel 


435 ± 16 


164± 10 


40±5 


t-channel 


1218 ±47 


94 ± 13 


58 ± 11 


s-channel 


42 ±2 


5 ±0.6 


0.6 ± 0.2 


tt -> 1 + jets 


1260 ± 38 


664 ± 27 


240 ± 16 


tt — > dilepton 


291 ± 18 


50 ±7 


17 ±4 


tt — ► 1 + T 


428 ± 22 


55 ±8 


17 ±5 


W+jets 


2983 ±71 


207 ± 19 


38 ±6 


Wbb+jets 


137 ± 33 


13 ±3 


6±2 


TOTAL bkg 


6359 ± 232 


1088 ± 74 


377 ± 42 


S/B 


6.8% 


15.0% 


10.6% 


s/Vb 


5.4 


5.0 


2.1 


Vs+b/s 


0.19 


0.21 


0.51 



In two jet events (labeled as 'lblj'), the signal yield is about 430 signal events for an integrated lumi- 
nosity of 1 fb -1 , with a signal to background ratio of 6.8%, combining both electron and muon channels. 
The W+jets production is the dominant background with more than 45% of the total background yield. 
The tt contamination in the 1+jets channel constitutes about 20% of the background, while the other tf 
modes combined contribute 11%. In this low jet multiplicity bin, the single top quark t-channel contam- 
ination is significant and represents 19% of the total. 

In the three jet final state events (labeled as 'lb2j'), 160 signal events are expected in 1 fb -1 , with 
a signal to background ratio close to 15%. In this final state, the use of the hadronic W boson mass 
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constraint helps improve the rejection of W+jets and the other single top quark events. The background 
is made up of tt events which represent 70% of the total background yield, and of about 20% of W+jets 
events. Among the tt events, the 1+jets mode is dominant and constitutes about 85% of the total. 

The four jet final state events (labeled as 'lb3j') only bring a marginal improvement to the analysis. 
The number of expected Wt-channel single top quark events is small and expected to be 40 for 1 fb _1 
with a signal to background ratio of 10.6%. In this high jet multiplicity bin, the main background comes 
from the tt events in the 1+jets mode, which constitute 64% of the background. The rest originate mostly 
from the single top quark t-channel and from W+jets events, each being of the same order as the signal. 

6.2 Boosted decision tree analysis 

As in the s-channel analysis, several multivariate discriminators have been defined to optimize the dis- 
crimination against backgrounds. Two Boosted Decision Tree (BDT) functions are defined to separate 
signal and tt events, one devoted to the discrimination from the dominant 1+jets channel ^tt/i+jets> and 
the other against the dilepton channels ^tt/diiepton including t's. A function ^w+jets has been formed to 
separate signal from the W+jets sample, defined as the total contribution from light and heavy flavor jets. 
Another BDT discriminator is devoted to the separation of the signal and the single top quark t-channel 
events ^-channel- The analysis being based upon events with jet multiplicity between two and four, spe- 
cific BDTs have been defined in each jet multiplicity bin for each background. The electron and muon 
channels are being treated in a combined way, thus leading to the definition of 3 (jet multiplicity) x 4 
(background) ie: 12 BDT discriminators in total. 

6.2.1 Definition of the discriminant variables 

The set of discriminant variables is derived from the same procedure of optimization as the one explained 
for the s-channel analysis. The final set of discriminant variables is built from 25 relevant kinematical 
variables: 

• the opening angles between the lepton and the jets AR(l,b), AR(l,ji), A/?(/,j2) where j\, 72 and 
j'3 are the non b-tagged pj ordered jets 

• the opening angle between the jets AR(b,ji), AR(b, 72), AR(j\ , 72) ; 

• the angles cosA<I>(ji, j'2) and the pseudo-rapidity of the non b-tagged jets T/j2 and Tjj3; 

• the invariant mass formed by the sum of all jets M inv (jets) and by the reconstructed W-boson and 
top quark candidates M(W + 1); 

• the mass of the hadronic W-boson candidate M( W had ) ; 

• the sum of the missing transverse energy and lepton transverse momentum $ T + pr(l); 

• the transverse mass of the leptonic W-boson candidate Mj(W\ ev ) 

• the transverse mass of the systems formed by the b jet and the reconstructed W-bosons M(b, W had ) 
andM(6,W lep ); 

• the scalar sum of the jet transverse momenta H T (jets) and of all objects in the events // T (tot) ; 

• the transverse jet momenta pi (b), p T (ji), Pj(j2)', 

• the longitudinal momentum of the neutrino solution computed with the top quark mass; 
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• the global event shape variables, sphericity, aplanarity and centrality. 

The BDT output distributions associated to each of the four backgrounds are represented in Figure 5 for 
the 3 jet final state analysis for the electron+muon channels for 1 fb _1 . 





Figure 5: Distributions for the four BDT defined in the 3 jet (' lb2j') final state analysis, a) BDT against 
tt in the '1+jets' channel; b) BDT against W+jets events; c) BDT against tt in the dilepton lepton (+tau) 
channel; d) BDT against single top quark t-channel events. 



6.2.2 Results with the Boosted Decision Trees 

Several selections can be designed, allowing for different levels of signal purity. In this analysis as in 
the s-channel, the cuts on the discriminant BDTs have been set so that the total uncertainty affecting the 
cross section measurement is minimized. 
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The thresholds set on the three BDT outputs are: 

^ttij > 0.6 
%y x > -0.36 
^Wjet > 0.30 

^t-chan>0.18 

The number of events for signal and the different backgrounds is reported in Table 1 1 for an integrated 
luminosity of 1 fb _1 . In two jet events ('lblj'), the signal yield is about 60 with a signal to background 
ratio of 35%, combining electron and muon analyses. This result represents an improvement by a factor 6 
compared to the cut-based analysis. This feature is important since one of the main sources of systematic 
uncertainty originate from the imperfect knowledge of the background levels. Regarding the composition 
of the background, the W+jets production constitutes the dominant background and contributes 58% 
of the background yield, tt production in the 1+jets mode represents about 40% of the total. The 
only remaining single top quark events originate from the t-channel which accounts for 6% of the total 
background yield. The statistical significance for this final state analysis alone is 4.5a. 

Table 11: Number of expected events for an integrated luminosity of 1 fb -1 as function of 
the jet multiplicity after the BDT analysis. The convention 1 = e,;U is used. 



Events in lfb 1 


2 jet (lblj) 


3jet(lb2j) 


4jet(lb3j) 


Wt-channel 


58.0 ±5.8 


20.9 ± 3.5 


6.6 ± 2.0 


t-channel 


10.2 ± 4.2 


negl. 


1.7 ± 1.7 


s-channel 


1.4 ±0.3 


negl. 


negl. 


tt — ► all jet 


negl. 


negl. 


negl. 


tt -> / + jet 


56.3 ± 8.2 


41.8 ±6.3 


13.7 ±3.4 


tt —> dilepton 


1.7 ± 1.2 


negl. 


negl. 


tt->l + T 


negl. 


negl. 


negl. 


W+jets 


92.1 ±8 


3.2 ± 1.4 


0.2 ±0.1 


Wbb+jets 


3.9 ± 3.9 


negl. 


negl. 


Total bkg 


165.6 ±9.2 


45.1 ±6.3 


15.6 ±3.4 


S/B 


35.0% 


46% 


36.2% 


s/Vb 


4.5 


3.1 


1.7 


VS + B/S 


0.25 


0.39 


0.71 



In three jet final state events ('lb2j'), about 20 signal events are expected with 1 fb -1 , with a signal 
to background ratio of 46%. Again, one notices a gain of more than a factor 3 compared to the cut-based 
analysis. The background events are made up almost exclusively (95%) of tt events in the lepton+jets 
events. The use of the jet-jet invariant mass reduces the W+jets background to a few percent. The 
selection in this final state alone provides a signal statistical significance of 3.1 o. 

In the four jet final state channel (' lb3j '), the number of expected Wt-channel single top quark events 
is 7 with 1 fb -1 . In this bin, the signal to background ratio is about 36%, which corresponds to a gain of a 
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factor 4 compared to the sequential cut analysis. In this jet multiplicity bin, the only competing processes 
is the tt events in the 1+jets. The signal significance remains small, around 1.7a with a corresponding 
statistical precision of 72%. 

The effect of pile-up has been investigated with signal and tt samples produced with a pile-up 
corresponding to a 10 32 cm 2 s~ 1 luminosity run. The results show decreases of 18%, 47% and 26% of the 
signal event yield in the 2, 3 and 4 jet final states. For backgrounds, similar variations of 16%, 39% and 
20% in the 2, 3 and 4 jet final states are seen. As expected the increase of the number of light jets seen 
in the events directly impacts the tight selection of 2- and 3- jet final states. No systematic uncertainty is 
associated to this effect, as for the two previous analyses, as only dedicated studies using data itself will 
be used to tune all Monte Carlo signal and background generators. 

6.3 Systematic uncertainties 

The systematic uncertainties have been evaluated on the BDT analysis. We follow the procedures defined 
in Section 4.2. 

6.3.1 Experimental systematic uncertainties 

The b-tagging uncertainties affect the Wt-channel selection because of the requirement of one b-tagged 
jet on one side, and the use of a veto for any second b tagged jet on the other side. The variation by 
5% of the b tagging efficiency and the corresponding mistag rate results in a 7% change of the signal 
selection efficiency in the 2 jet final state. The sensitivity is higher in the higher multiplicity bins, with 
effects of 10% seen in the 3 jet bin. Regarding backgrounds, the impact of this uncertainty increases with 
the jet multiplicity, with tt events being the dominant background. Variations of 3% and 5% are seen 
respectively in 3 jet and 4 jet final states. Table 12 reports the relative changes in signal and background 
events. Table 13 reports the impact on the total cross section determination, all final states combined. 

Table 12: Effect of the b-tag efficiency and mistag rate variation and of the jet energy scale 
variation on the number of expected events for an integrated luminosity of 1 fb _1 expected 
from the BDT analysis. 



Process 


Events in lfb 1 


b-tag ± 5% 


JES ± 5% 


2-jet events 








- signal 


58.0 


± 7.0% 


±0.5% 


- total bkg 


165.6 


± 3.0% 


±3.1% 


3 jet events 








- signal 


20.9 


±10.1% 


±7.0% 


- total bkg 


45.1 


±3.2% 


±3.0% 


4 jet events 








- signal 


6.6 


±3.1% 


±7.9% 


- total bkg 


15.6 


±5.1% 


±4.0% 



The precise knowledge of the jet energy scale is important for the Wt-channel analysis because of 
the requirements made on the mass reconstruction and pj thresholds used to select jets. A 5% variation 
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of the jet energy scale has been propagated to the jet reconstruction and the selection efficiencies are 
re-assessed. Table 12 shows the impact of the jet energy scale variation on signal and background event 
yields. In two jet events, the effects of the scale variation is about 1% on the signal events and 3% on the 
sum of all backgrounds. In three jet events a variation of 7% of the signal event yield is observed, with 
a change of 3% in the backgrounds. In four jet events, variations of 8% and 4% are seen in signal and 
background respectively. Table 13 reports the impact on the total cross section determination. 

An uncertainty of 1 % in the lepton identification or in the trigger efficiency would impact the number 
of selected events. Such an uncertainty would reflect in an uncertainty of 2.6% on the total cross section 
measurement, which is negligible with respect to the others. 

6.3.2 Theoretical and Monte Carlo uncertainties 

Uncertainties on the background estimates come from the theoretical uncertainties associated to the cross 
sections. An uncertainty of 10% is quoted for the tt events while 20% is associated to the W+jets and 
Wbb+jets events. This translates into a total of 12.5% in the 2 jet final state and 10% in the higher jet 
multiplicity bins where tt events completely dominate the background. Note that despite very distinct 
topologies, the selections of the Wt- and the t- channels analyses are not orthogonal. However it is 
believed that correlations can be properly addressed. 

Table 13: Summary of all uncertainties that affect the measured cross section. Data statistics 
is the Poisson error one would expect from real data while MC Statistics is the uncertainty 
on the estimated quantities due to MC statistics. (*) background to 2j (12.5%) and 3j and 4j 
final states (10%) 



Source of 


Analysis for 1 fb 1 


Analysis for 10 fb 1 


uncertainty 


Variation 


Aa/a 


Variation 


Aa/a 


Data Statistics 


20.6% 


6.6% 


MC Statistics 




15.6% 






Luminosity 


5% 


20% 


3% 


7.9% 


b-tagging 


5% 


16% 


3% 


6.6% 


JES 


5% 


11% 


1% 


1.5% 


Lepton ID 


1% 


2.6% 


1% 


2.6% 


Bkg x-section 


12.5/10%(*) 


23.4% 


3% 


9.6% 


ISR/FSR 


9% 


24.0% 


3% 


7.8% 


PDF 


2% 


5.2% 


2% 


5.2% 


b-fragmentation 


3.6% 


9.4% 


3.6% 


9.4% 


Total Systematics 


48% 


19.4% 



The selection of a low jet multiplicity final state is very sensitive to the presence of extra jets origi- 
nating from gluon radiations. Any uncertainty in the ISR/FSR modelling is thus expected to have a sig- 
nificant impact on the selection efficiencies, in particular for the Wt-channel and top quark pair events. 
We quote an overall 9% uncertainty due to the modelling of gluon radiation in the tt events. 

The uncertainties in the PDF may affect the topologies as well as the momentum distributions of 
the final state objects, hence impacting the determination of the selection efficiency for both signal and 
backgrounds. The procedure to estimate the impact of the choice of the PDF to the selection efficiency is 
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detailed in Section 4.2.2. Complete studies have been performed in each final state and a 2% uncertainty 
is quoted for signal and tt events. 

Finally the effect of the b-fragmentation parametrization has also been investigated following a sim- 
ilar procedure of that defined in Section 5.3.2 from fast simulation. The relative uncertainties of the 
different channel and final state selection efficiencies due to the b-fragmentation uncertainty is found to 
be 3.6%. 

Table 13 lists all sources of uncertainties and reports their impact on the cross section determination. 
Two cases are considered: one defined by the level of uncertainty in the b-tagging, jet energy scale and 
luminosity that will presumably characterize the early data taking period; another assuming reasonable 
improvements on those effects with an integrated luminosity of 10 fb _1 . The assumptions made in the 
latter case are listed in Section 4.2.3. 

6.4 Summary 

The determination of the Wt-channel cross section constitutes a challenging measurement with the early 
data, due to the presence of important tt and W+jets backgrounds. This measurement makes use of 
the events with a jet multiplicity between two and four jets. With a signal to background ratio of about 
30-40%, the analysis requires a good knowledge of the W+jets production in the lower multiplicity bins, 
and of the tt process in higher bins. The estimates of the shapes and normalization of those processes 
will have to rely upon the use of data. Strategies exist for QCD and W+jets events, but the discrimination 
against tt events remains a challenge. 

As for the t-channel single top quark analysis, the cross section determination will very early be 
dominated by systematic uncertainties. The dominant effect is constituted by the background uncertain- 
ties, followed by the modeling of the gluon radiation. From the detector side, the dominant source of 
uncertainty is the b-tagging because of the imperfect knowledge of the b-tag and b-tag veto efficiencies 
as well as of the mistag rates. Another source is the determination of the jet energy scale, which affects 
the reconstruction of the W-boson mass and all the jet energies used in the analysis. Note that the de- 
termination of the luminosity to better than 5% is required in order to ensure a good measurement, or 
the use of ratio of different Wt-channel final states [31] can be used as well with higher luminosity. A 
3 o evidence can be reached with a few fb _1 of data taking and a precision of 20% on the cross section 
measurement is achievable with about 10 fb _1 provided that improvements are made in both the experi- 
mental aspects of the detection (backgrounds from data, b-tagging, jet energy scale and luminosity) and 
from the theoretical side. 

7 Conclusion 

At the LHC the production of single top quark events accounts for about a third of the tt production, 
which leads to about 2.5 million events per year during a run at 10 32 cm 2 s _1 . Similarly to the situation 
at the Tevatron, the selection of single top quark events will suffer from the presence of both W+jets 
and tt backgrounds, which are produced at much higher rates. Thus, careful approaches devoted to 
the understanding of these backgrounds in terms of shape and normalization performed directly from 
data will have to be defined. Besides, except for the s-channel, single top quark analyses will be very 
early dominated by the systematic uncertainties, and will require a good control of b-tagging tools and a 
reliable determination of the jet energy scale. 

In a context of low signal over background ratio, the use of sophisticated tools like genetic algorithms, 
likelihoods and Boosted Decision Trees appears very useful if one wants to establish the signal or to 
determine its cross section precisely. These techniques, which are now in common use at the Tevatron, 
will require the use of reliable event samples for modeling signal and backgrounds, that will presumably 
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be produced from the data. The analyses should also be optimized with respect to the total level of 
systematic uncertainty, which will be the main limiting factor for 30 fb _1 measurements. 

Finally, a precise determination of single top quark cross sections can be achieved for a few fb^ 1 
in the t-channel and the Wt-channel , while for the s-channel, higher statistics will be required. Their 
possible interpretation in terms of new physics should thus come at a later stage, once the systematic 
effects are under control. 
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Top Quark Mass Measurements 

Abstract 

This note summarizes studies performed in order to estimate the potential 
of ATLAS to measure the top quark mass from the first few hundred pb _1 
of data. The analyses shown here, based on fully simulated events, have 
been performed for the channel where the top quarks decay into one lepton 
plus a number of jets, using various methods to extract the top quark mass. 
The performance of the detector has been evaluated, including triggering, 
particle identification, and jet reconstruction. For each method, the expected 
background, statistical and systematic uncertainties, and linearity have been 
studied. This work shows that the precision on the top quark mass depends 
mainly on the jet energy scale uncertainty: a precision of the order of 1 
to 3.5 GeV should be achievable with 1 fb _1 , assuming a jet energy scale 
uncertainty of 1 to 5%. 

1 Introduction 

At the LHC, top quarks will be produced mainly in pairs through the hard processes gg — > tt (90%) 
and qq —> tt (10%). The corresponding cross section, at next-to-leading order, is 833 pb [1]; therefore, 
we expect roughly 800 000 tt pairs to be produced with an integrated luminosity of 1 fb _1 , correspond- 
ing to about two weeks at a luminosity equal to 10 33 cm _2 s _1 . In contrast to earlier measurements 
of the top quark mass [2], which concentrated on preserving as much of the signal as possible in or- 
der to minimize the statistical uncertainty, the LHC will produce so many top quark pairs that rather 
stringent requirements can be imposed, to restrict the measurement to regions in which the systematic 
uncertainties can be well-controlled. With 1 fb~ 1 of data, the measurement will already be completely 
dominated by systematic uncertainties of the order of 1 GeV. The dominant contribution to the sys- 
tematic uncertainty is expected to be the jet energy scale. In-situ calibration methods will allow the 
light jet energy scale to be known to the percent level precision after 1 fb _1 of collected data [3]. This 
high precision can be, at least in part, translated to the ft-jet energy scale, although differences are 
expected. Detailed studies with the data will be necessary to reach the desired precision. 

The top quark mass measurements described here are based on finding the peak in the invariant 
mass distribution of the top quark's decay products: a W boson and a ft-quark jet. This closely 
corresponds to the pole mass of the top quark. Because of fragmentation effects, it is believed that 
the top quark mass determination in a hadronic environment is inherently ambiguous by an amount 
proportional to Aqcd [4]: the intrisic amibiguity is of the order of 100 MeV. 

This note describes several ways to measure the top quark mass in the semi-leptonic channel, 
which corresponds to a tt final state where one W boson decays leptonically while the other one 
decays into two jets. The primary results rely on a full reconstruction of the final state, with the mass 
estimator taken as the invariant mass of the three jets from the hadronically-decaying top quark. This 
has been studied for two cases. In the first case, both fr-jets are identified via displaced vertices, and 
two different mass measurement methods are presented. In the second case, fewer b-jets are identified. 
This latter case is expected to be important during early running, before the detector is fully calibrated. 
An analysis which relies on a kinematic fit is also presented. Other tt decay channels can also be used, 
but have not been considered in this note. 

Another possibility relies on the determination of the mean distance of travel of b-flavoured 
hadrons from top quark decays [5]. The top quark mass can be inferred from the averaged lifetime 
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in the laboratory frame of the b-flavoured hadron, since the bottom quark's boost directly impacts the 
lifetime of the b-flavoured hadron. Rather than measuring the lifetime, the transverse decay length of 
the b-flavoured hadron is used in this method. The mean of the transverse decay length distribution is 
determined for different assumed top quark masses, providing an estimator for the experimental data 
in the mean decay length as a function of the top quark mass. Since this analysis relies mainly on 
tracking, the systematic uncertainties are mostly uncorrelated with those of other methods; in partic- 
ular, the influence of the jet energy scale as a source of uncertainty is negligible. Work is ongoing on 
this method, which is not shown here. 

A last possibility is to estimate the top quark mass from the measured tt production cross section. 
The errors on these quantities are related, in the Standard Model, via Ao t j/o t j ~ 5Am t op/«Jtop [6]. This 
would allow a determination of m top independent of the kinematic reconstruction. In addition, it would 
be clear that the top quark mass is the one used in the perturbative calculations (i.e. the pole mass). 
However, even without considering experimental uncertainties, the achievable precision would already 
be limited to 2 GeV due to the uncertainty of the theoretical calculations. The scale dependence, 
which is the dominant source of uncertainty, might be reduced to the order of 6 % by performing the 
computation at higher orders, including the resummation of next-to-leading logarithmic corrections 
(NLL). This scale dependence can also be reduced by using a different mass definition such as the 
running mass 1 . This method is not considered further in this note. 

2 Motivations for a precise top quark mass measurement 

Electro weak precision observables in the Standard Model and the Minimal Supersymmetric Standard 
Model (MSSM) depend on the value of the top quark mass. Therefore, a precise measurement of 
the top quark mass is important for consistency tests of the Standard Model, to constrain the Higgs 
boson mass within the Standard Model, and to increase the sensitivity to physics beyond the Standard 
Model. 

The most important dependency of the electroweak observables on the top quark mass arises via 
the one-loop radiative correction term Ar [9], which is related to the W boson mass through the relation 
m w = 7757/7-3— (1 + Ar) (&w being the weak mixing angle, a the fine structure constant and Gp 

the Fermi coupling constant). The top quark mass is present in Ar as terms proportional to m t 2 op /m|, 
while the Higgs boson mass is present only in terms proportional to log(m#/mz). Therefore, the 
dependence on the Higgs boson mass is much weaker than the dependence on the top quark mass. 

The precision of the indirect prediction of the Higgs boson mass depends mainly on the uncertainty 
on the following quantities: the hadronic contribution to the electromagnetic coupling at the scale mz 
ktthad, sin^, W boson mass and top quark mass. For the current value of the top quark mass (m top 
= 172.6 ± 1.4 GeV) [10], m H = 87±|f GeV, implying m H < 160 GeV at 95% C.L [11]. In order to 
ensure a similar contribution to the indirect prediction of the Higgs mass, the precision on mw and 
m top must satisfy Amw — 0.07Am top . At the LHC, we expect to reach an accuracy of 15 MeV on 
mw [12] and 1 GeV on m top . With these precision measurements, the relative precision on a Higgs 
boson mass of 115 GeV would be of the order of 18% [13]. 

'The dependency of the perturbative calculations convergence on the quark mass definition has been observed in other 
observables in which the running mass has led to better results [7, 8]. This would allow as well a determination of the top 
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3 Top quark mass measurement in the semi-leptonic channel with the 
standard ATLAS Magging 

In this section we investigate the determination of the top quark mass using the standard ATLAS 
^-tagging. 

3.1 Physics background 

The major sources of background, listed in Table 1, are single top events (Wt and t channels) and 
W boson production (W boson with W —> lv decay, W+bb and W+cc). It has been shown that back- 
grounds from Z + jet events with Z — > //, WW, WZ, and ZZ gauge boson pair production have much 
smaller contributions [14]: therefore, their contribution has not been re-evaluated here. Backgrounds 
from QCD multi-jets and bb production have not been investigated with full simulation. Nevertheless, 
studies performed on fast simulation have shown that these backgrounds are negligible after leptonic 
cuts (lepton p T , $ T ) [14]. 

Another source of background comes from tt events themselves (fully leptonic or fully hadronic 
channels). Moreover, tt events in which the W boson decays into TV T are classified in the following 
way: X decaying leptonically belong to signal events, whereas T decaying hadronically belong to 
background (fully hadronic tt). 

Before any selection requirements, the signal to background ratio is of the order of g^ffi , i.e. 

i<r 8 . 

Table 1: Main backgrounds to the semi-leptonic (£ = e,/i) signal listing the number of events in 1 fb _1 
before and after the selection cuts (samples used here have comparable weights). 



Process 


Number 


1 isolated lepton 


>= 4 jets 


2 b-jets 




of events 


p T > 20 GeV 
and # r >20GeV 


p T > 40 GeV 


p T > 40 GeV 


Signal 


313200 


132380 


43370 


15780 


W boson backgrounds 


9.5 x 10 5 


154100 


9450 


200 


all-jets (top pairs) 


466480 


1020 


560 


160 


di-lepton (top pairs) 


52500 


16470 


2050 


720 


single top, t channel 


81500 


24400 


1230 


330 


single top, W t channel 


9590 


8430 


770 


170 


single top, s channel 


720 


640 


11 


5 



3.2 Combinatorial background 

When a tt semi-leptonic decay is reconstructed, one must choose which jets in the event to associate 
with the hadronic W boson decay and also which jets correspond to each of the two b-jets. When at 
least one of these choices is wrong, the event is classified as combinatorial background. 



980 



Top - Top Quark Mass Measurements 



3.3 Jets 

All particles (electrons, muons, jets) forming the it final state are required to lie within |tj| < 2.5, 
since b-jets cannot be tagged and muons are not reconstructed for higher values of \r\ |. See Ref. [15] 
for a description of the features of jet selection that are common to all top quark analyses; only the 
points specific to the analyses in this note are discussed below. 

3.3.1 Jet calibration 

This analysis uses jets defined as in [15]. The jet calibration effects are removed by performing a jet 
calibration to the partem level using the Monte Carlo information. This allows jet calibration to be 
disentangled from other effects on the top quark mass measurement from the other effects (selection, 
reconstruction, measurement methods). This is done by matching jets to partons before radiation 
(requiring A/?(quark, jet) lower than 0.2) and deriving the difference between reconstructed jet energy 
and the parton energy as a function of the parton energy. This is performed separately for Z>-quark jets 
with and without muons and for light jets. This procedure leads to a perfect and unbiased jet energy 
scale. 

Uncertainties on the top quark mass measurement arising from the jet energy scale uncertainties 
will be assesed here by applying miscalibration factors to the calibrated jets. 

Jet scales on data will be obtained in a different way. The light jet energy scale determination is 
explained in detail in a separate note [3]; the b-jet energy scale will be measured using Z+jets samples. 
The Z + jets yield will be low at the start of LHC running; therefore, the b-jet scale will probably be 
derived from the measured light jet scale, together with a Monte Carlo correction term modelling the 
difference between the two jet energy scales. 

3.3.2 Jet labelling 

A jet is called purely electromagnetic if the distance AR to the nearest electromagnetic cluster is lower 
than 0.2 and the ratio between the cluster and jet energies is greater than 0.8. Only 0.15% of jets 
produced by hadronisation pass these cuts. Such objects are ignored for the remainder of the analysis. 

A jet is tagged as a b-jet if its b-weight is larger than 6 (the weight is defined by the three- 
dimensional space Z>-tagging, described in the introduction of this chapter [15]). The ^-tagging effi- 
ciency for this weight in tt events is equal to 62% and the light (i.e. u, d and s flavoured) jet rejection 
to 130 for isolated (A/?(jet,jet) > 0.8) jets with p T greater than 15 GeV (Fig. 1). 

All remaining jets (i.e. non b-tagged jets nor purely electromagnetic jets) are called light jets. 
Therefore, a b-jet which is not tagged by the b-tag algorithm is considered as a light jet. 

3.3.3 Jet multiplicity 

• The light jet multiplicity, for signal events, is illustrated in Fig. 2.(a), for all light jets, and for 
light jets above the pj cut applied (40 GeV). The multiplicity is often higher than two (47% of 
events, for jets above the pj cut), due to the presence of initial and final state radiation. This 
has of course an impact on the combinatorial background size and shape. 

• The ft-jet multiplicity, illustrated in Fig. 2.(b), reflects the ^-tagging efficiency. 
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Figure 1: ^-tagging efficiency as a function of light (i.e. u, d and s flavoured) jet rejection in tt events, 
for isolated (A/?(jet,jet) > 0.8) jets with pj greater than 15 GeV. 



3.4 Event selection 

3.4.1 Trigger 

The following Event Filter trigger [16] selections have been applied: 

• At least one isolated electron with pj greater than 25 GeV ("e22i"). This trigger is satisfied by 
53% of tt e+jets events, and 71% of e+jets with pr(&) greater than 25 GeV pass. 

• At least one isolated muon with pj greater than 20 GeV ("mu20")- This trigger is satisfied by 
59 % of tt n + jets events, and 74% of {X + jets with prfjU) greater than 20 GeV pass. 

3.4.2 Standard cuts 

A sequence of consecutive cuts is applied in order to reduce the contribution from physics background. 

• Exactly one isolated lepton, with pj > 20 (25) GeV for muons (electrons) and |n| < 2.5. This 
cut corresponds to the trigger selection. Moreover, the isolation criteria reject a large fraction 
of the leptonic Z?-decays in the all -jets channel: 99.6% of the tt events with both W bosons 
decaying hadronically are rejected by this first selection step. 

• Missing transverse energy cut: $j > 20 GeV. Together with the lepton requirement, this cut 
reduces QCD background. 

• At least four jets with pj > 40 GeV. Below 40 GeV, jets are known to be less precisely cal- 
ibrated; the jet energy scale will be discussed later on, as a source of systematic uncertainty. 
Therefore, they are removed in order to improve the precision of the top quark mass measure- 
ment [3]. Only 34% of hadronically-decaying W bosons have both jets passing this requirement. 
If it is relaxed so that one jet can have pj down to 20 GeV, 88% of these W bosons pass, but 
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Figure 2: a) Light jet and b) b-jet multiplicity in semi-leptonic tt events before (solid line) and after 
(dashed line) requiring that the jet pj be greater than 40 GeV(plots are normalised to unity). 

the combinatorial and physical backgrounds increase. Moreover, due to initial and final state 
radiation (ISR, FSR), 30% of the signal events have more than two light jets with pj > 40 GeV. 
Therefore, no requirement is made on the number of light jets. 

• Among these jets, exactly two must be ^-tagged. 
3.4.3 Purity definition 

When a it semi-leptonic decay is reconstructed, one must choose which jets in the event to associate 
with the hadronic W boson decay and also which jets correspond to each of the two b-jets. The success 
of an algorithm for making such as choice is quantified as its purity, the fraction of events in which 
this choice is correct, based on looking at the Monte Carlo parentage information. For this purpose, 
jets are matched to the closest Monte Carlo partem with AR < 0.25. Purities are defined for identifying 
the hadronically-decaying W boson (both light jets chosen are within 0.25 from the quark stemming 
from the W boson), the hadronic b-quark, and the hadronically-decaying top quark. Note that the top 
quark purity is not the product of the other two due to correlations. 

3.5 Hadronic W boson mass reconstruction 

Several algorithms have been tried to choose the two light jets from the hadronically-decaying W bo- 
son. Three have been identified that give the best compromise between efficiency and purity: 

• the x 2 minimization method, 

• the geometric method: this method consists in choosing the two closest jets, 

• choosing the two light jets that give the mass closest to the known mass of the W boson [14]. 

The first and last methods are quite similar, but the first one contains in addition an event-by-event 
reseating. Therefore, the last method will not be described here. 
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Figure 3: Invariant mass of light jet pairs for 
events with only two light jets (the contribution 
of physics background is shown in black, the one 
from combinatorial background, in grey). 



Figure 4: Hadronic W boson mass after x 
minimization (the contribution of physics back- 
ground is shown in black, the one from combina- 
torial background, in grey). 



3.5.1 Hadronic W boson mass reconstruction with x 2 minimization method 

Events kept after the selection described above have at least two light jets above the pj threshold (50% 
of the events have more than two). Figure 3 shows the distribution of the invariant mass of the light 
jet pairs, in events with only two light jets. As a first step, we select the hadronic W boson candidates 
in a mass window of (± 30 GeV) around the peak value of this distribution (82 GeV). 

The energy scale of the jets may be shifted, due to effects that include the energy lost out of the 
jet cone and initial and final state radiation effects, not taken into account in the default jet energy 
calibration [17]. Moreover, the effects of the jet pj cut applied during event selection (explained 
in [3], section 3.3) contribute to the jet energy scale shift. To reduce the effect of such shifts on the 
final measured top quark mass, the jets are rescaled by constraining the pair to the known W boson 
mass using a x 2 minimization. The quantity to be minimized is shown in Eq. (1). The first term 
constrains the jet pair mass Mjj to the W boson mass and width from the Particle Data Group [18] 
(M™ G , r^P°). The other two terms are the usual X 2 terms for scaling the jets with multiplicative 
constants a^, j2 ; are the light jet energy resolutions 2 . 

2 _ {M-^a^a^) - M™) 2 (£ji(l - a E]l )) 2 (£ j2 (l - a E]2 )) 2 

x (r™ G ) 2 + of + a 2 ■ w 

This X 2 is minimized, event by event, for each light jet pair. The pair with the smallest x 2 is kept 
as the hadronic W boson candidate. This minimization procedure also gives the corresponding energy 
collection factors ct^, and (Xe s2 , shown in Fig. 5. Given the jet energy resolution, the first term in Eq. 
(1) dominates the x 2 ■ Therefore, the hadronic W boson mass reconstructed with the light jets chosen 

-The light jet energy resolution has been estimated from the gaussian fit of the difference between a light jet energy and 
the corresponding quark energy. The expression thus obtained is the following: rj£ = E * \J (a 2 /E) +b 2 , where a = 0.989 
GeV 1 / 2 and b = 0.075. 
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by this x 2 minimization is very narrow, as shown in Fig. 4. Further on, only the hadronic W boson 
candidates within a mass window of ± 2 Tm w (r^ 1 = 2.1 GeV) are kept; this cut is called CO. 




50 100 150 200 250 300 350 400 

jet energy [GeV] 



Figure 5: Energy correction factors estimated by the % minimization. 



3.5.2 Hadronic W boson mass reconstruction with geometric method 

In this method, the light jet pair with the smallest AR distance between the two jets is taken as the 
hadronic W boson candidate. This method is simple and does not depend on the accuracy of the jet 
energy scale. The resulting W boson mass distribution is shown in Fig. 6. Only hadronic W boson 
candidates within a mass window of ± 2 Gm w (Cm w = 10.4 GeV) around the peak value of the invariant 
mass distribution of all light jet pairs are kept; this cut is called CI. 



3.6 Leptonic W boson mass reconstruction 

The main difficulty in reconstructing the leptonic W boson comes from the kinematics of the neutrino. 
The missing transverse momentum is used as an estimate of the neutrino transverse momentum. 
This is, however, only an approximation, illustrated in Fig.7, as there may be other, softer, neutrinos 
in the event, such as from leptonic Z>-decay. With this hypothesis, p v T is underestimated by more than 
2% (miscalibration of #t and fake missing fyj also contributes to this shift). 

Using the known W boson mass, four-momentum conservation for the W — > £ + V decay gives a 
quadratic equation for the longitudinal component of the neutrino momentum p\ (£ stands for lepton 
and v stands for neutrino): 
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Mjj [GeV] 

Figure 6: Hadronic W boson mass with the geometric method (the contribution of physics background 
is shown in black, the one from combinatorial background, in grey). 




E miss /p MCv 

Figure 7: Ratio of Jfij to the generated neutrino transverse momentum, after CO cut. The neutrino 
considered here is the one from the leptonic W boson decay. 
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= m\ - 2(p[p v x + p\p v y ) + 2E^$ T 2 + (plf - 2(p[p v z ). 

This equation has no solution if the measured lf.j fluctuates such that the neutrino-lepton invariant 
mass is above the W boson mass; this happens in 30% of the remaining events after the CO cut. In 
this case, p v T is reduced until a solution is found, with the restriction that the transverse W boson 
mass remain below 90 GeV (see Fig. 8). Only 11% of the events still have no solution after this 
procedure. Otherwise, the equation has two solutions. The choice among the two pY is performed 
together with the association of the ft-jet to the corresponding W boson: the combination giving the 
smaller difference between the hadronic and leptonic top quark masses is kept. 




Figure 8: Transverse W boson mass (distributions are normalized to unity). On the left, the Jacobian 
peak is clearly seen at the generator level (solid line); events for which no pY solution is found (dashed 
line) can be distinguished from events for which a solution is found (dotted line). On the right, the 
effect of the p v T modification for events for which no pY solution is found, before (dashed line) and 
after (solid line) modification: a fraction of these events are recovered. 



3.7 Top quark reconstruction 

The two methods used for the hadronic W boson reconstruction lead to two methods for the top 
quark mass reconstruction. Moreover, additional cuts are applied in order to increase the purity of the 
selected sample; their relevance is illustrated below. 

Once the hadronic W boson is reconstructed, the next step is to choose from the two b-jets the 
one to associate with the hadronic W boson in order to reconstruct the hadronic top quark. Several 
methods have been investigated: 

• choose the b-jet that maximizes the hadronic top quark pj. 

• Choose the £»-jet closest to the hadronic W boson. 

• Choose the b-jet furthest from the leptonic W boson. 

All three methods give similar results, but the second method has a slightly higher purity, so that 
one has been chosen. The remaining b-jet and the leptonic W boson then define the leptonic top quark. 

The performance of the analyses before any additional cuts is summarized in Tables 2 and 3, in a 
full top quark mass window and within ± 3 <7 m , , where a m = 10 GeV. 



987 



Top - Top Quark Mass Measurements 



Table 2: Efficiency of cuts applied and final purities (full top quark mass window and within ± 3 
<J m , where a mtop = 10 GeV), with respect to tt semi-leptonic sample (e,/i), for both methods. 



|| Cuts applied | Efficiency (%) | W boson purity (%) | b purity (%) | top purity (%) |] 



% l minimization method 










Cut CO 


2.22 ± 0.03 


59.4 ± 0.8 


61.6 ±0.8 


40.2 ± 0.8 


Cut CO 


1.40 ±0.04 


70.4 ± 0.8 


77.3 ±0.8 


60.7 ± 0.8 


within ± 3 o",„ lop 










Cuts CO, C2 and C3 


1.25 ±0.04 


59.3 ± 0.8 


82.2 ± 0.8 


56.5 ±0.8 


Cuts CO, C2 and C3 


0.90 ±0.03 


74.9 ±0.9 


91.1 ±0.9 


73.6± 0.9 


within ± 3 o"„, lup 










Cuts CO, C2, C3, C4 and C5 


0.91 ± 0.05 


80.1 ±0.8 


92.8 ± 0.8 


77.1 ±0.8 


Geometric method 










Cut CI 


1.26 ±0.03 


68.8 ±0.8 


69.7 ± 0.8 


53.8 ±0.9 


Cut CI 


1.01 ±0.04 


77.6 ± 0.8 


77.7 ± 0.8 


65.9 ± 0.8 


within ± 3 o" miop 










Cuts CI, C2 and C3 


0.85 ± 0.03 


68.7 ±0.9 


84.7 ±0.8 


66.1 ±0.9 


Cuts CI, C2 and C3 


0.70 ± 0.03 


79.4 ±0.9 


90.7 ±0.7 


78.1 ±0.9 


within ± 3 (7 mtop 










Cuts C2, C3, C4 and C5 


0.57 ± 0.05 


86.9 ± 0.9 


94.0 ± 0.6 


86.4 ± 0.9 



Table 3: Number of events in 1 fb _1 (signal and background) after selection cuts (full top quark mass 
window and within ± 3 a,„ to , where a mto = 10 GeV), for both methods, x + jets events correspond 
to: hadronic W boson — > TV and X decays hadronically. Leptonic x decays are counted together with 
the signal. 



Number of events for 1 fb 1 


signal 


W + 
jets 


X 

^jets 


di-lepton 


all-jets 


single 
top 


X 2 minimization method 














Cut CO 


6946 


19 


14 


191 


51 


148 


Cut CO 


4382 


12 


3 


62 


20 


71 


within ± 3 O mtop 














Cuts CO, C2 and C3 


3918 


7 


10 


104 


28 


67 


Cuts CO, C2 and C3 


2863 


4 


3 


24 


28 


24 


within ± 3 (7 mtop 














Cuts CO, C2, C3, C4 and C5 


2850 


1 


2 


10 


17 


19 


Geometric method 














Cut CI 


3949 


9 


10 


19 


39 


89 


Cut CI 


3155 


4 


6 


9 


7 


56 


within ± 3 o,„ top 














CutsCl,C2andC3 


2643 


4 


5 


11 


33 


48 


CutsCl,C2andC3 


2198 


4 


3 


5 


0 


27 


within ± 3 a,„ top 














Cuts C2, C3, C4 and C5 


1785 


0 


1 


2 


7 


13 
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3.7.1 Additional cuts 

Additional cuts can be applied in order to increase the final top purity (combinatorial background 
rejection). 

• Cut C2: the invariant mass of the hadronic W boson and the b-jet associated to the leptonic 
W boson must be greater than 200 GeV. 

• Cut C3: the invariant mass of the lepton and the b-jet associated to the leptonic W boson must 
be lower than 160 GeV. 

These cuts are illustrated in Figs. 9 and 10. Their effects on efficiency and purity are shown in 
Tables 2 and 3. 




Figure 9: Invariant mass of the hadronic W bo- 
son and the leptonic b-jet for events satisfying 
CI. The vertical line corresponds to Mjjbi = 
M(W had ,&i ep ) > 200 GeV. 



Figure 10: Invariant mass of the lepton and 
the leptonic b-jet for events satisfying CI and 
C2. The vertical line corresponds to M/w = 
M(l,b lep ) < 160 GeV. 



The combinatorial background can be further suppressed with two more cuts [19]. These are 
defined based on the following variables, where E* denotes the energy of a particle in the top quark 
rest frame: 

Ml -Ml 



Xi — £\v ■ 



E* h =E* x +E* 2 -E* h = 



X 2 = 2E* h -- 

We call the peak and width of the X\ 2 distributions jUi,2 an d <7i,2, as is found from simulated it 
events with m top = 175 GeV that satisfy all previous requirements. Then the two following cuts are 
defined: 

• CutC4: |Xi-/ii| < 1.5 Oi, 

• CutC5: \X 2 -n 2 \ < 2a 2 . 

These cuts are illustrated in Figs. 11 and 12. With respect to C2 and C3, these cuts reduce the 
efficiency by 30% but increase the purity to 85%, as shown in Tables 2 and 3. The numbers are 
identical in the full top quark mass window and within ± 3 CJ miop , since C4 and C5 restrict the top 
quark mass to a more stringent window than ± 3 a„ hop around the peak value. 

Table 4 summarizes the cuts applied in these analyses. 
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ATLAS \ 





Figure 1 1 : Distribution of X\ = — E£ for 
events passing C2 and C3. Vertical lines corre- 
spond to the bounds of the C4 cut. 



Figure 12: Distribution of X 2 = 2E£ for events 
passing C2, C3, and C4. Vertical lines corre- 
spond to the bounds of the C5 cut. 



Table 4: Additional cuts applied, after the event selection, for both methods (X,-, jU,- and a,- are defined 
in the text of this section). 



Cut label 


Description 


Cut CO (x 2 minimization) 


|Mjp -<*>|< 213? 

(Mw c is the reconstructed hadronic W and F^ G = 2. 1 GeV) 


Cut CI (geometric method) 


\M% C - M^ ak \ < 2a Mw (a Mw = 10.4 GeV) 


Cut C2 (both methods) 


M(W had ,b lep ) > 200 GeV 


Cut C3 (both methods) 


M(lepton,ftiep) < 160 GeV 


Cut C4 (both methods) 


|Xi-jUi| < 1.5a! 


Cut C5 (both methods) 


\X 2 -H 2 \ < 2a 2 
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3.8 Top quark mass measurement 

In Fig. 13, the hadronic top quark mass reconstructed with the % 2 minimization method is fit to the 
sum of a Gaussian and a polynomial (third degree). For 1 fb -1 , the fit Gaussian has its mean at 
175.0 ± 0.2 GeV and a width of 1 1.6 ± 0.2 GeV (£ 2 /dof = 137/67). It is seen that C2 and C3 do not 
significantly shift the top quark mass: m top = 174.8 ± 0.3 GeV with a width equal to 11.7 ± 0.4 GeV 
(# 2 /dof = 82/67: C2 and C3 improve significantly this value). 




Figure 13: The hadronic top quark mass reconstructed with the x 2 minimization method, fit with a 
sum of a Gaussian and a third order polynomial, scaled to 1 fb _1 . Left, after CO: m top = 175.0 ± 0.2 
GeV, with a width equal to 1 1.6 ± 0.2 GeV. Right, after C2 and C3: m t0 p = 174.8 ± 0.3 GeV with a 
width equal to 11.7 ± 0.4 GeV 

Figure 14 shows the result of the hadronic top mass reconstruction using the geometric method, 
fit to the sum of a Gaussian and a threshold function 3 (left, # 2 /dof = 97/75) and to a pure Gaussian 
(right, £ 2 /dof = 38/24). After all cuts, the Gaussian mean fits to 175.0 ±0.4 GeV with a width of 
14.3 ±0.3 GeV. The width is larger than with the x 2 minimization method since no attempt is made 
to perform an event-by-event rescaling of the light jets. Nevertheless, the contribution of the light jets 
to the top quark mass resolution can be removed to first order by computing the top quark mass as 
m top = Mjjb — Mjj ±M^ ak . The results of this geometric method with rescaling are shown in Fig. 15. 
The width decreases to 10.6 GeV, consistent with the results from the x 2 minimization method. 

Table 5 summarizes the fit results from all the methods discussed here. 
3 The formula of the threshold function used is the following: 

A ■ e~ + CstBdf ■ (x - thresholdf ■ e -<*-'hreshold) (4) 
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Figure 14: The hadronic top quark mass reconstructed with the geometric method, fit with a sum of a 
Gaussian and a threshold function (left) and with a pure Gaussian (right), scaled to 1 fb _1 . Left, after 
CI, C2, and C3: m top = 174.6 ± 0.5 GeV, with a width equal to 11.1 ± 0.5 GeV; right, after C2, C3, 
C4, and C5: wi top = 175.0 ± 0.4 GeV, with a width equal to 14.3 ± 0.3 GeV. 
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Figure 15: The hadronic top quark mass reconstructed with the geometric method with rescaling, 
fit with a sum of a Gaussian and a threshold function, scaled to lfb 1 . Left, after CI, C2, and C3: 
m top = 175.4 ± 0.4 GeV, with a width equal to 10.6 ± 0.4 GeV (^ 2 /dof = 109/73); right, after C2, 
C3, C4, and C5: m top = 175.3 ± 0.3 GeV, with a width equal to 10.6 ± 0.2 GeV (^ 2 /dof = 43/16). 
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Figure 16: % 2 distribution (with combinatorial 
background), with cuts C2 and C3 applied. 



3.9 Kinematic fit 

The top quark mass can be extracted from a kinematic fit using a % 2 based on the entire final state, as 
defined in equation (5). The terms in the first line consist of the usual x 2 terms 4 , while the last four 
constrain the object masses. The resolutions are extracted from Monte Carlo. The % 2 is calculated 
event by event and the resulting x 2 distribution, shown in Fig. 16, exhibits a higher purity for lower 
X 2 values. The purity of the final sample could be improved by cutting on the x 2 - 

^- = E«^) 2 + (^) 2 ) + E (^ + E(^r- 

jets °V a 0 jets.lepton °E x,y,z °iv 

Ow <5\V ®t Of 

The x 2 minimization provides a high constraint on the jets from W boson. In an earlier study [14], 
it has been shown that the accuracy of the determination of the top quark mass depends on the X 2 - 
For events in which the b-quark jets are well measured and extra final state effects are small, the x 2 
of the fit is close to 0 and produces a top quark mass value reflecting the Monte Carlo generated top 
quark mass. For larger x 2 value, this top quark mass value decreases while the fraction of b-quarks 
with a large gluon radiation increases. This observation can be used by performing a linear fit to the 
extracted top quark mass as a function of the X 2 - The fit is shown on Fig. 17 where the top mass 
distribution is fitted by a gaussian for each x 2 slice. The most accurate estimate of the top quark mass 
is obtained by extrapolating this linear fit to x 2 = 0. This procedure should lead to a lower sensitivity 
to final state radiation effects of the extracted top quark mass, which is found to be m top = 174.8 ±0.4 
GeV, where the uncertainty is statistical. 

This method is more computationally intensive than the others to assess the systematics, and is 
not considered further in this note. 

4 The in index stands for measured quantities and the/ index, for fitted quantities. E, 7] and <j> are respectively the energy, 
pseudo-rapidity and polar angle of the considered objects. 



Figure 17: Top quark mass as a function of % . 
The points are fitted by a linear function to ex- 
tract m top = (x 2 = 0) 
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Table 5: Fitted peaks for the hadronic top quark mass and corresponding widths for several methods. 
From simulated events with m top = 175 GeV, for 1 fb _1 . 



Method 


Cuts 


Top quark mass 


a 


Pull 


Pull 






[GeV] 


[GeV] 








CO 


175.0 ±0.2 


11.6 ±0.2 


-0.3 


1.23 


% l minimization 


CO, C2, and C3 


174.8 ± 0.3 


11.7 ±0.4 


-0.2 


1.11 


X 1 minimization 


CO, C2, C3, C4, and C5 


174.8 ± 0.3 


11.8 ±0.4 


-0.5 


1.00 


Geometric 


CI, C2. and C3 


174.6 ± 0.5 


14.1 ±0.5 


-0.39 


1.03 


Geometric 


C2, C3, C4 and C5 


175.0 ± 0.4 


14.3 ± 0.3 


-0.37 


1.11 


Geometric and rescaling 


CI, C2. and C3 


175.4 ± 0.4 


10.6 ± 0.4 


0.51 


1.10 


Geometric and rescaling 


C2, C3,C4, and C5 


175.3 ±0.3 


10.6 ± 0.2 


0.17 


1.15 


Kinematic fit 




174.8 ± 0.4 









3.10 Statistical uncertainties 




Figure 18: Pull distributions for the top quark mass measurement. Left, x 2 minimization method and 
right, geometric method. Both plots after C2, C3, C4, and C5. 

The statistical uncertainties quoted have been evaluated using a single simulated experiment with 
statistics corresponding to lfb 1 . To evaluate the reliability of these estimates, a bootstrap resampling 
technique has been used [20]. 

The pull distributions for the top quark mass measurement (M; — MgenVdMj have been produced 
for each method using 1200 pseudo-experiments. The biases and widths are reported in Table 5 for 
all the analyses shown in this note. Fig. 18 shows the pull distributions for the x 2 minimization 
method and for the geometric method, after cuts C2 through C5. All pull mean values are of size 0.5 
or less: this indicates that the biases induced by the methods of measuring the top quark mass are of 
the order of 0.1 to 0.2 GeV depending on the cuts applied. The pull widths are slightly larger than 1; 
this indicates that the statistical uncertainty of the fit is slightly underestimated. 
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3.11 Systematic uncertainties 

The statistical uncertainty on the top quark mass being negligible with a few fb~ 1 of collected data, 
the total uncertainty will quickly be dominated by the systematic uncertainties. All the contributions 
are listed below and summarized in Table 6. 

Table 6: Systematic uncertainties on the top quark mass measured in the semi-leptonic channel. 



Systematic uncertainty 


X 2 minimization method 


geometric method 


Light jet energy scale 


0.2 GeV/% 


0.2 GeV/% 


b jet energy scale 


0.7 GeV/% 


0.7 GeV/% 


ISR/FSR 


-0.3 GeV 


~ 0.4 GeV 


b quark fragmentation 


< 0.1 GeV 


<0.1 GeV 


Background 


negligible 


negligible 


Method 


0.1 to 0.2 GeV 


0.1 to 0.2 GeV 



3.11.1 Jet energy scale (JES) 

The effect of the uncertainty of the jet energy scale on the top quark mass measurement has been 
estimated by multiplying separately the light jet and b-jet momenta by several rescaling factors (20 
factors, between —10% and +10%). Neither the event selection nor the fyj have been changed after 
this jet energy rescaling. 

The resulting top quark mass depends linearly on the rescaling factor. The related systematic 
uncertainty on the top quark mass can therefore be expressed as a percentage of the light jet and Z>-jet 
energy scale miscalibration. 

• The uncertainty in the &-jet energy scale produces an uncertainty in the top quark mass of 0.7 
GeV/%. The Z?-jet scale will ultimately be determined with data from Z+jets. However, at 
the start of LHC running, the Z+jets statistics will be low, so the Z>-jet scale will be derived 
from the measured light jet scale togheter with a Monte Carlo correction term modelling the 
difference between the two jet energy scales. The systematic uncertainty associated with these 
methods has not been yet evaluated. 

• The uncertainty in the light jet energy scale produces an uncertainty in the top quark mass of 
0.2 GeV/%. The reduced dependence compared to that of the &-jet energy scale is due to the 
W boson mass constraint used in the rescaling (j 1 minimization method or kinematical fit) or 
the definition of the top quark mass estimator (geometric with rescaling method). It has been 
shown that the light jet energy scale should be known with a precision of 1% in 1 fb _1 of 
data [3]: the corresponding uncertainty on the top quark mass would therefore be 0.2 GeV. 

3.11.2 Initial and final state radiation (ISR and FSR) 

The study of the effect of initial and final state radiation on the top quark mass measurement is still 
preliminary. Several samples have been simulated for this study, corresponding to different sets of 
parameters: 
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Table 7: Top quark mass measured with several ISR/FSR parameters {% 2 minimization method, after 
cut CO and jet calibration). 



Sample 


Top quark mass [GeV] 


a [(GeV] 


1 


176.6 ± 0.4 


13.4 ±0.6 


2 


176.3 ± 0.7 


13.7 ±0.8 


3 


176.3 ± 0.5 


12.7 ± 0.6 



• Sample 1: AcerMC tt events, with maximum reconstructed top quark mass (half the default 
value of A(QCD) for FSR; twice the default value of A(QCD) for ISR). 

• Sample 2: AcerMC tt events, with default ISR and FSR parameters. 

• Sample 3: AcerMC tt events, with minimum reconstructed top quark mass (twice the default 
value of A(QCD) for FSR, half the default value of A(QCD) for ISR 

For each sample, a specific jet calibration has been applied (separately for Z?-jets and light jets). 

The measured top quark masses are summarised in Table 7. A shift on the top quark mass is 
observed. A first estimate of the systematic uncertainty due to initial and final state radiation can be 
estimated from the relative difference between these three top quark mass values: it is approximately 
equal to 0.3 GeV (more statistics would be necessary for a more precise estimate). 

This preliminary study shows the impact on the top quark mass measurement of a given change 
of ISR/FSR parameters, but the estimate of the systematic uncertainty due to ISR or FSR will benefit 
from a measurement of these effects with ATLAS data. Figure 19 shows, for example, that the jet 
multiplicity could help to determine the size of the initial and final state radiation contribution. Initial 
state radiation could be measured with Drell-Yan events, as has been done at the Tevatron. 
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Figure 19: Light and b-jet multiplicity for several values of the ISR/FSR parameters (prQet) > 40 
GeV). 



3.11.3 b-quark fragmentation 

The effect of b-quark fragmentation has been estimated by varying the Peterson parameter within its 
uncertainty (study performed on fast simulation [14]). 
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160 165 170 175 180 185 190 160 165 170 175 180 185 190 



Generated top mass [GeV] Generated top mass [GeV] 

Figure 20: Reconstructed top quark mass with 2 ^-tagged jets as a function of the generated top quark 
mass (left: % 2 minimization method, after CO cut; right: geometric method after CI cut). The method 
has good linearity. 

The resulting uncertainty is lower than 0.1 GeV. 
3.11.4 Background estimate 

Variations in the size of the background have no noticeable effect on the extracted top quark mass. 
Nevertheless, it is important to extract the shape of the background from data. 

3.12 Linearity of the method 

The analysis leading to the top quark mass measurement has been applied to samples corresponding 
to several values of generated top quark mass. Figures 20 and 21 show that both methods have good 
linearity. The reconstructed top quark mass lies above the generated one by an average offset equal to 
0.2% of m top . This comes from the pj jet spectrum which is different for each sample with a different 
generated top quark mass [3]. The jet energy scale is then a little bit different in each sample due to 
the pr cut applied on jets. 

4 Top quark mass measurement in the semi-leptonic channel with re- 
laxed requirements on the ^-tagging 

At the start of LHC running, the detector will not be optimized and will require a commissioning phase 
with first data to calibrate its sub-components. This is particularly true for the pixel detector and its 
capability to tag b-jets. Thus, it could be useful to have a top quark mass measurement analysis in 
which the use of Z>-tagging is reduced. This is addressed in this section by performing a first analysis in 
which exactly one £»-jet is Z>-tagged (the assumed Z?-tagging efficiency is the same as before: £b ~ 60%). 
This sample has no events in common with the sample used in section 3. In a second analysis, b- 
tagging is not used in the reconstruction of the it events. Even if not used, this sample contains events 
with 0, 1, or 2 ^-tagged jets and overlap with the other two samples. 
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Figure 21: Difference between the reconstructed and generated top quark mass, as a function of the 
latter (left: x 2 minimization method, after CO cut; right: geometric method after CI cut). 

4.1 Displaced vertex ^-tagging, 1 ^-tagged jet 

The sample is here limited to events with no more than five jets. In this case, the best method to select 
the two light jets from the W boson is the one which minimizes My — M^f*. The b-jet associated with 
the hadronic top quark decay is then chosen among the remaining light jets and the tagged b-jet. The 
best choice is the one which minimizes yj (Xy — jX\) 2 + (X2 — H2) 2 (see equations (2) and (3)). 

A large fraction of the non-hadronic &-jets is removed by requiring the b-]et to be closer to the 
reconstructed W boson than the lepton originating from the leptonic top quark decay: 

• cut C6 : A/?(lepton,Z? had ) - AR(W,b had ) > 1 

As in previous sections, the purification cuts CI and C3 are applied. Harder purification cuts (C4 
and C5) could also be applied. The top quark mass spectra derived after applying the two sets of 
purification cuts are shown in Fig. 22. 

Table 8 summarizes the reconstruction efficiency and purity related to this reconstruction method 
for both sets of cuts. 



Table 8: Single ^-tagged sample: efficiency of cuts applied and final purities (within ± 3 a ratop ), with 
respect to semi-leptonic events. 



Cuts 


Efficiency (%) 


W boson purity (%) 


b purity (%) 


top purity (%) 


Number of events 


C1+C3+C6 


0.54±0.02 


70 ±1 


69±1 


62±1 


1063 


C3+C4+C5+C6 


0.52±0.02 


71±1 


70±1 


63±1 


1016 



The systematic uncertainties on the top quark mass measurement, determined in the same way as 
in the previous section, are summarized in Table 9. 

The top quark mass values obtained are shown in Table 10 together with the pull distribution 
results. The single-tag sample can be added to the double-tagged sample to perform a top quark mass 
measurement with 50% more signal events. 
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Figure 22: Events with only 1 Z?-tagged jet: m top =M^ — Mjj with the geometric method and 

rescaling, after purification cuts (left, CI, C3, and C6 cuts: m top = 176.0 ± 0.7 GeV, with a width 
equal to 9.4 ± 0.8 GeV(# 2 /dof = 58/52) ; right, C3, C4, C5, and C6 cuts: m top = 174.0 ± 0.4 GeV, 
with a width equal to 12.7 ± 0.4 GeV) (^ 2 /dof = 27/13). 



Table 9: Systematic uncertainties on the top quark mass with rescaling measured in the semi-leptonic 
channel, with 1 ^-tagged jet and no ^-tagging, using the geometric method. 



Systematic uncertainty 


1 ^-tagged jet 


No ^-tagging 


Light jet energy scale 


0.3 GeV/% 


0.4 GeV/% 


b jet energy scale 


0.7 GeV/% 


0.7 GeV/% 


ISR/FSR 


~ 0.4 GeV 


~ 0.4 GeV 


b quark fragmentation 


< 0.1 GeV 


< 0.1 GeV 


Background 


< 1 GeV 


1 GeV 



Table 10: Fitted top quark mass value and corresponding widths for 1 b-tag and no b-tag samples, in 
1 fb _1 . Pull bias and width are also given. 



Geometric method 


Cuts 


Top quark mass 


a 


pull 


pull 


with rescaling 




[GeV] 


[GeV] 


bias 


width 


1 /^-tagged jet 


CI, C3 and C6 cuts 


176.0 ± 0.7 


9.4 ± 0.8 


-0.48 


1.17 


1 /^-tagged jet 


C3, C4, C5 and C6 cuts 


174.0 ± 0.4 


12.7 ± 0.4 


-0.05 


1.07 


No ^-tagged jet 


CI and C3 cuts 


175.0 ± 0.4 


11.7 ±0.5 


0.40 


1.12 


No /^-tagged jet 


C3, C4 and C5 cuts 


175.2 ± 0.5 


12.4 ±0.8 


-0.15 


0.94 
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4.2 Top quark mass measurement in the semi-leptonic channel assuming no 6-tagging 

The same analysis is performed with no use of ^-tagging information. The main difference with 
the two ^-tagged analysis comes from the larger contribution of the combinatorial and physics back- 
ground. 

The corresponding top mass spectra are given in Fig. 23 for both sets of cuts. Table 1 1 summarizes 
the efficiency and purity related to this method. 




M ijbh -M i+ MP eak [GeV] ' ' ' ' M^-Mj+Mf^GeV] 



Figure 23: Events without ^-tagging: m top = Mjjb — Mjj +M^ ea with the geometric method, after 
purification cuts (left, CI and C3 cuts: m top = 175.0 ± 0.4 GeV, with a width equal to 11.7 ± 0.5 
GeV(j£ 2 /dof = 130/69) ; right, C3, C4 and C5 cuts: m top = 175.2 ± 0.5 GeV, with a width equal to 
12.4 ±0.8 GeV(;t 2 /dof = 41/24)). 



Table 11: Efficiency of cuts applied and final purities (within ± 3 o„ ho ), with respect to semi-leptonic 
(e,/i) events, assuming no ^-tagging. 



Cuts 


Efficiency (%) 


W boson purity (%) 


b purity (%) 


top purity (%) 




C1+C3 


1.59 ±0.03 


49.5 ± 0.9 


46.8 ± 0.9 


58.4 ± 0.9 


3115 


C3+C4+C5 


1.48 ±0.03 


51.1 ±0.9 


49.0 ± 0.9 


58.9 ± 0.9 


2916 



The systematic uncertainties on the top quark mass measurement are summarized in Table 9. The 
obtained top quark mass values together with the pull distribution results are shown in Table 10. 

For the second set of cuts (right plot of Fig. 23), the background contribution (~ 45 % of the 
sample) is peaked exactly below the signal contribution, preventing a fit using a parametrization like 
gaussian function (signal) + polynomial function (background). An event mixing technique has been 
used to fix the background shape in the parametrization. The event mixing technique consists in 
replacing one of the two jets momentum associated to the W with a simulated jet whose energy, <f) and 
r\ distribution is randomly selected according to the global energy, 0 and X] distributions observed for 
the jets associated to the W. This technique looks very promising as shown in Fig. 24 comparing data 
and event mixing samples for the two jet and three jet invariant masses after preselection. 
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MjlGeV] " M jib [GeV] 



Figure 24: Events without ^-tagging: comparison between background and event mixing samples for 
two jets (left) and three jets (right) invariant mass. 

5 Conclusion 

Several methods have been investigated in order to perform an accurate top quark mass measurement 
with 1 fb _1 of collected data, in the tt semi-leptonic channel. The best top quark mass determination 
is achieved with two ^-tagged events and a top mass estimator taken as the invariant mass of the 
three jets from hadronically-decaying top quark; the uncertainty on the top quark mass measurement 
with this analysis will be dominated by systematics, the statistical uncertainty being already small 
(<0.4 GeV). The precision on the top quark mass relies mainly on the jet energy scale uncertainty: a 
precision of the order of 1 to 3.5 GeV should be achievable with 1 fb -1 , assuming a jet energy scale 
uncertainty of 1 to 5%. W boson sample can be extracted from the tt sample in order to constrain the 
light jet energy scale. The main uncertainty on the top quark mass measurement will come from the 
b-jet energy scale. 

Events with one or no Zj-tagged jets lead also to an interesting measurement if the background 
(physical and combinatorial) shape is constrained from data. The estimated precision on the top quark 
mass value is below 2 GeV (assuming a jet energy scale uncertainty of the order of the percent), with 
a very good signal over background ratio. These samples are thus very useful for jet energy scale or 
^-tagging studies during the commissioning phase with early data. 
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Abstract 

The ATLAS potential for the study of top quark properties and physics beyond 
the Standard Model in the top sector, is reviewed in this paper. Measurements 
of the top quark charge, the spin and spin correlations, the Standard Model 
decay (t — > bW), rare top quark decays associated to flavour changing neutral 
currents (t —> qX,X = y,Z,g) and tt resonances are discussed. The expected 
sensitivity of the ATLAS experiment is estimated for an integrated luminosity 
of 1 fb _1 at the LHC. For the Standard Model measurements the expected pre- 
cision is presented. For the tests of physics beyond the Standard Model , the 
5a discovery potential (in the presence of a signal) and the 95% confidence 
level limit (in the absence of a signal) are given. 

1 Introduction 

Several properties of the top quark have already been explored by the Tevatron experiments, such as the 
mass, charge and lifetime, the rare decays through flavour changing neutral currents (FCNC) and the 
production cross-sections. The structure of the Wtb vertex and the main top quark decay mode (t — > bW) 
within the Standard Model were also investigated together with the measurements of the W-boson he- 
licity fractions. Many of these studies were performed by reconstructing tt pairs in the semilep tonic, 
dileptonic and fully hadronic decay modes. Given the current Tevatron luminosity, most of these studies 
are limited by the statistics acquired. 

The electric charge of the top quark is one of its fundamental properties and will be probed with high 
statistics at the LHC. The measurement of the top quark charge can be performed either by identifying 
the charge of its decay products in the main decay channel t — > bW or by studying radiative top quark 
processes. At the Tevatron the DO [1] and CDF [2] collaborations have already initiated the study of 
the top quark charge and, with the available statistics, they showed that the data gives preference to the 
Standard Model top quark hypothesis (with a charge of +2/3) over the scenario with an exotic quark 
(XM) of charge —4/3 and mass 170 GeV, fully consistent with the present precision electroweak data 
[3,4]. The DO and CDF exclude the exotic quark hypothesis with 92 and 87 % confidence, respectively. 

As the top quark decays before it can form hadronic bound states, a consequence of its high mass, 
the spin information of the top quark is propagated to its decay products. This unique behaviour among 
quarks allows direct top quark spin studies, as spin properties are not washed out by hadronization. 
Through the measurement of the angular distributions of the decay products the information of the top 
quark spin can be reconstructed. Top quark spin polarization and correlations in tt events produced at the 
LHC are precisely predicted by the Standard Model and are sensitive to the fundamental interactions in- 
volved in the top quark production and decay. By testing only the top quark decay, the W-boson polariza- 
tion measurement complements top quark spin studies, helping to disentangle the origin of new physics, 
if observed. The W-boson polarization states can be measured through the longitudinal (Fq), left-handed 
(Fl) an d right-handed (Fr) helicity fractions. At the present, the most stringent limits on the W-boson he- 
licity fractions were obtained at the Run-II of the Tevatron [5-13]. Analysing 1.9 fb _1 of data, the CDF 
experiment measured [5] Fq = 0.62 ±0.11 with Fr fixed to zero and Fr = —0.04 ±0.05 with Fq fixed 
to the Standard Model expectation for m t = 175 GeV. For 2.7 fb _1 , the DO experiment measured [10] 
F 0 = 0.490 ±0.106(stat) ±0.085(syst) with F R fixed to zero, and Fr = 0.1 10±0.059(stat) ±0.052(syst) 
with F) fixed to the Standard Model value. 

Within the Standard Model, the Wtb coupling is purely left-handed (at the tree level), and its size is 
given by the Cabibbo-Kobayashi-Maskawa (CKM) matrix element V t b- In Standard Model extensions, 
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Table 1: The values of the branching ratios of the FCNC top quark decays, predicted by the SM, the 
quark-singlet model (QS), the two-higgs doublet model (2HDM), the minimal supersymmetric model 
(MSSM) and SUSY with R-parity violation are shown [30-36]. 



Process 




SM 




QS 




2HDM 


MSSM 


tjf SUSY 


t^uZ 


8> 


io- 17 


1.1 


x 10" 


4 




2> 


c 10- b 


3 x IO" 5 


t^uy 


3.7 


x 10~ 16 


7.5 


x 10" 


9 




2> 


; 10" 6 


1 x 10~ 6 


t^ug 


3.7 


x IO" 14 


1.5 


x 10" 


7 




8> 


; 10~ 5 


2 x 10~ 4 


t^cZ 


1 > 


io- 14 


1.1 


x 10" 




~io- 7 


2> 


( 10" 6 


3 x 10~ 5 


t — ► cy 


4.6 


x 10- 14 


7.5 


x 10" 


9 


~ io~ 6 


2> 


( IO" 6 


1 x IO" 6 


t^cg 


4.6 


x IO" 12 


1.5 


x 10- 


7 


~ 10- 4 


8> 


; 10~ 5 


2 x IO" 4 



departures from the Standard Model expectation V& c± 0.999 1 are possible [14, 15], as well as new 
radiative contributions to the Wtb vertex [16, 17]. These deviations might be observed in top quark 
production and decay processes at LHC. The most general Wtb vertex for on mass shell W-boson, top 
quark and b quark, containing terms up to dimension five can be written as 

5? = --*-bf(V h P L + V R P R )tW„-^=b— ^( gh p L + gR p R )tW-+h.c, (1) 
V2 V2 M w 

with q = pt~ Pb the W-boson momentum and Jr(l) the chirality projectors. Additional o^ v k v and W 1 
terms, where k = p t + p\„ can be absorbed into this Lagrangian using Gordon identities. If the W-boson 
is on its mass shell or it couples to massless fermions q^e^ = 0, and terms proportional to can be 
dropped from the effective vertex. The new constants Vr, #l and gR [18, 19], are vector like (Vr) and 
tensor like (g^ and #r) anomalous couplings, can be related to ff, f \ and f\ in Ref. [20] (and references 
therein) as ff = Vr, f\ = — and ff = —gR. If we assume CP is conserved, these couplings can be 
taken to be real. Within the Standard Model = V t b — 0.999 and the other couplings (Vr, g^, gR) vanish 
at the tree level, while nonzero values are generated at higher orders [21, 22]. Indirect limits on the Wtb 
vertex anomalous couplings can be inferred from radiative B -meson decays and BB mixing [23]. Taking 
into account the current world average [23], BR(B ->• X s y) = (3.55 ± ± 0.03) x 10 -4 , varying 

one parameter at a time, the 95% C.L. bounds on Vr, g^ and gR are in the range [—0.0007,0.0025], 
[-0.0015,0.0004] and [-0.15,0.57], respectively [22,24]. 

Flavour Changing Neutral Currents are strongly suppressed in the Standard Model due to the Glashow- 
Iliopoulos-Maiani (GIM) mechanism [25]. Although absent at tree level, small FCNC contributions are 
expected at one-loop level, determined by the CKM mixing matrix [26-29]. For the top quark within 
the framework of the Standard Model, these contributions limit the FCNC decay branching ratios to 
the gauge bosons, BR(t — ► qX,X = Z,y,g), to below 10~ 12 . There are however extensions of the SM, 
like supersymmetry (SUSY) [30], multi-Higgs doublet models [31] and models with exotic (vector-like) 
quarks [32-34], which predict the presence of FCNC contributions already at tree level and significantly 
enhance the FCNC decay branching ratios compared to the Standard Model predictions [35, 36]. The 
branching ratio for the different models are shown in Table 1 . FCNC processes associated with the pro- 
duction and decay of top quarks have been studied at colliders and the observed upper limits on the 
branching ratios at 95% C.L., from the direct searches, are shown in Table 2. 

Since the top quark mass is much larger than the other quarks, the top quark may play a privileged 
role in the electroweak symmetry breaking (EWSB) mechanism. Any new physics connected to the 
EWSB could be preferentially coupled to the top quark sector. This would lead to deviations from the 

'Three generations of quarks and unitarity of the CKM matrix are assumed. 
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Table 2: Experimental observed upper limits on the branching ratios at 95%C.L. for the FCNC top quark 
decays. 





LEP 


HERA 


Tevatron 


BR(t -> qZ) 


7.8% [37-41] 


49% [42] 


3.7% [43] 


BR(t^qy) 


2.4% [37-41] 


0.75% [42] 


3.2% [44] 


BR(t^qg) 


17% [45] 


13% [42,46,47] 0.1-1 


% (estimated from [46,48]) 



expected Standard Model ft production rate and could distort the top quark kinematics. New resonances 
and gauge bosons strongly coupled to the top quark are expected in a large variety of models, in particular 
those with strong EWSB [49-5 1]. The tt final states are also interesting for leptophobic Z' bosons which 
can appear in Grand Unification Models [52]. These new particles could reveal themselves in the tt 
invariant mass distribution. At the Tevatron experimental upper limits were set at 95 %C.L. for the 
a(pp -> Z') x BR(Z' -> tt) with Z' masses between 450 GeV and 900 GeV. A topcolor leptophobic Z' is 
ruled out below 720 GeV and the cross section of any narrow Z' decaying to a tt is less than 0.64 pb at 
95%C.L., for Z' masses above 700 GeV [53]. 

In this note the ATLAS potential for the study of the top quark properties and tests of physics beyond 
the Standard Model in the top quark sector are reviewed for an expected luminosity of 1 fb -1 at the 
LHC. The note is organized as follows: the basic event selection and the trigger used are reminded in 
Section 2. In Sections 3, 4, 5 and 6 the studies of the top quark charge, the W-boson and top quark 
polarisation studies and the Wtb anomalous couplings, the top quark FCNC decays and the production 
of tt resonances are discussed respectively. The summary and conclusions are presented in Section 7. 
The top quark mass, one of the most important top quark properties, is not investigated here as a separate 
note is devoted to this issue [54]. 

2 Basic event selection 

As most of the studies performed in this paper are related to either the semileptonic (tt — > WWbb — > lvj i j2bb 
with l = e,n) or the dileptonic (tt — > WWbb — > lvl v'bb with 1, f = e, n) decays of tt events, basic criteria 
for the event selection were defined for each one of these final state topologies. Changes to the criteria 
are to be expected depending on the type of top quark property under study. For the background studies 
several sources were considered, tt, W+jets, Wbb+jets, Wcc+jefs, Z+jets, WW, ZZ, WZ and single top 
events (see the introduction of the Top Chapter for a full list of backgrounds). The backgrounds are also 
described in more detail separatly for each section of the note. All signal and background events were 
required to pass the single lepton trigger requirements for electrons and muons. The triggers considered 
were L1.EM18I (L1_MU20,L1_MU40) for electrons (muons) at LI, e22i (mu20) for electrons (muons) 
at L2 and e22i (mu20) for electrons (muons) at the Event Filter (EF) [55]. 

Different selection criteria are applied for the top quark FCNC analyses, as the final state topology is 
different from those considered above. These criteria are explained in Section 5. 

Semileptonic topology 

In the semileptonic topology, signal events have a final state with one isolated lepton (electron or 
muon), at least four jets (two of them from the hadronization of b quarks and labelled b-jets) and large 
transverse missing energy from the undetected neutrino. More information on the signal can be found 
in the introduction of the Top Chapter. The basic selection criteria were defined by requiring that the 
events should have: 
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Table 3: Cumulative efficiencies of the standard top quark selection criteria for the semileptonic and 
dileptonic type of events with electrons and muons for the pseudorapidity range \ rj \ < 2. 



criterion 


£(%) 


criterion 


£(%) 


Semileptonic events 


100 


Dileptonic events 


100 


1 isol.lept. (p T >25/20 GeV) 


58.9 


2 isollept. (p T >25/20 GeV) 


35.5 


> 4 jets (p T >30 GeV) 


34.2 


> 2 jets (p T >30 GeV) 


31.8 


> 2 b-tagged 


10.5 


= 2 b-tagged 


8.3 


missing Ej > 20 GeV 


8.5 


missing Ej > 30 GeV 


6.5 



• exactly one isolated electron (muon) with |t] | < 2.5 and p T > 25 GeV (p T > 20 GeV); 

• at least 4 jets with | tj | < 2.5 and p T > 30 GeV; 

• at least 2 jets tagged as b-jets; 

• missing transverse energy above 20 GeV. 



Dileptonic topology 

In the dileptonic topology, signal events have a final state with two isolated leptons (electrons and/or 
muons), at least two jets (tagged as b-jets) and large transverse missing energy from the two undetected 
neutrinos. The basic selection criteria were defined by requiring that the events should have: 

• exactly two isolated electrons (muons) with \r)\ < 2.5 and pj > 25 GeV (pj > 20 GeV); 

• at least 2 jets with |tj| < 2.5 and p T > 30 GeV; 

• 2 jets tagged as b-jets; 

• missing transverse energy above 30 GeV. 

Selection efficiency 

The efficiency of the basic selection criteria was examined using « 450 000 events from the 
tt — ► bWbW — > bqqb£v,b£vb£v signal sample. The results are presented in Table 3. 

The effect of the trigger efficiency is investigated separately for the individual analysis. 

3 Top quark charge reconstruction 

There are several techniques to determine the electric charge of the top quark at hadron collider ex- 
periments [56-58]. The top quark charge measurement presented here is based on the reconstruction 
of the charges of the top quark decay products. As the dominant decay channel of the top quark is 
t — ► W + b(t — > W~b), the top quark charge determination requires the measurement of both the W boson 
and the b quark charges. While the charge of the W boson can be determined through its leptonic decay, 
the b quark charge is not directly measurable due to quark confinement in hadrons. In this note two 
possible ways to determine the b quark charge were investigated: 
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• The charge weighting technique: this approach is based on finding a con-elation between the b 
quark charge and the charges of the tracks belonging to the b-jet [59, 60]. 

• The semileptonic b-decay approach: in this case the b quark charge is determined using the 
semileptonic b-decays (b — > c , u + W , W —>■ £~ + Ve), where the sign of the soft lepton indicates 
the sign of the b quark charge. 

Two major issues have to be addressed. The first one is to find the selection criteria to perform the correct 
pairing of the lepton and the b-jet originated in the same top quark decay. In the Standard Model a b- 
jet, coming from a b quark, should be associated with a positive lepton (£ + ), while in the exotic case it 
should be associated with a negative one (£~). The second issue is the assignment of a charge to the b-jet 
selected by the pairing criterion. While the former issue is common for both the approaches, the latter 
one is tackled in different ways. 

3.1 Event generation and selection 

The standard tt — > W + bW~b samples were used as signal events in both the semileptonic and the dilep- 
tonic channels (only electrons and muons are taken as signal). For the background studies, the W+jets 
sample was used. The tt all jets channels as well as the semileptonic and dileptonic channels of X leptons 
were analysed as they can contribute to background (all jets channel) and to signal (events with the lep- 
tonic decays of % leptons). In the present analysis the common selection criteria were used, as defined in 
Section 2. In addition, for each approach, specific criteria were applied to the events. 

3.2 The lepton and b-jet pairing algorithm 

The lepton and b-jet pairing was done using the invariant mass distribution of the lepton and the b-tagged 
jet, m(l,bjet). If the assignment is correct, m(l,bj et ) is limited by the top quark mass, otherwise there is 
no such restriction, as can been seen in Figure 1 , where the signal sample with the standard cuts applied 
was analysed. To find the connection between the b-quarks and reconstructed b-jets and the parton level 
leptons and reconstructed leptons, the MC truth was used: the matching was treated as successful if the 
cone difference, AR between b-quark and b-jet was less than 0.4 (in the lepton case AT? < 0.2). For 
double b-tagged events, only the b-jets that satisfy 

m(l,b£ t ' 2) ) <m cr and m(\,b^ } ) > m CI (2) 

were accepted. In the di-lepton case both leptons should fulfill the condition (2). The optimal value 
for the pairing mass cut, wi cr = 155 GeV, is a trade-off between the efficiency (e) and purity (P) of the 
pairing method. The factor e(2P — l) 2 was maximised to find the optimum working point. As this 
criterion requires events with two b-tagged jets and one combination for the lepton and b-jet invariant 
mass must be below m cr and the other one above m cr , the efficiency of the method is small. On the other 
hand, this criterion gives a high purity sample as is shown in Section 3.5.1. In the analysis two variants 
of b-tagged events treatment were considered. In the first one exactly two b-jets were required while in 
the second one two and more b-jets were allowed (the two with the highest pj treated as true b-jets). 
Slightly better results were obtained for the former variant and the results presented here correspond to 
this case. To suppress the background some additional cuts were tried: W boson mass (Mw) window, 
top quark mass (m top ) window, etc. By using the combined W boson and top quark mass window the 
background can be reduced by factor more than 10 at the expense of a factor 2 loss in signal. The Mw 
window requires that at least one pair of non b-tagged jets should have an invariant mass within 10 GeV 
of the W boson mass. The m top window requires that the reconstructed W boson can be combined with 
a b-jet (not previously paired with a high-/?x lepton) to give an invariant mass within 40 GeV of the top 
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Figure 1: Lepton - b-jet invariant mass spectra for the lepton and b-jet pairs from the same top quark (full 
line) and from different top quarks (dashed line). 



quark mass. This extra mass cut was applied by default in the whole top quark charge analysis using the 
weighting approach. 



3.3 The jet track charge weighting approach 

The determination of the average b-jet charge was done using a weighting technique in which the b-jet 
charge is evaluated as the weighted sum of the b-jet track charges: 

Gbjet - ~ — (3) 

Zi\ji-Pi\ K 

where qj(pi) is the charge (momentum) of the / th track inside the jet and j is the b-jet axis unit vector. 
The k parameter was optimised for the best separation between b- and b-jets and the optimum value 
was found to be K = 0.5. In addition, for the charge weighting technique, it was further required, using 
only tracks with pj > 1.5 GeV, that at least two tracks must be found within a cone with AT? < 0.4 with 
respect to the jet axis. For b-jets with more than seven such tracks, only the seven with the highest pj 
are used. The parameters of the weighting procedure are the result of a maximisation of the difference 
between the mean values of the b- and b-jet charge distributions - these mean values were found for a 
set of the parameters values (fc and track pi) and compared. For the procedure optimisation the signal 
tt-sample was used. 



3.4 Semileptonic b-decay approach 

In this approach the pairing procedure described in Section 3.2 is also used. But in this case the b quark 
charge is determined through its semileptonic decay. The sign of the b-jet charge is determined by the 
lepton charge within the b-jet, 

b->c,u + ^~ + v, b^c,u + £+ + v. 

The lepton from the b-decay will be identified as a non-isolated lepton inside the corresponding b- 
jet, and its charge (<2nonis) defines the b quark charge. The non-isolated lepton is searched for among 
the tracks pointing to the treated b-jet and originating in the corresponding secondary vertex. Several 
processes can lead to an incorrect b quark charge assignment with this approach. Semileptonic decays 
of D mesons produced in the B decay chain, and the B°-B° mixing are examples of such processes. To 
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suppress the contribution from D mesons, the non-isolated lepton transverse momentum with respect to 
the b-jet axis, p™ 1 , can be used. The fact that the lepton p r j l from b-decays is, on average, higher than 
from D meson decays can be used to diminish this contamination. The p^ 1 cut was optimised using a 
sample of 555000 signal tt events and the value of 1 GeV has been found as the optimum cut. An 
additional source of wrong b quark charge assignments is mistagging, i.e. light jets incorrectly tagged as 
b-jets. 

The main drawback of this approach is that, from all the selected lepton plus b-jet pairs, only those 
with a b-jet containing a non-isolated lepton can be used in the analysis. In addition to that, due to 
difficulties in selecting a pure sample of electrons from within jets, only muons were taken as the non- 
isolated leptons. 



3.5 Results 

In both approaches, the Standard Model scenario of top quark production and decay is assumed. The 
corresponding results are discussed in the following sections. 



3.5.1 Weighting technique approach results 

As a first step, the efficiency and purity of the lepton b-jet pairing was investigated. Using the events 
which passed the selection criteria, the obtained pairing efficiency is e = 30.5% and the pairing purity 
is P = 85.6%. The purity of pairing is defined as P = N goo i/Naa, where A^ goo d (Nail) is the number of 
correctly paired lepton - b-jet pairs (all treated pairs) and the Monte Carlo truth is used to find N goo d- 

The b-jet charge spectra reconstructed using the Monte Carlo truth and invariant mass pairing pro- 
cedure for the signal tt events are presented in Figure 2, left and right respectively. From Figure 2, the 
shift of the b-jet charges associated with £ + , Q^J V and £~, Q^J t , (or with b and b quark in the Monte 
Carlo case) is clearly seen. The obtained b-jet charge purity, defined as the percentage of b-jets with the 
correct charge (Q^jet < 0 an( ^ 2bjet > 0)> * s ^ ~ 62%. In addition to that, the g C omb b-jet charge spectrum, 
defined as Q(£) x Q^ eV nas ^ een recons tructed. The influence of trigger was also investigated, namely 
the lepton level 1 and level 2 triggers as well as the event filter (EF). The results with and without trigger 
are summarised in Table 4. No significant impact of the trigger is observed. A small asymmetry in favour 
of the positive b-jet charge, as was revealed by the analysis, is due to the dominance of positive charge 
in the initial state (two colliding protons). 

Note that the peaks at ± 1 in Fig. 2 correspond to the cases when all the tracks pointing to a b-jet 
have the same charge sign - in this case the weighting procedure (2) gives <2bjet = ±1. 



Table 4: The mean b-jet charge associated with positive (Q+) and negative (<2-) lepton and combined 
b-jet charge (2 com b) without (no) and with (yes) EF trigger; two b-tags required. 



trigger 


Q+ 


Q 


I Coomb | 


Nevent 


efficiency 


no 


-0.092 ± 0.006 


0.103 ±0.006 


0.097 ± 0.004 


7129 


100.0 


yes 


-0.095 ± 0.006 


0.106 ±0.006 


0.101 ±0.004 


6130 


86.0 



The main background processes for the top quark charge measurement in the semileptonic mode are: 
W+ jets production (the most important background in this mode), QCD multi-jets, di-boson and single 
top quark production. The single top production is not a genuine background as it gives the same sign 
of the b-jet charge asymmetry as the signal. For the selection criteria that were used, the ratio of the 
accepted semileptonic tt events to the accepted single lepton ones is more than 15:1. In the dileptonic 
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Figure 2: The b-jet charge associated with positive (full line) and negative (dashed line) lepton using the 
Monte Carlo truth (left) and invariant mass criteria (right) for the ^-pairing. 

mode, the background is composed of Drell-Yan pairs (the most significant background), multi-jet QCD 
processes and di -boson production [2]. 

The background studies for this analysis require large Monte Carlo samples (in addition to the stan- 
dard cuts the invariant mass criterion is highly restrictive) not available at present. The ideal way to 
determine the basic background parameters, the S:B ratio and background charge asymmetry, is to use 
the W+jets (dominant background) samples. However, after applying the selection criteria to the avail- 
able W+jets samples, only a few events remained (20 lepton b-jet pairs). It is clear that due to poor 
statistics the samples are not suitable for a valuable background analysis. Nonetheless, combining the 
b-jet charge spectra, obtained for the individual W+jets channels (W+nxjets and Wbb, Wcc + n xjets) 
scaled according to their cross sections to 1 fb, a S:B ratio of « 38 ± 8 was obtained. To fix the S:B 
ratio we need to include other backgrounds and take a regard for the poor statistics. Taking into account 
only the standard cuts with a loose Mw window (±30 GeV) a value of 7:1 was obtained for the S:B ratio 
which is compatible with that of the CDF background studies [2]. As a result, a nominal S:B ratio of 
30:1 has been assumed, with 7:1 as a very conservative lower limit for studying systematic uncertainties 
related to the background. 

The poor statistics of the available W+jets samples does not enable the background b-jet charge 
asymmetry to be determined precisely, the obtained value being rj —0.02 ±0.05. On the other hand, as it 
was shown by CDF [2], no marked background asymmetry is expected. For this reason, as a background, 
we use the signal events but without the pairing of leptons and b-jets. As a consequence the obtained 
b-jet charge spectrum is not correlated with the high pj lepton charge and should not have any charge 
asymmetry. Assuming the nominal S:B ratio, the spectrum is normalized to 1/30 of the signal statistics. 
The analysis showed that this background exhibits practically no asymmetry. For the systematics studies, 
a background corresponding to S:B=7:1 was also considered. 

To find a realistic b-jet charge distribution, the signal and background distributions are combined. In 
Figure 3 (left) the expected b-jet charge (<2comb) distribution combining the signal with the background 
(full line) and the background itself (dashed line) are shown. From the reconstructed b-jet charge spectra 
using the two treated backgrounds, the expected mean b-jet charge (assuming the Standard Model) is: 

g comb = -0.094 ±0.0042 (stat). 

The Qcomb value is obtained as the mean value of the signal plus background (S+B) distribution 
combining signal with the background. 

In conclusion, « 6000 lb combinations could be selected for the top quark b-jet charge analysis, 
using the 1 fb -1 sample. The expected combined b-jet charge purity, N(Q\,j et < 0)/N a n, is « 0.62±0.01 
for the Standard Model case. 
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Figure 3: Left: the full S+B b-jet charge (<2comb) distribution (full line) and the background itself (dashed 
line); right: the reconstructed top quark charge (<2™ mb ) (full line) and its background (dashed line). 

Taking into account the statistical uncertainty of the obtained mean b-jet charge (<2comb) it can be 
stated that the obtained value will differ from 0 by more than 20a. Using a simple statistical treatment 
it is easy to show that for a reliable determination of <2 com b (> 5a) a sample of « 0.1 fb -1 should 
be sufficient. In addition to that the analysis has revealed that the reconstructed b-jet charge is more 
influenced by the size of the S:B ratio than by the background asymmetry: going from the pure signal b- 
jet charge spectrum to that of the 7:1 mixture of signal and background, the mean b-jet charge decreased 
by 14%, while a replacement of the symmetric background by the asymmetric one with an asymmetry 
1/4 of the signal one, leads to only 3% change of the charge at the 7:1 S:B ratio. 

The direct reconstruction of the top quark charge can be done relying on the obtained value of <2 com b 
(see above). Using the Standard Model value of the b quark charge (2b = — 1/3) and the mean re- 
constructed value of the b-jet charge (<2combX the b-jet charge calibration coefficient Q, = Gb/Gcomb is 
3.54 ±0.16 and the top quark charge then reads: 

Gt = Q(£ + ) + x C b , Qi = Q(£- ) + fig x C b (4) 

where Q(^ ± ) = ±1 is the lepton charge and Q^J t is as above. 

The reconstructed top quark charge is shown in Figure 3 (right) for the sample of 1 fb -1 . The 
absolute value of top quark charge obtained by combining Q t and Q- t for the above mentioned sample 
is Q™ mb = 0.67 ±0.06 (stat) ±0.08 (syst). The statistical error assumes that the relative error of Cb is 
the same as that of <2 com b- The systematic error of Q^ omb can be studied comprehensively only by using 
experimental data 2 . In this case the main source of the systematic error is the weighting procedure that 
influences the coefficient Cb, that should be determined independently on the investigated b-jet charge, 
as well as the mean b-jet charge. In our case only the systematics stemming from determination of the 
mean b-jet charge were taken into account. 

3.5.2 Semileptonic b-decay approach results 

The charge of the non-isolated lepton found within the b-jet provides discrimination between the Stan- 
dard Model and the exotic hypotheses on a statistical basis. Figure 4 shows the number of b-jets, which 
have been paired with positive (left) and negative (right) high pj lepton and which contain inside a non- 
isolated lepton, as a function of the charge (Ononis) of the contained non-isolated lepton. The mean values 

-The reconstructed b-jet charge or coefficient Cb for an experimental sample, e.g. dijet bb data, should be compared with 
the corresponding Monte Carlo one to look for a possible difference in the b-jet track topology between Monte Carlo and real 
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of the non-isolated lepton charge obtained from these figures are: 

(+) _ N(n-N(n _ 032±005 0 n _ ^ + )-^-) _ 030±005 

Ononis " N{£+) +N{n ~ 0.32±0.05, Q nonls - N(£+) +N{£ _ ) - 0.30±0.05, 

where Gionis (Qnoiis) * s tne mean charge of the non-isolated leptons in the b-jets paired with the positive 
(negative) high pj lepton. N{1~) (N(£ + )) is the number of b-jets with a negative (positive) charged 
lepton. The quantity Q(£) x G^nis' wnere Q{£) is the charge of the lepton paired with the b-jet containing 
a non-isolated lepton, can be used to combine both histograms. The obtained mean combined charge in 
the Standard Model is 2n™™ b ' ) = —0.31 ±0.04, showing a potential to distinguish between the Standard 
model and exotic hypothesis even with 1 fb _1 of data, as in the case of the exotic scenario the opposite 
value of 2ionis b ' * s expected. 




\ -1.5 -1 -0.5 0 0.5 1 1.5 2 °-2 -1.5 -1 -0.5 0 0.5 1 1.5 2 



Charge Charge 

Figure 4: Number of b-jets, containing a non-isolated lepton inside, associated with positive (left) and 
negative (right) high pj lepton vs the charge of the non-isolated lepton. The contribution of different 
sources of leptons are marked by different shading styles. 



3.6 Systematic uncertainties 

The systematic studies have been done following the prescription described in Section 6 of Chapter 
1. The resulting systematic errors are summarised in Table 5. The systematic uncertainty caused by 
the top quark mass was estimated from the absolute difference between the <2comb reconstructed at the 
nominal value (175 GeV) and m t =160 and 190 GeV and rescaling to an effective 2 GeV uncertainty. The 
systematic uncertainty connected with the signal to background ratio was found assuming that this ratio 
is known with 30% uncertainty. The background asymmetry systematics was estimated assuming that 
the background charge spectrum asymmetry is at a level of 10% of the signal one. 

The systematic uncertainties due to Monte Carlo modeling of tt signal were studied by comparing 
samples with different fragmentation parameters: an AcerMC/Pythia sample and the standard signal 
MC@NLO/Herwig one, giving values of 18 ± 13% and 16 ± 13% for the weighting and semileptonic 
approaches, respectively. The large uncertainties are due to a limited Monte Carlo statistics, so these 
values cannot be considered as a reliable estimate of the corresponding systematic uncertainties and are 
therefore not included in Table 5. 

Pileup background affects the weighting technique procedure, as tracks from additional minimum- 
bias interactions get included in the jet charges. The associated systematic uncertainty was evaluated by 
comparing the standard sample to a dedicated tt sample including pileup, leading to a shift of 20±18%. 
Since the Monte Carlo statistical error is large, and it is expected that any pileup effect can be minimised 
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(at least at moderate LHC luminosities) by applying track impact parameter cuts to eliminate tracks from 
pileup vertices, this value is also not included Table 5. 

Table 5: The systematic uncertainties (%) of the mean reconstructed charge, <2comb, the weighting tech- 
nique and b-decay approaches. 



Source 


Weighting (%) 


b-decay (%) 


jet scale 


0.7 


0.3 


b-jet scale 


1.9 


6 


Am t 


1.3 


7 


PDF 


0.6 




ISR 


2.8 


15 


FSR 


7.8 


8 


Pile-up 




1.8 


Background asymmetry 


1 




S/B ratio 


9 




total 


12.5 


19.3 



4 Polarization studies in tt semileptonic events 

The measurements of the W-boson and top quark polarizations in tt events provide a powerful test of 
the top quark production and decay mechanisms and are a sensitive probe of new physics. W-boson 
or top quark spin information can be inferred from the angular distributions of the daughter particles 
in the W-boson or top quark rest frame, respectively. The W-boson can be produced with right, left or 
longitudinal polarizations, with corresponding partial widths Tr, Tl, To that depend on new anomalous 
couplings [61] (Vr, Vl, gL and #r) which can appear at the Wtb vertex (see Eq. 1). 

4.1 W-boson polarization and tt spin correlation measurements 

The probability for the three helicity states of W-boson produced in top quark decay, Fq (longitudinal), 
Fl (left-handed) and Fr (right-handed), can be extracted from the *P angular distribution [61] : 

1 dN 3 [ /sin»P\ 2 /l-cos»P\ 2 /l+cos»P\ 

where *P is the angle between the W-boson direction in the top quark rest frame and the charged lepton 
direction in the W-boson rest frame obtained by a boost along the W-boson flying direction in the top 
quark rest frame. The correlation between the parameter couples {FqJ\), (Fo,Fr) and (Fl,Fr) are -0.9, 
-0.8 and 0.4, respectively. 

Although, in the Standard Model, top quarks are produced unpolarised in tt events, their spins are 
correlated [62]. The production asymmetry A in these events, measures the spin correlation and is defined 
as 

A = <r(¥t) + °(tih) ~ °(¥l) ~ g (¥t) (6) 
<t(* t F t ) + aitJi) + ff^fj + o(t L fi) ' 
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where o(ty^ty±) denotes the production cross-section of a top quark pair with spins up or down with 
respect to a selected quantisation axis. It can be extracted from the 6\ and 62 angular distributions [62] : 

1 d 2 N 1 

TtI El 7- = T(l-A|aia2|cos0 lC os0 2 ), (7) 

./V<icos0i(icos02 4 

where Q\ (62) is the angle between the t (t) direction, measured in the tt rest frame, and the direction 
of the t (t) decay product in the t (t) rest frame obtained by a boost along t (t) direction in the tt rest 
frame, a, is the spin analysing power of the top quark decay product i, which ranges between — 1 and 1 
and measures the degree to which its direction is correlated with the spin of the parent top quark. The 
parameter Ad defined in [20], used to measure the production asymmetry in another basis, is extracted 
from the <I> angular distribution [62] : 

Tf j dN *> = ^(l-A D |«i«z|cos4>), (8) 
Ndcos® 2 

where <I> is the angle between the direction of flight of the two spin analysers, defined in the t and t rest 
frames respectively. 

Table 6: Standard Model values of W-boson polarization parameters(Fo, Fl, Fr) at the next-to-leading 
order and tt spin correlation parameters (A, Ad) at leading order for a top quark mass of 175 GeV. For A 
and Ad the asterisk superscript means that m x \ < 550 GeV is applied. 



Fo 


F h 


Fr 


A* 


A D * 


0.695 


0.304 


0.001 


0.422 


-0.290 



The Standard Model predictions for the W-boson polarization (Fo, Fl, Fr), at next-to-leading or- 
der, and tt spin correlation (A, Ad), at leading order, are given in Table 6. The ATLAS sensitivity to 
these observables has been evaluated with an ATLFAST simulation [20] and a full simulation of a per- 
fect detector [63]. A precision of 1% to 5% should be achievable with 10 fb _1 of data, dominated by 
systematic uncertainties (in particular from the b-jet energy scale). This measurement requires a com- 
plete reconstruction of the tt system and a reliable Monte Carlo description to correct the distortion 
induced by trigger, cuts and reconstruction. In this section the robustness of the analysis with a realis- 
tic detector simulation including triggering is assessed. Only the semileptonic topology of the tt events 
(tt — > WWbb — > £vjj2bb with I = e,/J.) is used as signal (an analysis using tt dilepton events as signal 
can improve the results of this study, especially for the tt spin correlation measurements). In this case the 
most powerful spin analysers of the top quark are the charged lepton (0£i = 1) and the least energetic non 
b-jet in the top quark rest frame (Oj et = 0.51) [64], which are chosen afterwards for the spin correlation 
measurement. 

4.1.1 Event simulation and selection 

The largest statistics Monte Carlo sample was generated with MC@NLO which does not implement tt 
spin corelations, so it can only be used to study W-boson polarisation. Spin correlations were studied 
using the smaller AcerMC sample. In both cases, non-semileptonic tt, semileptonic tt which decay to T, 
W-boson+jets and single top quark events were considered as background. Semileptonic signal events 
are characterised by one (and only one) isolated lepton, at least 4 jets, of which at least 2 jets are tagged 
as b-jets, and missing transverse energy. Following the common criteria for the semileptonic selection, 
all kinematic cuts are summarised in Table 7 together with the corresponding signal efficiencies. 

After kinematic cuts are applied, the event is fully reconstructed, as described below. The angles 
d\ , 62 and are computed and the polarization parameters are extracted. 
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Table 7: Selection cuts used and corresponding efficiencies for semileptonic tt MC@NLO events and tt 
AcerMC events. 



Variables 


Cuts 


Efficiency (%) 






MC@NLO 


AcerMC 


Lepton 


exactly 1 identified 


57.8 


58.7 


Jets 


at least 4 selected jets 


59.3 


62.1 


b-tagging 


at least 2 are tagged as b 


32.0 


31.0 


Missing energy 


p™ ss > 20 GeV 


92.0 


92.2 


Cumulative efficiency 


9.4 


9.8 



4.1.2 W-boson and top quark reconstruction 

The energies of all jets are calibrated according to the comparison with the energies of corresponding 
partem level quarks before selection and reconstruction. Then, in the event reconstruction, the light jet 
pair with invariant mass mjj closest to the known W-boson mass, raw, is selected to reconstruct the W- 
boson which decays hadronically. This W-boson is then combined with one of the b-jets to reconstruct 
the top quark. As there are several possible combinations, the one which gives the mass closest to the 
top quark mass ra t is assumed to be the correct one. The b-jet which is closest to the lepton in AR among 
the remaining b-jets, is reserved for reconstructing the other top quark whose daughter W-boson decays 
leptonically. To reconstruct the W-boson which decays leptonically, the neutrino pj is taken as the 
missing transverse energy. Its longitudinal component p z is determined by constraining m\ v to raw [63]. 
When two solutions for p z are found, the one giving closer to m t is kept. 

Quality cuts |rajjb — m t \ < 35 GeV, |ra^ V b — m t\ < 35 GeV and |mjj — mw| < 20 GeV are applied to 
reject badly reconstructed events. At this stage, 2.8% of the signal events are kept for MC@NLO events 
and 2.7% for AcerMC events, corresponding to 7000 signal events for 1 fb _1 of data (Table 8). After this 
event selection, the main background comes from the tt — > % + X events, where the tau decays to electron 
or muon. The number of events from W-boson+jets and single top quark channels is less than 3% of the 
selected number of signal events, so we neglect them in the following sections. Due to this cancellation of 
the background, the S/B ratio can be affected as much as 20%, which is taken as a systematic uncertainty 
(see Table 10). 

After event reconstruction, the angle *F (W-boson polarization) as well as the angles Q\ , 62 and <J> 
(tt spin correlations) are computed using the prescription descibed at the beginning of Section 4. The 
measured distributions of cos *J/, cos d\ x cos 82 and cos <I> are distorted, compared with their distributions 
at parton level. The detector resolution results in much smaller smearing effect on the final particles than 
that coming from particle radiation, quark fragmentation-hadronization and final event reconstruction. 
That is to say, the latter effects dominate the resolution of the reconstructed objects from top quark de- 
cay [65]. A correction function, taken from the ATLFAST simulation, is used to recover the distributions 
at parton level. With ATLFAST data, the correction function is obtained from the ratio between the two 
normalized distributions of the cos*? (i.e. after reconstruction of the signal and main background events 
and at parton level of the pure signal). To correct for the distortion, a weight derived from this function 
is applied on all the reconstructed cos*}/ of full simulation, on an event by event basis, allowing to re- 
cover, as much as possible, the shape of the distribution of the cos *P of the pure signal at parton level. A 
detailed description of the method can be found in [20]. 
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Table 8: The number of signal (MC@NLO) and the most important background events before (left) and 
after (right) selection for 1 fb~ 1 . * : in the single top quark decay W^e//i/r + v mode 





Events for 


Selected events 




1 fb-^xlO 3 ) 


full simulation 


Signal (tt semileptonic) 


250 


7000 


tt^T + X 


130 


710 


W(-»Zv)+jets 


800 


[10,55] 


Single t (Wt channel) 


25* 


90 


Single t (t channel) 


80* 


55 



4.1.3 Impact of the trigger on the analysis 

Semileptonic tt events are characterised by a single isolated lepton, which can be used to trigger the 
events with high efficiency. Applying the trigger selection to events passing the standard selection cuts, 
15% of well reconstructed events are lost. The measurement results with and without trigger applied on 
the data, while keeping all other aspects of the measurement unchanged, were compared. The effect of 
the trigger on the measurements of the W-boson polarization is almost zero But the effect on the tt spin 
correlations is not negligible, and is taken as a systematic error, shown in Table 10. 

4.1.4 Measurement of the W-boson polarization 

Figure 5 shows the correction function (left) and the reconstructed cos*!' distribution (right) after applying 
the correction function. This distribution is fitted to Eq. (5) varying Fq, Fl and Fr, but constrained by 
F) + Fl + Fr = 1. The fit is restricted to the region —0.9 < cos*!' < 0.8, which is the most extended 
region where the correction is varying slowly. The results are shown in Table 9. 




0.5 - 



-0 0.2 0.4 0.6 0.8 1 -0.8 -0.6 -0.4 -0.2 -0 0.2 0.4 0.6 0.8 



Figure 5: Left: Correction function taken from ATLFAST simulation fitted with a third order polynomial 
function. Right: Normalised reconstructed and corrected distribution of cos x ¥, the full line corresponds 
to the fit to Eq. (5). The sample has an integrated luminosity of 730 pb _1 . 

Two complementary methods to extract the W-boson helicity fractions, using the observed angular 
distribution between the charged lepton direction in the W-boson rest frame and the W-boson direction in 
the top quark rest frame, are currently under development at ATLAS inspired on Tevatron methods [5,66]. 
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4.1.5 Measurement of tt spin correlation 

At partem level, before any phase space cut, the two estimators C = — 9 x cos d\ cos 62 and D = — 3 x 
cosO are unbiased [20]. Figure 6 shows the reconstructed distributions of —9 x cos d\ cos 02/0.51 and 
—3 x cos<I>/0.51 after collection. It should be stressed that, in the evaluation of the tt spin conelation 
parameters, the theoretical value for the spin analyzing power for signal events (0.51) was assumed for 
both cases. The means of the distributions are unbiased estimators of A and Ad, provided corrections for 




"-15 -10 -5 0 5 10 15 -4 -2 0 2 4 

-9*cose l *cose 2 /0.51 -3*cos*/0.51 



Figure 6: Left: Reconstructed and corrected distribution of —9 x cos 61 cos 02/0.51. Right: Recon- 
structed and corrected distribution of — 3 x cos<I>/0.51. The integrated luminosity of the sample is 220 
pb~ 1 in each case. 

physics effects and detector effects are fully carried out. 

Due to the fact that the ATLFAST was used to extract the correction function, an additional system- 
atic uncertainty of 0.25 was derived for the A spin correlation parameter. This uncertainty reflects the 
different parametrizations of the Monte Carlo simulations. Preliminary studies suggest that this shift can 
be removed by using high statistics full simulation samples to derive the correction functions and for this 
reason this uncertainty was not included in Table 10. 

Table 9: W-boson polarization and top quark spin correlation parameters extracted after triggering. The 
indicated errors are statistical and systematic, respectively. 



W-boson polarization 


F L 

0.29 ±0.02 ±0.03 


F 0 

0.70 ±0.04 ±0.02 


0.01 ±0.02 ±0.02 


tt spin correlation 


A 

0.67 ±0.17±0.18 ±0.25 


A D 

-0.40 ±0.11 ±0.09 





4.1.6 Systematic uncertainties 

The systematic uncertainties were estimated with ATLFAST simulation for the factorisation scale, struc- 
ture function, ISR, FSR, b-fragmentation, top quark mass, hadronization scheme and pile-up effects, and 
with full simulation for the b-tagging efficiency, b-jet energy scale, light jet energy scale and signal to 
background ratio (S/B scale) as listed in Table 10, for the measurement of the W-boson helicity fractions 
and top quark pair spin correlations respectively. 

With data, tt events can be reconstructed without requiring b-jet tagging on the side of top quark 
whose daughter W-boson decays leptonically. This can provide a pure sample of b-jets which can be 
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Table 10: Systematics for W-boson polarization and top quark spin correlation measurement. 



Source of uncertainty 


Fl 


— ^2 — 


Fr 


— - — 


Ad 


Factorisation 


0.000 


0.001 


0.001 


0.029 


0.006 


Structure function 


0.003 


0.003 


0.004 


0.033 


0.012 




0.001 


0.002 


0.001 


0.002 


0.001 


FSR 


0.009 


0.007 


0.002 


0.023 


0.016 


b-fragmentation 


0.001 


0.002 


0.001 


0.031 


0.018 


Hadronization scheme 


0.010 


0.016 


0.006 


0.006 


0.008 


Pile-up (2.3 events) 


0.005 


0.002 


0.006 


0.001 


0.005 


Input top quark mass (2 GeV) 


0.015 


0.011 


0.004 


0.028 


0.013 


b-tagging efficiency (5%) 


0.007 


0.002 


0.005 


0.027 


0.07 


b-jet energy scale (5%) 


0.02 


0.002 


0.02 


0.07 


0.015 


light-jet energy scale (5%) 








0.11 


0.017 


S/B scale (20%) 


0.004 


0.002 


0.001 


0.000 


0.004 


Trigger 








0.10 


0.03 


TOTAL 


0.03 


0.02 


0.02 


0.18 


0.09 



used to check the uncertainty on the b-tagging efficiency [67]. The b-jet energy miscalibration can also 
be obtained from other control samples, such as Z+b-jet events [68]. 

4.2 Anomalous couplings at the Wtb vertex 

The W-boson polarisation is sensitive to new anomalous couplings (Vl, Vr, gL and #r) associated with 
the Wtb vertex. Although the W-boson helicity fractions (Fq, Fl and Fr) depend on these couplings, the 
helicity ratios Prl = Tr l/Fo = Fr l/F), were found to be more sensitive to Vl, Vr, gL and gR. The 
Pr and Pl observables are independent quantities and take the LO values Pr = 5.1 x 10~ 4 , Pl = 0.423 
in the Standard Model. General expressions for Prl in terms of the new anomalous couplings can 
be found in Ref. [19]. As for the helicity fractions, the measurement of helicity ratios sets bounds on 
Vr, gL and gR. A third and simpler method to extract information about the Wtb vertex is through 
angular asymmetries involving the angle of the charged lepton in the W-boson rest frame and the W- 
boson direction in the top quark rest frame (introduced in the previous section). For any fixed z in the 
interval [— 1, +1], one can define asymmetries as the difference between the number of events above and 
below z, normalised to the total number of events. The most obvious choice is z = 0, giving the forward- 
backward asymmetry Afb [18,69]. The forward-backward asymmetry is related to the W-boson helicity 
fractions by Afb = f [Fr — Fl]. Other convenient choices are z = T(2 2 / 3 — 1). Defining /3 = 2 1 / 3 — 1, 
we have A+ = 3/3 [F 0 + (1 + J3)F R ] and A_ = -3/3 [F 0 + (1 + /3)F L ]. Thus, A+ (A_) only depend on F 0 
and Fr (Fl). The LO Standard Model values of these asymmetries are Afb = —0.223, A + = 0.548, 
A_ = —0.840. They are sensitive to anomalous Wtb interactions, and their measurement allows to probe 
this vertex without the need of a fit to the distribution of the angle between the charged lepton direction 
in the W-boson rest frame and the W-boson direction in the top quark rest frame. In the present analysis 
the ATLAS sensitivity to the Pl, Pr, Afb, A + and A_ observables is studied. 

4.2.1 Event selection 

In this section an alternative two-level likelihood analysis is explored for the semileptonic channel, i.e. 
tt — ► W + bW~b where one of the W-bosons decays hadronicaly and the other one decays in the leptonic 
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channel W —> i\t (with £ = e,n). Any other process constitutes a background to this signal. In particular, 
it should be noticed that the fully hadronic, dileptonic and semileptonic (with one of the W-bosons 
decaying into TV) tt channels are considered backgrounds to the present analysis. Additionally, the 
following Standard Model processes were considered as background: single top production, W+jets, 
Wbb+jets, Wcc+jets, Z -> e+e", Z -> jU+/i , Z -> T+T~, WW, ZZ and WZ. The likelihood analysis 
is based on the construction of a discriminant variable which uses distributions of some kinematical 
properties of the events. In the first analysis level (called the pre-selection), the common selection criteria 
for the semileptonic topology, with the exception of the b-tagging requirement on jets, was applied to the 
event. The full event reconstruction was performed using a x 2 , defined by 

9 _ {miv\,-m t ) 2 (m jb j c j d -m t ) 2 (m ev -m w ) 2 (m jcjd -m w ) 2 

X — 9~r ?~r 9~r ?> 

where m t = 175 GeV, m w = 80.4 GeV, a t = 14 GeV and a w = 10 GeV are the expected top quark 
and W-boson mass resolutions 3 , I represents the selected electron or muon, m,£ V is the invariant mass 
of the electron (muon) and the neutrino, and y a .b,c,d corresponds to all the possible combinations among 
the four jets with highest pj (with m^ V j a , «ij b j c j d and my d being the corresponding invariant masses). 
The neutrino was reconstructed using the missing transverse energy and allowing the pi to vary in the 
range [—500, +500] GeV. The solution corresponding to the minimum % 2 was chosen. The jets used 
to reconstruct the hadronic W-boson will be labelled "non-b" jets and the remaining two are labelled 
"b-jets". It should be stressed that no b-tagging information was used so far. The pre-selection was 
completed by requiring x 2 < 16. In the second level (the final selection), signal and background-like 
probabilities were constructed for each event (^ Slgnal and £?f ack \ respectively) using probability density 
functions (p.d.f.) built from relevant physical variables: the cosine of the angle between the leptonic top 
quark and the leptonic "b-jet", the transverse momentum of the hadronic W, the hadronic and leptonic 
top quark masses, the transverse momentum of the two "b-jets", the transverse momentum of the lepton 
and the ^/x 2 distribution. It should be stressed that the objective is to test the sensitivity for new physics 
exclusion, under the hypothesis that the Standard Model holds, and the simulation was done assuming 
no anomalous couplings. Signal (Jz?s = H" =l ^-' 8naI ) and background (Jz?g = Tl" =l £Pf acL ) likelihoods 
(with n = 8, the number of p.d.f.) are used to define a discriminant variable Lr = log 10 (j£?s / ■ The 
distribution of this variable is shown in Figure 7(a). The final event selection is done by applying the 
cut Lr > 0. 1 on the discriminant variable. The number of signal and background events (normalised to 
L = 1 fb _1 ) after the pre-selection and final selection are shown in Table 11. After the final selection 
(including the trigger), the dominant backgrounds are W+jets and semileptonic tt with taus in the final 
state (corresponding to 49%, and 29% of the total background, respectively, as shown in Table 12). 

The effect of the single lepton trigger on the event selection was studied. The results are summarised 
in Table 1 1 . In what follows, only events passing the trigger are considered. 

Once b-tagging is well understood, additional information can be used. In this case only jets with 
a positive b-tagging weight were considered as "b-jet" candidates for the x 2 minimisation method. In 
addition, the b-tagging weights of these jets were considered as p.d.f.s for the discriminant variable 
evaluation. In this case, the number of selected signal and background events for L = 1 fb _1 is expected 
to be (6.6±0.1) x 10 3 and (0.9±0.1) x 10 3 , respectively. The tt background is expected to be dominant 
(72% of the total background, mainly due to the semileptonic channel with taus in the final state) and the 
W+jets and single top processes correspond to 15% and 13% of the total background, respectively. The 
discriminant variables corresponding to the analysis with and without b-tagging are shown in Figure 7. 

3 These resolutions are taken from the top quark mass measurement analyses [70]. It should be noticed that a t and o\y can 
be interpreted as weights for each term of the x 2 definition. By changing their values by a factor ~ 2, the obtained observables 
(Pl, PrA+ and A_) are the same within the statistical error. 
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Table 11: Number of signal tt £vbbqq' and background events (and corresponding statistical error), 
normalised to L = 1 fb -1 , after the pre-selection and final selection for the analysis without b-tagging. 
The effect of the trigger on the event selection is also shown. 







e + jl sample 


e sample 


jX sample 




presel. 


(15.8 ±0.4) x 10 3 


(7.2 ±0.3) x 10 3 


(8.6±0.3) x 10 3 


Total background 


final sel. 


(5.2 ±0.2) x 10 3 


(2.5 ±0.2) x 10 3 


(2.8 ±0.2) x 10 3 




trigger 


(4.0 ±0.2) x 10 3 


(1.8±0.2) x 10 3 


(2.1 ±0.2) x 10 3 




presel. 


(27.4 ±0.2) x 10 3 


(12.0±0.1) x 10 3 


(15.5 ±0.1) x 10 3 


Signal 


final sel. 


(15.2±0.1) x 10 3 


(6.5 ±0.1) x 10 3 


(8.8±0.1) x 10 3 




trigger 


(12.6±0.1) x 10 3 


(5.8±0.1) x 10 3 


(6.9±0.1) x 10 3 



Table 12: Background composition and corresponding statistical error, normalised to L = 1 fb after 
the final selection, including the effect of the trigger, for the analysis without b-tagging. 



e + ji sample e sample jit sample 



W±jets, Wbb±jets, Wcc±jets 


(19.6± 1.9) x 


10 2 


(8.8 ±1.4) > 


<10 2 


(10.3 ±1.4) > 


<10 2 


Z±jets 


(1.6±0.4) x 


1() 2 


(1.2 ±0.4) > 


< 10 2 


(0.5 ±0.3) > 


< 10 2 


WZ, ZZ, WW 


(0.4 ±0.2) x 


10 2 


(0.3±0.1) > 


< 10 2 


(0.2±0.1) > 


< 10 2 


tt (except signal) 


(13.1 ±0.6) x 


10 2 


(5.4±0.3) > 


< 10 2 


(7.9 ±0.4) > 


< 10 2 


single top 


(5.3 ±0.3) x 


10 2 


(2.5 ±0.2) > 


<10 2 


(2.7 ±0.2) > 


<10 2 



4.2.2 Measurement of the angular distribution and asymmetries 

The experimentally observed angular distribution, which includes the tf signal as well as the Standard 
Model background, is affected by detector resolution, tt reconstruction and selection criteria [71]. In 
order to recover the Standard Model distribution, it is necessary to subtract the background and correct 
for the effects of the detector, event selection and reconstruction. For this purpose, two different sets 
of signal and background event samples were used: one "experimental" set, which simulates a possible 
experimental result, and one "reference" set, which is used to parametrise the effects mentioned, and cor- 
rect the previous sample. The procedure is as follows. After subtracting reference background samples, 
the full "experimental" distribution is multiplied by a correction function f c in order to recover the Stan- 
dard Model one. This correction function is determined by assuming that the charged lepton distribution 
corresponds to the Standard Model. In case that a deviation from Standard Model predictions (corre- 
sponding to anomalous couplings) is found, the correction function must be modified accordingly, and 
the expected distribution recalculated in an iterative process. These issues have been analysed in detail in 
Ref. [20], where it was shown that this process quickly converges. The correction function is calculated, 
for each bin of the angular distribution, by dividing the number of generated events by the number of 
selected events, using the reference sample. In order to avoid non-physical fluctuations due to the limited 
amount of Monte Carlo statistics, a smoothing procedure was applied to the obtained correction function. 
The value of f c is in the range [0.2,1.4]. Other methods of correcting the angular distribution are under 
investigation. 

The procedure of correcting for detector and reconstruction effects in the asymmetries is similar to 
that used with the full angular distribution, but using only two or three bins. This has the advantage 
that the asymmetry measurements are less sensitive to the extreme values of the angular distributions, 
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Figure 7: Discriminant variables for the Standard Model background (shaded region) and the tt signal 
(full line), normalised to L = 1 fb -1 corresponding to the (a) analysis without b-tagging and (b) analysis 
using the b-tagging weights of the "b-jets" selected by the % 2 minimisation method. 



where correction functions deviate from unity. Moreover, it should be noticed that the extreme bins of 
the angular distribution have a very significant impact on the measurement of the Pl and Pr helicity 
ratios. The values obtained from a fit to the corrected distribution, as well as the angular asymmetries 
Afb, A±, are collected in Table 13, with their statistical uncertainties. Although the statistical errors of the 
observables obtained for the analyses without and with b-tagging are similar, the use of this tool allows to 
improve the signal to background ratio, leading to smaller systematic uncertainties, as discussed below. 



Table 13: Expected values and corresponding statistical errors for the helicity ratios and angular asym- 
metries. The results for an integrated luminosity of 1 fb~ ! (analyses with and without b-tagging) are 
shown. 



PL 


Pr ^fb 




A_ 


e + Ll 0.402 ±0.050 


Analysis without b-tagging 
-0.008 ± 0.008 -0.220 ± 0.025 


0.560 ± 0.024 


-0.845 ± 0.012 


e + Li 0.453 ±0.048 


Analysis with b-tagging 
-0.004 ± 0.007 -0.229 ± 0.026 


0.542 ± 0.028 


-0.830 ± 0.014 



4.2.3 Systematic uncertainties 

The study of the systematic uncertainties considered possible errors from different sources: jet energy 
scale, luminosity, top quark mass, background level, ISR and FSR, Monte Carlo generator and pile-up. 
The jet calibration used in the present analyses is described in Ref. [70]. As for the reference analyses, 
full simulation Monte Carlo samples were used for the study of all the systematic sources of uncertainty. 
Only the simulated sample used as the "experimental" set (which fakes the data) was changed for each 
systematic source of uncertainty. The correction function and the Monte Carlo sample used to perform 
the background subtraction were kept unchanged. The impact on the measurements is summarised in 
Tables 14 and 15. As the background subtraction is based on Monte Carlo simulation, the background 
estimation required a luminosity value and therefore the corresponding systematic uncertainty was con- 
sidered. Once the cross-sections for known backgrounds are measured with data, a data-driven normali- 
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sation will be possible and the luminosity systematic error should be reduced. Moreover, with data it will 
be possible to compare the discriminant variable distributions for selected events obtained from data and 
Monte Carlo. This comparison will allow the correction of systematic uncertainties caused by inaccurate 
description of data by the Monte Carlo simulation. 

Table 14: Sources of systematic uncertainties in the determination of the helicity ratios and angular 
asymmetries (analysis without b-tagging). 



Source 


PL 


PR 




A+ 


A_ 


Jet energy scale 


0.02 


0.003 


0.004 


0.006 


0.002 


Luminosity 


0.02 


0.002 


0.006 


0.005 


0.001 


Top quark mass 


0.02 


0.002 


0.009 


0.006 


0.004 


Background 


0.01 


0.002 


0.005 


0.003 


0.002 


ISR+FSR 


0.13 


0.009 


0.044 


0.046 


0.011 


MC generator 


0.18 


0.013 


0.039 


0.042 


0.001 


Pile-up 


0.14 


0.004 


0.053 


0.039 


0.017 


Total 


0.27 


0.017 


0.080 


0.074 


0.021 



Table 15: Sources of systematic uncertainties in the determination of the helicity ratios and angular 
asymmetries (analysis with b-tagging). 



Source Pl Pr ^fb A+ A_ 

Jet energy scale 0.04 0.001 0.010 0.004 0.002 

Luminosity 0.01 0.000 0.006 0.005 0.001 

Top quark mass 0.03 0.003 0.013 0.008 0.006 

Background 0.01 0.000 0.003 0.002 0.004 

ISR+FSR 0.05 0.006 0.024 0.028 0.015 

MC generator 0.01 0.008 0.009 0.011 0.000 

Pile-up 0.15 0.006 0.012 0.041 0.022 

Total 0.16 0.012 0.033 0.052 0.027 



4.2.4 Constraints on the anomalous couplings 

Using the parametric dependence of the observables on Vr, and gR (and considering the correlations 
between them, which are shown in Table 16), constraints can be set on the anomalous couplings. The 
helicity ratios Prl and the asymmetries A± were used as input for the program TopFit [19]. The 
expected 68% CL allowed regions on the Wtb anomalous couplings for L = 1 fb _1 (analyses with and 
without b-tagging) are shown in Figure 8. In addition to the allowed regions of Figure 8, additional 
solutions can be found at #r ~ 0.8. Such solutions are due to a large cancellation between &{g-g) and 
&(gR 2 ) terms and can be excluded by the measurement at the LHC of the single top cross-section [72]. 
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Table 16: Correlation matrix for the A±, Pr l observables. 





A + 


A_ 


PL 


PR 




1 


0.16 


-0.73 


-0.14 


A_ 


0.16 


1 


-0.10 


0.55 


PL 


-0.73 


-0.10 


1 


0.42 


PR 


-0.14 


0.55 


0.42 


1 




Figure 8: Expected 68% CL allowed regions on the Wtb anomalous couplings for luminosities of 1 fb 1 
(with and without b-tagging), obtained from the Prl and A± observables using TopFit. 



5 Rare Top Quark Decays and FCNC 

This section discusses the study of rare top quark decays via FCNC (t —> qX,X = y,Z,g) using tt events 
produced at the LHC. These decays are strongly suppressed in the Standard Model at tree level due to 
the GIM mechanism. In the effective Lagrangian approach [73,74] the new top quark decay rates to the 
gauge bosons can be expressed in terms of the xf q , K? q , (\vf q \ 2 + \af q \ 2 ) and fc^ anomalous couplings to 
the g, y and Z bosons respectively, and A, the energy scale associated with the new physics. 

5.1 Event samples 

The signal event samples used in this analysis correspond to tt — >• blvqX, where X = y, Z — ► U, g and i = 
e,Li. The following common Standard Model samples were considered as background: fully hadronic, 
fully leptonic and semi-leptonic tt, all from MC@NLO; Wt, s- and t- channels of single top production, 
all from AcerMC; W+jets, Wbb+jets and Wcc+jets, all from ALPGEN; Z -> e+e~, Z -> and 
Z ->■ T+T-, all from PYTHIA; WW, ZZ and WZ, all from HERWIG. 

5.2 Event selection 

The tt final states corresponding to the different FCNC top quark decay modes lead to different topologies 
according to the number of jets, leptons and photons. There is however a common characteristic of all 
channels under study, i.e. in all of them one of the top quarks is assumed to decay through the dominant 
Standard Model decay mode t — > bW and the other is forced to decay via one of the FCNC modes t —* qZ, 
t — > qy or t — > qg. The QCD backgrounds at hadron colliders make the search for the signal via the fully 
hadronic channels (when the W and the Z bosons decay to quarks) difficult. For this reason only the 
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Table 17: Selection cuts applied to the FCNC analyses. For the t —> qg channel, £ v is> PT g and m qg 
represent the total visible energy, the transverse momentum of the jet associated with the gluon and the 
re constructed mass of the top quark with FCNC decay, respectively (see text for details). 



Channel tt -> bWqy 


tt -> bWqg 


tt -► bWqZ 


Pre-selection = U (p T > 25 GeV) 


= U Ot > 25 GeV) 


= 3^(p T >25,15,15GeV) 


>2j(p T >20 GeV) 


= 3j(p T > 40,20,20 GeV) 


>2j(p T > 30,20 GeV) 


= l7(p T >25 GeV) 


= 0y (p T > 15 GeV) 


= 0y Ox > 15 GeV) 


fa > 20 GeV 


in > 20 GeV 


/* T > 20 GeV 



p Ty > 75 GeV £ vis > 300 GeV 2 £ same flavour, 



PT g > 75 GeV oppos. charge 

mqg > 125 GeV 
mqg < 200 GeV 

e22i, mu20 or g55 e22i or mu20 e22i or mu20 



Table 18: The trigger efficiency, in percentage, for the background and signal events after all the other 
cuts of the FCNC analyses. 



t -> 


qy 


t -> 


qZ 


t -> 


qg 


Sig. 


Back. 


Sig. 


Back. 


Sig. 


Back. 


Trigger 99.6 


99.5 


99.2 


95.0 


83.2 


82.2 



leptonic decays of both W and Z to e and jX were taken into account. Only isolated muons, electrons 
and photons separated by AT? > 0.4 from other reconstructed objects, were considered. Specific pre- 
selection and selection cuts were applied for each FCNC channel, as outlined in Table 17. For the 
t — > qy channel, exactly one reconstructed lepton and one reconstructed photon where required in the 
events. Additionally, at least two jets were required. For the t — > qg channel, the events had to have 
exactly one reconstructed lepton, three jets and no reconstructed photon. Finally, for the t — ► qZ channel, 
only the events with at least two jets, exactly three reconstructed leptons and no reconstructed photons 
were accepted. Therefore, the selection criteria for these channels are orthogonal. The expected number 
of background events and signal efficiencies after the final selection are shown in Table 19. The effect of 
the trigger on the background and signal events, after all the other cuts, is shown in Table 18. It should be 
noticed that the events of the t — > qy (t — > qZ) channel can be triggered by the lepton or the photon (one of 
the three leptons), which results on higher trigger efficiencies, when compared with the t —>■ qg channel 
with just one lepton. For the t —> qy channel, the dominant backgrounds are tt, Z+jets and W+jets 
events, which correspond to 38%, 30% and 29% of the total background. The total background for the 
t — > qZ channel is mainly composed of ft and Z+jets events (59% and 28% of the total background, 
respectively), while for the t — >• qg it is mainly composed of W+jets and ft events (which correspond, 
respectively, to 64% and 25% of the total background). 

For all the channels, the top quark with Standard Model semileptonic decay (t —>■ b£v) cannot be 
directly reconstructed due to the presence of an undetected neutrino in the final state. The neutrino four- 
momentum was estimated with a method similar to the one used in Section 4.2, by finding the p v z value 
and the jet combination (and the lepton combination in the case of the qZ topology) which minimizes the 
following expression: 

2 _ (mf CNC -m t ) 2 (m w -mt) 2 (m 4v -m w ) 2 (m 44 -m z ) 2 
a t 2 a t 2 cr| 
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Table 19: The expected number of background events and signal efficiencies after the final selection level 
(the trigger is included) of the analyses for each FCNC channel. The corresponding statistical errors are 

also shown. T he expected background numbers are normalised to L = 1 fb~ 1 . 

e jj i 

tt -> bWqy: 

Total (4.4 ± 0.6) x 10 2 (2.2 ± 0.6) x 10 2 (6.5 ± 0.7) x 10 2 

Signal % 3.6±0.2 4.1 ±0.2 7.6±0.2 

tt-» bWqZ: 

Total (0.3 ± 0.6) x 10 2 (0. 1 ± 0.6) x 10 2 ( 1 .3 ± 0.6) x 10 2 

Signal % 1.4 ±0.1 2.5 ±0.1 7.6 ±0.2 

tt— > bWqg: 

Total (11.0±0.3)xl0 3 (8.3 ±0.2) x 10 3 (19.3 ±0.4) x 10 3 
Signal % 1.3±0.1 1.5±0.1 2.9±0.1 



where mf CNC , me aV j, mg aV and me b e c are, for each jet and lepton combination, the reconstructed mass of 
the top quark decaying via FCNC, the top quark decaying through the Standard Model, the W-boson from 
the top quark with Standard Model decay and the Z boson from the top quark FCNC decay, respectively. 
The last term of Eq. 10 was only used in the t — > qZ channel. The following values are used for the 
constraints: m t = 175 GeV, m w = 80.42 GeV, m z = 91.19 GeV, a t = 14 GeV, a w = 10 GeV and a z = 
3 GeV 4 . No b-tag information was used to reconstruct the event kinematics. The jet chosen to reconstruct 
the top quark with Standard Model decay is labeled as b quark. For the t — »• qy and the t — > qZ channels, 
the other jet, which was used to reconstruct the top quark with FCNC decay, is denoted by q quark. For 
the t — > qg channel, it is assumed that the jet created by the gluon is the most energetic from the two 
which reconstruct the top quark with FCNC decay and the other is created by the light quark. 

Following the selection cuts, a likelihood-based type of analysis was applied, as described in sec- 
tion 4.2. Due to the small statistics of the available full simulation samples, ATLFAST samples were 
used to obtain the probability density functions (p.d.f.s). The p.d.f.s of the t — »• qy channel were built 
based on the following variables: the mass of the top quark with FCNC decay (mf CNC ); the recon- 
structed mass of the photon and the b quark (mt, r ) and the transverse momentum of the leading photon 
(Pj). For the t — > qZ channel, the p.d.f.s were based on the following physical distributions: the mass of 
the top quark with FCNC decay (mf CNC ); the minimum invariant mass of the three possible combinations 
of two leptons (m™ n ); the reconstructed mass of the Z and the b quark (mbz); the reconstructed mass of 
the two quarks (m q b); the transverse momentum of the third lepton (pj) and the transverse momentum 
of the light quark (p^). The following variables were used to build the p.d.f.s of the t —> qg channel: the 
mass of the top quark with FCNC decay (mf CNC ); the mass of the top quark with Standard Model decay 
(me aV j); the reconstructed mass of the light and the b quark (m q b); the transverse momentum of the b 
quark (pj); the transverse momentum of the light quark (pj) and the angle between the lepton and the 
gluon (OCig). The distributions of the discriminant variables are presented in Figure 9. It can be seen that 
ATLFAST describes the fully simulated distributions fairly well, when there are sufficient statistics to 
tell. 

4 These resolutions are taken from the top quark mass measurement analyses [70]. The effect of changing their values was 
considered as one of the systematic source of uncertainties. Its impact on the observables is below the statistical error. 
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— Signal ATLFAST + Signal FullSim Background ATLFAST A Background FullSim 




Figure 9: Distributions of the normalised discriminant variables for the expected background and signal 
after the final selection level of the t — > qy channel when the isolated lepton is identified as (a) an electron, 
(b) a muon and (c) electron or muon; of the t — > qZ channel when the 3 isolated leptons are identified as 
(d) electrons, (e) muons and (f) electrons or muons and of the t — ► qg channel, when the isolated lepton 
is identified as (g) an electron, (h) a muon and (i) electron or muon. 
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Table 20: The expected 95% confidence level limits on the FCNC top quark decay branching ratio, in the 
absence of signal, are shown for a luminosity of L = 1 fb _1 . The central values are represented together 
with the la bands, which in clude the contribution from the statistical and s ystematic uncertainties. 
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5.3 Results and systematic uncertainties 

In the absence of a FCNC top quark decay signal, expected limits at 95% CL were derived using the 
modified frequentist likelihood method [75] and the discriminant variables obtained with the full simu- 
lation samples. No cuts on the discriminant variables were used. Using the Standard Model tt production 
cross-section these limits were converted into limits on the branching ratios. The central values of these 
limits are shown in Table 20. The branching ratio sensitivity for each FCNC channel, assuming a sig- 
nal discovery with a 5a significance, is on average 3.0 times larger than the central values presented in 
Table 20. 

Several sources of systematic uncertainties were studied following the common criteria used for this 
note and the results are shown in Table 21. The jet energy scale (the jet calibration used in the present 
analyses is described in Ref. [70]), the luminosity, the influence of the cross-section values, ISR/FSR, 
Monte Carlo generator and pile-up effects were studied. For the top quark mass, the full simulation 
samples were analysed using the values of 170 and 180 GeV (i.e. p.d.f.s, Eq. 10 and limit computation). 
The systematic error was taken from a linear fit of the values found, for a top quark mass error of 
±2 GeV. To study the effect of the mass resolutions in Eq. 10, and since no cut is applied to the X 2 
distribution, the ratio a t /aw was changed by a factor 2. The total systematic uncertainties, computed as 
the quadratic sum of these individual contributions, are also shown in Table 21. The analysis stability was 
also cross-checked by varying the kinematic cuts by about 10%, and the maximum relative change on the 
expected 95% CL limits was 3%, 9% and 5% for the t — > qy, t — > qZ and t — > qg channels, respectively. 
Table 20 shows the obtained central values for the BR limits. The ±la contributions from statistical and 
systematic uncertainties (added in quadrature) are also shown. The contribution from the luminosity and 
the absolute value of background level may be reduced with data, by normalising to measured processes. 

Figure 10 shows the ATLAS 95% CL expected sensitivity for the first ft)- 1 in the absence of signal, 
for the t — ► qy and t — »• qZ channels taking into account the contributions from both the statistical and 
systematic uncertainties. 
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Figure 10: The present 95% CL observed limits on the BR(t -> qy) vs. BR(t -> qZ) plane are shown 
as full lines for the LEP, ZEUS and CDF collaborations. The expected sensitivity at ZEUS, CDF and 
ATLAS (together with the statistic plus systematic la band) is also represented by the dotted and dashed 



Table 21: Maximum changes (with respect to the central values of Table 20) of the expected 95% CL 
limits for each FCNC top quark decay branching ratio for different systematic error sources. 
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6 tt resonances 

New resonances or gauge bosons strongly coupled to the top quark could manifest themselves in the tt 
invariant mass distribution. Due to the large variety of models and their parameters, studies have been 
done in a model-independent way searching for a "generic" narrow resonance decaying into tt (semilep- 
tonic channel) [51,76]. The ATLAS discovery potential is assessed using the latest available simulations 
and detector description, performing the tt event reconstruction along the same lines as done for the 
high precision measurement of the top quark mass in the semileptonic decay channel [54]. Alternative 
methods are currently under investigation to study the production of high mass resonances (m>2 TeV), 
where the decay products of the top quark are close together, so requiring a modified selection. 

6.1 Event generation and selection 

In this study, tt resonances were produced with Pythia. The Z' — ► tt channel without interference in 
the Z' production has been chosen [77]. Only the semileptonic decay (electrons and muons) of tt pairs 
is considered. Five samples of Z' resonances with mass 700 GeV (the lower limit from CDF and D0 
measurements [78,79]), 1000 GeV, 1500 GeV, 2000 GeV and finally 3000 GeV have been produced. 

The common tt semileptonic selection criteria were used to accept signal events. Once these criteria 
are applied, the main source of physical background to be considered in the search for tt resonances 
comes from Standard Model tt events. The other sources of background, dominated by W+jets events, 
are negligible [54]. 

6.2 Data characteristics 

6.2.1 tt reconstruction 

Several ways to reconstruct the W-bosons and the top quarks have been investigated [54]. The simplest of 
them is used here. This method selects, among all jets from light quarks (u,d,c and s), the two jets which 
are closest in AR to build the hadronic W-boson. The hadronic top quark is reconstructed combining the 
nearest b-jet to the hadronic W-boson. For the leptonic side, the constraint on the W-boson mass is used 
to compute the longitudinal momentum of the neutrino, identifying the missing transverse energy as the 
neutrino transverse momentum. Among the 2 p z v solutions, the one providing the leptonic top quark 
mass closest to the mean value of the hadronic top quark mass is chosen. To reduce both the physical 
background and the combinatorial background (which dominates), cuts are applied on the hadronic W- 
boson mass spectrum, and on both top quark mass spectra. The resulting tt mass distribution, given in 
Figure 11, is used as a starting point for the discovery potential determination. 

This method does not need a well understood jet energy scale and it is also one of the most efficient. 
More sophisticated methods to select the top quarks, such as a kinematic fits of the tt-pairs (used by the 
D0 experiment [79]), or a matrix-element motivated method (used by CDF [78]), are not described here. 
The method used could be considered as a baseline for the tt resonance search. 

6.2.2 Event yield 

The reconstruction efficiency of Standard Model tt pairs corresponding to the reconstruction scheme and 
cuts described above is shown in Figure 13. The fact that the produced particles are closer together when 
the generated tt mass is higher explains the drop of efficiency observed as a function of the tt mass. Part 
of the efficiency can be recovered if another jet finding algorithm, better at resolving nearby jets (such as 
a cone jet algorithm with smaller cone radius), is used. 

Figure 14 gives the reconstruction efficiency of Z' resonances as a function of their mass. This 
reconstruction efficiency is of the same order as for the Standard Model tt pairs. Nevertheless, some 
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Figure 11: Standard Model tt mass spectrum. 
The black area represents the combinatorial back- 
ground where at least one of the selected jets or 
leptons does not match the corresponding 
parton. 



Figure 12: A 700 GeV Z' reconstructed mass 
spectrum. The black area represents the combi- 
natorial background where at least one of the se- 
lected jets or leptons does not match the corre- 
spondent parton. 




700 800 900 



Figure 13: Reconstruction efficiency of Standard Figure 14: Reconstruction efficiency of Z' — > tt 
Model tt pairs as a function of the tt mass. resonances as a function of the Z' mass. 
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Figure 15: Z' mass resolution as a function of the 
Z' mass. 




Mass [GeV] 



Figure 16: 5a discovery potential of a generic nar- 
row tt resonance as a function of the integrated lu- 
minosity. 



small differences arise due to the different production mechanism and spin. Thus, the final result is 
not completely model-independent. The purity (fraction of well reconstructed events among all selected 
events) in each Z' sample is of the order of 80-85%. 

6.3 Discovery potential 

6.3.1 Method and results 

The method used to extract the 5a discovery sensitivity consists of counting the number of Standard 
Model tt events in a sliding mass window over the invariant mass spectrum. The 5a sensitivity means 
that an effect is seen over the expected background with a deviation at least 5 times the background 
fluctuation in the mass window. The width of the window is twice the detector resolution for a given 
resonance mass. Then, the lowest cross section times branching ratio a x Br(Z' — > tt) is computed for 
the discovery of a resonance at a given mass. 

The produced resonances are expected to have a width smaller than the resolution, leading to a 
gaussian shape for the reconstructed invariant mass. The discovery potential is thus estimated here only 
for narrow tt resonances. 

This method, explained in detail in [76], requires as input the Z' mass resolution (Figure 15), the 
reconstruction efficiency of both Standard Model tt events and resonance events, and the purity of the 
final samples. 

The resulting sensitivity is shown on Figure 16. For example, a 700 GeV Z' resonance produced with 
a a x Br(Z' — > tt) of 11 pb should be discovered with a 5a significance after 1 fb _1 of data taking. The 
tt mass spectrum associated with such a case is shown in Figure 17. 

6.3.2 Measurement uncertainties 

Uncertainties on the sensitivity arise from: 

• the reconstruction efficiency for the Z' signal and tt background. The main contribution arises 
from the expected error on the b-tag efficiency, which is set to ±5%. 

• the background contribution (Standard Model tt). The main contribution comes from the tt cross 
section uncertainty +f?j %. 
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Figure 17: Expected tt invariant mass spectrum for the discovery threshold of a 700 GeV Z' resonance 
a x Br(Z' -> tt) = 1 1 .07 pb with 1 fb" 1 of data. 

Table 22: Sources of uncertainties considered on the 5a discovery potential computation. 



Source of uncertainty Error (%) Effect on the discovery potential (%) 



Reconstruction efficiency 16.6 


8.3 


Background contribution +47 


3.1 


tt mass resolution ± 1 (7 reS oiution 


2 to 11 


Luminosity 5 


2.5 


Jet energy scale 5 





• the tt mass resolution (Figure 15). The effect on the discovery potential increases with the reso- 
nance mass. 

• the uncertainty on the integrated luminosity. 

The 5 a discovery potential is given as a function of the tt resonance mass. The position of the 
resonance mass peak depends on the accuracy of the jet energy scale. This will be constrained by the 
W-boson and top quark events themselves. 

All these quantities have been varied within expected errors. The impact on the sensitivity is reported 
in Table 22. 



7 Summary and conclusions 

The ATLAS sensitivity to the measurement of several top quark properties was reviewed in this paper 
for an expected luminosity of 1 fb -1 at the LHC. The precision of several measurements was estimated, 
using the full simulation of the ATLAS detector. For the tests of physics beyond the Standard Model 
associated with the production of top quarks, the 95% CL limit (in the absence of a signal) was presented. 
In Table 23 a summary of the expected sensitivity on these observables is shown, for a luminosity of 
1 fb -1 . Several sources of systematic errors were considered using an approach common to all studies 
appearing in this note. 
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Table 23: Expected ATLAS sensitivity for the top quark properties, for a luminosity of 1 fb _1 . For the 
Standard Model measurements the sensitivity is given as the total error divided by the expected Standard 
Model value, for the searches the absolute value is presented. 



Observables 


Expected Precision 


Top quark charge (2/3 versus -4/3) 


> 5a 


Spin Correlations: 




A 


50% 


A D 


34% 


W-boson Polarisation: 




F 0 


5% 


F L 


12% 


Fr 


0.03 


Angular Asymmetries: 




^FB 




A+ 


11% 


A_ 


4% 


Anomalous Couplings: 




v R 


0.15 


8l 


0.07 


8r 


0.15 


Top quark FCNC decays (95% C.L.): 




Br(t^q7) 


io- 3 


Sr(t-qZ) 


io- 3 


Br(Wqg) 


10~ 2 


it Resonances (discovery): 




a x Br (m t t=700GeV) 


> llpb 



The sensitivity of the ATLAS experiment to the top quark charge measurement was evaluated. The 
analysis shows that using the weighting technique, already with 0.1 fb -1 it is possible to distinguish 
with a 5a significance, between the b-jet charges associated with leptons of opposite charges, which 
allows to distinguish the Standard Model from an exotic scenario. For the semileptonic b-decay method 
the required luminosity is ~ 1 fb~ 1 . The top quark charge itself was reconstructed relying on a Monte 
Carlo calibration of the b-jet charge. Although the reconstruction of the numeric value of the top quark 
charge using the weighting technique seems possible with ~ 1 fb -1 , it will be necessary to check the 
performance of the method with real data e.g. di-jet bb events, once available. A more realistic treatment 
of the background processes will be required for a full understanding of the top quark charge issues. 

A study of the W-boson polarisation fractions {Fq, Fl and Fr) and tt spin correlation parameters (A 
and Ad) has been performed in the semileptonic tt channel. Reconstructed angular distributions were 
used to set the ATLAS sensitivity to the measurement of the W-boson polarisations and top quark spin 
correlation parameters. 

The W-boson polarisation ratios (Pr and pO and the angular asymmetries (A + and A_) dependence 
on the anomalous couplings (Vr, §l and #r) was used to find the sensitivity to the Wtb anomalous 
couplings (for the analyses with and without b-tag). 

Top quark rare decays through FCNC processes (t — > qZ,qy,qg) were studied in this note using tt 
events produced at the LHC. Expected limits on the branching ratios were set at 95% CL in the absence 
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of signal. 

The discovery potential of the ATLAS experiment for narrow tt resonances decaying in the semilep- 
tonic channel, was studied as a function of the resonance mass. 
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1 ATLAS 5-physics programme 

The ATLAS B-physics programme covers many aspects of beauty flavour physics. First, by mea- 
suring production cross-sections of beauty and charm hadrons and of the heavy-flavour quarkonia, 
J/y and T, ATLAS will provide sensitive tests of QCD predictions of production in proton-proton 
collisions at the LHC. Secondly, ATLAS will study the properties of the entire family of B mesons 
(B®, B + , BP, B c and their charge-conjugate states) and B baryons, thereby broadening our knowledge 
of both the spectroscopic and dynamical aspects of B-physics. However, the main emphasis will be 
on precise measurements of weak B hadron decays. In the Standard Model, all flavour phenomena of 
weak hadronic decays are described in terms of quark masses and the four independent parameters 
in the Cabibbo-Kobayashi-Maskawa (CKM) matrix [1]. Enormous quantities of data collected in the 
past decade by the experiments BaBar, Belle, CDF and DO allowed very precise measurements of 
flavour and CP-violating phenomena. Whilst the analysis of the remaining data of these experiments 
may still push the boundaries, no evidence of physics beyond the Standard Model, nor any evidence 
for CP violation other than that originating from the CKM mechanism, has yet been found. At the 
LHC, thanks to the large beauty production cross-section and the high luminosity of the machine, the 
sensitivity of B decay measurements is expected to substantially improve. Whilst direct detection of 
new particles in ATLAS will be the main avenue to establish the presence of new physics, indirect 
constraints from B decays will provide complementary information. In particular, precise measure- 
ments and computations in B-physics are expected to play a key role in constraining the unknown 
parameters of any new physics model emerging from direct searches at the LHC. 

In ATLAS, the main B-physics measurements will be made with an instantaneous luminosity 
of around 10 33 cm~ 2 s _1 ; however, the B-physics potential begins during early data taking at low 
luminosity (10 31 cm~ 2 s _1 ). With an integrated luminosity of 10 pb _1 ATLAS will already be able to 
register about 1.3 • 10 5 events containing J /iff — > jJ. + jJ.~ selected by the low luminosity trigger menu 
[2]. Recorded events will contain J / y — > fx~ produced both directly in proton-proton interactions 
as well as indirectly from decays of B hadrons. With these statistics, beauty and quarkonia studies 
will play an important role in the early data-taking period. 

In this document we present studies for several periods of beauty measurements in ATLAS. First, 
there will be a period of integrated luminosity of order of 10 - 100 pb _1 when B-physics and heavy- 
flavour quarkonia signatures will serve in helping to understand detector properties and the muon 
trigger, as well as measuring production cross-sections. The physics analyses in this period will deal 
with prompt // y and T events, along with inclusive B hadron decays to muon pairs via // y . Further 
on, the exclusive decays of B+ J/yK + , B® ->■ J/yK°* and B° s ^J/y§ will be studied. 

During the next period, from about 200 pb _1 to 1 fb -1 , we expect to collect the same or higher 
statistics as are currently available at the Tevatron. During this period we will start to improve upon 
current measurements of B hadron properties and set new decay rate limits or possibly give evidence 
for rates above Standard Model predictions for rare decays (e.g. in the channel B° — >• 

In the most important period for ATLAS B-physics we expect to achieve about 10 - 30 fb -1 at 
an instantaneous luminosity of mostly around 10 33 cm~ 2 s -1 . It is expected that ATLAS can achieve 
this integrated luminosity in about three years. We are preparing to study a large variety of B-physics 
topics covering both the production and decay properties of B hadrons. In this document we give 
examples of performance studies for this period with polarization measurements of heavy-flavour 
quarkonia and of the baryon A b , by the oscillation phenomena of the B° -B° system, and the rare 
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decay measurement B° — > . We expect to achieve sensitivities allowing the confirmation of 

possible contributions of physics beyond the Standard Model. 

2 Trigger 

All B-physics studies reported in the current document include trigger reconstruction. The ATLAS 
trigger comprises three levels and the selection of B-physics events is initiated by a di-muon or a 
single-muon trigger at the first level trigger (LI). At 10 31 cm~ 2 s _1 the lowest possible threshold of 
about p T > 4 GeV will be used, rising to 6 - 8 GeV at 10 33 cm" 2 s -1 for at least one of the LI muons. 
The muon is confirmed at the second trigger level (L2), first using muon-chamber information alone, 
and then combining muon and inner-detector (ID) information. The use of the more precise muon 
information available at L2 allows rejection of below-threshold muons that passed the LI trigger. 
Combining information from the muon chambers and ID track segments gives rejection of muons 
from K and K decays. 

Following the confirmation of the LI muon(s), cuts on invariant mass and secondary vertex re- 
construction of B decay products are used to select specific channels of interest. Channels such as 
B + — > / / V( i u + p~)K + and B° s — > are triggered by requiring two muons fulfilling J/y or 5° 

mass cuts. At 10 31 cm~ 2 s _1 a single muon is required at LI, with the second muon either originating 
from an additional LI trigger or found at L2 in an enlarged Region of Interest (Rol) around the trigger 
muon. At luminosities above about 10 33 cm~ 2 s _1 , a LI dimuon trigger is used, giving an acceptable 
rate while keeping a low pj threshold for both of the two muons. 

For hadronic final states such as B® — >• D s n, ID tracks are combined to reconstruct first the 
§^K + K~, then the D S ^^>K , and finally the Two different strategies are used for finding the 
tracks, depending on luminosity. Full reconstruction of the whole ID can be performed at 10 31 cm~ 2 s _1 . 
At higher luminosities, reconstruction will be limited to regions of interest defined by the LI calorime- 
ter jets. 

In all studies reported in the current document the trigger decision was used to accept or reject a 
given event. The trigger menu used to produce the datasets contained low-pj- LI muon thresholds of 
4, 5 and 6 GeV. These were used to initiate selections at L2 of decays containing J / 'y —>■ ;U~ , like 
B + — > J /y({X + pi~)K + , and B° s —> jU + /x~. The lowest thresholds for J/y or B® required two 4 GeV 
muons. In addition there were selections for fi° — > D s n based either on full ID reconstruction or on 
a Rol-based reconstruction in LI Jet Rols with a 4 GeV threshold. More detailed information on the 
B-physics trigger is presented in three studies in this document. Specific questions on the LI di-muon 
trigger performance for B-physics are addressed in Section 1 of this chapter. L2 triggering on muons 
and di-muons is presented in Section 2. Finally, triggers for B decays to purely hadronic final states 
have been studied in the last part of this chapter, Section 8. 

If not stated otherwise, the trigger efficiencies in di-muon events were calculated with respect to 
Monte Carlo samples generated with cuts of pj > 6 GeV and pj > 4 GeV for the first and second 
muon respectively and with pseudorapidity cuts of \r] \ < 2.4 on both muons. 

3 Simulation of the events 

Some 1 million B hadron events, along with around 400 000 prompt J/y and T(1S) as well as cc 
events, were produced using PYTHIA 6.4 [3]. The simulations were performed with the CTEQ6L set 
of parton distribution functions. For quarkonium production, the NRQCD matrix element parameters 
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in PYTHIA were tuned to fit Tevatron data [4]. In the production of non-resonant bb and cc flavour- 
creation, PYTHIA models describing flavour-excitation and gluon-splitting were included. The frag- 
mentation of b quarks and c quarks to hadrons was simulated according to the Peterson fragmentation 
function with parameter 0.006 and 0.05 respectively. The choice of parameters was motivated by 
Tevatron measurements [5]. Kinematic selections on final state particles from B decays were applied 
so that most of the generated B events passed the trigger threshold at the reconstruction stage. 

The total bb or cc cross-section is not well defined in PYTHIA when one includes processes other 
than those of the lowest order for bb (cc ) production, since PYTHIA takes the partons to be massless 
and therefore the cross-section diverges when the transverse momentum approaches zero. However, 
only part of the cross section is relevant for our studies - in the phase space of events passing B 
triggers. 

Table 3 summarises the predicted single and di-muon cross-sections from charm, beauty and onia 
production expected at ATLAS from PYTHIA with cuts on the transverse momenta of the muons of 
6 or 4 GeV (as appropriate) and pseudorapidity cuts on both muons of \r}\ < 2.4. The cuts reflect 
the trigger thresholds and were selected to allow most of the simulated events to be accepted by the 
trigger. 



Process (jj.6 threshold) 


Cross-section 


Process (/x4 threshold) 


Cross-section 


bb -► jU6X 


6.1 


Aib 


bb -► /x4X 


19.3 


Aib 


cc -> jx6X 


7.9 


Aib 


cc -> \i4X 


26.3 


Aib 


bb -► /i6ju4X 


110.5 


nb 


bb -► \i4\i4X 


212.0 


nb 


cc — > /x6/x4Z 


248.0 


nb 


cc — > lu.4jj.4X 


386.0 


nb 


pp -► J/y{ii6n4)X 


23.0 


nb 


pp J/Y(^ji4)X 


28.0 


nb 


pp^T(ix6ix4)X 


4.6 


nb 


PP-T(ai4ai4)X 


43.0 


nb 


bb^J/xi/(n6n4)X 


11.1 


nb 


bb^J/xi/(n4n4)X 


12.5 


nb 



Table 1: Predicted PYTHIA cross-sections for various muon and di-muon sources. The numbers fol- 
lowing each symbol }X denote the pj thresholds that were used in generating the events with PYTHIA. 

After PYTHIA simulation, the events were passed through detector simulation (based on GEANT4 
[6]) which first modelled the behavior of the particles as they passed through the detector, and simu- 
lated the response of the active detector components to the energy deposition by these particles. The 
layout of the detector used by the simulation code was published in Ref. [7]. The output of the simu- 
lation was then reconstructed as if it were real data. Following this, reconstructed muons and hadrons 
were analysed using the dedicated B-physics analysis software. 

4 Organization of the 5-physics chapter 

The following reports on B-physics present our best current understanding of the ATLAS B-physics 
programme. As the B trigger plays a role in all the studies presented here, the chapter starts with 
two reports, in Sections 1 and 2, dealing with LI and L2 muon triggers respectively. There follow 
three physics studies typical for the early data period: physics of heavy-flavour quarkonia (Section 3), 
measurements of beauty cross-sections (Section 4) and early physics and performance measurements 
with decays of B° d and B° s mesons in Section 5. Finally there are Sections 6, 7 and 8, which give typical 
examples of ATLAS B measurements that can only be achieved during the more advanced data taking 
period; they are measurements of the polarization of the baryon A b , the rare decay B° — >• and 
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finally the oscillation measurement of the — system. 
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Abstract 

An event with two muons in the final state is a distinctive signal and can be 
triggered efficiently with the use of the level- 1 di-muon trigger. Nevertheless 
triggering is still an issue if these muon tracks are fairly soft and fake di-muon 
triggers originating from muons that traverse more than one region of the trig- 
ger chambers increase the trigger rate. It is important to provide an acceptable 
trigger rate, while keeping high trigger efficiency to study low-pr B-physics 
such as rare B hadron decays or CP violation in the fi-events, especially in a 
multi-purpose experiment like ATLAS. In this note, the level- 1 di-muon trigger 
and its expected performance are described. 

1 Introduction 

ATLAS is a multi-purpose experiment and its main focus is the direct search for and study of physics 
beyond Standard Model. However the indirect search for new physics will also play an important role in 
revealing the (flavor) structure of a non-trivial Higgs sector, which cannot be obtained by direct searches. 

The B® d — > H + H~ rare decay is one of the interesting signatures which are sensitive to new physics 
at the TeV energy scale. The precise measurement of the forward-backward asymmetry [1] or branch- 
ing ratio in the semi-muonic B decay B — > [i + n~X requires good understanding of the possible biases 
introduced by event selection. Also important are precise measurements of Standard Model parameters, 
including CP-violation in B-events, such as B s ° -^J/yty and B® -^J/\yK®. The key to the detection 
of these B signals in ATLAS is to achieve a high trigger efficiency for low-pr di-muon events, keep- 
ing an acceptable trigger rate. It is also essential to understand the acceptance and efficiency using 
pp -^J/Y(^ + ^-)X or pp - T(p+p-)X [2]. 

First, the level- 1 muon trigger is briefly explained in Section 2, then trigger simulation and MC 
samples used in this note are described in Section 3. The performance of the level-1 single- and di-muon 
trigger as well as the effect of an algorithm to resolve muons traversing more than one trigger chambers 
are described in Section 4. Finally, the performance and the impact of various level-1 muon trigger 
configurations on some example B signal events are discussed in Section 5. 

2 Level-1 muon trigger 

The ATLAS trigger system architecture is organised in three levels; level-1, level-2 and event filter. The 
level-2 and event filter triggers provide a software-based event selection after the level-1 trigger and 
events accepted by this trigger chain are finally reconstructed and analyzed offline. 

The ATLAS level-1 muon trigger is based on dedicated, fast and finely segmented muon cham- 
bers (RPC and TGC) [3] and a trigger logic implemented in hardware. The muon trigger system provides 
acceptance in pseudo-rapidity up to |t]| ~ 2.4 and in the full azimuthal angle (<j>) range. A muon track 
is triggered by the coincidence of two or three detector stations (consists of chamber doublet or triplet) 
within a certain road (coincidence window). The transverse momentum of a muon candidate is deter- 
mined by its deviation from the trajectory of an infinite-momentum track (i.e. a straight line). A three 
station coincidence is required for any pj threshold in the endcap and forward regions (|t]| > 1.05) to 
avoid high trigger rates caused by accidental coincidences due to background. 1 As a result, the accep- 
tance at large \r\ \ becomes small for low-p^ muons, as shown in Section 4. 

^he number of stations used for coincidence is programmable giving the possibility of lowering a pj threshold at low 
luminosity if background conditions allow it. 
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The granularity of the level- 1 muon trigger, the size of a Region of Interest (Rol), is At] x A(j> = 
0.1 x 0.1 in the barrel region (| 17 1 < 1.05) and ~ 0.03 x 0.03 in the endcap regions (1.05 < |tj| < 2.4). 
In short, if two muons leave tracks in the same Rol, they are counted as a single muon candidate by the 
trigger system. The level- 1 muon trigger decisions from different regions are sent to the Muon Central 
Trigger Processor Interface (MuCTPI) which combines the information and calculates the multiplicity 
of muon candidates for each pj threshold over the whole detector. In forming the multiplicities, care 
is taken to avoid double counting of single-muon tracks, reconstructed separately in adjacent trigger 
sectors, while retaining a high efficiency for genuine di-muons. The overlap handling is carried out 
using Lookup Tables (LUT) to give flexibility and programmability. The LUTs are generated based on 
Monte Carlo simulations of single-muon events, selecting the regions of the level- 1 muon system that 
can be traversed by a single muon. 

The overlap handling is mandatory to avoid unacceptably high trigger rates caused by doubly-counted 
single muons, so called fake di-muon triggers. Most of the overlaps are resolved by the MuCTPI, which 
takes the muon candidate with the higher pj into account when calculating the muon multiplicity and 
finding overlapping muon candidates. The MuCTPI consists of independent 16 octant modules and 
overlap resolving is performed in each module. Therefore, overlapped regions connected to two different 
octant modules cannot be solved in MuCTPI and should be treated in each subdetector (RPC and TGC) 
by masking channels or flagging a overlap bit. The parameters used in the overlap handling, such as the 
combination of Rols and masked channels are programmable. More details are described in Refs. [3, 4]. 

3 Monte Carlo samples and trigger configuration 

Single-muon Monte Carlo samples with various, fixed pj values are used for performance tests of the 
level- 1 single muon trigger logic. Single-muon events are generated uniformly over the full azimuthal 
angle range and |t] | < 2.7 with a fixed pj value. The detector response is simulated using Geant4 [4] with 
the ATLAS geometry in the Athena framework [5]. Trigger simulation is performed using the trigger 
configuration for luminosity of 10 31 -10 33 cm _2 s _1 running, with a set of pj thresholds of 4, 5, 6, 11, 
20 and 40 GeV. The production vertex is smeared with a Gaussian distribution with o x = o y = 15 ;Um 
and o z = 56 mm. The level- 1 single- and di-muon trigger menu items are named as MUx (single- 
muon trigger) and 2MUx (di-muon trigger with the same threshold for both muons) respectively, where 
x represents the pj threshold. The exceptions are MU0 and 2MU0, which correspond to completely 
opened coincidence windows. However, even in this case, the acceptance of the coincidence windows is 
limited by connectivity as well as by the dimension of the coincidence matrix ASICs [6]. 

The trigger efficiencies of various level- 1 trigger configurations are also studied using B-physics 
Monte Carlo events. Event generation is done using PYTHIA [7]. Only interesting events associated 
with at least two muons with pr > 6 GeV (the leading muon) and 4 GeV (the second muon) within 
|tj| < 2.5 are studied. The B-physics samples used in this note are B s ° — > (47,650 events), 

B s ° -> ti+jj,- (47,450 events) and B + -> J/y{il + >l~)K + (49,250 events). The last sample is chosen 
partly as a control sample. The opening angle of the two muons is smaller in this sample than in the 
other two samples. 

In this note, only the level- 1 muon trigger is simulated and events triggered by level- 1 are studied. 

4 Level- 1 muon trigger efficiency in single-muon events 

Detailed performance studies of the single muon trigger were performed in the barrel and endcap regions 
separately and the results are described in Ref. [8]. The single muon trigger efficiencies as a function 
of the transverse momentum of the simulated muon, over the whole muon trigger system are shown in 
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Figure 1. The single muon trigger efficiency is defined by: 

£(MUx) - # of events triggered ^ 

# of single muon events with |tj| < 2.5 

The range of pseudo-rapidity applied in the efficiency calculation is |t]| < 2.5 and not 2.4 as used in 
Ref. [8], since |tj| < 2.5 is applied for muons in the B-physics samples (Muons with |t]| > 2.4 can be 
triggered if their momentum is relatively low and the track is bent towards smaller |t]|). The efficiency, 
e(MUx), includes all contributions from geometrical acceptance and coincidence window coverage. The 
trigger efficiency of MU5 and MU6 is lower than MUO even for high pj muon tracks. This is because all 
regions don't have full (100 %) acceptance except MUO which accepts all muons within station coinci- 
dence windows. Figures 2 (a) and (b) show the trigger efficiency as a function of the | rj | of the simulated 




Figure 1: Trigger efficiency as a function of pj with each low-p^ threshold; MUO (filled circles), 
MU5 (filled triangles) and MU6 (open squares) for muons with \r\\ < 2.5 at the interaction point. 



muon with pj at threshold and with pj=\9 GeV for MUO and MU6 trigger selections, respectively. The 
efficiency at the plateau is predominantly determined by geometrical acceptance. Efficiency losses at 
| T] | ~ 0, 0.4 and 0.7 are caused by cracks and inactive material, like ribs, detector support structures and 
the elevator hole. Around the transition region between the barrel and the endcap, \r\ \ ~ 1.05, some effi- 
ciency is lost because the station coincidence cannot be satisfied. A small efficiency gap at the boundary 
between the endcap and forward regions (\r\ \ ~ 2) is also due to the station coincidence, since the endcap 
and forward systems are treated separately. However, this effect is smaller than in the transition region 
between the barrel and the endcap. Poor MUO efficiency at pj =4 GeV is seen in the endcap region 
due to the requirement of three station coincidence, not two as required in the barrel. The trigger was 
originally designed for pj > 6 GeV as the lowest threshold, but can be applied below this threshold at 
very low luminosity. 

Figures 3 (a) and (b) show the trigger efficiency as a function of <j> (0=0 and ±71 denote directions 
perpendicular to the beam axis in the horizontal plane) of the simulated muon with pj at threshold and 
with pr=l9 GeV for MUO and MU6 trigger selections, respectively. The efficiency loss can be clearly 
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Figure 2: The | Tj | dependency of the trigger efficiency for MUO (a) and MU6 (b) trigger selections. 
Solid lines correspond to the simulated muons with pt=19 GeV, dotted lines are for muons with 
Pt at threshold (in the case of MUO this threshold is set to 4 GeV). 



seen around 0 ~ —1.2 and —2 at the magnet's feet, especially for high pj muons. The geometrical 
structure is in general more visible in efficiency using higher pj muons, as they produce straighter 
tracks. 




Figure 3: The 0 dependency of the trigger efficiency for MUO (a) and MU6 (b) trigger selections. 
Solid lines correspond to the simulated muons with pj-Y) GeV, dotted lines are for muons with 
pj at threshold (in the case of MUO this threshold is set to 4 GeV). 



5 Level- 1 muon trigger performance in 5-physics events 

A detailed level- 1 trigger simulation is mandatory to find effective trigger menus and thresholds for 
certain physics processes and to determine the trigger efficiency and rate as well as to study the trigger 
bias. In this section, di-muon and single-muon trigger efficiencies with different pj thresholds and the 
dependency of di-muon trigger efficiencies as a function of the opening angle between simulated muons 
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are discussed using fi-physics Monte Carlo samples. 

5.1 Efficiency with various trigger configurations 

The efficiency of the level-1 muon trigger for three different B-physics processes, B s ° — > jJ. + jJ.~ , B s ° —> 
and B + — > J n~)K + is studied with various trigger configurations. The efficiencies of 
the single muon triggers (MUO and MU6) and the di-muon triggers (2MU0 and 2MU6) are summarized 
in Table 1. The efficiency of the single muon trigger is high, about 95 %, since multiple muons have 
a higher probability to be triggered by a single muon trigger. The 2MU0 trigger gives better efficiency 
than 2MU6 by ~ 16 % for di-muon events with pj above 6 GeV (4 GeV) for the fastest (second fastest) 
muon. The efficiency loss due to the MuCTPI overlap handling is seen in all physics processes, although 
the relative efficiency loss is small, ~ 0.5 %. 





Efficiency [%] 


Trigger Menu / Process 


B s ° jU+jU" 


B+ ^J/y(n+ii-)K + 


B s ° -> H + H~(j> 


MUO 


97.0±0.1 


96.8+0.1 


97.0+0.1 


MU6 


93.0±0.1 


92.9+0.1 


93.1+0.1 


2MU0 


67.9±0.2 


68.8+0.2 


69.0+0.2 


2MU6 


51.6±0.2 


52.9+0.2 


53.2+0.2 


2MU0 (w/o MuCTPI overlap handling) 
2MU6 (w/o MuCTPI overlap handling) 


68.2+0.2 
52.0+0.2 


69.1+0.2 
53.4+0.2 


69.4+0.2 
53.7+0.2 



Table 1: The level-1 trigger efficiency of various configurations. The efficiency is calculated with 
respect to the number of generated events (not to the number of muons) of three different physics 
processes. The errors are statistical only. 



5.2 Opening-angle dependency 

In case of measuring some quantity as a function of a certain parameter, one must ensure that the trigger 
efficiency is independent of that parameter, or correct the measurement to avoid a bias from th e trigger. 
Figure 4 shows the 2MU0 and 2MU6 trigger efficiencies as a function of AR (=y / AT7 2 + A(j) 2 ) between 
the two leading muons in B s ° — »■ events. The r\ and 0 are true parameters of muon flight direction 

at the production vertex. No offline selection is applied in order to see see the bias from the level-1 
trigger. The trigger efficiency clearly depends on the opening angle of the two leading muons. The 
efficiency is lower at large opening angles because in this case the B s system is not boosted i.e. momenta 
of the muons are lower. Since overlap removal between the muon candidates doesn't play a role at large 
opening angles, this effect is purely due to kinematics. Figure 5 (a) shows the pj distribution of the 
leading muons for three different AR ranges: AR < 0.1, 0.2 < AR < 0.4 and AR > 0.5. The muons at 
large opening angles clearly have a softer pj spectrum. The effect is even more clearly visible for higher 
threshold di-muon trigger items, as the probability of having muons at large AR with transverse momenta 
high enough to trigger a high threshold di-muon trigger item is very small, but non-negligible at small 
opening angles. 

The small efficiency loss at very small opening angles (AR < 0.1) is however due to the trigger 
system. In the case that the muons both leave hits in the same Rol, only one muon can be triggered by 
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• — 2MU0 efficiency with MuCTPI 
a — 2MU6 efficiency with MuCTPI 
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Figure 4: (a) Trigger efficiency as a function of AT? in the semi-leptonic rare B decay T? $ ° — > jU + /i~ 0 
using the 2MU0 (filled circles) and 2MU6 (open squares) with MuCTPI. (b) Effect of MuCTPI as 
a function of the opening angle, e(without MuCTPI)/e(with MuCTPI) for 2MU0 (filled circles) 
and 2MU6 (open squares). 




Figure 5: (a) The truth pj distribution of the two leading muons is different opening angles range 
in B s ° -> ^i+jU 0 events: AT? < 0. 1 (filled circles), 0.2 < AT? < 0.4 (filled triangles) and AT? > 0.5 
(filled squares), (b) The efficiency of 2MU0 as a function of opening angles for two cases: the AT? 
separation of the two muons is either larger (filled circles) or smaller (open circles) at the muon 
trigger system compared to that at the primary vertex. 
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the system. Figure 5(b) shows the 2MU0 efficiency as a function of the opening angle, but events are 
divided into two classes: In one case we select the muon pairs whose separation is expected to shrink 
when reaching the muon spectrometer (filled circles), while in the other case (open squares) we select 
the muon pairs whose separation is expected to grow. 




Figure 6: (a) Scaled di-muon invariant mass-squared distribution in B s ° — > ii + n~<\> events: no 
level- 1 selection (open histogram), triggered by 2MU0 (hatched histogram) and by 2MU6(cross 
hatched histogram), (b) Efficiencies of 2MU0 (filled circles) and 2MU6 (open squares) as a func- 
tion of di-muon invariant mass-squared. The two lines show the relative efficiency with respect to 
the corresponding average value for 2MU0 (solid line) and 2MU6 (dotted line). 

One example B-physics study is to look at the differential decay rate dBr(B — > /i + ;U~0) / ds or 
forward-backward asymmetry Afb as a function of di-muon invariant mass to discriminate between 
Standard Model and new physics contributions. The § is m^/ntg and the Afb(s) is defined as 

where z = cos0 (6 is the angle between the }X + and 0 in the IX + }X~ rest frame) and dT/dz is the differ- 
ential event rate of —> /i + ;U~0. 

Figure 6 (a) shows the invariant mass distribution with and without level-1 triggers in B s ° — > 

H + H~(f) events. Trigger efficiency with 2MU0 and 2MU6 are shown as a function of jJ. + jJ.^ invariant 
mass in Figure 6 (b). The relative trigger efficiency, normalized to the efficiency averaged over a whole 
m jin/ m B ran g e > is a l so shown to illustrate the trigger bias. The invariant mass dependency of the level-1 
di-muon trigger efficiency is weak compared to the opening angle dependency and almost independent 
of the trigger threshold. A efficiency drop can be seen at small m w , which is also because the two muons 
are triggered as a single muon by the muon trigger system. 

Similarly, the forward-backward asymmetry Afb is shown as a function of s in Figure7(a). The dif- 
ference (Ap b[2MUx] - Afb [no selection]) is also shown in Figure 7(b). It should be noted that significant 
differences between forward and backward samples in terms of momentum, pseudo-rapidity and 
opening angle distributions are observed. Nevertheless, as illustrated in Figure 7, the forward-backward 
asymmetry is more robust to the effects of level- 1 di-muon triggering than di-muon mass distributions 
(see Figure 6). 
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Figure 7: (a) Forward-Backward asymmetry in B s ° — > events: no level-1 selection (filled 

triangles), triggered by 2MU0 (filled circles) and by 2MU6(open squares), (b) deviation between 
Afb's with and without triggers (Afg[2MUx] - A/r B [no selection]: 2MU0 (filled circles) and 
2MU6 (open squares). 



6 Summary 

The level-1 di-muon trigger is essential for selecting rare B decays that have low-pr muons in the final 
state. The rate for a single-muon trigger at the relevant thresholds would be unacceptably high. A detailed 
level-1 muon trigger simulation is implemented and used for the study of the trigger efficiency and its 
bias for B-physics events. The level-1 di-muon trigger efficiencies of 2MU0 and 2MU6 are high enough 
(about 70 % and 50 %, respectively) in the interesting B-physics events. The fake level-1 di-muon trigger 
rate is 2.3 kHz [8] while the genuine di-muon event rate is ~ 600 Hz (muons with pj > 4 GeV from cc 
and bb) at = 10 33 cm~ 2 s _1 . The efficiency loss from the MuCTPI overlap handling is found to be 
negligible. 

The trigger bias in regard to the opening angle and invariant mass of two muons was also studied. 
An opening angle dependence is clearly seen, of the order of ±10 % and ±15 % for 2MU0 and 2MU6, 
respectively. It is explained by the single-muon efficiency curves and muon kinematics in signal events. 
This dependency is stronger for higher pj thresholds. No clear bias is seen due to the presence of the 
MuCTPI overlap handling. The trigger efficiency is rather flat over the invariant mass and less dependent 
on the trigger threshold since the invariant mass is not as strongly correlated to muon momenta as the 
opening angle is. 
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Abstract 

Muon pairs from J/yr decay are a clear signature of b hadrons. As a large 
fraction of b hadrons are produced at low-pr, a low rate, efficient di-muon 
trigger for low-pr muons and a good understanding of the trigger efficiency 
are essential. Di-muon final states will also play a key role in calibration, 
alignment and determination of the trigger efficiencies. The performance of 
the level-2 dimuon trigger algorithms is discussed, together with a method 
for reducing backgrounds from decays-in-flight. A strategy for calculating 
the single muon and di-muon trigger efficiencies at level- 1 and level-2 using 
J/yr from the data themselves is presented. 

1 Introduction 

B-physics is one of the areas of the physics programme of the ATLAS experiment. It includes the 
study of production cross sections, searches for rare b decays and measurements of CP violation 
effects. These studies make use of the large bb production cross section at the LHC where bb pairs are 
abundant in the low transverse momentum (pr) region. On the other hand, one must extract signals 
from amongst the large QCD background, mostly composed of light quarks. For this purpose, one of 
the main channels for B physics study involves decay channels with one or more muons in the final 
state, especially the channel J /iff — > jJ. + jJ.^. 

The output rate of the first level trigger at a luminosity of 10 33 cm~ 2 s _1 is expected to contain 
20 kHz of events where one muon passed the pj threshold of 6 GeV. Early running is envisioned to 
include even lower pj thresholds, down to the lowest threshold achievable in the hardware. 

At the second level trigger this rate of events must be reduced to 1-2 kHz, of which 5-10% are 
available for channels of interest only to B-physics. Currently this goal is achieved for level- 1 muon 
triggers by first confirming that a muon over the nominal threshold is reconstructed in the muon 
spectrometer (MS), and then confirming that there is a matching track in the inner detector (ID). 
This selection criterion removes many muons from K and % decays, but does not by itself produce 
the required rate reduction. To achieve the required rate pj thresholds need to be raised and many 
interesting b events are likely to be filtered out. We therefore focus also on di-muon final states. 

We developed an algorithm, TrigDiMuon, which achieves high efficiency at level-2 for the golden 
CP channels (// yr), using the identification of relatively low-py muons from // yr decay. TrigDiMuon 
searches for di-muon pairs from // yf or other resonant sources, when only one of the muons passed 
the level- 1 or level-2 single muon selection. The use of TrigDiMuon can enhance J/yr efficiency at 
\ow-p T compared to the trigger based on two muons found at level- 1, with an acceptable increase in 
the fake rate. 

In this note, we present the performance of the TrigDiMuon algorithm, that looks for a second 
low-pr muon partner to a single muon triggered at level- 1. We compare it to the performance of an 
algorithm that requires a di-muon level- 1 trigger. The comparison is based on a sample ofj/yf which 
decayed into muons with low-py, such that the second muon of each decay may be below the level- 1 
threshold. 

In addition to the foreseen specialised trigger strategies for di-muon signatures, it is important 
to optimize the rejection of muons from K and n decays for the standard single muon selection in 
order to have the lowest possible threshold on the inclusive single muon trigger. This is achieved 
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by extrapolating the muon track from the MS back to the interaction vertex and requiring a good 
match between this track and the associated track from the ID. Muons from light hadron decays do 
not match accurately the ID track, which in this case is the track produced by the parent hadron (or 
from a mixture of hits from the parent and the daughter muon) and not by the muon track. This 
note presents a method implemented at the level-2 trigger for rejecting muons from K and % decays. 
We summarize the rejection power of this method, show that the efficiency loss is minimal for direct 
muons and estimate the efficiency loss for low-py muons from J/y decay. 

Cross section measurements or searches for rare decays require a good understanding of the ef- 
ficiency of the event selection. As we are interested in events with rather low-pj, the understanding 
of the trigger efficiency is crucial and we must have a strategy for measuring it from data with high 
precision. The tag-and-probe method for measuring the single muon trigger efficiency using J/y 
events is presented, and we demonstrate that the obtained trigger efficiency can be applied to calcu- 
late the di-muon trigger efficiency. A calibration trigger is proposed to collect an unbiased sample of 
single muons with an enhanced J/y fraction, and the expected performance of the trigger efficiency 
measurement in the early days of the data-taking is discussed. 

Particles from additional collisions in the same bunch-crossing (pile-up) are not simulated in the 
samples used for this paper, and therefore this additional background is not taken into account in the 
analysis. 

2 Simulated datasets used and production tools 

Since the subject of this note is low-p r muons, we use simulated samples that were produced with 
especially low-pj- cuts at the event generation level. Samples were generated using the PYTHIA 
event generator [1]. Except for the minimum bias events, a generator level filter was applied to 
pre-select efficiently the events in the sample. The di-muons sample passed a filter which required 
the existence of at least two muons with the appropriate pj and r\ cuts. For the inclusive muon 
samples the filter required the existence of a single muon passing the corresponding selection. Events 
were processed with the full simulation of the ATLAS detector based on the GEANT package [2]. 
The level- 1 simulation, High Level Trigger (HLT) selection and offline event reconstruction were 
performed. 

The following di-muon samples were used: 

• An 8000 event sample of the channel A& — > J/yA wifh//i//^ Ai + jU~, where one of the muons 
is required at the event generation level to have p T > 4 GeV and the second muon is required to 
have p T > 2.5 GeV. 

• Simulated samples of direct J/y production and bb — >• J/y production, with the generator 
level filter requiring the existence of at least two muons with pj > 6 GeV for the highest pj 
muon and pj > 4 GeV for the second highest pj muon. For both processes, 150000 events 
were generated. 

• Two samples of inclusive muon from b decays were used. A 200000 event sample of bb — > \lX 
with a generator level filter requiring one muon with pj > 4 GeV, and a sample of 250000 bb 
events with at least one muon with pj > 6 GeV in the final state. 

• Since this note deals specifically with methods of rejecting muons from K and % decays, a large 
sample of such decays was required. A minimum bias sample where pions and kaons were 
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forced to decay inside the inner tracker volume was produced for this purpose. The method 
developed for producing this "forced-decay" sample is described in detail below. 

• Events containing a single muon each were used to study the effect on prompt muons of the 
method for rejecting muons from K and n decays. Some of the single muon samples were 
simulated with a perfectly aligned detector geometry, while some other samples were simulated 
with a misaligned detector description. 

The trigger and reconstruction software used for some of the samples are of a later version than 
that used in other notes, because the level-2 and reconstruction software used here have improved 
significantly in recent version. The software modification will be explained where each package is 
described. Table 1 summarizes the simulated Monte Carlo samples used in this note. 



Samples 


Generator level filter 


Statistics 


Signal 
samples 


Direct J/y^ jU + ;U 


Pr 


>6,4GeV 


150 k 




A- 


>6,4GeV 


150 k 


A h ^J/yA(J/\i/^n + n-) 


Pt 


> 4,2.5 GeV 


7.6 k 




bb->n+X 




>6GeV 


250 k 


Background 


bb^n+X 


Pt 


>4GeV 


185 k 


samples 


Minimum bias (forced K 1%) 






114k 




Minimum bias (standard) 






500 k 



Table 1: Summary of MC samples. 



2.1 Production tools and samples employed for K and n decays 

Minimum bias events are the most copious source of pions and kaons. These particles are produced 
mainly with very low-p^, and typically the muons coming from their in flight decays do not escape the 
ATLAS hadronic calorimeter. On the other hand pions and kaons with high-^y have a low probability 
to decay before the calorimeter because of their high energy. As a consequence it is not efficient to 
use minimum bias events as a source of decays in flight to muons. Estimates made for this analysis 
indicated that the simulation of 5000 minimum bias events would be needed to provide one muon 
from K or % decay capable of passing the 6 GeV threshold of the ATLAS level- 1 muon trigger. 

To increase the statistics of events with charged pions and kaons decaying in flight, a special 
simulation tool is applied. Since we are interested in decays which happen inside the detector, the 
decays cannot be made on generator level but must happen in the GEANT simulation. The program 
to force the K In to decay in the detector is an extension to GEANT which runs before the simulation 
itself. 

In each event a list of all charged pions and kaons with pj > 2 GeV is compiled, and one of the 
particles in the list is randomly selected. Events with no charged pions or kaons with pj > 2 GeV are 
dropped at this stage, such that the events are not further simulated and nothing is written to output. 

The point of decay is selected by first computing the trajectory length from the production vertex 
to where the particle would exit the ID (neglecting curvature in the magnetic field). The decay position 
is then determined by taking a random fraction of this maximum track length. Since particles assigned 
a late decay have a larger probability to be stopped through hadronic interactions, this procedure may 
introduce a weak bias towards shorter decay lengths. 
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For this study a minimum bias sample of 1 14 k events was simulated with forced decays. The 
transverse momentum cut of 2 GeV applied to the light hadrons reduces the cross section of the 
sample to (43.41 ±0.07)% of the minimum bias cross section. In addition to this a minimum bias 
sample of 500 k events, simulated with the standard GEANT simulator, was also employed to provide 
a cross-check from a single source for both the muons from K Ik decay and the prompt muons. 

3 The muon trigger 

The ATLAS trigger system [3] reduces the event rate from the 40 MHz beam crossing rate to ~200 Hz 
for mass storage, keeping the events that are potentially the most interesting for physics. The first 
level trigger [4] selection is performed by custom hardware and identifies a detector region for which a 
trigger element was found. The second level trigger [5] is performed by dedicated software, making its 
decision based on data acquired from the Region of Interest (Rol) identified at level- 1. Eventually, the 
event filter [5] uses the complete event data, and algorithms adapted from the offline reconstruction, 
to refine the selection of level-2 and further reduce the trigger rate by about a factor 10. An event must 
pass all trigger levels to be kept for analysis. 

A detailed description of the ATLAS muon trigger and the estimated trigger rates are presented 
in [6]. Here we give a very brief summary of the principles of the level- 1 and level-2 muon triggers. 

3.1 The level-1 muon trigger 

The level-1 muon trigger is based on dedicated fast detectors: the Resistive Plate Chambers (RPC) in 
the barrel and the Thin Gap Chambers (TGC) in the end-caps [7]. The basic principle of the algorithm 
is to require a coincidence of hits in the different trigger stations within a predefined angular region, 
called a "road", from the interaction point through the detector. The width of the road is related to the 
bending of the muon in the magnetic field and thus to the pj threshold to be applied. 

The trigger in both the barrel and the end-cap regions is based on three trigger stations at different 
distances from the interaction point [4]. The low-pr triggers (4 to 10 GeV) are derived from a coin- 
cidence in two stations, while the high-py triggers (over 10 GeV) require an additional coincidence 
with the third station. In the end-cap, there is an option to use all three stations also for the low-pr 
trigger. This option is the one used in the trigger performance studies in Reference [6]. 

The level- 1 trigger provides for each muon candidate the region where it was found, called the 
region of interest (Rol). For the muon trigger, the size of the level-1 Rol is At] x A<p = 0.1 x 0.1 in the 
barrel and Atj x A(j) = 0.03 x 0.03 in the end-cap region, respectively. 

3.2 The level-2 single-muon trigger 

The level-2 trigger is a software-based trigger and uses the information of the Region of Interest 
provided by the level-1 trigger. Level-2 algorithms only process data around the Rol, using the full 
granularity of the detector readout within the Rol. 

The HLT trigger selection proceeds in "trigger chains". A chain consists of a series of reconstruc- 
tion and decision (hypothesis) algorithms that process the data in a Rol identified by level-1. The role 
of the level-2 muon trigger is to confirm muon candidates flagged by the level-1 and to give more 
precise track parameters for the muon candidate. 

The level-2 muon selection is performed in two stages. The first stage is performed by the muFast 
algorithm [8], which starts from a level-1 muon Rol and reconstructs the muon in the spectrometer, 
using the more precise Monitored Drift Tubes (MDT) to perform a new pj estimate for the muon 
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candidate and creating a new trigger element. The hypothesis algorithm cuts on the estimated pj and 
passes the validated trigger elements to the next algorithm. 

Track finding in the inner detector is based on the region of the candidate found by muFast. The 
muFast candidate and ID tracks are passed to the next algorithm, muComb, which matches an ID 
track with the trigger element from the muon spectrometer and refines the p T estimate [5]. 

3.2.1 muFast 

For level- 1 Rol's flagged by the RPC (barrel region) and the TGC (end-cap region), muFast performs 
global pattern recognition, a local segment fit in each muon station and a fast pj estimation. The 
global pattern recognition is designed to select clusters in MDT tubes belonging to a muon track 
without using the drift time measurements. It is divided into two steps, firstly the pattern recognition 
in the trigger chambers seeded by the level- 1 Rol, and the subsequent MDT pattern recognition seeded 
by the result of the previous step. In the MDT pattern recognition muon roads are opened in selected 
MDT chambers, and the loactions of hit tubes are collected. A contiguity algorithm is applied on the 
selected hits to remove the background. 

The track reconstruction approximates a muon track as a series of segments built separately in each 
MDT chamber. Segments are reconstructed using the drift time measurements and an approximate 
calibration to obtain hit radii from them. The fitted segments provide a precision measurement of the 
point where the fitted line crosses the middle of the MDT chamber, called the super-point. 

The track bending is measured in a different way in the barrel and the end-cap. In the barrel, the 
sagitta is computed from the three super-points found in the three stations. In the end-cap, the track 
bending is measured by the angle a between the track direction measured by the muon chambers in 
the middle and outer stations of the muon spectrometer, and the direction obtained by connecting the 
nominal interaction vertex with the mean hit position in the middle station. 

The muon transverse momentum is estimated using an inverse linear relationship between the 
measured sagitta (in the barrel) or a (in the end-cap) and pj. The detector region is divided into bins 
in tj and 0 and the parameters of this inverse linear function are estimated in each bin. 

3.2.2 muComb 

The muComb algorithm matches the muon track found by muFast to ID tracks reconstructed at level- 
2 citelD-CSC. In reconstructing the level-2 ID tracks, only hits from the pixel and SCT detectors were 
used, for speed. 

The matching between muFast and ID track segments proceeds as follows. First, a preselection of 
ID tracks is made based on the difference in tj and 0 between muFast and ID track segments. In the 
barrel, this preselection also makes use of the difference in the Z of the extrapolated track segments at 
the radius of the barrel calorimeter. A weighted combined pj, and a matching % 2 , are calculated for 
each ID track passing the preselection in combination with the muon track information, and the ID 
track giving the lowest % 2 is selected as the best match to the muFast track. 

The version of the algorithm used in this study is improved with respect to that used in [6]. The 
resolution of muFast tracks assumed in combining with the ID tracks have been retuned, using the 
correct pj resolutions for the different end-cap regions, and for the misaligned detector geometry. A 
specific tuning of the matching windows is used to improve the resolution for very low pj muons. 
This algorithm will be referred to as "the baseline muComb selection" in Section 5, where a modified 
algorithm with better rejection for muons from K and n decays is also described, and the performance 
of the two algorithms is compared. 
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3.2.3 Level-2 muon hypotheses 

The pr cuts corresponding to each nominal threshold are set so that 90% of the muons at the nominal 
threshold would pass the selection. The actual cuts used depend on the resolution of the pr estimation. 
Therefore the cuts are different for different regions of the detector as well as different between muFast 
and muComb. Thus for a nominal threshold of 6 GeV the muFast hypothesis cuts at estimated pj 
values between 4.5 and 5.4 GeV in the different tj regions, while the muComb hypothesis cuts at 
estimated pj of 5.8 GeV in the barrel and end-cap and 5.6 GeV in the forward region. Since the pj 
resolution of muComb is better than that of muFast, the cuts are closer to the nominal thresholds, and 
reject more muons with pj below the threshold. A special case is the 4 GeV nominal threshold which 
is meant to accept lower pj muons and the cuts are set at 3 GeV in the barrel and 2.5 GeV in the 
end-cap for both muFast and muComb. 

3.3 The level-2 di-muon triggers 

There are two approaches at level-2 for selecting di-muon events from a resonance such as J/y and 
T. The first approach is to start from a di-muon trigger at level- 1 which produces two muon regions of 
interest. In this approach, reconstruction of a muon is confirmed separately in each Rol as described 
above and the two muons are subsequently combined to form a resonance and to apply a mass cut. 
We will refer to this trigger as the "topological di-muon trigger". 

An alternative approach is to start with a level- 1 single muon trigger and search for two muons 
in a wider tj and <j> region. This approach starts from reconstructing tracks in the inner detector and 
extrapolating the track to the muon spectrometer to tag muon tracks. Since this method does not 
explicitly require the second muon at level- 1, it has an advantage for reconstructing J/y at low-p^. 
This is implemented in the TrigDiMuon algorithm. The two approaches using either two or one muon 
Rol are illustrated in Figure 1 . 



Figure 1: A schematic picture of Rol based di-muon trigger, using two RoFs (left) and seeded by a 
single muon Rol (right). 
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3.3.1 The TrigDiMuon algorithm 

TrigDiMuon is a level-2 trigger algorithm based on associations established between ID tracks and MS 
hits. Each time a pair of oppositely charged ID tracks, above a minimal invariant mass, is successfully 
associated with the MS hits, a muon pair object is created with the parameters of these ID tracks. 
Later, in the hypothesis step, an additional invariant mass selection can be applied, thus selecting 
interesting physics objects such asJ/y or B. 

The motivation for developing this algorithm comes from the fact that while di-muon final states 
exist in many interesting B-physics channels, the cross sections for di-muon final states are orders 
of magnitude smaller than those for single muons of the same pj. An additional advantage is that 
resonant final states can also be used to calibrate trigger efficiencies as will be shown in Section 5. 

First, the initial muon Rol is extended in order to search for a second muon, which was not 
triggered by level- 1. The input muon may be from a region of interest identified at level- 1, but the 
input rate to the algorithm can be reduced if, prior to the Rol extension, the level- 1 Rol would be 
confirmed in the level-2 trigger by the muFast and (possibly also) the muComb algorithms. The 
performance of these options is studied in Section 4. The size of the extended Rol is based on the 
distribution of angular distance in r\ and 0 between two muons from J/y decay. Figure 2 shows 
the probability of including the second muon from J/y decays Rol as a function of the extended Rol 
size for different samples. The current default region size in TrigDiMuon is At] x A0 = 0.75 x 0.75. 
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Figure 2: Probability of including the second muon from J/y decays Rol as a function of the extended 
Rol size for different samples. Open squares are from J/y decays where one muon has pr > 6 GeV 
and the other pj > 3 GeV. Full squares are from J/y decays where one muon has pj > 4 GeV and 
the other p T > 2.5 GeV. 

The ID tracks in the search region are found using the trigger-tracking program IdScan or SiTrack [9], 
and are selected if they form a pair of oppositely charged tracks with invariant mass M > 2.8 GeV. 
Each selected track is extrapolated to the different stations of the MS using a formula parameterizing 
the expected track bending in the magnetic field. The bending parameterization is calculated sep- 
arately for different regions of the muon spectrometer to account correctly for the inhomogeneous 
toroidal field in the end-cap region. Figure 3 shows the difference, At? , between tj measured in the 
inner detector and that measured in the middle station of the muon spectrometer. The lines indicate 
the choice of tj regions for the parameterization. The parameterization was also subdivided in 0. 

The algorithm then searches for muon hits within a road around the extrapolated track. The road 
size also differs for different tj regions. If a sufficient number of muon hits are found in the MS, the 
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Figure 3: The r\ direction of muons at the intercation point vs. the difference in Vj position between 
the inner detector and the middle station of the muon spectrometer, for muons with pj = 6 GeV. The 
lines indicate the choice of 77 regions for the parameterization. 



track is identified as a muon. If both tracks from the pair are identified as muons, a muon pair object 
is created. The two tracks are fit to a common vertex, and the vertex x 2 is calculated to allow a later 
selection of only the pairs with a good quality vertex. 

4 Performance of the level-2 di-muon triggers for J/\jf 

In this Section the efficiency and fake rates resulting from the two approaches to selecting di-muons 
at level-2 will be presented and compared. For the TrigDiMuon algorithm we calculate the efficiency 
and fake rates in three different trigger chains. 

In the first configuration, TrigDiMuon runs directly after the level- 1 trigger based on the Rol 
produced by level- 1. The performance of this trigger is compared to the efficiency and fake rate of the 
level- 1 di-muon trigger. 

The other two trigger chains confirm a single level-2 muon before calling TrigDiMuon. The 
purpose of these chains is not to reduce the fake di-muon trigger rate, but rather to reduce processing 
time by reducing the input rate to TrigDiMuon. Since TrigDiMuon starts from ID track reconstruction 
in an extended region around the muon region of interest, the tracking in the inner detector requires 
three times longer than if only reconstructing ID tracks in the narrow road used by muComb. Thus, this 
time consuming process can be avoided for candidates for which the level- 1 trigger is not confirmed 
at level-2. 

In the second chain TrigDiMuon runs after muFast. The input to TrigDiMuon in this case is a 
muon confirmed in the MS, with a cut on the pj estimated at this stage. The efficiency of this trigger 
is compared with that of a topological di-muon trigger based on a level- 1 di-muon with two oppositely 
charged muons confirmed in the MS at level-2. 

In the third chain TrigDiMuon runs after muComb. The input to TrigDiMuon in this case is a 
muon confirmed in the MS and the ID, with a cut on the pj estimated at this stage. The efficiency 
of this trigger is compared with that of the topological di-muon trigger with two oppositely charged 
level-2 combined muons, within the same invariant mass window. The invariant mass window can be 
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applied in this chain, and not the second chain, because only after muComb the momentum resolution 
is sufficient for a reasonable selection on invariant mass. 

The efficiency and fake rates of these trigger sequences were studied for two different trigger 
thresholds, a 4 GeV threshold that is envisioned to run at initial luminosity, and a 6 GeV threshold that 
will be the lowest threshold for running at a luminosity of 10 33 cm~ 2 s _1 . 

A selection requiring the two muons to be oppositely charged, and the pair to have invariant mass 
between 2.8 GeV and 3.4 GeV was applied to both TrigDiMuon and the topological di-muon trigger. 
For the topological algorithm the invariant mass selection was only applied after muComb. The fake 
rates were calculated as follows: the probability of each of the level-2 strategies to find a di-muon 
pair in events which contain only a single muon was estimated separately for the b events and the 
minimum-bias events. This probability was multiplied by rates which were estimated independently 
from [6], but are consistent with it. The fake probability in b events was taken to be representative 
of that in all events with prompt muons. For the topological di-muon trigger, the probability was 
calculated relative to the number of di-muon level- 1 triggers, and multiplied by the level- 1 fake di- 
muon trigger rate from [6]. 

4.1 Efficiency relative to events accepted at level- 1 

Table 2 gives the efficiency, relative to level- 1, for the two di-muon trigger algorithms for a trigger 
threshold of 4 GeV. Table 3 gives the efficiencies, relative to level- 1, for a trigger threshold of 6 GeV. 
The efficiencies in these tables are calculated with respect to the J/y events accepted by the corre- 
sponding level- 1 single muon trigger. In parenthesis we give the efficiency relative to J/y events at 
the starting point of the di-muon algorithm. 

One can see that the TrigDiMuon efficiency is significantly higher than the topological di-muon 
trigger in all cases. As a matter of fact, the topological di-muon trigger, which applies pj cuts on both 
muons, can have only a limited acceptance for J/y events passing a single muon trigger because the 
second muon is very frequently below the trigger threshold. To pass the topological di-muon trigger 
the second, lower p T muon also has to pass the level-2 selections. 

Because TrigDiMuon can reconstruct muons below the level- 1 thresholds, the TrigDiMuon ef- 
ficiency, shown in brackets, remains nearly the same with the different chains. However, the total 
efficiency is reduced when starting from the single muons accepted by muFast or muComb, because 
of the pj cut imposed by those algorithms. In particular, in Table 3 muFast and muComb reject suc- 
cessively more of the muons below the nominal threshold and this explains the big drop from row to 
row for TrigDimuon and even bigger drop for the topological trigger. 

The loss of efficiency in the single muon triggers is smaller for the 4 GeV threshold, because the 
level-2 cuts for the 4 GeV threshold are quite loose, as mentioned above. When calculating the 4 
GeV trigger efficiency using only muons from J/y with generated pj above the 4 GeV threshold, 
TrigDiMuon has an efficiency of 90% and the topological trigger has an efficiency of 64%. 

As discussed earlier, in spite of some loss of efficiency, the trigger chains with TrigDiMuon run- 
ning after a level-2 confirmed muon might be more suitable to an overall planning of the ATLAS 
trigger menu due to the reduced input rate they have to sustain. 

4.2 Fake rates 

Table 4 gives the expected fake rates for TrigDiMuon with the different chains described above, for 
a trigger threshold of 4 GeV at a luminosity of 10 31 cm~ 2 s _1 . Table 5 gives the rates for a trigger 
threshold of 6 GeV at a luminosity of 10 33 cm~ 2 s _1 . 
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Chain 
starting from 


TrigDiMuon 

(%) 


Topological 
trigger 

(%) 


level- 1 


73 (73) 


51 


muFast 


71 (73) 


43 


muComb 


70 (74) 


33 



Table 2: Efficiency, relative to level- 1, of the two di-muon trigger algorithms for a trigger threshold 
of 4 GeV. In parenthesis is the efficiency calculated relative to J/y events that passed the single 
muon trigger that selects the input to TrigDiMuon. To estimate the efficiency we used a sample of 
A b -> J/yA, where J/y \i(p T > 2.5 GeV)jj,(p T > 4 GeV). 



Chain 
starting from 


TrigDiMuon 

(%) 


Topological 
trigger 

(%) 


level— 1 


75 (75) 


56 


muFast 


67 (77) 


25 


muComb 


60 (78) 


15 



Table 3: Efficiency, relative to level- 1, of the two di-muon trigger algorithms for a trigger threshold 
of 6 GeV. In parenthesis is the efficiency calculated relative to J/y events that passed the single 
muon trigger that selects the input to TrigDiMuon. To estimate the efficiency we used a sample of 
A h ^J/yA, where J /y ^ n(p T > 2.5 GeV) n(p T >4GeV). 



Source 


Chain 
starting from 


Input rate 
(Hz) 


Fake 
acceptance 

(%) 


Fake rate 
(Hz) 




level- 1 


460 


0.42 


1.9 


b + c 


muFast 


380 


0.42 


1.6 




muComb 


340 


0.43 


1.5 




level— 1 


620 


0.07 


0.43 


Kin 


muFast 


270 


0.11 


0.29 




muComb 


170 


0.09 


0.15 




level — 1 


1080 


0.22 


2.3 


Total 


muFast 


650 


0.29 


1.9 




muComb 


510 


0.32 


1.6 



Table 4: Fake rate of the TrigDiMuon algorithm for muons from different sources and total fake rate 
using a trigger threshold of 4 GeV, at a luminosity of 10 31 cm _2 s _1 . The b and c components were 
estimated from a sample of bb — > }i +X with pj > 4 GeV and the K In component from the minimum 
bias sample with forced decays 

Fake rates can be further reduced by reconstructing the J/y decay vertex from the two muon 
tracks. Selecting J/y with a good quality vertex fit will reduce fake rates from unrelated track 
combinations. Figure 4 shows the distribution of the vertex x 2 for true J/y decays and for fake 
di-muon triggers. A cut of % 2 < 30 reduces the fake rate by 20-30% and only reduces the trigger 
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Source 


Chain 
starting from 


Input rate 
(Hz) 


Fake 
acceptance 

(%) 


Fake rate 
(Hz) 




level- 1 


21500 


0.48 


103 


b + c 


muFast 


12500 


0.60 


76 




muComb 


10300 


0.75 


76 




level- 1 


15800 


0.22 


34 


Kin 


muFast 


5000 


0.30 


15 




muComb 


3500 


0.41 


14 




level- 1 


37400 


0.37 


137 


Total 


muFast 


17500 


0.51 


91 




muComb 


13700 


0.66 


90 



Table 5: Fake rate of the TrigDiMuon algorithm for muons from different sources and total fake rate 
using a trigger threshold of 6 GeV, at a luminosity of 10 33 cm~ 2 s _1 . The b and c components were 
estimated from a sample of bb — > fl +X with > 4 GeV and the K In component from the minimum 
bias sample with forced decays 

efficiency by 1-2%. The vertex position can also be used to reject J/y produced at the primary 
interaction and accept only J/y from b hadron decays, but a study of this is outside the scope of this 
note. 




Figure 4: Distribution of the vertex % 2 for true J/y decays (shaded) and for fake di-muon triggers 
(open histogram). 

Finally Table 6 compares the total rates and efficiencies of the two level-2 di-muon algorithms. 
When TrigDiMuon runs after muComb, the signal to background ratio is worse than when it runs after 
muFast. Using the input rates from Table 4 and 5, one can calculate that the time needed to reconstruct 
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the ID tracks is also the smallest when TrigDiMuon runs after muFast, reduced by 1/3 for the 4 GeV 
threshold and by 1/2 for the 6 GeV threshold with respect to running after LI. 

It can be seen from this table that the fake rate for the topological di-muon trigger is small. The 
fake rate from the TrigDiMuon algorithm is much higher but these output rates from level-2 should 
be acceptable for the gain in efficiency, at the initial low luminosity. 



Threshold 


Chain 


TrigDiMuon 


Topological 


(Luminosity) 


starting 


Efficiency 


J/Y 


Total 


Efficiency 


J/Y 


Total 




(%) 


(Hz) 


(Hz) 


(%) 


(Hz) 


(Hz) 


4 GeV 




71 


1.17 


3.1 


51 


0.8 


24 


(lO^cm-V 1 ) 


muFast 


70 


1.15 


2.7 


43 


0.7 






muComb 


69 


1.14 


2.4 


33 


0.5 


0.6 


6 GeV 




74 


43 


151 


56 


32.5 


357.5 


(lO^cm-V 1 ) 


muFast 


66 


38 


114 


25 


14.5 






muComb 


59 


34 


109 


15 


8.7 


9.3 



Table 6: Total rate and efficiency relative to level- 1 of the TrigDiMuon algorithm including the vertex 
cut x 2 < 30, and of the topological di-muon trigger. The efficiency is estimated from a sample of 
A h -► J/yA, where J/y -> \i(p T > 2.5 GeV)/i(p r > 4 GeV). 



The ^-physics trigger rate from TrigDiMuon requires further reduction for L = 10 33 . This can be 
achieved by introducing an additional trigger with a cut on the J/y decay length. Then the trigger 
without decay length cut will be prescaled to an acceptable rate for calibration and alignment purposes, 
as for example in Section 6. An event filter algorithm will further reduce the rates for a luminosity of 

10 33 cm- 2 s~ 1 . 

4.3 Efficiency relative to reconstructed events 

Our goal is to maximize trigger efficiency at the level-2 trigger for the muons that can later be identi- 
fied offline. The efficiencies for the two algorithms and thresholds were re-estimated with respect to 
the J/y events reconstructed with the muon identification program MuGirl [10], which is efficient 
for low-pr muons. The resulting efficiencies are given in Table 7 for a trigger thresholds of 4 GeV 
and 6 GeV respectively. Figure 5 shows the efficiency of TrigDiMuon relative to muons identified by 
MuGirl for the higher pj muon (left) and the second muon (right). 



Threshold 
(Luminosity) 


Chain 
starting 
from 


TrigDiMuon 
Efficiency (%) 


Topological Trigger 
Efficiency (%) 


4 GeV 


level— 1 


84 


58 


(10 31 cm- 2 s- 1 ) 


muComb 


81 


42 


6 GeV 


level— 1 


81 


45 


(10 33 cm" 2 s- 1 ) 


muComb 


66 


17 



Table 7: Efficiency of the TrigDiMuon and Topological di-muon algorithms for J/y reconstructed 
by MuGirl. To estimate the efficiency we used a sample of A/, — > J/yA, where J/y —>■ /J.(pt > 
2.5GeV)/i(pr>4GeV). 
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Figure 5: Efficiency of TrigDiMuon relative to muons identified by MuGirl for the higher pj muon 
(left) and the second muon (right). 



5 Rejection of muons from K and n decays 

The lowest single muon pj threshold in the original ATLAS HLT design [5] was chosen to be 6 GeV. 
This is because below this pj value the rate of muons from K and n decays becomes higher than that 
from b and c decays. Nevertheless, during the low luminosity phase, it is desirable to collect muons 
with lower pr, both for detector and trigger calibration and for initial physics studies. If thresholds are 
lowered, K and % decays become the dominant source of single muon triggers. These decay muons 
must be rejected as early as possible so as not to dominate the single muon trigger rate, thus ensuring 
we can achieve the physics and calibration studies with prompt muons. 

5.1 Description of the method 

The method we describe rejects K and n decays based not on their pj but on the topology of the 
decay. The track position of a prompt muon, extrapolated from the MS to the interaction vertex, is 
a gaussian distributed around the ID track position. The corresponding distribution of a decay muon 
around the light hadron from which it decayed is broader because of the contribution of the decay 
kink in addition to the multiple scattering effect. Thus if the track seen in the inner detector is that of 
the K or n, this discrepancy with prompt muons can be used to reject some of the muons from light 
hadron decays. 

With this method, we can reject the muons from K and % decays by using a matching window 
tuned for prompt muons, with the window width varying according to the track pj. Due to time 
constraints, a precise propagation of tracks in the magnetic field can not be done at level-2, so instead 
the muon track from the MS is extrapolated back to the interaction vertex using a parameterization 
that is a function of measured r\, <j> and pj (exploiting the linear relationship between the bending and 
1 / pj). Multiple scattering effects can be parameterized in the same way to estimate the corresponding 
errors of the back extrapolation, which determine the window size. 

Three main regions are identified for the field parameterization used to propagate the muon tracks, 
one in the barrel and two in the end-cap. To account for the relative inhomogeneity of the magnetic 
field inside each region, the parameters of the back extrapolation are computed as a function of T7, (j>, 
muon charge and spectrometer side (z or — z). A different tuning is used for high and low-pr tracks, 
to take into account the fluctuations of the energy loss in the calorimeter which are important for the 
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propagation of the latter. In the end-cap, the innermost station of the MS does not provide a complete 
geometric coverage, thus for the muons with no hits in the innermost station, a seed from the middle 
station is used to extrapolate the track back. This is the most difficult case since an extrapolation 
through the toroids must be performed and the resulting precision is spoiled by a factor of two with 
respect to the other cases. Therefore the back extrapolation in the end-cap is treated separately for 
regions with inner station coverage and region without inner station coverage. 

The strategy for the level-2 combined muon reconstruction described in [5] is to use only the pixel 
and the SCT data. With this setup, the decays in flight happening near or after the last SCT layer are 
reconstructed using mainly the hits of the decaying K or %. Some rejection of these events can be 
achieved by checking the 77 and 0 position of the extrapolated MS track relative to the ID track with 
a matching window whose size is based on the position spread coming out from the muon multiple 
scattering. Some of the decays between the pixel and the SCT can be rejected by applying a cut on 
the x 2 of the Inner Detector fit. 

The cuts studied, ordered in terms of increasing rejection are: 

• A loose-window cut, using the muon track position from muFast, and the track position from 
the ID reconsrtruction. The window size was tuned to recover almost 100% of the multiple 
scattering for a muon pj equal to the threshold value (4 GeV and 6 GeV); 

• A tight-window cut, refining the muon back-extrapolation by exploiting the measurement of 
the interaction vertex from the ID reconstruction. The window size was tuned to 2.7 a of the 
multiple scattering spread for that muon p T ; the combined p T estimation is used to tune the 
window width; 

• The normalized % 2 of the ID track fit is required to be less than 3.2. 

5.2 Performance of the method 

5.2.1 Rejection results for K and % decays 

The 6 GeV threshold is used as a benchmark to estimate the K and n rejection achieved by the various 
cuts. The forced-decay minimum bias sample has been used to study and tune the cuts to reject muons 
from K and K decays. The results shown in Table 8 are expressed in terms of trigger rate, computed 
using the cross section of these events. 



Cut 


n/K rate (Hz) 


Baseline muComb 


3470 ±380 


Loose window cut 


2920 ±430 (-16%) 


Tight window cut 


2800 ±440 (-20%) 


Tight window + x 2 cut 


2550 ±440 (-26%) 



Table 8: Expected rate with a 6 GeV single muon threshold from the muComb algorithm for the K 
and K decays. The rejection with respect to the baseline muComb algorithm is shown in parenthesis. 
These rates were estimated from the forced-decay minimum bias sample 

Figure 6 shows the trigger efficiency for muons from K and % decays as a function of pr for the 
baseline muComb selection, compared to the tight window selection described above. 
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Figure 6: Efficiency for muons from i<r and % decays as a function of pj for the baseline muComb se- 
lection, compared to the optimized muComb selection described in Section 5. 1 for the 4 GeV threshold 
(left) and the 6 GeV threshold (right). 
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Figure 7: Efficiency of the tight window match on single muons simulated with the aligned detector 
setup. The efficiency drop for the 6 GeV threshold is shown according to different o cuts. 



5.2.2 Efficiency loss for prompt muons 

The efficiency loss for prompt muons due to the matching window cuts has been estimated with a 
single muon sample. Figure 7 shows the relative efficiency obtained for the aligned detector setup. 
The relative efficiency is seen to be almost constant in the pj range of 4-40 GeV and its value for 
the cut at 2.7 a is about 98%. The detector misalignment reduces the efficiency plateau to 95% as 
shown in Figure 8, but the relative efficiency can be recovered by 1% with a specific tuning of the 
back extrapolator. 

The rate reduction for b decays is calculated reliably from the b — > /x(4) +X sample. The results 
are shown in Table 9. A good agreement between b events and single muons is found for the efficiency 
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Figure 8: Efficiency of the tight window match at 2.7 o on single muons simulated with the misaligned 
detector setup. The relative efficiency is shown for the 6 GeV threshold. 



loss due to the tight window match. 



Cut 




rate (Hz) 


Baseline muComb 


4850 ±20 


Loose window cut 


4780 ±20 (-1.5%) 


Tight window cut 


47 10 ±20 (-3%) 


Tight window + % 2 cut 


4560 ±20 (-6%) 



Table 9: Expected rate with a 6 GeV single muon threshold from the muComb algorithm for the b 
component. In parenthesis is the percentage rejection with respect to the baseline muComb. 



5.2.3 Resulting muon trigger rates 

A coherent description of the full trigger rate after the muComb algorithm is obtained using the stan- 
dard minimum bias sample and is shown in Table 10 and in Table 11. All the cuts mentioned were 
applied. A very good agreement is found with both the forced sample for the K 1% component, and 
the b —>■ ji + X sample for the b component. 

The optimized version of muComb improves the rejection of muons from decays in flight by about 
30% at the 4 GeV threshold and of about 20% at the 6 GeV threshold with respect to the baseline 
muComb algorithm. The rejection of muons from b events is 20% at the 4 GeV threshold and 7% at 
the 6 GeV threshold. Thus while the total trigger rate is reduced the purity of the sample increases. 
Given the uncertainties on the estimation of the production cross section for K and n this optimization 
is crucial for the low-pr single muon trigger. 
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Sample 


muFast rate (Hz) 


muComb rate (Hz) 


muComb + nIK cuts rate (Hz) 


KlK 


224 ±15 


210±9 


145 ±9 


b 


145 ±7 


140 ±6 


114±6 


c 


234 ±8 


200 ±8 


168 ±8 



Table 10: Expected output rate of muFast and muComb for a 4 GeV threshold at the 10 31 cm 2 s 1 
Luminosity. 



Sample 


muFast rate (Hz) 


muComb rate (Hz) 


muComb + izIK cuts rate (Hz) 


KlK 


5050 ±760 


3530 ±380 


2860±410 


b 


5550 ±600 


4900 ±400 


4550 ±430 


c 


6900 ±700 


5390 ±420 


5050 ±450 



Table 11: Expected output rate of muFast and muComb for a 6 GeV threshold at the 10 33 cm 2 s 1 
Luminosity. 

Checking the effect of this method to reject muons from K and n decays on the two level-2 di- 
muon strategies showed that both efficiency and trigger rates are reduced. There is no significant gain 
in purity from this method for the di-muon selections, because the rejection is achieved by using the 
J I y reconstruction and mass cuts, and most of the fake rate comes from b and c decays. 

6 Measuring trigger efficiency for low -pr muons from ATLAS data 
6.1 Method description 

Cross section measurements require a good understanding of the efficiency of the event selections. A 
precise understanding of the trigger efficiency is crucial and we must have a strategy for measuring 
it from data with high precision. We study the performance of measuring the trigger efficiency from 
data with the tag-and-probe method which uses di-muon final states for measuring the single muon 
trigger efficiency. In this method, a single triggered muon from a reconstructed di-muon decay of a 
specific particle identified by mass cuts provides the tag that allows us to probe the trigger efficiency 
of the second muon. 

For B physics we are interested in events with rather low-p T muons, so the tag-and-probe method 
for measuring the single muon trigger efficiency using J/xy events is presented. We demonstrate 
that the obtained trigger efficiency can be applied to calculate the di-muon trigger efficiency. This 
principle can also be applied to Z decays to calibrate the high-pj- trigger efficiency [11]. 

6.1.1 Measuring single muon efficiency 

We use the tag-and-probe method to measure the muon trigger efficiency, using as the calibration 
sample events collected by a single muon trigger where the J f\\f is found in the offline reconstruction. 
In this sample, one of the muons forming the J j\\f is triggered, while the other one may or may not 
be triggered, thus providing an unbiased sample of muons to study the single muon trigger efficiency. 

First, the triggered muon is matched to one of the reconstructed muons from an identified J/y . 
This muon is called the tagged-muon. Once the tagged-muon is identified, the other muon, the probe- 
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muon is used to check whether it is also triggered or not. A matching between the reconstructed muon 
and the one found at the trigger level is needed here too. 

At the high level trigger, the position of the muon found at the trigger level is stored, and can be 
compared precisely to the reconstructed muon. However, at level- 1 the position granularity is that of 
the Rol. Due to the limited precision of the location of the Rol, care must be taken when matching a 
muon Rol to the reconstructed muon. The bending of the muon tracks in the magnetic field and the 
fact that high-pr decay muons have small opening angles between them also introduces an ambiguity 
in the matching. 

The single muon trigger efficiency, e^, is calculated as the ratio between the number of probe 
muons which were triggered (N pro b e &triggered) to the total number of probe muons (N pr „b e ), 

Nprobc&iriwml \ 

£lM = — w~ h — ' (1) 

Mprobe 

The single muon efficiency can be obtained in detail as a function of kinematic variables (pr, r\, 
0) of the muons using as fine a binning as the statistics allows. A fine binning is, in fact, necessary 
since the efficiency depends on these variables, especially at level- 1 where there are sharp changes in 
the efficiency due to structural features such as the experiment's support structures. Because of this 
the overall efficiency depends on the distribution of the muons produced. We call the detailed map of 
efficiencies in each region of the phase space a trigger efficiency map. 

Our primary goal is to demonstrate that it is possible to obtain the trigger efficiency map from 
data alone. The di-muon trigger efficiency can then be calculated from it, given the distribution of the 
parent particles and the decay angular distribution. 



6.1.2 Calculating di-muon trigger efficiency 

The di-muon trigger efficiency can be calculated using the obtained single muon efficiencies, taking 
into account the dependence on kinematic variables of the muons. For example, the efficiency of J/y 
particles are different depending on the kinematic distribution of the two decay muons. The J/y 
efficiency, s J ^ ¥ can be calculated using the single muon trigger efficiency map as 

£j/ v (Pt /V ^ J/v ,<I> J/v ) = ^JJ s i M l ^\^')siM 2 ^ 2 ,^ 2 )f(co & e*)dcosd*dr- (2) 

Here, /(cos d*) is the angular distribution of the decay muon from the J/y where 6* represents the 
decay angle of the muon in the J/y rest frame with the z-axis taken as the direction of the J/y 
in the laboratory frame. The variable 0* is the azimuthal angle of the decay muon in the J/y rest 
frame, normalized as / /(cos 6*)dcos 6* = 1. Kinematic variables of the decay muons (pj , Pj 2 ,^^ 1 , 
r]^ 2 , 0 W , <j)^ 2 ) are functions of the // y variables, cos 6* and 0*. To get the overall efficiency of J / y 
events, the integration oiJ/y variables must be performed in the kinematic region of the cross-section 
definition. 

Note that this formula is universal and can be applied to other resonances such as B s j —>■ jijiX 
using the same single muon efficiency map. The cos 6* distribution depends on the polarization state 
of the J/y which reflects the J/y production mechanism. For the unpolarized case, this distribution 
is flat. In certain analysis, the production mechanism of the parent particle could be of interest, so 
we cannot assume the distribution to be flat. In such cases, it is necessary to be able to calculate the 
efficiency as a function of cos 6* as well. In these cases, Equation 2 would become, 

ej^p^^y^, cose*) = ^J eiM (^7^^M^^W O) 
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without the cos 6* integration. Equation 3 is simply expressing the efficiency of events where J / V is 
produced with a fixed momentum and the decay angle is also fixed, so the decay muon momenta are 
also fixed. Some variables could be integrated out, but the important thing is that the efficiency with 
respect to cos 6* can also be obtained from the single muon efficiency map, £\ tl (pj, 7]^, 0^). 

6.2 Performance studies 

In order to emulate the efficiency measurement from data, we use J/y events with two muons in 
the offline reconstruction passing the single muon threshold of 6 GeV and a di-muon invariant mass 
between 2.88 GeV and 3.3 GeV. 

6.2.1 Matching of muons at trigger and reconstruction 

The first step of the tag-and-probe method is to find out which of the two offline muons was triggered. 
This is done by finding the best match using AR = \/At] 2 + A0 2 , where Atj and A0 are the difference 
of tj and 0 between the offline muon and the triggered muon. For the matching with level-2 muons, 
track parameters at the perigee are used. On the other hand, for level- 1, the position of the triggered 
muon is taken as the center of the Rol. In this case, the matching must be done carefully as the opening 
angle of the two muons from the J fry is small and the bending, of the rather low-pr muons, in the 
magnetic field is non-negligible. The offline muon tracks are extrapolated to the plane of the RPC or 
TGC chamber which defines the Rol. 

Figure 9 shows the AR distrbution between the level- 1 Rol and offline track with and without 
using the extrapolation. The improvement obtained by the extrapolation is significant and as a result 
a good matching is established by requiring AR < 0.15. To further reduce the possibility of having 
a wrong match, in this study we only use events where the opening angle between the two muons is 
AR > 0.4. 

Figure 10 shows the AR distribution between the level-2 muon and the offline track. The resolution 
of AR becomes an order of magnitude better than at level- 1 since the level-2 muon tracks use the 
measurements from the ID. The condition AR < 0.005 is used for the matching between the offline 
track and the level-2 track. 




Figure 9: The distribution of AR between the level- 1 Rol and the offline muon track, (a) using the 
offline track parameters at the perigee and (b) by extrapolating the offline track to the Rol position. 
The dashed line shows the value where the cut was applied for the matching. 
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Figure 10: The distribution of AR between the level-2 muon and the offline muon track. The dashed 
line shows the value where the cut was applied for the matching. 



6.2.2 Level- 1 efficiency 

Using the method described in Section 6.1 and the matching criteria, we obtain the single muon 
efficiency as a function of the pj of the muon. Figure 11 shows the single muon efficiency as a 
function of pj by evaluating how often the probe-muon was triggered. The points are fitted with the 
function 

Also shown in Figure 11 is the efficiency measured directly in the single muon Monte Carlo sample 
which provides an unbiased value of the efficiency. The agreement of the efficiencies obtained by the 
two methods is around 5% in the turn-on region (4 GeV < pj < 8 GeV) and becomes smaller as the 
Pt increases, becoming within a few percent at pj > 10 GeV. 

To calculate the di-muon trigger efficiency using Equation 2, the efficiency curve must be mea- 
sured in each r\ and 0 region. For this, we divided the detector into 10 x 10 regions for the barrel 
(—1.05 < T/ < 1.05). For the end-cap region (1.05 < |tj| < 2.4), we assumed that there is a complete 
symmetry between octants and divided one octant into 8x6 regions. Efficiency curves as a function 
of pj are obtained in each of the regions. Figure 12 shows the three fit parameters, A, a and b in 
Equation 4 as a function of tj and 0 . The figure shows that the efficiency curve behaves differently 
for different regions but in a smooth way, except for a small region around tj = 0.725 and 0 = — 1.6. 
In this region the efficiency is very low due to the MS layout and as a result the fit is unreliable. These 
results are used to calculate the di-muon efficiency. 

Distributions of offline J/y variables, p J j V , r\ J l w and cos0* are shown in Figure 13 after ap- 
plying the selection criteria of muons (p£ > 6 GeV and \t] il \ < 2.4). The open histograms are for all 
J/y in the MC sample and the filled histograms are for events where the level- 1 di-muon trigger, 
requiring at least two muons with pj > 6 GeV, has fired. The generator level filter with the pj cut for 
the highest (second highest) p T muon of pr > 6(4) GeV was applied for the MC sample. 

The cos 6* distribution reflects the polarization state of the J/y and is flat for the non-polarized 
case. Since the cos(9* distribution is an interesting quantity to measure in its own right, we do not 
make any assumptions about this distribution but instead try to measure the efficiency as a function 
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Figure 11: The overall efficiency of the level- 1 single muon trigger with respect to the offline selection 
obtained by the tag-and-probe method (filled circles) and the efficiency estimated from a single muon 
Monte Carlo sample (open circles). The curve is a fit to the efficiency obtained by the tag-and-probe 
method from Equation 4. 



of cos0*. Although the generated cos0* distributions were flat, reconstructed distributions will be 
biased by the pj cuts applied at the generator level. At |cos0*| = 1, one of the decay muons flies 
in the opposite direction to the J /\\f and has low transverse momentum, therefore these events are 
more likely to be rejected by the pr cut for the highest (second highest) pj muon of pj > 6(4) GeV 
requirement. 

Figure 14 shows the di-muon trigger efficiencies calculated in two methods. The open circles 
are obtained by checking the decision of the level- 1 di-muon trigger for each event. The efficiencies 
calculated using the parameterization in Figure 12 are shown by the filled circles. They are calculated 
by assigning an efficiency for each event according to the kinematics of the two muons in the final 
state using Equation 3. The same technique may be used once data are collected by the experiment. 
The effect of the systematic uncertainty of the method has been estimated by changing the procedure 
to obtain the trigger efficiency map, namely by using 16 x 16 bins in r\-§ in the barrel and 15 x 10 in 
the endcap. In addition, the fitting function has been changed by adding a linear function c(pj —d) to 
the original function for pj > d. This was done to better describe a drop of efficiency for higher pj. 
c and d are additional fit parameters. Both statistical and systematic errors are a few % in most of the 
region, but the systematic uncertainty increases where the change of efficiency is rapid. In Figure 14, 
the statistical and systematic errors are added in quadrature. 

Results of the two methods agree within a few % in most regions. Efficiency losses at r\ ] lv 
around -1,0 and +1 are due to the layout of the muon trigger chambers. These plots confirm that the 
requirement of pj > 6 GeV on two muons introduces an effective cut of pj > 12 GeV for the J/y/ 
and the efficiency with respect to cos(9* is flat across the region between —0.8 < cos(9* < 0.8. The 
overall efficiencies calculated in the kinematic region of p J T /¥ > 12 GeV and \t] j Iv\ < 2 are 76.1% and 
77.0% using the trigger decision and the trigger efficiency map, respectively, which is in agreement 
within the statistical and systematic errors. The size of the systematic errors may be improved by 
creating the efficiency map with finer granularity, which requires more statistics. 
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Figure 12: Fit parameters (A, a and b in Equation 4) of the efficiency curve in different X] and 0 
regions. 
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Figure 13: Distributions of J /iff variables, pj , 77 and cosd*. The open histograms are for all 
reconstructed J/ys with the generator level cut of pj 1 > 6 GeV and p^ 2 > 4 GeV. Filled histograms 
are distributions of events passing the level- 1 di-muon trigger. 
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Figure 14: Di-muon trigger efficiency with p^>6 GeV. Open circles are the result obtained from 
decision of the level- 1 di-muon trigger and filled circles are the efficiency obtained using the parame- 
terization shown in Figure 12. 
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6.2.3 Level-2 efficiency 

The single muon level-2 efficiency can be defined in two ways: level-2 efficiency with respect to 
offline reconstruction (L2/rec) and level-2 efficiency with respect to level- 1 (L2/L1). The L2/rec ef- 
ficiency is obtained in the same way as the efficiency with respect to level- 1. The only difference is 
that the probe muon must have both level- 1 and level-2 trigger objects associated to be considered as 
triggered. 

For the calculation of the L2/L1 efficiency the set of probe muons is restricted to only those that 
have an associated level- 1 Rol. This way the efficiency with respect to level- 1 is obtained. Overall 
efficiencies as a function of pj are shown in Figure 15. The points were fitted with the functional 
form of Equation 4. Efficiency curves calculated using all offline muons matched to generated muons 
are plotted in the same figure to check that the selection of probe muons is unbiased. It is clear that 
there is a good agreement between the measured efficiency and the direct Monte Carlo efficiency. 



* 1 1- 
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Figure 15: The overall efficiency of the level-2 single muon trigger, (a) with respect to offline recon- 
struction, (b) with respect to level- 1. 

To calculate level-2 di-muon efficiency the trigger efficiency map was created in the same way as 
for level-1. The tj-0 plane was divided into regions as described in Section 6.2.2 and in each region 
an efficiency curve was constructed and fitted with Equation 4. This was done for L2/rec as well as 
for L2/L1 efficiencies. 

Using the trigger efficiency map for the L2/rec efficiency, we can calculate the level-2 trigger 
efficiency. To check that the map was created correctly, the single-muon efficiency curves were con- 
structed using level-2 muons associated to the offline muons with the AR matching criteria explained 
in Section 6.2.2. To get an agreement between the two methods one must use the same data sample. In 
Figure 16, open triangles represent a straightforward calculation of the efficiency using matched trig- 
ger objects while the solid circles are the efficiencies calculated using the map. To each offline muon 
from the sample the probability that it would be triggered was assigned using the map. In each p T 
bin the efficiency was calculated as an average of these probabilities. Systematic uncertainties were 
estimated in the same way as the level-1 efficiency by changing the procedure to calculate the trigger 
efficiency map. The same method was used for other two variables Y\ and <j>. The two methods give 
an agreement within 6% in most regions while the difference gets as large as 15% at some regions 
(0 ~ - 1 , -2) where the efficiency is low and therefore the precision of the fit to create the trigger 
efficiency map was poor. Also, since the average efficiency is calculated in each bin of the map this 
causes discrepancies in the regions where efficiency changes rapidly. 
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Figure 16: The overall L2/rec single-muon trigger efficiency as a function of pj , r\ and 0. Efficiency 
is calculated using the trigger efficiency map (black circles) and it is compared to the one calculated 
using matched level- 1 and level-2 trigger objects (open triangles). 
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The efficiency of the level-2 di-muon trigger for J/y was calculated in the same way as for level - 
1. Again two methods were used: firstly direct calculation using the trigger decision for the di-muon 
trigger and secondly using the efficiency map. Since a simple di-muon trigger without any cut on 
invariant mass was used, all the muons in the sample must be used in calculation of the efficiency (not 
just those from J/y). To each offline muon in the event the probability £,• that it would be triggered 
was assigned. The probability that the whole event will be triggered by the di-muon trigger is then 

(5) 



e 2 " = ^-W- £ i)-L e iW- e j) 



where the products and sums run over all decay muons used in the measurement. 

The di-muon efficiency is calculated in the same way as the single-muon efficiency, as an average 
of probabilities £ 2 ^ in each bin of a given variable. In our case it is either the pj ,T\,ty or the cos 8* 
of the J/y reconstructed in the event. Figure 17 shows a comparison of the J/y efficiency curves. 
Like the single muon efficiency results, the agreement between the two methods is within 6% in most 
regions except for some regions where the available statistics was low. The overall J/y efficiency 
(L2/rec) in the kinematic region, p J J v > 15 GeV, \r\ J lv\ < 2 with the cuts on the muons (pj > 6 GeV, 
|tt/| < 2.4) is 69.2% and 69.4% using the trigger decision and the trigger efficiency map, respectively. 
We have a good agreement on the overall efficiency integrated in the above phase space. 
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Figure 17: The overall di-muon J/y trigger efficiency as a function of pj ,f],^> and cos 6*. Efficien- 
cies from the trigger decision bit (open triangles) and the ones calculated from the trigger efficiency 
map (black circles) are shown. 



In Figure 18 the efficiency as a function of the distance, AR, in the rj-(j) plane between the two 
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J/y muons is shown. 



J/iC efficiency from trigger decision 
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Figure 1 8 : The overall di-muon J/y trigger efficiency as a function of AR. Efficiencies from the trigger 
decision bit (open triangles) and the ones calculated from the trigger efficiency map (black circles) are 
shown. 



6.3 Requirements on the trigger menu 

In order to use the method developed here for real data, we need a large sample of J/y events with at 
least one muon unbiased by the trigger selection. The following selection criteria at each trigger level 
will satisfy this requirement: 

Level-1 Single muon trigger above a certain threshold; 

Level-2 J / y reconstruction within one Rol using the TrigDiMuon algorithm to enhance // y events; 
Event Filter (EF) No further selection is imposed so as to avoid biasing the sample; 

In this Section, we give some estimates of the statistics of the J / y sample using this trigger 
selection and show the precision on the trigger efficiency with a certain luminosity. Taking into 
account the limit on the EF output rate of 200 Hz it is plausible to use a few Hz of the bandwidth for 
this calibration trigger. The parameters to optimize are the prescale factor, to reduce the rate when it 
is too high, and the threshold value. 

The level-1 single muon trigger rate has been studied in detail taking into account contributions 
from different sources. At low-pr , the main contribution is from K In in-flight decay and muons 
from b and c quarks. The rejection of these non-J /y events by the topological di-muon trigger at 
level-2 has been studied using a genererated sample of bb — > }l +X, where the efficiency of non-J/y 
events to be selected was found to be 0.8%. This factor is used to estimate the rate reduction by the 
level-2 selection. 

Table 12 shows the expected rate of the level-1 single muon trigger and the contribution of the 
J/y —> jJ. + jJ.~ to the rate assuming a luminosity of 10 31 cm~ 2 s _1 . The rate of J/y events is the sum 
of J/y direct production and from bb production. If we could allocate a bandwidth of 1 Hz to this 
trigger chain with 6 GeV level- 1 threshold, we must apply a prescale factor of 3 to reduce the rate 
down to around 1 Hz. Given the fraction of J/y — > jJ. + jJ.^ events among this rate is 6%, we get a rate 
of 0.06 Hz for collecting the calibration sample. 
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rate (Hz) 


J/yr fraction 


level- 1 


380 (0.21) 


0.05% 


level-2 


3 (0.19) 


6% 



Table 12: The rates after level-1 and level-2 using the proposed calibration trigger for a luminosity of 
10 31 cm _2 s _1 with the threshold of p T > 6 GeV. The contribution of //V — > jU + M~ process to the 
rate is also shown in parentheses. 

With one year of data, we expect to collect about 300 kJ/yr events according to this triggering 
strategy, which is comparable to the statistics used in this study (150 k direct J/yr and 150 k bb — > 
J j yr). Therefore, we expect a similar performance (for ~ 1 Hz calibration trigger rate) to that shown 
in Section 6.2 after the first year of data-taking. The number of events may increase if we allocate 
more than 1 Hz for the calibration trigger. 

7 Conclusions 

In this note we presented methods to efficiently select J/yr events at the second level trigger, reject 
muons from K and n decays and measure the trigger efficiencies from the ATLAS data. 

TrigDiMuon is a second-level trigger algorithm that selects efficiently at level-2 events which 
include J/yr or other di-muon states, starting from a single level-1 muon trigger. The efficiency of 
TrigDiMuon for events accepted by level-1 and the level-2 single muon trigger is between 73% for the 
4 GeV trigger threshold and 60% for the 6 GeV threshold. This may be compared with the topological 
di-muon trigger efficiencies of 33% for the 4 GeV threshold and 15% for the 6 GeV threshold. The fake 
trigger rates of TrigDiMuon are estimated to be 2 Hz for a trigger threshold of 4 GeV at a luminosity 
of 10 31 cm _2 s _1 , and 90 Hz for a trigger threshold of 6 GeV at a luminosity of 10 33 cm~ 2 s _1 . For the 
B-physics trigger at L = 10 33 cm _2 s _1 the rate will have to be further reduced by means of a decay 
length cut on the J/yr decay vertex. 

Extrapolating the muon track from MS back to the interaction vertex improves the single muon 
trigger selection at the level-2 stage and allows increased rejection of muons from K and % decays 
without a significant loss of efficiency for b events. The back extrapolator provides a further reduction 
factor of about 20% at the trigger threshold of 4 GeV and of about 10% at the trigger threshold 
of 6 GeV with respect to the output trigger rate of the baseline muComb selection. Despite this 
good performance, its use for triggering on low pj di-muon objects is not recommended, since the 
background to TrigDiMuon is dominated by muons from b and c rather than by muon from K and % 
decays. However, because the rate of muons from K and % decays may be even higher than in our 
simulation, it is important to have it available for the single muon triggers. 

The J /iff trigger efficiency can be measured from ATLAS data using the tag-and-probe method. 
As the efficiency of the single muon trigger depends on the tj and 0 regions, it is necessary to measure 
the efficiency as a function of these variables. With 300k J/yr events, it is possible to measure the 
efficiency at level-1 and level-2 with better than 5 % precision for each region. The uncertainty comes 
mainly from the lack of statistics to measure the efficiencies for each region of tj and 0 and will 
improve as more J/yr events become available. To collect an unbiased J/yr sample which can be 
used for the trigger efficiency measurement, we plan to use a level-1 single muon trigger with a J/yr 
reconstruction using the inner detector at level-2. With this trigger we can collect around 300k events 
for an integrated luminosity of 100 pb _1 while keeping the rate of this calibration trigger around 1 Hz. 
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With the methods provided it is possible to collect a large number of J /\\r events at luminosities 
of around 10 31 cm~ 2 s _1 , without an overly high rate from K and n decays. Furthermore, these low-pr 
J I y events can also be used to calibrate trigger efficiencies from the actual ATLAS data, and provide 
the trigger efficiencies for analyses that use low-p^ muons. 
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Abstract 

Results are reported on an analysis on simulated data samples for production 
of heavy quarkonium states J '/yf—> and corresponding to an inte- 

grated luminosity of 10 pb _1 . It is shown that the pj dependence of the cross- 
section for both J/yr and T should be measured reasonably well in a wide 
range of transverse momenta, pj ~ 10 — 50 GeV. The precision ofj/y polari- 
sation measurement is expected to reach 0.02 — 0.06, while the projected error 
on T polarisation is around 0.2. Observation of radiative decays of % c states, 
and the feasibility of observing Xb —> J/yJ/Y decays are also discussed. 

1 Introduction and theoretical motivation 

The number of J/yr — > }X + }X~ and T — > }X + }X~ decays produced at the LHC is expected to be quite large. 
Their importance for ATLAS is threefold: first, being narrow resonances, they can be used as tools for 
alignment and calibration of the trigger, tracking and muon systems. Secondly, understanding the details 
of the prompt onia production is a challenging task and a good testbed for various QCD calculations, 
spanning both perturbative and non-perturbative regimes. Last, but not the least, heavy quarkonium 
states are among the decay products of heavier states, serving as good signatures for many processes of 
interest, some of which are quite rare. These processes have prompt quarkonia as a background and, as 
such, a good description of the underlying quarkonium production process is crucial to the success of 
these studies. 

This note mainly concentrates on the capabilities of the ATLAS detector to study various aspects of 
prompt quarkonium production at the LHC. The methods of separating promptly produced J/yr and T 
mesons from various backgrounds are discussed, and strategies for various measurements are outlined. 

1.1 Theory overview 

Quarkonium production was originally described in a model where the quark pair was assumed to be 
produced endowed with the quantum numbers of the quarkonium state that it eventually evolved into [1]. 
This approach, subsequently labelled as the Colour Singlet Model (CSM), enjoyed some success before 
CDF measured an excess of direct J/yr production [2], more than an order of magnitude greater than 
predicted (see Figure 1(a)). 

The Colour Octet Model (COM) [5] was proposed as a solution to this quarkonium deficit. COM 
suggests that the heavy quark pairs produced in the hard process do not necessarily need to be produced 
with the quantum numbers of physical quarkonium, but could evolve into a particular quarkonium state 
through radiation of soft gluons later on, during hadronisation. This approach isolates the perturbative 
hard process from the non-perturbative long-distance matrix elements, which are considered as free pa- 
rameters of the theory. However, their universality means that their values can be extracted independently 
from a number of different processes, such as deep inelastic scattering, hadro- and photoproduction. 

Hence the good description of the Tevatron data by the Colour Octet Model shown in Figure 1(a) 
is, at least in part, due to the fact that the values of some parameters were determined from the same 
data. Tests of other COM predictions have not been so successful: Figure 1(b) shows the polarisation 
coefficient in T —> jijX decay as a function of its transverse momentum, where the COM prediction 
disagrees with the data. 

A model based on kj factorisation in QCD showers [6] claims to be able to describe both the lack 
of transverse polarisation in J/yr decays [4,7] and the high cross-section oiJ/yr production. Another 
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Figure 1: (a) Differential cross-section of J/yr production at CDF, with predictions from CSM 
and COM mechanisms (from [3]). (b) T polarisation measured as a function of pj at D0 (black 
dots) and CDF (green triangles), compared to the limits of the kj factorisation model (dashed and 
dotted curves [6]) and COM predictions [5], depicted by a shadowed band (from [4]). 



model [8] argues that the deficit in the cross-section as predicted by CSM can be largely explained by 
the production of a quarkonium state in association with an additional heavy quark, and also predicts 
lower levels of polarisation. 

In the following, we show that ATLAS is capable of detailed checks of the predictions of various 
models by measuring not only pj and r\ distributions of onium states in a wide range of these variables, 
but also the degree of polarisation and the production of C-even states. In the absence of a comprehensive 
Monte Carlo generator capable of simulating all aspects of all theoretical models, we used the Pythia 
6.403 generator [9] incorporating the Colour Octet Mechanism, with model parameters fixed through 
a combination of theoretical and experimental constraints [10]. Inevitably, this simulation is unable to 
reproduce adequately some features of the data, notably the polarisation angle distributions and hadronic 
accompaniment of the quarkonium states. However, the simulated samples allowed us to study the 
acceptance and efficiency of ATLAS to detect all required particles and measure their parameters, across 
the whole range of the accessible phase space. 

1.2 Classification of production mechanisms in the simulation 

In the following, we will use a simple classification of the quarkonium production mechanisms based on 
the model implemented in the Pythia generator. 

A sample diagram describing the leading colour-singlet subprocess g + g—>J/\}f + gis shown in 
Figure 2(a). In the accessible range of transverse momenta of J/y its contribution is expected to be 
small. The dominant contribution at the lower pj comes from the subprocess shown in Figure 2(b), 
where both singlet and octet cc states with various quantum numbers contribute to J/xy production, 
through Xcj —> J /W+ / decays and/or soft gluon emission. 

At high pj, the gluon fragmentation subprocess shown in Figure 2(c) becomes increasingly domi- 
nant. According to COM, this is unlikely to produce anything other than 3 S\ quarkonium states. Hence, 
the fraction of J/yr mesons produced from % c j decays should decrease with increasing pj. The pro- 
duction mechanisms for the radially excited y/"'(3686) meson follows the same pattern, except for the 
absence of respective %' cJ contributions, thus one should expect different pj distributions for J ' j\i and 
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(a) (b) (c) 



Figure 2: Some example diagrams for the singlet and octet J/y production mechanisms imple- 
mented in Pythia. 



The overall picture is expected to be similar for bottomonium production, except the number of 
radial excitations below the open beauty threshold is now three, and many more radiative transitions are 
possible between the various n 3 Pj and n 3 Si state. However, compared to J/yr, the accessible range of 
Pt for T is significantly extended towards smaller transverse momenta. This opens up the range of pj 
dominated by the colour singlet contribution, which may make it directly observable for T . 

1.3 Xb^J/yJ/yfecay 

Despite much higher production cross-sections, C-even states of quarkonia are far more difficult to ob- 
serve than their vector counterparts. The usual way of studying %c,b (and r\ c ^) states has been so far 
through radiative decays of or into respective vector states. However, in the high energy hadronic colli- 
sion environment, observation of the photon in %b — ¥ 1° + 7 ma y be problematic (see Section 5. 1). 

We have performed a feasibility study to assess the capability of ATLAS to observe Xb^J/W/Y^ 
ix + fl~ix + ll~ decay with the standard di-muon trigger. The results are presented in Section 5.2. The 
observation and measurement of these final states will give a valuable insight into the heavy quark bound 
state dynamics from several separate viewpoints. 

2 Trigger considerations 

Details of the triggers to be used in ATLAS B physics programme can be found in [11]. This section 
discusses the trigger signatures relevant for quarkonium production at ATLAS, the implications they 
have on the measured cross-section, and the expected effects they have on our ability to make various 
physics measurements. 

Two specific types of di-muon triggers dedicated to quarkonium are: the topological di-muon trig- 
gers, which require two level- 1 regions of interest (Rols) corresponding to two muon candidates with pj 
thresholds of 6 and 4 GeV, and di-muon triggers that only require a single level- 1 Rol above a threshold 
of 4 GeV and searches for the second muon of opposite charge in a wide Rol at level-2. They are dis- 
cussed in Section 2.1. An additional trigger scenario is based on a single muon trigger with a higher pj 
threshold of 10 GeV, discussed in Section 2.2. 

2.1 Di-muon triggers 

Being able to determine the trigger efficiency of measured J jyr and T is crucial to correctly infer the 
production cross-section of quarkonium at the LHC. Indeed, using J/yr and T (as well as the Z boson) 
to construct a trigger efficiency map is a necessary step in order to perform cross-section measurements 
in ATLAS. 
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Studies are being conducted in ATLAS into developing a calibration method to obtain the low-pj- 
single muon trigger efficiency, and henceforth the di-muon trigger efficiency of events using real data by 
virtue of the so-called tag-and-probe method (see Ref. [11] for details). In the absence of data, we have 
performed our own studies of trigger efficiencies, based on Monte Carlo simulation. 

If not stated otherwise, the quoted trigger efficiencies have been calculated with respect to the Monte 
Carlo samples, generated with the cuts Pt{H\) > 6 GeV, pr(H2) > 4 GeV, where jiti (/I2) is the muon 
with the largest (second largest) transverse momentum in the event. 

The level- 1 trigger is a hardware trigger that uses coarse calorimeter and muon spectrometer infor- 
mation to identify interesting signatures to pass to the level-2 and Event Filter stage. Figure 3 shows the 
various individual level- 1 trigger efficiencies as a function of the pj of the di-muon system. The total 
trigger efficiency at level-1, running over direct J/yr events, is 87%. 




0 5 10 15 20 25 30 

P T (GeV) 



Figure 3: Efficiency of various level-1 triggers for prompt J/y events versus pj of the di-muon 
system. Only the triggers with efficiencies greater than 2% in some region of pj are displayed. 
The relevant triggers are the single muon pj threshold triggers labelled L1_MUXX (where XX indi- 
cates the pj threshold in GeV) and the di-muon trigger L1_2MU06. Each pj range of these triggers 
is exclusive. The efficiency curve labelled LVLl_Muon is the sum of all level-1 single muon effi- 
ciencies (excluding the di-muon trigger L1_2MU06). 



The level-2 trigger is software-based and is designed to reduce the output rate of the data, passed to 
it from level-1, by two orders of magnitude. Within regions of interest defined by the level-1 trigger, full 
granularity of the detector is accessible. The efficiency of the level-2 triggers for prompt J/y events is 
plotted in Figure 4 as a function of di-muon pj. The total level-2 trigger efficiency in the reconstructed 
prompt J/y events (relative to level-1) is 97%. The di-muon trigger scenario /i6/i4, considered in the 
majority of this note, uses all the above trigger signatures. 

2.1.1 Effect of di-muon trigger cuts on quarkonium rates 

Figures 5(a) and 5(b) illustrate the distribution of cross-sections across the values of the pj of the harder 
and softer muon from the quarkonium decay without any muon cuts applied at generator level. The lines 
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Figure 4: Efficiency of various level-2 triggers for prompt J/yr events versus pj of the di-muon 
system. Only the triggers with efficiencies greater than 2% in some region of pj are displayed. 
The level-2 trigger signatures of interest include single muon pj threshold triggers L2_MUXX and 
TrigDiMuon' triggers L2_BJpsimuXmuY which are specialised for searching for J/y [11]. The 
efficiency curve labelled LVL2_Pass in (b) is the sum of all level-2 efficiencies. 



overlaid on the plots represent various nominal muon pj thresholds: (6 GeV, 4 GeV) and (4 GeV, 4 GeV), 
as well as the nominal thresholds (10 GeV, 0.5 GeV) corresponding to the single muon trigger /xlO (see 
below). 

For J/y the bulk of the cross-section lies near the (4 GeV, 1 GeV) region, far from the low-pj- muon 
trigger thresholds proposed for ATLAS, and we see only a small increase in accessible cross-section 
by lowering the cut on the harder muon from 6 GeV to 4 GeV (although this reduction in the effective 
J/y pj threshold is useful from a physics standpoint). The situation for T is significantly different 
however, as the relatively large mass of the T shifts the bulk of the production to the region near muon 
pj thresholds of (5 GeV, 4 GeV). This means that by lowering the di-muon trigger cuts from (6 GeV, 
4 GeV), which sits just above the highest density area of T production, to (4 GeV, 4 GeV), a much 
higher fraction of the produced T can be recorded, leading to a predicted order-of-magnitude increase 
in the accessible cross-section. The predicted cross-sections for the processes pp — > J /y[jx + ji^)X and 
pp — > Y(jU + jU~)X (before incorporating trigger and reconstruction efficiencies) for a number of trigger 
scenarios are presented in Table 1. Although no higher y and T states have been simulated for this 
analysis, their expected cross-sections are also shown in Table 1, as estimated using Tevatron results on 
their relative yields [12]. Numbers include feed-down from % states and higher radial excitations to 
lower ones. Due to the expected ATLAS mass resolution for the T states, however, it is unlikely that 
the higher state resonances will be separable. These predictions have been obtained by extrapolating the 
Colour Octet Model, tuned to describe the Tevatron results, to the LHC energy. Although every care have 
been taken to ensure stability of this extrapolation, inevitably there is an uncertainty in the overall scale 
of the predicted cross-sections (linked to the uncertainties in the partem distribution functions at small x), 
which we estimate at the level of ±50%. 
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(a) J/y (b) T 

Figure 5: Densities of J/yf^ MM ( a ) an -d MM 0 3 ) production cross-section as a function of 
the two muon transverse momenta. No cut was placed on the generated sample. The overlaid lines 
represent the nominal thresholds of observed events with various trigger cuts applied: }x6}xA (solid 
line), ^4^4 (dashed line) and /1 10+track (dash-dotted line). 



It is likely that the cross-section accessible by ATLAS will be higher than the values quoted in 
Table 1, as during early running the low pj muon trigger will run with an open coincidence window 
in 77 at level- 1 and no requirement of an additional level-2 di-muon trigger. This trigger item has a 
turn-on threshold at around 4 GeV, giving the (4 GeV, 4 GeV) trigger scenario described above, but in 
practice there is a non-zero trigger efficiency below 4 GeV, which, combined with the large rate of low 
Pt onia, may add a significant extra contribution to the overall observed cross-section. Even including 
this contribution, the overall rate of signal events from all quarkonium states is likely to remain below the 
rate of 1 Hz at a luminosity of 10 31 cm _2 s _1 , which is a small fraction of the available trigger bandwidth. 



Quarkonium 




Cross-section 
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\1A\1A 


IX6/J.4 
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/x6^4n/xl0 


J/y 


28 
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0.8 


0.8 


0.2 


T(1S) 


48 


5.2 


2.8 


0.8 


T(2S) 


16 


1.7 


0.9 


0.3 


T(3S) 


9.0 


1.0 


0.6 


0.2 



Table 1 : Predicted cross-sections for various prompt vector quarkonium state production and de- 
cay into muons, with di-muon trigger thresholds and jU6/i4 and the single muon trigger 
threshold /^10 (before trigger and reconstruction efficiencies). The last column shows the overlap 
between the di-muon and single muon samples. 
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2.1.2 Effect of trigger cuts on analysis of octet states 

As discussed above, the quarkonium cross-section is composed of three main classes of processes: direct 
colour singlet production, colour octet production and singlet/octet production of % states. Figure 6 
illustrates the contributions of these three classes to the overall production rate for T , once the pj trigger 
cuts of 6 and 4 GeV are applied to the muons. Lower pj trigger cuts will strongly enhance the T rate and 



Figure 6: Expected pj -distribution for T production, with contributions from direct colour singlet, 
singlet X production and octet production overlaid. 



allow for analysis of colour singlet production, which is expected to dominate for T with pj < 10 GeV. 
Lower trigger cuts available during early running, such as the \i4\i4 trigger described above, will allow 
the opportunity to extend the low-pj region down to pj ~ 0 in the case of T and help separate octet and 
singlet contributions. 

2.1.3 Acceptance of cos 6* with di-muon triggers 

An important consideration for calculating the di-muon trigger efficiencies ofj/y and T is the angular 
distribution of the decay angle 6*, the angle between the direction of the positive muon (by convention) 
from quarkonium decay in the quarkonium rest frame and the flight direction of the quarkonium itself in 
the laboratory frame (Figure 7). 




Figure 7: Graphical representation of the 6* angle used in the spin alignment analysis. The angle is 
defined by the direction of the positive muon in the quarkonium decay frame and the quarkonium 
momentum direction in the laboratory frame. 



The distribution in cos d* may depend on the relative contributions of the various production mech- 
anisms, and is as of yet not fully understood. Crucially, Monte Carlo studies have shown that different 
production mechanisms (and thus different angular distributions) can have significantly different trigger 
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acceptances, and without the measurement of the spin-alignment of quarkonium it will be difficult to be 
sure that the full trigger efficiency has been calculated correctly. 

It is clear that cos 6* ~ 0 corresponds to events with both muons having roughly equal transverse 
momenta, while in order to have cos0* close to ±1 one muon's pj needs to be very high while the 
other's pj is very low. In the case of a di-muon trigger, both muons from the J/\jf and T decays must 
have relatively large transverse momenta. Whilst this condition allows both muons to be identified, it 
also severely restricts acceptance in the polarisation angle cos 6*, meaning that for a given pjofJ/yov 
T a significant fraction of the total cross-section is lost. 

Examples of the polarisation angle distributions for the trigger are shown by solid lines in 

Figure 8. Here, the samples for both J/yr and T were generated with zero polarisation, so with full 
acceptance the corresponding distribution in cos0* should be flat, spanning from —1 to +1. Clearly, 




(a) J/y (b) T 

Figure 8: Reconstructed polarisation angle distribution for di-muon triggers (solid line) and 
a jx 10 single muon trigger (dashed line), for J/y (a) and T (b). The distributions are normalised 
to unit area. The generated angular distribution is flat in both cases. 



narrow acceptance in | cos 6* | would make polarisation measurements difficult. 
2.2 Single muon trigger 

Another possibility for quarkonium reconstruction is to trigger on a single identified muon. The non- 
prescaled level- 1 single muon trigger L1_MU10 with a 10 GeV pj threshold is expected to produce man- 
ageable event rates at low luminosities [11]. Once this muon triggers the event, offline analysis can 
reconstruct the quarkonium by combining the identified muon with an oppositely-charged track in the 
event. In Figure 5 this trigger corresponds to the dash-dotted lines, with the predicted cross-sections 
also shown in Table 1. With this trigger (referred to as /xlO in the following) one removes the need for 
the other muon to have a large pj, i.e. one has a fast muon, which triggered the event, and one track, 
whose transverse momentum is only limited by the track reconstruction capabilities of ATLAS, with the 
threshold around 0.5 GeV. 

Thus, the onium events with a single muon trigger typically have much higher values of | cos 0*|, as 
illustrated by the dotted lines in Figure 8, complementing the di-muon trigger sample. So, the single- 
and di-muon samples may be used together to provide excellent coverage across almost the entire range 
of cos 6* in the same pj range of onia. 

It's worth noting that the di-muon and single muon samples have comparable cross-sections and 
similar pj dependence. They are not entirely independent: at high transverse momenta the two samples 
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have significant event overlap (see Table 1 for more details), which could be useful for independent 
calibration of muon trigger and reconstruction efficiencies. 

3 Reconstruction and background suppression 
3.1 Quarkonium reconstruction with two muon candidates 

In each event which passes the di-muon trigger, all reconstructed muon candidates are combined into 
oppositely charged pairs, and each of these pairs is analysed in turn. The invariant mass is calculated 
and, if the mass is above 1 GeV, the two tracks are refitted to a common vertex. If a good vertex fit is 
achieved, the pair is accepted for further analysis. If the invariant mass of the refitted tracks is within 
300 MeV of the nominal mass in the case of J/y, or 1 GeV in the case of T , the pair is considered as a 
quarkonium candidate. The values quoted by the Particle Data Group [13], 3097 MeV and 9460 MeV for 
J/y and T respectively, are used throughout this paper, and the widths of the mass windows are chosen 
to be about six times the expected average mass resolution (see Table 2). 

For those pairs for which the vertex fit is successful (more than 99% for both J/y and T ), the 
invariant mass is recalculated. The invariant mass resolution depends on the pseudorapidities of the two 
muon tracks. To illustrate this effect, all accepted onia candidates are divided into three classes depending 
on X] of the muons, and Gaussian fits are performed to determine the resolutions and mass shifts. The 
results are presented in Table 2. It is found that the mass resolution is the highest when both tracks are 



Quarkonium 


M rec — MpDG, MeV 


Resolution a, MeV 
Average || Barrel | Mixed 


Endcap 


J/w 
r 


+4±1 
+ 15±1 


53 
161 


42 
129 


54 
170 


75 
225 



Table 2: Mass shifts and resolutions for di-muon invariant mass distributions after the vertex fit, 
for J/y andT candidates. 

reconstructed in the barrel area, |tj| < 1.05, degrades somewhat if both tracks are reconstructed in the 
endcap regions, |t] | > 1.05, and is close to its average value for the mixed tj events, with one muon in the 
barrel and the other in the endcap. It should be noted that no significant non-gaussian tails are observed 
in either of these mass distributions, and the fit quality is good. Also shown in the table are the shifts of 
the mean reconstructed invariant mass from the respective nominal values. The observed mass shifts are 
due to a problem with simulation of material effects in the endcap, which has since been understood and 
corrected. 

The reconstructed muon pairs that remain after vertexing cuts are considered to be good quarkonium 
candidates, and further analysis is done using these pairs only. The transverse momentum distributions 
of these candidates are shown in Figure 9. 

As can be seen from the Figure 9(a), prompt // y are mainly selected with pj above around 10 GeV, 
due to the di-muon trigger cuts applied to the events. The decay kinematics of T is somewhat different 
due to its larger mass, thus allowing T to be selected with pj as low as 4 GeV. Even at these, relatively 
low, statistics one expects to see significant numbers of both types of quarkonia at large pj, which will 
allow statistically significant high-/^ analyses beyond the reach of the Tevatron. 

Figure 10(a) presents the J/\jf acceptance as a function of the J/y transverse momentum, relative 
to the Monte Carlo generated dataset, which requires the two muons to be within \r\\ < 2.5 and have 
transverse momenta greater than 6 and 4 GeV, respectively. Geometric acceptance of the detector and 
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Figure 9: Transverse momentum distribution of triggered reconstructed quarkonium candidates, 
also shown separately for quarkonia found in the barrel and endcap regions of the detector. Statis- 
tics shown in the figures correspond to integrated luminosities of about 6 pb _1 and 10 pb _1 for 
J I y and T , respectively. 



P T (J/y) (GeV) 
(a) J/\j/ acceptance with pj 



Figure 10: Acceptance of reconstructed prompt J/\j/ as a function of J/\j/ transverse momentum 
and pseudorapidity (relative to the MC generated dataset with cuts). 



reconstruction efficiency losses due to vertexing, as well as trigger efficiencies have been taken into 
account. When J/y are produced with a transverse momentum above 10 GeV, we see a sharp rise in the 
acceptance as J fry above this threshold are able to satisfy the muon trigger requirements within a certain 
kinematic configuration. 

The structure in the plot of the tj -dependence of J/y reconstruction efficiency, shown in Fig- 
ure 10(b), highlights the configuration necessary in order for muons from the J/y to be able to pass 
the di-muon trigger, described below. The distribution of reconstructed quarkonium candidates with the 
angular separation of the two muons, described by the variable AR = A(j) 2 + At] 2 , is shown in Fig- 
ure 11. On average, muons from reconstructed J/y (^6/^4) candidates are separated by AR ~ 0.47, and 
are restricted from being detected with separations larger than around 0.7. 
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(a) AR with fl6fl4 cuts (b) AR with ju 10 cuts 

Figure 11: Distribution of AR separation of the two muons from J/y and T candidates with 
di-muon generator-level cuts (left) and single muon /ilO cuts (right) applied. 



In comparison, the higher mass of T requires the muons in the /i6/i4 case to have a much larger 
opening angle, with a broad disuibution in AR peaking at around 1.8 and spanning up to 2.6. One can 
see that for the single /ilO case in Figure 11(b) the distributions are much broader, and generally with 
smaller separation in AR, reflecting the lower pj constraint on the second muon. 

The small separation of muons in AR for the // \ff (/i6/^4) case has consequences for the J/y recon- 
struction efficiency as a function of pseudorapidity, shown in Figure 10(b). Significant dips in efficiency 
are seen near 7] ±1.2 and T] = 0, due to the muon spectrometer layout [14]. As the muons from J/\jf 
are on average separated by only AR = 0.47, they are subject to similar material and detector effects, 
and so these effects are carried over into the J/y reconstruction with very little smearing. Hence, this 
distribution has a similar shape to the individual muon reconstruction efficiency distribution in ATLAS. 



TT 



(a) Y acceptance with pj 



(b) T acceptance with r\ 



Figure 12: Acceptance of reconstructed prompt T as a function of transverse momentum and 
pseudorapidity of the quarkonium state (relative to the Monte Carlo generated dataset with 
cuts). 



This contrasts with the T reconstruction efficiency dependence on pseudorapidity, shown in Fig- 
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ure 12(b), which is much smoother than in J/y case: the two muons have large angular separation and 
the detector layout effects are smeared over a broader range of r\ values. Figure 12(a) shows the variation 
of acceptance with the T transverse momentum, and reflects the fact that with the jx6}xA trigger T can be 
reconstructed with a lower pj threshold. In the absence of a dedicated topological trigger for T , trigger 
efficiency at low pj suffers due to the differing decay kinematics between J/y and T as only specialised 
J /iff triggers exist in reconstruction software used in this analysis. At larger pj both acceptances reach 
a plateau at around 80-85%. 

3.2 Offline monitoring using quarkonium 

The di-muon decays of J /iff and T will be used in both online and offline monitoring at ATLAS. Mass 
shifts for the reconstructed quarkonium states, plotted versus a number of different variables, have been 
proposed to monitor detector alignment, material effects, magnetic field scale and its stability, as well as 
to provide checks of muon reconstruction algorithm performance. The CDF collaboration extensively 
and successfully used this method, although it took many years at the Tevatron to collect sufficient 
statistics to allow for the disentanglement of various detector effects [15]. 

The expected rate of quarkonium production at ATLAS is such that we can expect to be able to 
perform meaningful monitoring and corrections online. There are many examples of where monitoring of 
quarkonium mass shifts can be useful in data-taking. Mass shifts in quarkonia as a function of transverse 
momentum can reveal problems with energy loss corrections and the muon momentum scale. As a 
function of pseudorapidity this can be a good probe of over- or under-correction of material effects in 
the simulated detector geometry and of magnetic field uniformity. J/y mass shifts in Monte Carlo 
simulations have already helped to improve muon reconstruction algorithms in ATLAS. 

An example of a reconstructed J/y mass shift measurement at ATLAS with the statistics correspond- 
ing to 6 pb _1 is presented in Figure 13. This is the dependence of AM on the difference in curvatures 



1/p T (n+)-1/p T 0r) (TeV- 1 ) 



Figure 13: J/y mass shift plotted versus the difference of curvature between the positive and 
negative muons. Statistics corresponds to the integrated luminosity of about 6 pb _1 . 



of positive and negative muons, which allows for checks of a potentially important effect seen at CDF: 
horizontal misalignments in some detector elements may result in a constant curvature offset that can 
lead to significant charge-dependent tracking effects. A misalignment may be such that a negative track 
has a higher assigned curvature (and hence lower momentum) than is truly the case, whilst a positive 
track would be affected in the opposite way. The sample shown in the figure is simulated with ideal 
geometry and does not show any significant effects of this kind. 
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For detector alignment and data monitoring purposes, quarkonium provides a low pj point for cali- 
bration, complementary to the Z boson sample, and allows for the possibility to identify any systematic 
variations that may develop at higher p T . 

In order to be able to analyse mass shifts due to two-variable correlations and disentangle various 
detector effects, significant statistics of // yf and T di-muon decays have to be accumulated. A dedicated 
study is being performed in ATLAS to optimise the strategy of real time and offline monitoring using 
this method, but these results lie beyond the scope of this note. 

3.3 Background suppression in di-muon case 

The expected sources of background for prompt quarkonium with a di-muon jj,6/J,4 trigger are: 

• indirect J /yf production from bb events; 

• continuum of muon pairs from bb events; 

• continuum of muon pairs from charm decays; 

• di-muon production via the Drell-Yan process; 

• decays in flight of %^ and ^ mesons. 

The most important background contributions are expected to come from the decays b — >• J /yr+X, 
and the continuum of di-muons from bb events. Both of these have been simulated and analysed. The 
estimated total contribution from charm decays is higher than that from bb events. However, this back- 
ground has not been simulated, as it is not expected to cause problems for prompt quarkonium recon- 
struction because the transverse momentum spectrum of the muons falls very steeply and the probability 
of producing a di-muon with an invariant mass within the range of interest is well below the level ex- 
pected from bb events. Only a small fraction of the Drell-Yan pairs survive the di-muon trigger cuts of 
jll6jU4 in the J / yf— Tmass range, which makes this background essentially negligible, as estimated from 
generator-level simulation. Muons from decays in flight also have a steeply falling muon momentum 
spectrum, and in addition require random coincidences with muons from other sources in the quarko- 
nium invariant mass range. This is estimated to be at the level of a few percent of the signal rate, spread 
over a continuum of invariant masses. 

All background di-muon sources mentioned above, apart from Drell-Yan pairs, contain muons which 
originate from secondary vertices, which makes it possible to suppress these backgrounds by remov- 
ing such di-muons whenever a secondary vertex has been resolved, based on the pseudo-proper time 
measurement. The pseudo-proper time is defined as 

Pseudo-proper time = — - — ' , (1) 

where Mj/y and px{J/Y) represent the mass and the transverse momentum of the J /yf candidate, c is 
the speed of light in vacuum, and is the measured radial displacement of the two-track vertex from 
the beamline. Once the two muons forming a J /yf candidate are reconstructed, the pseudo-proper time is 
used to distinguish between the prompt J /yf, which have a pseudo-proper time of zero, and J/yr coming 
from B-hadron decays and hence having an exponentially decaying pseudo-proper time distribution, due 
to the non-zero lifetime of the parent B-hadrons. 

The dependence of the resolution in radial decay length Lyy on di-muon pseudorapidity r\ is shown 
in Figure 14, while the variation of the expected resolution in the pseudo-proper time with di-muon 
Pt is shown in Table 3. An improvement in the resolution is seen with increasing pj of the J / yf and 
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Figure 14: Radial position resolution of secondary vertex for J/y decays as a function of the J/y 
pseudorapidity. 



// y transverse mo- 
mentum (GeV) 


9-12 


12-13 


13-15 


15-17 


17-21 


>21 


Pseudo-proper time 
resolution (ps) 


0.107 


0.103 


0.100 


0.093 


0.087 


0.068 



Table 3: Pseudo-proper time resolution of direct J/y events as a function of J/y pj. 



decreasing \r\\. Here a perfect detector alignment is assumed, with the resulting average resolution 
estimated at around 0.1 ps. 

Figure 15(a) illustrates the pseudo-proper time distribution for both the prompt and indirect J/y 
samples. By making a cut on the pseudo-proper time, one can efficiently separate most of the indirect 
J/y from a prompt J /iff sample (or vice-versa). The efficiency and purity of the pseudo-proper time 
cuts for prompt J/y are presented in Figure 15(b). A pseudo-proper time cut of less than 0.2 ps allows to 
retain prompt // y with the efficiency of 93% and the purity of 92%. Note that the distribution shown in 
Figure 15(a) is, in a sense, self-calibrating: the part to the left of the maximum can be used to determine 
the resolution a, and an appropriate cut of 2a can be applied to remove the 'tail' of secondary J/y 
candidates on the right hand side. 

The background levels of beauty and Drell-Yan production under the T peak are similar to those 
for the J/y , except that here one does not have to contend with sources of non-prompt quarkonia 
from B-decays. However, the bb —> jj,6jJ,4 background continuum under the T is more problematic: 
higher invariant masses around the T mean that the two triggered muons will necessarily come from two 
separate decays, meaning that the pseudo-proper time cut is far less effective. 

Fortunately, flags associated to individual reconstructed muon tracks provide further vertexing infor- 
mation, which could be used for suppressing of the bb — >• /x6/x4 continuum background. Reconstructed 
tracks are assigned to either come from the primary vertex, a secondary vertex, or are left undetermined. 
By requiring that both of the muons combined to make a. J/y or a T candidate are determined to have 
come from the primary vertex, background from the bb — > jj,6/J,4 continuum can be reduced by a factor 
of three or more, whilst reducing the number of signal events by around 5% in both cases. 

Figure 16 illustrates the quarkonium signal and main background invariant mass distributions in the 
mass range 2—12 GeV, for those events which satisfy the jj,6jJ,4 trigger requirements, with reconstruction 
efficiencies and background suppression cuts taken into account. Peaks from the J/y and T(1S) clearly 
dominate the background. As no higher y and T states were simulated for this analysis, their peaks are 
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Pseudo-Proper time (ps) Pseudo-Proper time (ps) 



(a) (b) 

Figure 15: (a) Pseudo-proper time distribution for reconstructed prompt J/\\f (dark shading) and 
the sum of prompt and indirect J/y candidates (lighter shading), (b) Efficiency (solid line) and 
purity (dotted line) for prompt J/y candidates as a function of the pseudo-proper time cut. Statis- 
tics correspond to the integrated luminosity of 6 pb _1 . 



not shown. The dotted line indicates the level of the background continuum before the vertexing cuts. 

In conclusion, we find that the level of the backgrounds considered for both J/y and T do not 
represent any serious problem for reconstruction and analysis of direct quarkonia with the di-muon ^6/^4 
trigger. 

3.4 Reconstruction and background suppression with a single muon candidate 

By using the /xlO trigger, one selects events with at least one identified muon candidate with pj above 
10 GeV. In this part of the analysis, each reconstructed single muon candidate is combined with oppositely - 
charged tracks reconstructed in the same event. For both J/y and T reconstruction, we insist that any 
other reconstructed track to be combined with the identified trigger muon has an opposite electric charge 
and is within a cone of AR = 3.0 around the muon direction, so as to retain over 99% (91%) of the signal 
events in the J/y (T) case. As in the di-muon analysis, we require that both the identified muon and the 
track are flagged as having come from the primary vertex. In addition, we impose a cut on the transverse 
impact parameter do, \do\ < 0.04 mm on the muon and \do\ < 0.10 mm on the track, in order to further 
suppress the number of background pairs from B-decays. 

The invariant mass distribution for the remaining pairings of a muon and a track is shown in Fig- 
ure 17(a) for J/y with pj larger than 9 GeV and in Figure 17(b) for J/y with pj larger than 17 GeV. 
The distributions are fitted using a single gaussian for the signal and a straight line for the background. 
Clear J/y peaks can be seen, with statistically insignificant mass shifts and the resolution close to that in 
the di-muon sample. It's worth noting that the signal-to-background ratio around the J / y peak improves 
slightly with increasing transverse momentum of J/y. At higher pj the cos 6* acceptance also becomes 
broader, which should help independent polarisation measurements. 

For T the situation is less favourable, due to the combination of a lower signal cross-section and 
a higher background. Although the T peak can be seen above the smooth background, its statistical 
significance is rather low. Hence, with this statistics, the use of the single muon sample for T cannot be 
justified, and in the following we will only rely on the di-muon sample. 

In conclusion, we expect that the single muon trigger with a 10 GeV threshold can be successfully 
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Figure 16: The cumulative plot of the invariant mass of di-muons from various sources, recon- 
structed with a jU6;U4 trigger, with the requirement that both muons are identified as coming from 
the primary vertex and with a pseudo-proper time cut of 0.2 ps. The dotted line shows the cumu- 
lative distribution without vertex and pseudo-proper time cuts. 



used to select prompt J/yr events. The expected background here, although much larger than in di- 
muon case, is well under control. For T however, the single muon sample is only likely to be useful at 
significantly higher statistics and higher transverse momenta. 

3.5 Summary of cuts and efficiencies 

Table 4 summarises the efficiencies of all the selection and background suppression cuts described above, 
for both the di-muon and single muon trigger samples. Not all cuts are applicable to all samples; those 
which are not are labelled accordingly. Numbers in italics are estimates in cases where no adequate 
fully simulated sample was available. The efficiencies for }x6}xA samples are calculated relative to the 
Monte Carlo sample with generator-level cuts on the two highest muon transverse momenta of 6 and 4 
GeV. For the /xlO samples, the generator-level cut of 10 GeV was applied to the pj of the highest-^r 
muon. Expected yields Ns of quarkonia for 10 pb _1 are given at the bottom of the table, along with 
background yields Nb within the invariant mass window of ±300 MeV for J/y and ±1 GeV for T , and 
the signal-to-background ratios at respective J/\\r and T peaks for each sample. 

For higher, excited quarkonium states with vector quantum numbers the efficiencies are expected to 
be similar, but not necessarily identical. The biggest differences are expected for where the produc- 
tion mechanisms as well as decay kinematics are significantly different. 

4 Polarisation and cross-section measurement 

The Colour Octet Model predicts that prompt quarkonia produced in pp collisions are transversely po- 
larised, with the degree of polarisation increasing as a function of the transverse momentum. Other 
production models predict different pj dependencies of the polarisation and so this quantity serves as an 
important measurement for discrimination of these models (see Figure 1(b)). 

Quarkonium polarisation can be assessed by measuring the angular distribution of the muons pro- 
duced in the decay. The relevant decay angle 6* is defined in Figure 7. The spin alignment of the parent 
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(a) y/ywith p T > 9 GeV 



(b) //V with p T > 17 GeV 



Figure 17: Prompt quarkonium signal and bb — > /iX background events selected with the jUlO 
trigger, in the mass range around J/y with (a) pr above 9 GeV, and (b) pj above 17 GeV, 
corresponding to 10 pb _1 of data. The background from B decays is shown in light grey. Cuts 
described in the text have been applied. The distributions were fitted using the sum of a linear 
background and a gaussian peak centered at M = 3097 MeV +AM with resolution a. 



vector quarkonium state can be determined by measuring the polarisation parameter a in the distribution 

The choice of parameters in Equation 2 is such that the distribution is normalised to C. The parameter 
a, defined as a = (oj — 2ol) / (<Jr + 2<7l), is equal to +1 for transversely polarised production (helicity 
= ±1). For a longitudinal polarisation (helicity = 0), a is equal to —1. Unpolarised production consists 
of equal fractions of helicity states +1,0 and — 1 , and corresponds to a = 0. 

The difficulty of quarkonium polarisation measurements is evidenced by the discrepancies between 
D0 and CDF results shown in Figure 1(b). The problem can be traced to the limited acceptance at high 
| cos and hence difficulties in separating acceptance corrections from spin alignment effects (see, 
e.g., [7]). 

Note that the feed-down from % state and ^-hadron decays may lead to a different spin alignment 
and hence to a possible effective depolarisation which is hard to estimate. In addition, due to the lim- 
ited statistics, the polarisation measurements at the Tevatron cannot reach the region of high pj, where 
theoretical uncertainties are expected to be smaller. 

At ATLAS we aim to measure the polarisation of prompt vector quarkonium states, in the transverse 
momentum range up to ~ 50 GeV and beyond, with extended coverage in cos d* which will allow for 
improved understanding of efficiency measurements and thus reduced systematics. The promptly pro- 
duced J/yr mesons and those that originated from B-hadron decays can be separated using the displaced 
decay vertices, as explained above. With a high production rate of quarkonia at LHC, it will be possible 
to achieve a higher degree of purity of prompt J/yr in the analysed sample and reduce the depolarising 
effect from B-decays, whilst retaining high statistics. 

As explained in Section 2.1.3, with the di-muon trigger signature such as jU6/i4, the acceptance 
at large values of | cos 0* | (where the difference between various polarisation states is the biggest) 
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Quarkonium 


J/Y 


J/w 


r 


r 




Trigger type 


jU6;U4 


jUlO 


jU6jU4 


jUlO 




MC cross-section 


23 nb 


23 nb 


5.2 nb 


2.8 nb 


£li 


Level- 1 trigger 


87% 


96% 


84% 


96% 


S L 2 


Level-2 trigger 


97% 


>99% 


66% 


>99% 


£Rec 


Reconstruction 


89% 


96% 


93% 


96% 


£vtx 


Vertex fit 


99% 


99% 


99% 


99% 


£1 


£L1 • £L2 • ZRec m &Vtx 


75% 


90% 


51% 


90% 


£?o 


Pseudo-proper time cut 


93% 


93% 


n/a 


n/a 


£Flg 


Only primary vertex tracks 


96% 


92% 


95% 


92% 


£ar 


Second track inside cone 


n/a 


99% 


n/a 


91% 


£d0 


Impact parameter cut 


n/a 


90% 


n/a 


90% 


£2 


£f0 • £f/g • £a« • £d0 


90% 


76% 


95% 


75% 


£ 


Overall efficiency £1 • £2 


67% 


69% 


49% 


68% 




Observed signal cross-section 


15 nb 


16 nb 


2.5 nb 


2.0 nb 




^forlOpb- 1 


150000 


160000 


25 000 


20000 




Nb in mass window for 10 pb _1 


7000 


700000 


16000 


2 000000 




Signal/Background at peak 


60 


1.2 


10 


0.05 



Table 4: Predicted and observed cross-sections for prompt vector quarkonia, and efficiencies of 
various selection and background suppression cuts described in Section 3. 



is strongly reduced, especially at low transverse momenta of quarkonium. The kinematic acceptance 
s/ (pt, cos 6*) of the /x6/x4 cuts applied at generator level, with respect to the full generator-level sample 
with no cuts on muon transverse momenta, is shown by the solid lines in Figure 1 8 for various pj slices 
of J/\ff ■ The acceptance is seen to be quite low at J/ljf pj below 12 GeV, but in higher pj slices there 
is an area in the middle of cos 6* range with essentially 100% acceptance, which becomes broader with 
increasing p T of the J/y, but does not go beyond | cos 6* | ~ 0.5. 

The acceptance for the single muon trigger sample, shown with the dashed lines in Figure 18, is 
different: here the areas of 100% acceptance are at high | cos 9*\, and the dip in the middle gradually fills 
up with increasing pj. This sample essentially has a full acceptance at pj > 20 GeV, apart from the drop 
at I cos 6* I > 0.95 due to the cut of 0.5 GeV on the p T of the track of the second muon. 

The plots in Figure 18 were obtained using a dedicated generator-level Monte Carlo sample. The er- 
ror bars shown in the figure reflect both statistical errors and the uncertainties due to possible dependence 
on Tj coverage. 

The simulated 'raw' measured distributions dN raw / d cos 6* , for the same slices of J/y transverse 
momenta, are shown in Figure 19. Again, solid and dashed lines represent the events selected by the 
di-muon /x6/^4 and the single muon /xlO triggers, respectively. The sample was generated with zero 
polarisation. The raw numbers of measured events in the ji 10 sample were obtained by fitting the invari- 
ant mass distributions with a gaussian peak and a linear background, for each bin of cos 6* in each p T 
slice. With the estimated signal-to-background ratios shown in Figure 17(a), this causes an increase in 
the statistical errors, typically by a factor of 2. 

The corrected distributions dN cor /d cos 6* are calculated according to the following formula: 

^ cor = 1 dN mw 

dcosd* s?/(p T ,cosd*)-Si-S2 dcosd* 
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Figure 18: Generator-level kinematic acceptances of the (solid lines) and }X 10/^0.5 (dashed 

lines) cuts, calculated with respect to the sample with no muon pj cuts, in slices ofj/y transverse 
momentum: left to right, top to bottom 9-12 GeV, 12 - 13 GeV, 13 - 15 GeV, 15 - 17 GeV, 
17-21 GeV, above 21 GeV. 



Here £\ stands for the trigger and reconstruction efficiency, while £2 denotes the efficiency of background 
suppression cuts for each sample, as defined in Table 4. Their values have been averaged over the 
accessible phase space within the relevant pj slice. Studies have shown that while £1 depend on pj (cf. 
Figure 10(a)), £2 remain essentially constant over the phase space of interest. The efficiencies £1 and £2 
for both samples are listed in Table 5, while the acceptances stf (pr,cos 6*) are shown in Figure 18. 



p T , GeV 


9-12 


12-13 


13-15 


15-17 


17-21 


>21 


ei(/x6ju4),% 
e 2 (/i6/i4), % 


67 ±1 
90±1 


75 ±1 
90±1 


77 ±1 
90±1 


78 ±1 
90±1 


79±1 
90±1 


80±1 
90±1 


£l(/ilO),% 

£2 (At 10), % 


86±1 
76±1 


89±1 
76±1 


90±1 
76±1 


90±1 
76±1 


90±1 
76±1 


90±1 
76±1 



Table 5: Efficiencies for the /i6/^4 and /ilO samples, averaged over each of the six pj slices. 

At high pj the two samples increasingly overlap, thus allowing for a cross-check of acceptance and 
efficiency corrections. However, for measurement purposes the \xb\xA samples are used whenever pos- 
sible, complemented by jUlO samples at high cos0*. In order to achieve this, the distributions shown 
in Figure 19 were appropriately masked and combined. The combined distributions dN C01 / d cos 6* , 
corrected according to Equation 3, are shown in Figure 20. The errors shown in the plots include the sta- 
tistical errors on the raw data, as well as the uncertainties on the acceptance and efficiencies. These cos 0* 
distributions are fitted using the Equation 2, with a and C as free parameters for each pj slice. The fit 
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Figure 19: Measured distributions for ^6^4- (solid lines) and /1 10- (dashed lines) triggered events, 
in the same pj slices of the J/yr candidate as in Figure 18. The simulated data sample is unpo- 
larised. Statistics correspond to 10 pb _1 . 



results are presented in Table 6, with constant C rescaled to the measured cross-section a, corresponding 
to the integrated luminosity of 10 pb _1 . 

To further check our ability to measure the spin alignment of J/y , the raw distributions shown in 
Figure 19 were reweighted to emulate transversely polarised (a gen = +1) and longitudinally polarised 
(a gen = — 1) J/Y samples, and the analysis described above was repeated. The results are shown in 
Figure 21 and in the middle two sections of Table 6. 

A similar analysis can be done for measuring the polarisation and cross-section of T , but at the 
integrated luminosity of 10 pb _1 these measurements are expected to be far less precise than in J/y 
case. The main reasons are lower T cross-sections at high transverse momenta, and higher backgrounds 
for the jitlO sample. The latter reason, as explained in Section 3.4, means that with these statistics the 
/^10 sample is essentially unusable, and the limited acceptance of the /i6/i4 sample at high | cos 0* | 
makes a precise measurement difficult. 

The corrected | cos 0* | distributions for unpolarised T from the /^6/i4 sample are shown in Fig- 
ure 22. The results of the fit using Equation 2, with normalisation matched to the integrated luminosity 
of 10 pb -1 , are shown in the last section of Table 6. With the integrated luminosity increased by an 
order of magnitude, the /^10 sample should become useful and the estimated errors on T polarisation 
measurement could be reduced by a factor of 5. 

The errors shown in Figures 20 — 22 and Table 6 include the statistical uncertainties on the measured 
numbers of events as well as various systematic errors stemming from the uncertainties on acceptances 
and efficiencies described above. 

The overall uncertainty on the integrated luminosity needs to be added to all measured cross-sections, 
and is expected to be rather large during the initial LHC runs. This uncertainty will not, however, affect 
the relative magnitudes of the cross-sections measured in separate pj slices, or the measured values of 
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Figure 20: Combined and corrected distributions in J/y polarisation angle cos0*, for the same 
Pt slices as in Figure 18. The data sample is unpolarised (a gen = 0). The lines show the results of 
the fit using Equation 2, where the fitted values of a are given in Table 6. Statistics correspond to 
10 pb- 1 . 



Sample 


p T , GeV 


9-12 


12-13 


13-15 


15-17 


17-21 


>21 


J/y, a gen = 0 


a 


0.156 
±0.166 


-0.006 
±0.032 


0.004 
±0.029 


-0.003 
±0.037 


-0.039 
±0.038 


0.019 
±0.057 


a, nb 


87.45 
±4.35 


9.85 
±0.09 


11.02 
±0.09 


5.29 
±0.05 


4.15 
±0.04 


2.52 
±0.04 


J/y, a gen = +1 


a 


1.268 
±0.290 


0.998 
±0.049 


1.008 
±0.044 


0.9964 
±0.054 


0.9320 
±0.056 


1.0217 
±0.088 


a, nb 


117.96 
±6.51 


13.14 
±0.12 


14.71 
±0.12 


7.06 
±0.07 


5.52 
±0.05 


3.36 
±0.05 


J/y,a sea = -l 


a 


-0.978 
±0.027 


-1.003 
±0.010 


-1.000 
±0.010 


-1.001 
±0.013 


-1.007 
±0.014 


-0.996 
±0.018 


a, nb 


56.74 
±2.58 


6.58 
±0.06 


7.34 
±0.06 


3.53 
±0.04 


2.78 
±0.03 


1.68 
±0.02 


T,a gen = 0 


a 


-0.42 
±0.17 


-0.38 
±0.22 


-0.20 
±0.20 


0.08 
±0.22 


-0.15 
±0.18 


0.47 
±0.22 


a, nb 


2.523 
±0.127 


0.444 
±0.027 


0.584 
±0.029 


0.330 
±0.016 


0.329 
±0.015 


0.284 
±0.012 



Table 6: J/y and T polarisation and cross-sections measured in slices of pj, for 10 pb 
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Figure 21: Combined and corrected distributions in polarisation angle cos$*, for longitudinally 
(otgen = — 1, dotted lines) and transversely (a gen = 1, dashed lines) polarised J /if/ mesons, in the 
same pj slices as in Figure 18. The lines show the results of the fit using Equation 2, where the 
fitted values of a are given in Table 6. Statistics correspond to 10 pb _1 . 



the polarisation coefficient a. Additional systematic effects have also been studied, such as the influence 
of finite resolution in pj and cosfT, changes in binning, details of the functions used for fitting the 
invariant mass distributions, and variations of cuts used for background suppression. Their respective 
uncertainties on the measured values of a and o have been found not to exceed a small fraction of the 
quoted errors, and have thus been deemed negligible. 

In conclusion, with the integrated luminosity of 10 pb _1 it should be possible to measure the polari- 
sation of J/yr with the precision of order 0.02 — 0.06, depending on the level of polarisation itself, in a 
wide range of transverse momenta, pj ~ 10 — 20 GeV and beyond. In case of T , the expected precision 
is somewhat lower, of order 0.20. In both cases, however, the pj dependence of the cross-section should 
be measured reasonably well. 

5 Analysis of % production 

Quarkonium states with even C parity, such as r) c j, and Xc,b, have a strong coupling to the colour-singlet 
two-gluon state, and hence a significantly higher production cross-section than vector quarkonia. Their 
dominant production mechanism for the phase space area accessible in ATLAS is via the subprocess 
shown in Figure 2(b) in Section 1. Their detection, however, is rather more difficult due to the absence 
of purely leptonic decays. 

About 30 to 40% of //!//■ and Tare expected to come from decays % c — > J /yry and Xb —* Yy. Un- 
fortunately, the energies of the radiated photons tend to be quite small. The ability of ATLAS to detect 
these photons and resolve various % states is analysed in Section 5.1. Another possibility of observing %b 
and possibly Tft, states is considered in Section 5.2, where the reconstruction of these states is attempted 
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Figure 22: Corrected distributions in polarisation angle cosO*, for unpolarised T mesons, in 
the same slices of T transverse momentum as in Figure 18. Only /i6/i4 sample has been used. 
Statistics correspond to 10 pb _1 . 



through their decay into a pair ofj/y, both of which subsequently decay into jX + jl . 
5.1 Radiative decays of % c , % b states 

Reconstructing % c candidates requires associating a reconstructed J/\\f with the photon emitted from the 
%c decay. The transverse momentum distribution for all identified photon candidates in events with a 
prompt J/y, as measured by the ATLAS electromagnetic calorimeter, is shown in Figure 23(a) (light 
grey histogram). 

For Xc reconstruction, each selected quarkonium candidate is combined with every reconstructed 
and identified photon candidate in the event, and the invariant mass of the system is calculated. No 
explicit cut is applied to the pj of the photon. The \L\iy system is considered to be a % candidate, if the 
difference AM between the invariant masses of the \i\iy and systems lies between 200 and 700 MeV, 
and the cosine of the opening angle a between the J/\jf and y momenta is larger than 0.97. The last 
requirement comes from the observation that for the correct jit/i y combinations, the angle a is usually 
very small (see reconstructed distribution in Figure 23(b)). By analysing Monte Carlo information, it was 
found that all photons from generated x decays were found in the peak near cos a = + 1 , with the long 
tail in the reconstructed distribution representing the combinatorial background. The transverse energy 
distribution for those photon candidates which satisfy the above requirements is presented in Figure 23(a) 
by the dark histogram. With these cuts, the combinatorial background is strongly reduced. 

Figure 24 shows the distribution in AM for the selected Xc decay candidates. The expected mean 
positions of the peaks corresponding to Xo, %\ an d X2 signals (318, 412 and 460 MeV, respectively) are 
indicated by arrows. The grey histogram shows the contribution from the background process of // y 
production from B-hadron decays, some of which survive the pseudo-proper time cut. 
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(a) (b) 



Figure 23: (a) Transverse momentum distribution of photons reconstructed in prompt J/y events, 
(b) Distribution of cos a for each reconstructed 7 in an event. On both plots, the light grey (dark 
grey) histograms show the distributions before (after) the cut on the opening angle a between the 
photon and the J/y momentum direction. All photons from % — ► J/yy decays have cos a > 0.97, 
while the vast majority of background combinations fall outside the range shown in plot (b). The 
sample corresponds to the integrated luminosity of 6 pb~ 1 . 



The solid line in Figure 24 is the result of a simultaneous fit to the measured distribution, with the 
three peak positions fixed at their expected values, and the common resolution function a (AM). The 
resolution in a (AM) is expected to increase with increasing AM, and was empirically parameterised 
as a(AM) = a- AM + b. The dashed lines show the shapes of individual peaks and of the background 
continuum. The fit parameters are the heights of the three gaussian peaks h 0 ,hi,h 2 , the constants a 
and b, and the three parameters describing the smooth polynomial background. The systematic studies 
include the variation of the background parameterisation and the introduction of a mass shift common 
for the three resonances. The true amplitudes of the peaks (15, 123 and 87, respectively) are reproduced 
reasonably well: 

ho = 15±3(stat.)±10(syst.), 

h { = 101±4(stat.)±12(syst), (4) 
h 2 = 103±4(stat.)±9(syst), 

with a strong negative correlation between the last two. The resolution is found to increase from about 
35 MeV at Xo to about 48 MeV at X2, while the overall reconstruction efficiency of Xc states is estimated 
to be about 4%. It may be possible to significantly improve the resolution by using photon conversions, 
but this is unlikely to yield a big increase in efficiency. 

The procedure of reconstructing % b decays into T+ 7 is the same as in the charmonium case, except 
the di-muon pair is required to be an T candidate. However, the higher di-muon mass and hence smaller 
expected boost makes the photon much softer and hence more difficult to detect. With the available 
simulated statistics (50000 events corresponding to 10 pb _1 ), only 20 Xb candidates have been found in 
the appropriate mass window, which gives an efficiency estimate of 0.03%. In order to reliably observe 
Xb — ► T+ 7 decays, an integrated luminosity of at least 1 ftr 1 will be needed. 
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Figure 24: Difference in invariant masses of \L\iy and systems in prompt J/y events (light 
grey) with bb — > /i6/^4X background surviving cuts (dark grey). The arrows represent the true 
signal peak positions, and the lines show the results of the fit described in the text. Event yields 
correspond to an integrated luminosity of 10 pb _1 . 



5.2 Analysis of Xb^J/YJ/V 

Another possibility for measuring %b production is through the decay %b — ► J /wJ/V^ /IjU/IjU. The use 
of this decay for %b detection was proposed in [16], while in [17] the corresponding branching fraction 
was calculated to be Br(% b o —> J/y J/y) =2 x 10~ 4 . 

The predicted total inclusive cross-section of Xbo production at LHC is estimated at around 1.5 jlib 
[17], yielding the following theoretical estimate (without any momentum cuts on muons): 

<*{pp -» Xbo+X)Br{xm -» J/w J/w) = 330pb (5) 

We use this cross-section in our study. It should, however, be considered as a lower bound, with higher 
order QCD corrections expected to increase it significantly, especially within the COM approach. This 
cross-section also does not include other C-even states (f]bi%b2 and radial excitations), meaning that the 
overall combined cross-section of resonant J/y J/y production in the T mass region can be at least an 
order of magnitude higher. 

The Pythia Monte Carlo generator, used to simulate this process, was modified to include this 
particular decay. Events for this study are triggered with a di-muon trigger /x6/i4, as for the 7/i^and 
T di-muon analysis. Out of 50000 generated % b — > J j y{\i\i)J j y{\i\L) events 815, or 1.6%, passed 
the ^6/14 trigger cuts. Taking into account di-muon branching fractions of the two J/y mesons, this 
corresponds to the cross-section a = 20 fb after trigger. 

The two triggered muons have the highest pj of the four. The two remaining muons, in many cases, 
have transverse momenta too low to be identified as muons (i.e. below 2.5 GeV), and sometimes too low 
to be even reconstructed (below 0.5 GeV). 

Two classes of events, remaining after the trigger cuts, have been considered to be useful: 

a) events where the two trigger muons came from the same J/y . Then, the third muon has to be 
identified by the muon system, while the fourth must at least be reconstructed as a track (124 
events); 
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b) events where each trigger muon came from a different J/yr . The remaining two muons may or 
may not be identified, but their tracks still need to be reconstructed (330 events). 

Thus, taking the trigger, muon identification and track reconstruction efficiencies into account, one ex- 
pects about 50% of triggered % b — > J /yr[^^i)J /yr{}X}x) decays to be observed, which amounts to 0.8% 
of the generated sample, corresponding to the cross-section of 10 fb. Hence, the observed statistics is 
expected to be around 100 events for the integrated luminosity 10 fb -1 . 

Once the two J/yr candidates in the event have been reconstructed, a simultaneous fit of the four 
muon tracks to the common vertex is performed, with J/yr mass constraints applied to the respective di- 
muon invariant masses. The resulting distribution is presented in Figure 25(a). The resolution on the Xb 




1.5 2 2.5 3 3.5 



(a) 



(b) 



Figure 25: (a) Reconstructed % h invariant mass, with J /yr mass constraints applied on the respec- 
tive di-muon pair masses, (b) Higher di-muon invariant mass plotted versus the lower di-muon 
invariant mass in Xb —> J /vKMMV VvCMAO events. 



mass is estimated to be as good as 40 MeV. Similar resolution should be expected for the reconstructed 
invariant mass in the decays of other XbJ states, while the resolution for Tjf, — > J /yr{jXjx)J /yr{jx^i) should 
be slightly better. 

With two pairs of muons in each signal event, there are two possible pairings of oppositely charged 
muons. The plot of the invariant mass of one di-muon pair versus the invariant mass of the other is shown 
in Figure 25(b), using generator-level information. All correct pairings, and none of the incorrect pairings 
of di-muons fall within the circle of radius 200 MeV (about 3-4 a) from the point with coordinates 
Mjjy.Mjjy. The incorrect pairings are scattered over the whole area, so by selecting the pairings from 
the circle defined above, the combinatorial background can be strongly reduced. 

The main expected sources of background to Xb — > J /'KMMy /'KMAO decays are the processes of 
bottom quark production, pp — > bbX, with each b either decaying into J/yr + X, or into a muon with 
additional charged tracks. These backgrounds have been analysed with the same Monte Carlo samples 
used in our study of backgrounds for single J /yr and T production. Within the available statistics, very 
few background events have survived the signal selection cuts described above, and the background sup- 
pression cuts on pseudo-proper time on secondary vertices. Extrapolating these results to the integrated 
luminosity of 10 fb -1 shows that the statistically significant Xb J /vIMMV VvG^AO signal P ea k (or 
peaks) should be visible on top of the combinatorial continuum, with the expected signal-to-background 
ratio of 10-20% or above. 

In short, so far we have seen no major obstacles in an attempt to search for narrow resonances in the 
J /yr{jl^l)J /yr{jXjx) invariant mass distributions. However, dedicated high statistics Monte Carlo samples 
are needed to draw more reliable conclusions. 
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6 Physics reach with early data 

During the initial run of the LHC, the integrated luminosity of 1 ptr 1 with the ^6^4 trigger would mean 
about 15 000 J /yr^ MM and 2500 /x/x recorded events. If the /^4/x4 trigger is used, these numbers 
would increase up to 17 000 and 20000 respectively, with these additional events mainly concentrated at 
the lower end of the quarkonium transverse momenta. 

Additional, largely independent statistics will be provided by the /xlO trigger: 16000 J/yr and 
2000 T with transverse momenta above about 10 GeV, with distributions similar to those from the 
jU6;U4 samples. Quite separate from these, another 7 000 of J/yr — ► jiji events are expected from b- 
decay events. All these events should be perfectly usable for detector alignment, acceptance and trigger 
efficiency studies, as well as for understanding tracking and muon system performances. 

At the integrated luminosity of about 10 pb _1 recorded numbers of J/yr^ jUjll and jiji will be 
roughly equal to the statistics used in this note. With these statistics, the pj dependence of the cross- 
section for both J/yr and T should be measured reasonably well, in a wide range of transverse momenta, 
pj ~ 10 — 50 GeV. The precision of J/yr polarisation measurement can reach 0.02 — 0.06 (depending 
on the level of polarisation itself), while the expected error on T polarisation is unlikely to be better than 
about 0.2. At this stage, first attempts may be made to understand the performance of the electromagnetic 
calorimetry at low photon energies, and to try and reconstruct % c states from their radiative decays. 

With an integrated luminosity of 100 pb -1 , the transverse momentum spectra are expected to reach 
about 100 GeV and possibly beyond, for both J/y/ and T . With several million J/yr^ and more 
than 500000 of jiji decays, and a good understanding of the detector, high precision polarisation 
measurements, at the level of few percent, should become possible for both J/yr and T. Xb —> Ty decays 
could become observable, while other measurements mentioned above will become increasingly precise. 

Further increase of the integrated luminosity should make it possible to observe the resonant produc- 
tion of J/yr meson pairs in the mass range of the T system. During the future high luminosity running, 
the need to keep event rates manageable will mean an increase of thresholds of relevant single- and di- 
muon triggers, and the prescaling of lower threshold triggers. The higher luminosity will further expand 
the range of reachable transverse momenta and allow further tests of the production mechanisms, as well 
as make % b reconstruction easier. 
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Production Cross-Section Measurements and Study of the 
Properties of the Exclusive B + — > // \j/K + Channel 

Abstract 

In the initial phase of the LHC operation at low luminosity several Standard 
Model physics analyses will be performed in order to validate the ATLAS 
detector and trigger system. The B + ^J/yK + channel can be observed with 
the first ATLAS data at LHC and can be used for detector performance studies. 
This channel will provide a reference in the search for rare B decays. It will 
also be used to estimate the systematic uncertainties and efficiencies of flavor 
tagging algorithms, needed for CP violation measurements. The prospects to 
measure the B + mass, its total and differential production cross sections and 
lifetime with the first ATLAS data, are described in this note. 

1 Introduction 

The expected large hadronic cross-section for ft-quark production and the high luminosity at the LHC 
leads to copious &-quark production, with the presence of a bb pair in about one percent of the collisions. 
Quantitatively, the expected inclusive production cross-section for pp — >• bb + X at LHC is estimated 
to be o b i « 500 }lb leading to more than 10 5 bb pairs per second at the LHC design luminosity of 
Jz? k, 10 33 cm~ 2 s _1 . However, the extrapolation of the bb cross-section measurement from the Tevatron 
energy of 1.8-1.96 TeV [1,2] to the LHC energy of 14 TeV suffers from large uncertainties. The theo- 
retical predictions are based on NLO QCD calculations with uncertainties smaller than 20 % [3] in the 
kinematical region of the LHC, originating mainly from scale uncertainties [4], as well as uncertainties 
due to the parton density functions and the ^-fragmentation. 

A precise measurement of the bb inclusive cross-section at the LHC can be used to constrain these 
theoretical uncertainties. In addition, the large production rate allows for exclusive cross-section mea- 
surements shortly after the LHC start up, which have different systematic uncertainties and model de- 
pendencies (fragmentation models) from the inclusive ones. Furthermore, the bb represents the largest 
physics background for many processes, therefore its measurement is a prerequisite to any discovery. 

In this note the exclusive channel B + —>■ J /\yK + is studied extensively and the procedure to measure 
the differential and total cross-sections with the first 10 pb _1 is presented, with event selection based on 
the identification of the J/yr decay to two muons. 

The exclusive B + —> J /\yK + decay can be measured during the initial luminosity phase of the LHC, 
because of the clear event topology and rather large branching ratio. It can serve as a reference channel 
for rare B decay searches, whose total and differential cross-sections will be measured relative to its 
cross-section, thus allowing the cancelation of common systematic uncertainties. Furthermore, it can be 
used to estimate the systematic uncertainties and efficiencies of flavour tagging algorithms, which are 
needed for CP violation measurements. Finally, the relatively large statistics for this decay allow for 
initial detector performance studies. In particular, the precise measurement of the well-known mass and 
lifetime [5] can be used for inner detector calibration and alignment studies. 

In Section 2 of the note the Monte Carlo data sets used for this study are described. In Section 3 the 
J/ljf selection procedure is presented. The B + ^J/yK + mass, cross-section and lifetime measurements 
can be found in Section 4. The expected statistics during the early LHC luminosity phase are discussed 
in Section 5 together with estimates of the systematic uncertainties. 

All the following studies have been done for luminosity of Jz? = 10 32 cm _2 s _1 . However, since pile- 
up does not play any role at this luminosity, it is straightforward to rescale the results of these studies to 
JSf = 10 31 cm 2 s _1 , in case this will be the luminosity at startup. 
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2 Monte Carlo Samples 

All Monte Carlo (MC) data sets used for the studies presented in this note have been produced using 
Pythia-6.4 [6] without overlaying pileup events. 



Process 


N gen 


JS? [pb-'] 


N gen {B+ ^J/yK+) 


bb^J/\jf{n6n4)+X 


145 500 


13.2 


7 072 



Table 1: Monte Carlo data set used for the B + study. The N gen (B + — > J/yK + ) events are the ones 
contained in the whole generated sample. 



At the generator level, the ATLAS specific Pythia implementation for B-physics which provides an 
interface to Pythia- 6 [7] was used. The study of the B + — »■ J/yK + channel is done using the inclusive 
production cross-section of a(bb — >• //v(/i6/i4)Z), where the numbers in the bracket denote the cuts 
applied on the muons from the J/y decay in order for the generated event to be accepted (one muon 
with pr > 6 GeV and the other with pj > 4 GeV). The cross-section at the generation level, after 
implementing these cuts to the muons from the J/y decay is 11.1 nb. The total number of generated 
events and the number of the B + —> J /yK + decays found in the sample are given in Table 1. All 
efficiencies presented in this note, are calculated relative to the generated number of events. 



3 J /iff Identification Procedure 

A reliable identification of the J/y meson in the decay channel J/y — > H + H~ , as well as the recon- 
struction of the primary and secondary vertices, are the prerequisites for the B + — > J/yK + cross-section 
measurement. For the selection of the B + candidates a further requirement of a positively charged track 
(K + ) originating from the J/y secondary vertex is imposed. 

The distance x between the pp interaction vertex and the secondary vertex of the B-decay in the 
transverse plane is used for the J/y identification. In the ATLAS Inner Detector TDR [8], the deter- 
mination of the position of the primary vertex on an event-by-event basis was demonstrated, and for the 
B + —>■ J/yK + decay, a vertex resolution of o x = 29 jim and o y = 27 jim was estimated. For up-to-date 
information on the average primary vertex resolution with the staged ATLAS detector see [9]. 

The vector x = x pr j m — xg from the primary vertex jcpnm to the secondary B-decay vertex xb in the 
plane normal to the incoming proton beam [10] is used to define the transverse decay length L^, which 
is actually the projection of x onto the direction of the transverse momentum of the B meson: 

\Pt\ 

The transverse decay length L^. is a signed variable, which is negative if the particle appears to decay 
before the secondary vertex of its production and positive otherwise. For a zero lifetime sample, a 
Gaussian distribution peaked at = 0 is expected. For exclusive decays, the proper decay length is 
given by: 

p T 

For the uncertainty of the transverse decay length L^., only the contribution arising from the uncertainties 
on the primary and secondary vertex coordinates are taken into account: 

< = (<7M) 2 + 2^* + <^ (3) 
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where a x , a xy , and a y are the covariance matrix elements of the secondary vertex fit, o x \ is the resolution 
of the primary vertex in x, a y \ is the resolution of the primary vertex in y, pj is the transverse momentum 
of the B + meson and finally p x and p y are the x and y components of B + momentum. 

Since the J/y reconstruction relies on its decay into two muons: J/y^ , the first step in the 

event selection procedure is the identification of the decay muons, which in general have low pj . 

If a muon track reconstructed by the muon spectrometer has an inner detector track associated to 
it, it is considered to be a muon and is used to form the J/y candidate. An inner detector track may 
also be declared a muon candidate and used in the J/y mass reconstruction, if it is has hits or track 
segments in the innermost stations of the muon spectrometer. In either case the J/y mass is calculated 
using the momentum of the muon candidate provided by the inner detector, in order to exploit the better 
momentum resolution of the inner detector in this pj region. The main J/y selection cuts are as 
follows: 

• All possible di-muons with p? \ > 3.0 GeV and pr,2 > 6.0 GeV are formed; 

• The tracks of each muon pair are then fitted to a common vertex; 

• From the vertices found, only the ones with # 2 /ndf < 10 are retained; 

• To select J/y mesons originating from the decay of a B + , a cut on the proper decay length, 
A > 0.1 mm, is imposed to reduce combinatorial background from prompt J/y . If this cut is 
not imposed, the algorithm identifies all possible combinations consistent with J/y decaying to 
muons in the event; 

• J/y candidates inside a mass window of 120 MeV around ntj/y are retained. 

The efficiency for all previously mentioned cuts is presented in Table 2, where the efficiency after each 
cut is computed with respect to the previous. 

Given that the sample used does not contain any prompt J/y , the effect of the cut on the proper 
decay length X in the table indicates the loss in signal events. As it is explained in the following section, 
this cut is not applied for the lifetime measurement. The J/y reconstruction efficiency is also given for 
the case of no cut on the J/y proper decay length X. 

The J/y invariant mass distribution without a cut on X is shown in Figure 1. The shape can be 
described by a Gaussian with exponential tails. The J/y mass and its resolution, obtained from a 
Gaussian fit, is 3098 MeV and 57.4 MeV respectively. 



cut 


with X cut 


no X cut 


J/y cut 


N J/¥ 


£j/w [%] 


N J/¥ 




after vertexing 


123 489 


84.8 


123 489 


84.8 


after vtx % l cut 


115 156 


L 933 


115 156 


93.3 


after X cut 


84 829 


73.6 






after mass cut 


81293 


95.8 


105 827 


91.9 


Total eff 




55.8 




72.7 



Table 2: J/y reconstruction efficiencies with and without X cut. 



In order to study the effect of misalignment, displaced magnetic field and incorrect material map on 
the J/y observed mass position and resolution, a systematic study using different ATLAS geometry 
configurations was performed, with different combinations of possible misalignments of the calorimeter 
and the muon spectrometer, as well as a displaced magnetic field map and distorted material in the inner 
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Figure 1: J/y invariant mass distribution with a Gaussian fit superimposed. 

detector and the calorimeter. These studies were performed using a dedicated 5° — > J/y<j> dataset. It 
was found that the width of the J/y mass fit a(mjj^) is rather stable, varying between 51 and 59 MeV. 

The trigger efficiency is about 99 % and was computed from the J/y candidates that have at least 
one muon with pj > 6 GeV in the trigger. This is expected, since at the generation level it is required 
that both muons from the J/\j/ have pr > 6 GeV and pj > 4 GeV. 

4 Analysis of the B + — ► J / yK + Channel 

The analysis that follows for the selection of B + events can equally well be applied for the charge con- 
jugate state. Negligible direct CP violation is expected in the B ± — > J /yK^ because for b — > c +cs 
transitions the standard model predicts that the leading and higher order diagrams are characterized by 
the same weak phase. A measurement of the asymmetry is given in [1 1]. The main source of asymmetry 
is the different interaction probabilities for K + and K with the detector material. Other non-CP- 
violating sources of asymmetry are expected to lead to a lepton energy asymmetry and estimated to be 
negligible [12]. 

4.1 Event selection 

The B + mesons are reconstructed from a J/y and a K + candidate. The J/y selection is described in 
Section 3 and the K + candidates are identified using information from the inner detector. Specifically, 
the procedure comprises the following steps: 

• The original collection of tracks is scanned once again (excluding those already denoted as muons) 
and those with pj > 1.5 GeV and \r\ \ < 2.7 are retained; 

• From this collection, the tracks with positive charge and inconsistent with coming from the primary 
vertex at one standard deviation level (\do\/o c j Q > 1, where do is the impact parameter of the track) 
are considered to be K + candidates; 

• The }A + }A~ pair considered to be originating from the J / y — ► jU + /i~ decay and the K + candidate 
are fitted to a common vertex. The vector defined by the sum of the J/y and K + momentum 
vectors is required to point to the primary vertex, and the two muon tracks are constrained to mjiy, 
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• Only combinations with vertex # 2 /ndf < 6 , Pt(h) > 5 GeV and X > 0.1 mm are retained; 

• In case that more than two B + candidates were found in the same event, the one with the smallest 
vertex # 2 /ndf is accepted. 




Figure 2: Invariant mass M{K + n + [i ) distribution with the B + mass peak for signal (red) and combina- 
torial bb -background (blue). 



4.2 Mass fit 

The B + mass determination has been performed using the sample of bb — ► J/yrX decays. The B + in- 
variant mass distribution m(K + jx + of the candidates fulfilling all cuts is presented in Fig. 2. In the 
same figure the signal and background events can be seen separately, where the distinction between them 
is made using the Monte Carlo truth information. The fit to the mass distribution is done by using the 
maximum-likelihood method, where the probability density function is a Gaussian for the signal region 
and a linear function for the background: 

L = a/ sig ±(l-a)/ bkg 

/sig = -=^-i(^) 2 , (4) 
\JlKO 

/bk g = Km-\) + ^, 

where a is the fraction of signal events in the fitted region, m the B + mass, b is the slope of the back- 
ground distribution and w defines the range of the fit. The mass range of the fit is taken from 5.15 GeV 
to 5.8 GeV. This is done in order to reduce contributions from partially reconstructed B meson de- 
cays that populate the left side of Fig. 2. The background at the right of the mass peak originates from 
misidentified n + from B + — > J jyn + decays. 

The result of the B+ mass fit is: M{B+) = (5279.3 ±1.1) MeV with a width of o{B+) = (42.2 ± 
1.3) MeV. The relative errors scaled properly for an integrated luminosity of 10 pb 1 are about 0.02% 
and 3.5% respectively. The corresponding fit is presented in Fig. 3. The slight shoulder to the left of the 
mass distribution is due to the background shape. 
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Figure 3: B + mass fit with the both signal (red) and background (blue) contributions shown separately. 



4.3 Differential and total production cross-section 

The feasibility of the measurement of the B + — > J/\(fK + total and differential production cross-sections 
at LHC, with the first 10 pb -1 , is explored in this section. The dataset used is the bb — ► J/yX and 
the reconstructed B + — ► J/yK + candidates were selected based on the event selection described in Sec- 
tion 4. 1 . The B + mass fit method described previously was then used to extract the efficiencies in bins of 
Pt as well as the total efficiency. 

The differential cross-section do/dpr can be obtained from: 

d<T(fl + ) = A^ig 
dp T Ap T -^-£/-BR K) 

where N s i g is the number of reconstructed B + mesons obtained from the mass fit. The size of the pr 
bin is denoted with Apr- Furthermore, is the total luminosity and srf the overall efficiency. The 
branching ratio BR is the product of the world average [5] branching ratios of BR(B + — > J/yK + ) = 
( 10.0 ± 1.0) x 10~ 4 and BR(J/\jf -> jU+^i ) = (5.88 ± 0.10) x 10~ 2 . The invariant mass spectra of the 
B + candidates are fitted in each pj range using an extended unbinned maximum likelihood fit. The 
probability density function is a Gaussian for the signal and a linear function for the background region: 

Nsig Motal-Nsig 

L=-^-f slg + Slg -/bk g (6) 

Wtotal ^Vtotal 

where / s ; g and /bkg are the fit functions as described in Equation 4. For this fit, the B + mass has been 
fixed to the value obtained from the mass fit in the previous Section 4.2, m = 5279.3 MeV. The results 
for the overall efficiencies and the mass widths of the fits, for the individual pj bins, are summarized in 
Table 3. The mass fits in the various pj regions are presented in Figure 4 whereas the fit over the full 
pj range is shown in Fig. 3. 

To measure the B + total cross-section a similar procedure to the one used for the calculation of the 
differential cross-section is followed, but in this case all B + with pj > 10 GeV are used to calculate the 
total efficiency s$ ' . The B + mass distribution is shown in Fig. 3. The results of the total efficiency and 
the mass width from the fit, for the B + total cross-section measurement, are presented in Table 4. 
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p T range [GeV] 


Pre [10,18] 


Pt& [18,26] 


Pt € [26,34] 


PT G [34,42] 


si [%) 
a{B+) [MeV] 


20.1±1.0 
38.5±2.0 


37.3±1.7 
42.3±2.1 


45.0±3.1 
46.1 ±3.2 


51.6±4.7 
46.6±4.0 



Table 3: Efficiency si and B + mass width o(B + ) for the various pr bins. 




(c) 26 < < 34 GeV (d) 34 < p T < 42 GeV 



Figure 4: Fit of the B + mass in various ranges: pj G [10,18] GeV (a), pj G [18,26] GeV (b), 
p r G [26,34] GeV (c), p T G [34,42] GeV (d). 



total cross-section 


si [%} 


29.8±0.8 


a(B+) [MeV] 


42.2±1.3 



Table 4: Overall efficiency and B + mass width for all B + with pj > 10 GeV. 



4.4 Lifetime measurement 

The measurement of the lifetime % of the selected B + candidates is a sensitive tool to confirm the beauty 
contents in a sample, in particular the number of the reconstructed B + — > J/\ffK + decays obtained in the 
bb — ► J/yX dataset. The proper decay time is defined as f = X/c. For this analysis, no cut on the proper 
decay length A (Equation 2) of the J/y candidate or the B + candidate should be applied. 

The proper decay time distribution in the signal region B + ^J/yK + can be parametrised as a convo- 
lution of an exponential function with a Gaussian resolution function, while the background distribution 
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parametrisation consists of two different exponential functions, where each is convoluted with a Gaus- 
sian resolution function. In the bb — > J /yX no additional zero lifetime events are expected because there 
are no prompt J/y produced. In the realistic case, where zero lifetime events will be present, an extra 
Gaussian centered at zero is needed in order to properly describe those events. With the model used here, 
the Gaussian resolution functions depend on the reconstructed uncertainties on an event-by-event basis. 
In addition, it is assumed, that the distribution of the uncertainties per event are different for the signal 
and the corresponding background probability density functions (pdf) [13]. The use of conditional pdfs 
was required in order to take into account the proper decay time error per event. The exponential part of 
the lifetime distribution has the usual form: 

F t (t)=e^, (7) 
where t is the proper decay time and T is the lifetime. Accordingly, the convoluted function is then 

F c (t) = eT®G{t,il,s-CJ i ), (8) 

where /I is the mean value of the Gaussian resolution function which parametrises the average bias in 
each proper decay time measurement. The scale factor of the error is s and (7, is the per event proper 
decay time error. The conditional pdf on the per event uncertainty is then: 

F t (t)=F c (t\a i )-P((J i ), (9) 

where P(a,) is the distribution of the proper decay time error. The distribution of the proper decay time 
uncertainty is approximated by a superposition of Gaussian functions. In order to separate between the 
signal and the background, the proper decay time pdf is multiplied with the B + mass pdf, described in 
Section 4.2. A two-dimensional fit to the B + proper decay time and B + mass is then performed. 

The results of the lifetime fit are presented in Table 5 and shown in Figure 5. The background can 
be best described with the two lifetime components (Ti and T2) which are also shown in Table 5. For 
the events in the mass region of the signal within M{B + ) G [5. 15, 5.8] GeV the proper decay time found 
from the decay length is compared to the generated B + lifetime. The result of the comparison is shown 
in Fig. 5. The differences are well centered at zero with a Gaussian distribution and sigma 0.088 ps. It 
should be noted that the resolution as well as its a in 77 bins of 0.25 is found to be independent of 77. 




Figure 5: B + lifetime fit (left) with the signal (dashed red) and the background (dashed black) contribu- 
tions shown separately and B + lifetime resolution (right). 
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Signal lifetime x [ps] 


1.637±0.036 


BG lifetime X\ [ps] 


1.320±0.24 


BG lifetime T2 [ps] 


0.370±0.067 



Table 5: Results for the lifetime fit. 



5 Statistical and Systematic Uncertainties 

From the analysis presented above, the expected number of reconstructed B + candidates amounts to 160 
per pb _1 . This implies that sufficient statistics can be collected for a reliable cross-section measurement, 
after just a few months of data taking at the initial low luminosity phase of the LHC. This scenario is valid 
for a luminosity less than J£ = 10 32 cm _2 s _1 , since the analysis was performed without pileup events 
and contains no special trigger requirements or prescaling other than a single muon with pj > 6 GeV at 
level- 1. 

For the measurements presented in this note the main sources of systematic uncertainties are the 
same. The uncertainty from the luminosity in the initial phase is estimated to be 10 % and will be 
reduced to about 6.5 % after 0.3 fb _1 of data. The uncertainty from the PDF's is estimated to be 3 %, 
while the scale uncertainty of the NLO calculations is about 5 %. Finally, the uncertainty originating 
from the muon identification is about 3 %. Assuming Gaussian distributions for the above mentioned 
uncertainties, the total systematic uncertainty of the signal varies from 9.2 % to 12 % and is dominated 
by the uncertainty in the luminosity. 

Given that a statistical precision of G{\ %) will be reached with an integrated luminosity of 0.1 fb -1 , 
the contribution of the systematics will dominate the uncertainties of the first measurements. This is 
the case even for the differential cross-section measurement. Although the statistics in each pj bin 
is limited, the total uncertainty is dominated by the systematic uncertainties in the branching ratio of 
the B + — > J /yK + and in the luminosity, which are of the same order. For the exclusive cross-section 
measurement in the B + ^J /yK + channel, the relative uncertainties of the differential and total cross- 
sections are given in Table 6. Therein, the first row of the table contains the quadratic sum of the 
statistical uncertainty corresponding to an integrated luminosity of 0.01 fb -1 and the uncertainty in the 
efficiency. The latter is based on the statistics of the Monte Carlo dataset used. The second row is 
calculated by adding in quadrature the above uncertainty to the systematic uncertainty of the luminosity 
and the branching ratio for every pj bin. 

For the high statistics p T bins as well as for the total cross-section, the total relative uncertainty 
is dominated by systematic errors, originating mainly from the uncertainty in the luminosity, which is 
assumed to be 10 % for the initial phase, and the 10 % uncertainty in the branching ratio of B + — > J / yK + . 
The effect of the assumed background shape on the measurements is estimated to be less than 1 %. 
Finally, the precision of the lifetime measurement, for the same integrated luminosity is 2.5 %, where no 
systematic effects are taken into account. 



p T range [GeV] 


pre [10, 18] 


PT e [18,26] | p T e [26,34] 


p T € [34,42] | p T G [10,inf) 


stat. + s/ [%] 


7.7 


6.9 


10.5 


13.9 


4.3 


total [%] 


16.1 


15.8 


17.6 


19.8 


14.8 



Table 6: Statistical and total uncertainties for the B + — > J/yK + differential and total cross-section mea- 
surements for an integrated luminosity of 0.01 fb _1 . Total uncertainties include luminosity and BR 
systematic uncertainties. 
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6 Summary and Conclusions 

In this note, the B + ^J/\j/K + channel using an inclusive bb^>J jyi{]X^\x\)X dataset has been studied by 
developing the J/\\r selection methods and understanding their efficiencies. A method for measuring 
the B + mass using a likelihood fit, in order to separate signal and background, is established. The B + 
selection efficiency stf , both in pj bins and in the whole pj region, needed for the calculation of the 
differential and total cross-sections from real data, is extracted with fit methods similar to those used in 
the B + mass measurement case. Finally a likelihood fit, which takes into account the per-event primary 
vertex error, is performed for the measurement of the B + lifetime. 

The total B + ^J /yK + production cross-section can be measured with a statistical precision better 
than 5% with the first lOpb -1 of data. The differential cross-section with precision of the order of 10%. 
With the same statistics, adequate detector performance studies can be realised using the B + mass and 
lifetime measurements. 
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Physics and Detector Performance Measurements with the 
Decays B° s J/yQ and B° d J/y/K 0 * with Early Data 

Abstract 

The decay processes B d —> J/yK Q * and B° s —> J/yty are expected to be ob- 
served in large numbers with the ATLAS experiment. During the early data 
taking period, with an integrated luminosity of around ~ 10 — 150 pb 1 , it will 
be possible to measure the masses and proper lifetimes for these decays with 
sufficient precision to allow them to be used for detector performance checks. 
Methods for the determination of the mass and lifetime when the performance 
of the detector and reconstruction software will not be fully understood are 
presented. A powerful simultaneous fitting technique is used. Understanding 
the potential for flavour tagging methods will be one of the important goals 
for B-physics with early data. The performance of the jet charge tagger for the 
self-calibrating decay, B° d — ► J/y/K 0 *, is presented. The implications for the 
jet charge tag in the B { ] -^J/ycj) decay are also discussed. 

1 Introduction 

The decay — > J/y(j) is one of the most promising channels at the LHC due to its rich physics potential. 
This will begin with the earliest data taken by ATLAS. Due to the high bb cross-section [1] and dedicated 
J/y trigger in ATLAS [2], large statistics data will be quickly accumulated. This will allow the channel 
to be used for basic measurements of the B mass and lifetime, which will provide a sensitive test of the 
understanding of the tracking system after only 150 pb -1 of data. After collecting only 1 fb _1 of data, 
ATLAS will begin to improve world precisions for these measurements. A similar analysis will also be 
performed for the channel B d — > J/yK 0 *, where the expected statistics are higher by about a factor of 
15. 

The analysis methods to be used will evolve with increasing statistics and understanding of the detec- 
tor and backgrounds. This paper concentrates on the early phase of data taking. More advanced studies 
of the angular dependence of the decays and CP violation are not covered. In the very early data tak- 
ing period, the low statistics will not allow an investigation of the full list of theoretical parameters, but 
rather will concentrate on the mass and lifetime. As the backgrounds will not be well understood either, 
no hard cuts will be made to reject them, but rather backgrounds topologically similar to the signal will 
be admitted. This will also reduce the dependence on reconstruction algorithms and trigger behaviour, 
neither of which will be thoroughly tested when ATLAS starts to take data. In particular, no secondary 
vertex displacement cuts will be applied, and the dominant background admitted will be from direct J/\ff 
production. 

The simulation of the decays and their backgrounds is described in Section 2, and the reconstruction 
of the events in Section 3. The methods developed to extract the B hadron mass and lifetime as well as the 
precision expected to be reached with early data are described in Section 4. A study of the possibilities 
for flavour-tagging with early data, an important initial step for the CP violation measurements to be 
done later, is presented in Section 5. 

2 Monte Carlo production 

Table 1 lists the Monte Carlo data samples used in this study. The beauty events were generated by 
PYTHIA 6.4 [3] using a method described in [4]. For the direct J/\jf decays a special tuning of the 
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Colour Octet Model was prepared within PYTHIA [5]. In order to make the simulation studies more 
efficient the initial cuts on the transverse momentum, pr, and the pseudorapidity, 7], were applied at the 
generator level. To ensure that most of the generated events passed the trigger at the reconstruction stage, 
only events containing decays of J/y into dimuons, with pj larger than 6 GeV and 4 GeV, both detected 
within \r\ | < 2.4, were retained for detector simulations. 



Table 1: Monte Carlo samples used in this study. Cross sections are given by PYTHIA after applying 



Process 


MC Statistics 


Cross section 


bb^J/yX 


150 000 


11.1 nb 


pp^J/yX 


150 000 


21.7 nb 


B° s -> J/y<j) 


50 000 


0.02 nb 


B° d -» J/yK°* 


30 000 


0.24 nb 



3 Analysis of the decays B° s ^ J/ and B° d ^ J/ y/K 0 * 

3.1 Strategy for analysis of early data 

The strategies deployed during the early period of the experiment will differ from those used later. In 
particular, the low statistics available will not allow a determination of the complete list of physics 
variables that can in principle be determined from the B® — > J/yty and B d — > J/yK°* decays [6]. During 
the early phase, the compositions of the backgrounds will not be well understood. Furthermore, at 
this time, the detector and reconstruction software performance will also not be fully understood, and 
restrictive selection cuts to remove backgrounds may bias the signal in an uncontrolled way. The strategy 
in these early stages will therefore be to use loose cuts, which will admit more of the background decays. 
In particular, omitting vertex selections allows a statistically meaningful contribution from prompt J/y 
events. Most of these events fall outside the signal region of the study, and allow a better determination 
of the vertex resolution, which in turn allows a better overall B lifetime determination. This approach 
is consistent with the B trigger strategy for early data where no cut on secondary vertex displacement is 
required. 

3.2 Reconstruction 

Monte Carlo events of signal and background processes, as described in Table 1 , were passed through 
full detector simulation and reconstruction. Trigger algorithms were applied during the reconstruction. 
Only events accepted by the J/y —> trigger [2] (with thresholds of p T > 6 GeV and p T > 4 GeV 

for the fastest and second fastest muon) were retained for offline analysis. The reconstructed data objects 
were then processed as follows. 

J/y — > jU + jU~ candidates were sought by forming all possible pairs of oppositely charged muon 
tracks passing the cuts pr > 4 GeV and |t]| < 2.4. Pairs containing at least one muon track with pj > 
6 GeV were fitted to a common vertex. Pairs were assumed to be muons from J/y decays if the vertex 
fit resulted in a fit % 2 /n.d.f < 6 and the invariant mass of the muon pair fell within a 3 a window around 
the nominal J/y mass, with o = 58 MeV. This window was chosen by fitting a Gaussian distribution 
to the invariant mass of the muon pairs in the events pp — > J/yX and bb — » /i + jU _ X, see Figure 1. The 
background from non resonant /i + i u _ pairs in the 3 a window is 10%. 
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Figure 1: Reconstructed invariant mass distributions of J/y — > /i + jU (left), 0 — ► iC'+A' (middle) and 
^o* _^ ^ijj-T ( right ) can didates. 

The 0 — > K + K candidates were reconstructed from all pairs of oppositely charged tracks, not iden- 
tified as muons, with pj > 0.5 GeV and |t] | < 2.5, which were fitted to a common vertex. These tracks 
were assumed to be kaons from 0 decays if the vertex fit resulted in a % 2 /n.d.f < 6, and the invari- 
ant mass of the track pairs (under the assumption that they were left by kaons) fell within the interval 
1009.2 — 1029.6 MeV. This interval is based on a fit to the invariant mass distribution of the reconstructed 
<f> — > K + K decay candidates shown in Figure 1 . The signal fit used a Breit-Wigner correctly accounting 
for phase space convoluted with a Gaussian to represent the detector resolution. The background was 
approximated by a linear function. (Additional terms up to quadratic have no significant influence on the 
fit.) 

The K°* — > K^it 11 candidates were reconstructed by selecting all tracks that had pj > 0.5 GeV and 
|tj| < 2.5 that had not been previously identified as muons, forming them into oppositely charged pairs 
and fitting them to a common vertex. These pairs were assumed to be from K°* decays if the fit 

resulted in a % 2 /n.d.f < 6, the transverse momentum of the K°* candidate was greater than 3 GeV, and 
the invariant mass of the track pair fell within the interval 790-990 MeV, under the assumption that they 
were left by ^71^ hadrons. In Figure 1, the signal has been fitted to a Breit-Wigner function convoluted 
with a Gaussian and the background has been fitted to a second degree polynomial function. 

To find the B° d —> J/ yK Q * candidates, the tracks from each combination of J/yr —> /i + ;U~ and K°* —> 
K^K 11 candidates were fitted to a common point. The two muon tracks were constrained to the PDG 
J /yr mass. These quadruplets of tracks were assumed to be from B d — > J/yfK°* decays if the transverse 
momentum of the B d candidate was greater than 10 GeV and the fit resulted in a % 2 /n.d.f < 6. In the 
case of more than one candidate per event, the candidate with the lowest % 2 /n.d.f was retained. 

B Q S — ► J /\f§ candidates were sought by fitting the tracks from each combination of J/\\r — > jU + /i _ 
and (j) — ► K + K candidates fitted to a common vertex. The two muon tracks were constrained to the PDG 
J/yf mass. These quadruplets of tracks were assumed to be from B® — > J/lffQ decays if the transverse 
momentum of the B° candidate was greater than 10 GeV and the fit resulted in a % 2 /n.d.f < 6. If there 
was more than one candidate per event then the candidate with the lowest % 2 /n.d.f was chosen. 

Accepted B® and B d candidates contain a negligible background from non-resonant /i + i u _ pairs, 
0.1% and 0.2% respectively, and therefore the background from non-resonant bb — > /^ + /i _ X events are 
not considered in this analysis. 

Events were accepted in a wide invariant mass window of ±12 • a around B hadron mass, where 
the mass resolution a was determined from reconduction of the B d and B° masses for the two signal 
channels. The mass resolutions were obtained from fitting a single Gaussian to the Monte Carlo signal. 
Table 2 shows the number of events that can be expected using the above procedure for an integrated 



1123 



5-Physics - Performance Measurements for B° d J/ipK°* and B° J/V</> ■ 



Table 2: Signal and background statistics of and B® candidates expected with 10 pb 1 . 





Selected candidates 




expected with 10 pb -1 


Signal B d ^ J/ yK u * 


1024 


pp — > //i/aX background 


1419 


bb ^J/yX background 


3970 


Signal B u s 7/^ 


76 


— > J/yX background 


2449 


bb—>J/ yX background 


1660 


All events satifying B d or B i] s selections 


10323 



luminosity of 10 pb -1 . 

By the time the LHC reaches a luminosity of 10 33 cm~ 2 s _1 and the detector is better understood, 
it will be safe to apply displaced secondary vertex cuts, which will remove most of the backgrounds. 
In the studies of exclusive channels of B decays, vertex displacement selections are replaced by cuts on 
the B hadron decay time. This method avoids any bias on the proper decay time measurements. Table 
3 shows the reconstruction efficiences with and without decay time cuts. In particular, by requiring that 
the proper decay time of the candidate is greater than 0.5 ps, additional rejection by a factor of 260 
for the pp — > J/yX can be achieved while losing 25% of the signal. 

Table 3: B d — > J/yK°* (B® — >J/y<j)) signal and background reconstruction efficiencies before and after 
the cut on B d (fi*') decay time t. The applied cut was t > 0.5 ps. 





efficiency 


%] 




before time cut after time cut 


Signal B d -► J/yK u * 


42.0 


30.4 


pp — > J/yX background 


0.67 


0.0064 


bb—>J/ yX background 


3.05 


1.52 


Signal B u s J/y(j) 


40.5 


30.0 


pp — > J/yX background 


1.5 


0.0058 


bb J /yX background 


1.1 


0.8 



4 Simultaneous fit of mass and lifetime of B° d and with early data 

We now turn to methods for extracting physically interesting parameters from the decays of the B® and 
B d mesons. The first measurements with early data will comprise the mean lifetimes and masses of these 
mesons. 

We perform a simultaneous maximum likelihood fit for each B° s and B d mass and proper decay time 
distributions. The likelihood function L is defined by: 

L = fl [ffi + ^ x Phkg M,m i ) + »-"*-"**> xpwhm)] (i) 
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where the index i runs over the events, N = n s i g + ribck\ + «& C /t2 is the total number of reconstructed events 
in the fit and n S i g , Ub c k\ and ribcia are the numbers of signal and background events. The terms p s j g , pbk g \ 
and pbkgi are products of two probability density functions that model the mass m and proper decay time 
t of the signal and the prompt and non-prompt backgrounds respectively (see Section 2). The number of 
expected events for the prompt background is ribcki and the corresponding probability density function 
in formula 1 is Pbkgi- The probability density function for the non-prompt background is pbkgi- 

For the signal, the mass distribution is modeled by a Gaussian distribution, whose mean value is the 
B hadron mass m(B) and its width a m is given by the detector mass resolution. Both m(B) and o m are 
determined from the fit. The reconstructed proper decay time distribution for the signal is parameterised 
by the function: 

Psi8{i} ~ i:Mo^- Tt pif-mw {) 

where the decay time resolution function p(t — tj) was approximated by a Gaussian of width o which is 
a free parameter of the fit. 

For the background, the mass distribution of the prompt component is assumed to follow a flat dis- 
tribution as observed in simulated data (see Figure 2). The non-prompt component is modeled with a 
second order polynomial function where the coefficient of the linear (quadratic) terms, denoted as c\ (c 2 ) 
in Table 4, are determined from the fit. 

The decay time distribution of the prompt background component is parametrised by a Gaussian of 
width a. The non-prompt component was modeled by the sum of two exponential functions, convoluted 
with the decay time resolution function p. The two exponential functions are denoted as F\ and F2, the 
constant coefficient between them is b\. 

,. v _ So (He- r ' ? + bi x T 2 e- r ^) pif- U) At 

4.1 B° d ^J/y/K°* decay 

The likelihood function, —2lnL is minimised to extract the B d lifetime x = 1/T and mass m(B) from the 
reconstructed events containing a B d — »• J/\f/K 0 * candidates and backgrounds. This fit corresponds to an 
integrated luminosity of 10 pb~'. The distributions of the reconstructed masses and lifetimes are shown 
in Figure 2. Table 4 summarises the results of the likelihood fit. The values obtained from the fit agree 
with the input values used in the simulation (given in the first column) within the statistical errors of the 
fit. The average lifetime of the B® can be measured with an uncertainty of 10% for 10 pb~ x . 

4.2 B° s -> J/\ff(j) decay 

The B®, B® system exhibits two mass eigenstates with two lifetimes; the lifetime difference Ars/T^ is 
expected to be <^(10 -1 ). However, with early data (a few hundred pb _1 ), the statistics are insufficient to 
determine both lifetimes. For the initial period of LHC running, it is assumed that AF S = 0. The method 
for the B° fit is the same as for the B° d case, the main difference being the smaller fraction of signal events, 
as shown in the mass and lifetime distributions for the reconstructed events after cuts selecting the B { ] 
signal (Figure 3). 

Statistics of reconstructed events corresponding to an integrated luminosity of 150 pb -1 enables 
measurements to be made with relative precisions on the B® lifetime of 10% (Table 5). In the fit the 
background events are weighted by factor of 15, since Monte Carlo statistics were limited to the equiva- 
lent of 10 pb _1 for the current study. 
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Table 4: Results of the fit to reconstructed B d candidates corresponding to 10 pb l . The first column 
shows input values used in simulation. 



Parameter 


Simulated value 


Fit result with statitical error 


r, ps- 1 


0.651 


0.73 ± 0.07 


m(B), GeV 


5.279 


5.284 ± 0.006 


a, ps 




0.132 ±0.004 


o m , GeV 




0.054 ± 0.006 


n sig /N 


0.16 


0.155 ±0.015 


"bcki/N 


0.062 


0.595 ± 0.017 


bi 




1.08 ±0.27 


r uP s- 1 




0.67 ± 0.05 


r 2 ,ps-' 




2.4 ± 0.3 






-2.75 ± 0.28 


C2 




4.7 ± 1.4 




4900 5000 5100 5200 5300 5400 5500 5600 5700 
Mass (MeV) 



-2 0 2 



§|bb->J/ ¥ X 

□ pp^j/ ¥ x 

■ B d -> J/\|/ K° 



l\ 



6 8 10 
Decay time (ps) 



Figure 2: Distributions of the reconstructed B d mass and decay time expected with integrated luminosity 
of 10 pb" 1 . 



5 The performance of the jet charge tagger with early data 

Most studies of CP-violation and mixing require the identification of the flavour of the neutral B mesons; 
this is known as flavour tagging. Understanding the potential for flavour tagging methods will be one of 
the important goals with early data. In studies of CP-violation and mixing of neutral B mesons, one must 
know the flavour of a B meson both at the time of production (t = 0) and at the time of decay. 

In a small number of cases, the flavour at production can be inferred from the charge of the highest 
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Table 5: Results from the fit to reconstructed fi' 1 candidates corresponding to 150 pb l . 





Input 


Fit result with statistical error 


J — 




A lA'X -X- f> fK1 


m(fl), GeV 


5.343 


5.359 ± 0.006 


a, ps 




0.152 ±0.001 


o m , GeV 




0.061 ± 0.006 


n sig /N 


0.018 


0.031 ±0.005 


nbcki/N 


0.397 


0.379 ± 0.006 


h 




0.023 ± 0.01 


Tups- 1 




1.35 ± 0.02 


T 2 ,ps- 1 




0.44 ± 0.08 


c\ 




-1.44 ±0.07 






2. 14 ±0.49 




Q bb -» J/\|f X 
Q pp -» J/\j/ X 
■ B s ^ J/\|f 



5000 5100 5200 5300 5400 5500 5600 5700 
Mass (MeV) 




Decay time (ps) 



Figure 3: Plots to show the distributions of the reconstructed B® mass and decay time expected with 
150 pb -1 . Background distributions constructed from simulated events corresponding to 10 pb -1 were 
scaled by a factor of 15. 



Pt lepton unassociated with the signal decay, with the assumption that this tagging lepton originates 
from a semi-leptonic decay of the other B hadron in the event. For the majority of the events, one must 
use the jet charge tagging method. According to fragmentation models, the particles are ordered in the 
momentum component parrallel to the original quark direction, while charge conservation also imposes 
charge ordering [7]. These two facts may be used to form a jet charge, which is related to the b-quark 
charge at production. The jet used in this method consists of all tracks that are unassociated with the 
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signal decay with pj > 500 MeV, \r\ | < 2.5, inside a cone of opening angle AR around the I 
the laboratory frame. The opening angle of the jet cone, AR, is defined: 



AR = ^/A^f + A^ (4) 

where AT7 and Aq> are the differences in pseudorapidity and azimuthal angle between the cone wall and 
the B meson. The jet charge, <2jet, tends to be positive for ft-jets and negative for b-jets, thus allowing the 
B° meson flavour at production to be inferred. The jet charge is defined as: 

Q) *~Li\Pi\ K () 

where the qi is the charge of the i th track in the jet and p\ is a measure of the tracks momentum that can 
be, for example, the transverse momentum of the track or a projection of the track's momentum along the 
axis of the B meson's direction. These are referred to as the pj method and the pi method respectively. 
The parameter K controls the relative contribution of the hard and soft tracks in the jet charge. One 
possible improvement in the algorithm is to remove ambiguous cases such as events with <2jet close to 
zero; the smallest allowed value of \Qj et \ is called the "exclusion cut". The opening angle of the jet 
cone, the exclusion cut and K are free parameters and must be tuned to get the best performance from the 
tagger. 

5.1 Quantifying the performance of a flavour tagger 

The effectiveness of the discrimination between B° and B° mesons at production time is characterized by 
two quantities: its efficiency, £ tag , and the dilution, D tag . The efficiency is the fraction of B mesons that 
were tagged either correctly or incorrectly and is described by: 

N r +N w 

£ta g = — ^— (6) 

where N r and N w are the numbers of correctly and incorrectly tagged B mesons respectively, and N t is 
the total number of reconstructed B mesons. The dilution, also known as the purity, is given by: 

r, _ N r ~N w 



Nr + Nw 

where w tag is the wrong tag fraction: 



= l-2w tag (7) 



In a typical CP violation study, where the aim is to identify a difference in some property between a 
particle and its anti-particle, the relationship between the true asymmetry of this property, A true , and the 
asymmetry as measured in the data, A meas , will be 

Kue = ^~A meas (9) 

which is derived in, for instance, [8]. For the small asymmetries expected in the B decays, the statistical 
uncertainty on A true is, to a good approximation: 

(10) 



\l^l g N t 

The tag algorithm effectiveness is indicated by the quality factor or tagging power, Q tag : 

Gtag — £ tag^tog 

The quality factor is used as a measure of success when optimising the flavour tagger. 
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5.2 Understanding the jet charge tagger using B° d — > J/yK°* decays 

During the early data taking phase, there will be too few B° s —> J/y<f) decays reconstructed to allow a 
detailed comparison between the jet charge distribution obtained from the data and that predicted by the 
Monte Carlo. However, there will be a sufficient number of the analogous B d — > J/yK l) * decays to allow 
such a comparison to be made. Additionally, the final state of the B° d —> J/yK 0 *, with a subsequent decay 
of K°* to charged mesons, allows the initial flavour to be determined in a statistical way, and therefore 
the decay mode is considered as self-calibrating. The jet charge tagger thus produced will be important 
for the CP violation studies with B d —> J/yK s , and the study of B d — ► J/yK°* will allow us to gain 
confidence in the tagging performance for fi' 1 — > J/y(j>. 

For this study, the signal decays were reconstructed as described in the Section 3 for both B d — > 
J/yK°* and — ► J/y<j> decays. A reconstructed sample of 15000 decays was used for each channel, 
corresponding to 150 pb _1 for B d — »• J / yK Q * and 1.5 fb _1 for B® —>J/y<f>; this defines our working point 
for the two channels in this study. The quality factor was then maximised by systematically varying the 
jet charge tagger input parameters AR, K and exclusion cut. It was found that optimal results for both 
B® and B d mesons were obtained using the projection of the track momentum in the direction of the B 
meson (the pi method) as the measure of momentum in Equation 5. The other optimal parameters are 
shown in Table 6. Using these optimised parameters, the jet charge distribution for both B d — »• J/yK°* 
and B® — > J / yty are shown in Figure 4. 

Table 6: The optimised parameters of the flavour tagging algorithm for both B d — »• J/yK Q * and B® — > 
J/y(j>. 



Parameter 


B d ^J/yK Q * 


B° s -> J/y(f) 


K 


0.9 


0.8 


AR cut 


0.7 


0.6 


Exclusion cut 


0.05 


0.2 




Figure 4: Plots of <2jet for B d — »• J/yK Q * (left) and B® —>J/y<f> (right) using their optimised parameters 
of Table 6 and the equivalent luminosity of Table 7. 

One might expect that the different flavour content in the formation of the B mesons will result in a 
different jet charge behavior for B d — > J/yK°* and B® — > 7 This is indeed what is observed, both 
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in the optimisation of these jet charges and their distributions. However, one should also note that both 
the shapes and the optimised parameters, except the exclusion cut, are similar. 

The numbers in Table 7 characterise the expected performance of the jet charge tagger. With an 
integrated luminosty of 150 pb~ ! , it will be possible to calibrate the jet charge tagger for the B d , from 
the data, with an efficiency of 87.0 ±0.3% and a wrong tag fraction of 38.0 ±0.4%. Calibrating with 
real data for the B° s is more challenging as there is no readily availible and clean self-tagging mode. In 
this case, the Monte Carlo dependent calibration will be used, but the agreement of the Monte Carlo with 
real data will be tested indirectly though the B° d — > J/yK 0 * channel. 

Table 7: Performance of the flavour tagging algorithm for the optimised values given in Table 6. The 
eiTors given in the table are statistical. 



Parameter 


B d ^J/ ¥ K°* 


B^J/yQ 


Equivalent luminosity 


150 pb- 1 


1.5 fb- 1 


Number of Reconstructed Events 


13948 


15784 


Efficiency, £ tag 


0.870 ±0.003 


0.625 ±0.005 


Wrong Tag Fraction, w tag 


0.380 ±0.004 


0.374 ±0.005 


Dilution, D tag 


0.240 ±0.009 


0.251 ±0.010 


Quality, Q tag 


0.050 ±0.004 


0.039 ±0.003 



6 Summary and conclusion 

With the early data, the decays B d — » J/yK°* and 5° —> J / \ff(j) can be used to measure B hadron masses 
and lifetimes with sufficient precision to permit sensitive tests of the detector performance. In particular, 
the B d lifetime can be determined with a relative statistical error of 10%, with an integrated luminosity of 
1 0 pb~ 1 , and the same precision will be achieved for the lifetime with 1 50 pb 1 . The proposed method 
of a simultaneous fit of background and signal events allows a sensitive determination of the masses and 
decay times of B mesons. With early data, the optimal overall precision will be obtained with no cuts 
on the secondary vertex displacement. This is appropriate for the early data when the performance of 
the detector and reconstruction algorithms may not be well understood. This strategy is consistent with 
that of the early B-physics triggers, where no displacement cuts on the J/y will be applied at the trigger 
level. 

In the early data taking phase, the self-tagging decay B d — > J/\ffK Q * will be used to calibrate the 
jet charge tag for jets containing a B d . This will be of use for physics studies involving B d decays, but 
also this good understanding for the tagging performance for B d — > J/yK°* will allow the fragmentation 
modelling for 5° — > J /yij) decays to be improved. 
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Plans for the Study of the Spin Properties of the A*, Baryon 
Using the Decay Channel A b —> //i//( i u + i u~)A(/?7r") 

Abstract 

This note summarizes the results of a study of the feasibility of measuring 
certain spin properties of A& baryon in the ATLAS experiment. We present an 
assessment of approaches for extracting the inclusive Ab polarization and the 
parity violating 0C\ b parameter for the decay A/, — ► J /\y(ix + fX~)A(pn~) from 
the reconstructed four final state charged particles. As a key test, we generated 
Monte Carlo samples of Ab events of fixed polarization in the ATLAS detector 
and evaluated our ability to precisely extract the input polarization from the 
reconstructed events. The physics motivation for the planned measurements 
in ATLAS include the search for an explanation of the anomalous spin effects 
in hyperon inclusive production observed at lower energies, tests of various 
decay models based on HQET, tests of CP in an area not yet directly explored, 
and the development of Ab polarimetry as a possible tool for spin analysis in 
future SUSY and other studies. 

1 Introduction 

We report here plans for the measurement of spin parameters of the Ab hyperon. We utilize the decay 
mode Ab — > J }A~)A(pK~) to extract the Ab signal from what is expected to be a low background 
environment, given that the final state has four charged particles and a displaced secondary vertex. The 
polarization and parity violating a\ b parameter will be determined from the relevant angular correla- 
tions between the final state particles. We expect to accumulate approximately 13000 Ab events (and a 
similar number of A&) with an integrated luminosity of 30 fb _1 . This estimation is based on the latest 
reconstruction software and trigger simulation for the ATLAS experiment. 

The Ab is the lightest baryon containing a b quark, and since its discovery in 1991 by the UA1 Col- 
laboration [1] it has created a great deal of interest. Besides the so— called A& lifetime puzzle [2], the 
Ab has been the subject of various theoretical studies ranging from proposed tests of CP violation [3], 
T violation tests and new physics studies [4], measurement of top quark spin correlation functions [5] 
and the extraction of the weak phase y of the CKM matrix [6]. Specific physics interest in the A& parity 
violating a\ b parameter studies derives from its ability to serve as a test for various heavy quark factor- 
ization models and perturbative QCD (PQCD). Ab studies are also of interest because of the continuing 
mystery of why hyperons have consistently displayed large polarizations when produced at energies even 
up to several hundred GeV and at large pj where most models predict zero polarization. It is not known 
if these effects can be explained by some not yet understood effect of existing physics or if they point 
to new physics altogether. Ab polarization holds the possibility of illuminating just how polarized b 
quarks are produced and, indeed, it may have relevance to how fermions are produced in all pp induced 
processes. 

Interest in the studies of the Ab lifetime parameter derives from the current controversy from Tevatron 
experiments concerning the question of how much longer the b quark lives in a meson vs. in a hyperon. 
With an expected increase of a factor of 100 in the statistics at the LHC, we expect to make a definitive 
statement on this puzzle. Again, this will further constrain the theoretical models which have as their 
basis PQCD and the Heavy Quark Model. Lifetime measurements will not be examined in this article, 
since it is not the focus of the current study, though many of the event selection issues, discussed here, 
might be applicable in the Ab lifetime studies. 



1132 



5-Physics - Plans for the Study of the Spin Properties of the A b Baryon Using . 



We have examined the primary technical challenges in the measurement of A& polarization in ATLAS 
by generating large samples of Ab baryons with various known polarizations, allowing them to decay in 
the detector using model— predicted amplitudes, and then reconstructing these events using standard 
ATLAS packages. These samples have permitted us to test our ability to reconstruct events and to 
confirm that we can recover the input polarization and the decay amplitudes. They also have allowed us 
to compare various polarization extraction methods and to assess the impact of detector collections and 
detector resolution effects. We provide here a report on the results of these studies, and on the work we 
undertook to adapt the EVTGEN [7] decay package to produce polarized A/, within the ATLAS software 
framework. 

2 Theoretical overview 

In the quark model the Ab is a fermion consisting of a b quark accompanied by a di— quark (ud) of 
total spin zero. In this model the polarization of the At is thus expected to be totally due to the b quark 
polarization. QCD calculations suggest that the b quark polarization would be small. However, there 
are models of quark scattering [8], in which spin effects are expected to scale with the mass of the 
heavy quark, and where the possibility exists for Ab polarizations to be quite large. We further note that 
QCD has not been able to predict the very large polarizations that have been observed in the inclusive 
production of A hyperons at energies of several hundred GeV. It is hoped that the huge mass difference 
in the b and s quarks will help elucidate the origin of these unexplained spin effects. 

Interest in the a\ b parameter for the Aj, stems from the fact that HQET models [9] purport to calculate 
this quantity from rather basic principles of PQCD and factorization. We have an interest in comparing 
our ultimate measurements of this quantity with these predictions and assessing what constraints they can 
provide for these models. We provide below a brief overview of the theoretical basis for the polarization 
and dA b measurements. 

2.1 Heavy quark polarization in QCD 

In the Standard Model heavy quark production is dominated by gluon— gluon fusion and qq annihilation 
processes. A non— zero polarization requires an interference between non— flip and spin— flip helicity 
amplitudes for the A/, production, with the latter containing an imaginary part. In QCD this complex part 
can only be generated through loop corrections, so that the relevant diagrams for polarized quarks are 
&(0Cs). The polarization expected from all QCD sub— processes (g — g fusion, qq annihilation and q — 
q, q — g scattering) have been calculated [10]. The formulae for the polarization for each one of the four 
processes is directly proportional to ce s , and it depends just on the ratio x Q = yhq/ p Q and the scattering 
angle 6q, (all defined in the center of mass frame), and are thus valid for any final— state quark Q. The 
expected polarization in single b quark production by gluon— gluon fusion and qq annihilation has been 
found to be a maximum of 5% for gluon— gluon fusion, and a maximum 10% for qq annihilation. When 
these predictions are compared to the observed A polarization (due to the s quark polarization) [11], they 
are found to be an order of magnitude too small. One might not be surprised if the A& polarization is, as 
well, greater than predicted in QCD. 

An important result in [10] is the dependence of the polarization on the quark mass. The heaviest 
quark produced is the most polarized, and the maximum polarization is reached around x Q ~ 0.3. The b 
quark polarization is predicted to be an order of magnitude greater than the s quark polarization, which 
from A polarization measurements has been found to reach values over 20% at 400 GeV [12]. 

The measurement of the At polarization in ATLAS in the exclusive channel A& — > J / yM. proposed 
here would cover pr(Ab) > 8000 MeV (because of trigger and reconstruction constraints on the trans- 
verse momentum of the final— state particles, see Table 2) and x F (Ab) < 0.1. It could make a significant 
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contribution to testing different models of production of polarized baryons in this new kinematic region. 

An idea that the heavy quark pre— exists in the incoming proton before scattering and becomes po- 
larized through a direct scattering from an incoming quark provides another pathway for the Ab to be 
polarized. This possibility has been discussed by Neal and Burelo [13]. If polarizations are observed in 
inclusive A& production that exceed a few percent, such a mechanism should be given careful attention, 
since no other existing models can account for such large values. 

2.2 A b —> J/\ir(n + n~)A(pn~) decay and angular distributions 

The proposed study of A& polarization would probe not only the production process but also explore the 
decay of A&. Decay models predict values for various quantities that can be experimentally observed, 
thus providing a test of specific HQET/Factorization model [14] assumptions. 




Figure 1 : The weak decay of A&: Ab — > // yA . 

The fact that At, has a significant lifetime suggests that it decays weakly. The dominant decay process 
would involve the emission of a W~ boson, as illustrated in Figure 1. The spin and parity of the particles 
involved in the Ab — > J/y(n + ii~)A(pn~) decay are well known. The Ab with J F = \ decays to A with 
J p = \ + and J/x\r with J p = 1 . The general amplitudes for the decay of A b (j + ) — > A(^ + )//i//-(l~) is 
given by: 

^ = A(p A ) e;(p J/v ) \A 1 y*>f+A 2 %±r i +B 1 >f+B 2 %± \A b (p Ab ), (1) 

m A b m A b 

which is parameterized by the four complex decay amplitudes A x , A 2 , B 1 , B 2 and where is the polar- 
ization vector of the J/y . 

Given the general amplitude, we may compute the helicity amplitudes. We use helicity amplitudes, 
because they have a direct physical relationship to the spin parameters we wish to study. Four helicity 
amplitudes are required to describe the decay completely. We will use the notation Hx A x for the 
helicity amplitudes of the decay A& — »• J /\lf{jl + jX~)A{pn~) , where Aa = ±1 /2 is the helicity of A and 
Xj/y = +1, 0, -1 is the helicity of J/\\r . These four helicity amplitudes: a + = #1/2,0, a_ = H_i/ 2 Q , 
b + = //_ 1/2,1, b_ = #1/2,-1 are normalized to unity: 

|a+| 2 + |a_| 2 +|fc+| 2 + |fe_| 2 = l. (2) 

In this notation, the A& decay asymmetry parameter a\ b is given by [15]: 

\a + \i-\a_\i + \b + \i-\b_f 
aAb \a + \ 2 + \a_\ 2 + \b + \ 2 + \b_\ 2 - ^ 
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The helicity amplitudes a + , a_ , b + , b_ are computed directly from the decay amplitudes A x , A 2 , 
B V B 2 according to the following equations: 



where £2± = (jti\ b ± aha) — and m\ b and »za are the Ab and A masses respectively [16, 17]. 

The polarization of the A& can be determined from the angular correlations between the Ab — * J / \\fA 
final decay products. The Ab polarization reveals itself in the asymmetry of the distribution of the angle 
6. This angle is defined as the angle between the normal to the beauty baryon production plane and the 
momentum vector of the A decay daughter, as seen in the A^ rest frame. The decay angular distribution 
can be expressed as: 

w~ l + a A6 Pcos(0), (5) 

where a\ b is the decay asymmetry parameter of Ab and P is the A^ polarization [18]. 

Using the method described in [17], it can be shown that the full decay angular distribution is: 

w(G,A,P) = ^jf/^ / 2l -6P,a A ) Fi0) (6) 

where the fa (A) are bilinear combinations of the helicity amplitudes and A = (a + ,a_,b + ,b_). fji stands 
for Pcc\, P, «a, or 1, where OCa is A decay asymmetry parameter. Ft are orthogonal angular functions 
defined in Table 1. The A& decay asymmetry parameter a\ b is related to the helicity amplitudes as 
defined in Equation 3. The five angles d = (d,d 1 ,d 2 ,(p l ,(P2) (see Figure 2) in this probability density 
function (p.d.f.) have the following meanings: 

• 6 is the angle between the normal to the production plane and the direction of the A in the rest 
frame of the Ab particle; 

• d\ and 0i are the polar and azimuthal angles that define the direction of the proton in the A rest 
frame with respect to the direction of the A in the Ab rest frame; 

• 62 and 02, define the direction of }X + in the J / y rest frame with respect to the direction of the 
J/y in the A^ rest frame. 

There are nine unknown parameters in Equation 6. They are the polarization P and four complex 
helicity amplitudes: a + = \a + \e M+ , a_ = \a_ \e' a - , b + = \b + \e'P+ , b_ = \b_\e'P~ . Using the normalization 
condition (see Equation 2) and using the fact that the overall global phase is arbitrary, we can reduce the 
number of unknown independent parameters to seven. 



3 Monte Carlo samples 

In order to determine if it is feasible to detect polarized A//s in the ATLAS experiment and to mea- 
sure their polarization, Monte Carlo samples of polarized A^ particles were generated using the standard 
ATLAS software packages. The generation of polarized Ab particles and the propagation of their polar- 
ization in the decay process required a special treatment, and EVTGEN was adapted for this puipose. 
The next sections describe how this was implemented in the framework of the ATLAS experiment. 
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hi 


hi 


F- 


0 


a + a* + + a_a*_ + b + b* + + b_b*_ 


1 


1 



1 


a + a\ - a 


_u"_ 


-+KK- 


b_b*_ 


P 


COS0 


2 


a + a\-a 


_a*_ 


■ -KK + 


-b_b*_ 


a A 


cosd l 


3 


a + a* + +a_ 


_a*_ 


-KK- 


b_b*_ 


Pa A 


cos d cos 6 l 


4 


-a + a* + -a_ 




+ \b + b%- 


f \b_b*_ 


1 


l/2(3cos 2 0 2 -l) 


5 


—a + a* + +a_ 


a 


+\kk- 
-\kk- 


- U_b*_ 
f \bjf_ 


P 


l/2(3cos 2 0 2 - 1) cos0 


6 


-a + a* + +a_ 




a A 


l/2(3cos 2 0 2 -l)cose i 


7 


—a + a* + — a_ 


a 


-\kk- 


-\b_b*_ 


P,a A 


1/2(3 cos 2 d 2 - 1 ) cos d cos 0j 



8 -3Re(a + a*_) P,a A sin 6 sin d { sin 2 d 2 cos <Pj 

9 Vm{a + a*_) Pa A sin 0 sin 6^ sin 2 0 2 sin <Pj 

10 -\Re(b_b* + ) Pa A sinesin©! sin 2 0 2 cos (<?>!+ 2 <p 2 ) 

11 \lm{b_b\) Pa A sin0 sin^ sin 2 0 2 sin(<P[ +2<p 2 ) 
~~ 12 -^Re(b_a* + + a_b* + ) Pa A sin 6 cos 6 l sin 6 2 cos 6 2 cos (p 2 

13 ^Im(b_a* + + a_b\) Pa A sin 6 cos 6^ sin d 2 cos 6> 2 sin <p 2 

14 -^Re(b_a*_+a + b* + ) Pa A cosd sind l sin0 2 cos0 2 cos(<p 1 + <p 2 ) 

15 4=Im(b_a*_+a + b* + ) Pa A cos 0 sin 6>j sin 0 2 cos 0 2 sin((p l + (p 2 ) 



16 




P 


sin d sin 0 2 cos 0 2 cos <p 2 


17 




P 


sin 0 sin d 2 cos 0 2 sin <p 2 


18 




a A 


sin 6y sin 0 2 cos d 2 cos(<Pj + <p 2 ) 


19 


— -^Im(b_a*_ —a + b* + ) 




sin 6y sin 0 2 cos 0 2 sin((p 1 + <p 2 ) 



Table 1: The coefficients f u , f 2j and F i of the probability density function in Equation 6. 




3.1 The generation of polarized A h particles 

To generate A/, particles, the PYTHIA 6.4 generator [19] is used. Since PYTHIA does not incorporate 
polarization information from the decay of At particles, EVTGEN was used to generate the A^ decay. 
EVTGEN provides a general framework for implementation of B hadron decays using spinor algebra 
and decay amplitudes. This framework permits the proper management of spin correlations of very 
complicated decay processes. EVTGEN is a Monte Carlo generation package itself, but in this case it is 
used only to decay the A^ particles produced by PYTHIA. 
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3.1.1 Re-hadronization process and cuts at PYTHIA level 

PYTHIA provides mechanisms to produce b quarks, referred to as gluon— gluon fusion, q-q annihilation, 
flavor excitation, and gluon splitting. If all these processes are taken into account, beauty quark events 
would constitute only 1% of the total number of generated events. In addition, the fraction of b quarks 
hadronizing to A& is less than 10%. These make the process of A^ generation computationally slow. To 
optimize the generation process, a re— hadronization step of the same event in the bb pairs production is 
used. In order to avoid repetition of A/, events due to the re-hadronization process, a Ab pre— selection 
is implemented at this stage to filter on average only one of the re— hadronized copies of the same event. 
An additional reason that the At generation process is slow is that around 95% of final state particles 
(two muons, a proton, and a pion) of the generated Ab events are outside of the tj limits (|tj| < 2.5) 
of the ATLAS detector. In addition, all events must pass the level— 1 trigger of the ATLAS trigger 
system and some pre— reconstruction requirements, such as having a minimum reconstructable transverse 
momentum. We could not apply these cuts in the PYTHIA step since the kinematics information of 
the Ab children is available only at a later stage, when EVTGEN decays the Ab particles. However, by 
analyzing the pj and tj distributions of A& particles before and after cuts (emulating level— 1 and level— 2 
triggers, and requiring \r}\< 2.5) on the final state particles, we estimated pj and tj limits, below which 
the Ab can not be selected and then applied these cuts in the PYTHIA selection. Figure 3 shows the pj 
and tj distributions from which the pr(Ab) > 6000 MeV and |tj(A^)| < 3 cuts were selected to filter A^ 
particles in PYTHIA. 




Figure 3: Distributions of pj (left) and tj (right) for Ab particles generated using PYTHIA, without 
cuts (hollow circle), applying tj cuts only (cross) and applying all cuts (solid circle) from Table 2. 



3.1.2 Setting A b polarization in EVTGEN 

To set the polarization of Ab particles we used the spin density matrix description of EVTGEN. For the 
case of spin— 1/2 particles like A& the density matrix is defined as: 

p= l -(I + p.a) (7) 

where P is the polarization vector, and a = (01,02,03), where o ( is z'-fh Pauli matrix. In our case P is 
defined as: 

? = p( f (8) 

\\ZX Plab{A b )\J 

where P is the magnitude of the polarization, p\ ub is the momentum of the Ab in the laboratory frame, 
and z is the z - axis (along the beam direction) in the ATLAS reference system. 
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To decay polarized A& we use the HELAMP model of EVTGEN. This model is capable of simu- 
lating a generic two body decay with arbitrary spin configuration, taking as input the helicity amplitudes 
describing the process. In the case of the decay A& — ► J n~)A(pn~) , as it has been shown in the 
previous section, there are four complex helicity amplitudes: a + , a_, b + , and b-. 

The A decay into a proton and a pion has been simulated with the same model, using as input 
parameters the two helicity amplitudes Ha a ,A p defined in terms of the A helicity A A and the proton X p 
helicity as 

h_=H_i ; _i, h + =H + i >+ i. (9) 

The choice of h± is constrained by the experimentally well known A — > p%~ asymmetry parameter [20] 

a A = | h + | 2 - | h- | 2 = 0.642 ± 0.013. (10) 

Finally, the decay J/yr—y jJ. + H~ has been described with the EVTGEN VLL (Vector into Lepton Lep- 
ton) model [7]. 

3.1.3 Filtering of A;, ->■ J n~)A(pK~) events 

As a last step in the generation process, we apply kinematic cuts on muons, pion and proton to emulate the 
fiducial acceptance, level— 1 trigger, and pre— reconstruction requirements. These cuts are summarized 
in Table 2. 



Particles 


Minimum pj [MeV ] 


Maximum Vj 


Protons and k's 


500 


2.7 


Most energetic muon 


4000 


2.7 


Other muon 


2500 


2.7 



Table 2: Cuts applied at the particle level. 



3.2 Monte Carlo samples and input model for A b decays 

As input to the HELAMP class of EVTGEN, the result obtained within the framework of PQCD for- 
malism and the factorization theorem [9] has been used to model the A^ decay. From the complex 
amplitudes calculated in this model, A x , A 2 , B x , B 2 in Equation 1, the helicity amplitudes a + , a_ , b + , 
b_ are calculated by using Equation 4. This is summarized in Table 3. In this model, the A^ decay 
asymmetry parameter, defined in Equation 3, is OL\ b = -0.457 l . 



A \ 


=-18.676- 


185.036/ 




=-0.0176 


-0.4229/ 


A 2 


= -7.461- 


351.242/ 


a_ 


= 0.0867 


f 0.2425 i 




= 15.818- 


162.663/ 


b__ 


=-0.0810 


-0.2837/ 


8 2 


= -4.252 + 


266.653/ 


b_ 


= 0.0296 


f 0.8124/ 



Table 3: PQCD model amplitudes A, and B,, are given in units of 10 10 and helicity amplitudes a± and 
b± are normalized to unity. 

By using this decay model as input, two Monte Carlo samples were generated with polarizations 
of -25% and -75%. These Monte Carlo samples were generated, simulated, and fully reconstructed by 
using the Athena framework [21]. 

1 There is an ERRATA in [9] in the reported value of a Ah [14]. 
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To show how the angular distributions behave, fast Monte Carlo samples (see section 3.3) were 
generated using an accepted— rejected method based on the p.d.f. defined in Equation 6. Figure 4 shows 
the distributions of the five angles for helicity amplitudes from Table 3 and polarizations of 40%, 0%, 
-40%. 




Figure 4: Distributions of the five angles characterizing the decay A& — > J j V(jU + n~)A(pn~) for differ- 
ent polarization values. For cos{6\), cos(02), and 02, all three distributions for the different polarization 
values look similar, thus only one polarization case is presented. 



3.3 Fast Monte Carlo generation 

In order to do fast tests of different Af, decay models and different polarization values, we need to generate 
large Monte Carlo samples. This represents a problem due to the computer time required to produce a 
full chain simulated Monte Carlo data. In order to address this problem a fast Monte Carlo generator was 
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developed. This generator uses Equation 6 to generate angular distributions for the daughters of the 
in the rest frame, and then uses a (p, rj) distribution derived from phase space of generated events in 
PYTHIA to compute the kinematic variables of the daughter particles in the laboratory frame. Detector 
effects are incorporated by using pj and Vj cuts on final state particles to mimic di-muon triggers and 
pre— reconstruction requirements. Figure 5 illustrates the strong agreement between angular distributions 
produced by using PYTHIA and EVTGEN Monte Carlo events and fast Monte Carlo events. 



ATLAS 




Figure 5: Comparison of Monte Carlo events (PYTHIA + EVTGEN) with fast Monte Carlo generated 
events. Solid dots represent the Monte Carlo events. 



4 A b reconstruction 

The reconstruction of A^ candidates begins with a search for events with J/\ff candidates. Among these 
events we search for A — > pn~ candidates, which are then combined with the // \ff to reconstruct the At. 
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4. 1 Selection of//y^/i + /i candidates 

We search for J/\\r candidates which satisfy the following selection criteria: 

• The jJ. + jJ.~ candidates must originate at the same reconstructed vertex and the x 2 of the vertex 
must be lower than 20; 

• The invariant mass of candidates M(/i + jU~ ) should be within 2800 MeV and 3400 MeV. 
The invariant mass distribution of candidates before applying the invariant mass cuts to select A^ 
is shown in Figure 6. 




Invariant mass (MeV) 

Figure 6: Invariant mass of jJL + jJL~ candidates. The dark color represents all J/y candidates after 
reconstruction and vertexing requirement. The circles represents J/\\r candidates when level— 1 and 
level— 2 trigger signature are required. 



4.2 Selection of A — > p% candidates 

From the previously selected events containing a J/\\r , A candidates are selected by applying the follow- 
ing requirements: 

• Two opposite charged tracks originating from the same reconstructed vertex. 

• The invariant mass of two tracks M(pK~) should be within 1 105 MeV and 1 128 MeV range, where 
for computing M(pn~), the track with the highest transverse momentum was assumed to be the 
proton, as observed in 100% of the times in Monte Carlo generations, while the other track was 
assumed to be a pion. 

Many of the A particles decay outside of the high— precision part of the Inner Detector, which covers 
a radius of about 40 cm from the beam line, and thus are lost in reconstruction. The decay vertex position 
of A's in the RZ plane is presented in Figure 8. If the A decays outside the 40 cm radius, the number 
of reconstructed space points (hits in the pixel or silicon layers) is not sufficient for a successful track 
reconstruction. This effect reduces the fraction of reconstructible A to around 60%. Figure 7 presents 
the invariant mass distribution of the pn~ candidates before the invariant mass cuts have been applied. 
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4.3 Selection of A b — > 7/ )A(pn ) candidates 

A previous study [22] based on early ATLAS simulation software estimated that the number of A b and 
At events which we expect to collect for the integrated luminosity of 30 fb _1 is 75000. Using the new 
fully reconstructed sample we made a new estimation. We used the following expression to calculate the 
number of events: 

jV ' = &o{A b )g, (11) 

where Jz? is the integrated luminosity, o(Ab) = 7.4 pb is the cross section of A b — > J /y(fx(pT > 
4000 MeV)n(p T > 2500 MeV))A(p(p r > 500 MeV)7t(p T > 500 MeV)) , see details of the calculation 
in Table 4, and <§ is an overall A^ acceptance, which includes the level— 1 and level— 2 acceptance for 
A b ^J/y{p + p-)A{ P K-) . 

For selecting events with b hadrons at a luminosity below about lO^cm -2 .? -1 , the first level trigger 
will require the presence of a muon with pj > 6000 MeV within the trigger geometric acceptance of 
|tj| < 2.4. The effect of the level— 1 trigger threshold on muon pj is not a sharp cut and a fraction of 
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a(pp^A h X) 


0.00828113 mb 


BR ( A^ — > J/\jfA ) 


(4.7 ±2.8) x 10~ 4 [20] 


BR ( A — > pn~ ) 


(63.9 ±0.5) x 10~ 2 [20] 


BR( j/y^m+pr ) 


(5.93 ±0.06) x 10~ 2 [20] 


Including cuts 


0.05 


Overall cross-section 


7.4 pb 



Table 4: The cross-section calculation of A b — > J/y(/jL + ij, )A(pn ) decay. 

muons with pj lower than 6000 MeV will be collected. Figure 9 shows the efficiency of the level— 1 
simulation with nominal pj threshold of 6000 MeV as function of pj . Around 69% of events with 
J/ljf — > H + /J.~ , where one muon has pj > 4000 MeV and the second muon has pj > 2500 MeV, 
passed the level— 1 trigger simulation. Therefore a signal dataset with pj less than 6000 MeV has been 
chosen to study all possible triggered events with low pj muons instead of a usual shaip 6000 MeV cut. 




Figure 9: The level— 1 trigger simulation efficiency as a function of muon pj , obtained from the A& 
signal sample over the whole detector volume. 

Further selections in the high level trigger are based on the Region of Interest (Rol) identified at 
level— 1, as follows: a search for a second muon close to the trigger muon is used to select channels 
containing two final state muons, for example from J/yr . It is based on expanding the level— 1 muon Rol 
to find a second muon which was not triggered by level— 1. This increases the efficiency of the di— muon 
trigger by extending the pj acceptance for the second muon down below 6000 MeV. The size of the 
increased Rol is based on the distribution of angular distance in Y\ and 0 between two muons decayed 
from J/ y . The Inner Detector tracks which are reconstructed within these Rol, are then extrapolated to 
the muon system to find the corresponding hits within the window. The Inner Detector tracks associated 
with the muon spectrometer hits can be identified as muons. The level— 2 trigger efficiency is found 
to be around 78% for A,, -► J/y{Li{p T > 4000 MeV)/i(pr > 2500 MeV))A(p(p r > 500 MeV)n(pr > 
500 MeV)) . 

We reconstruct the A b by performing a constrained fit to a common vertex for the two muon tracks 
and A , with the two muon tracks constrained to the J j\\f mass of 3097 MeV [20]. The reconstruction ef- 
ficiency depends on the cuts which will be applied on all Inner Detector tracks in the reconstruction stage 
to reduce the fake rate. The overall efficiency is found to be around 6.1% if the pj threshold is 500 MeV, 
see Table 5. Figure 10 shows the invariant mass distribution of A b candidates. Simulation of the level— 1 
trigger with level— 1 pj thresholds of 6000 MeV and 4000 MeV and level— 2 trigger, explained above, 
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included in the analysis. We expect to collect around 13500 (13100) — »• J / y/"(M + jl )A(pn ) events 
using 4000 MeV (6000 MeV) level- 1 muon threshold for the integrated luminosity of about 30 ftr 1 . 



level— 1 trigger: 
with pr threshold 


one muon 


two muons 


4 GeV | 6 GeV 


4GeV 


6GeV 


level— 2 trigger: 


TrigDiMuon 


Topological trigger 


J/\ff reconstruction efficiency 
including level- 1 and 
level-2 triggers 


42% 


39% 


27.5% 


10% 


A reconstruction efficiency 




15% 


At overall efficiency 


6.1% | 5.9% 


5.4% | 3.5% 



Table 5 : The overall Ab efficiency depending on the trigger strategy. 




Invariant mass [MeV] 

Figure 10: jU + jU~ A invariant mass distribution. The dark color represents all A/, candidates after 
reconstruction and vertexing requirement, and the light color represents the case when a level— 1 and 
level— 2 trigger signature is required in addition. Filled circles represents data after all selection cuts. 
The fit is the result of using double Gaussian and Polynomial functions. 

We need to acknowledge that there are other inefficiencies that will appear when we analyze the real 
data. For example, even if the individual track reconstruction efficiency is as high as 98%, we will have 
an overall reduction in event rate of about 10%. Even if such reductions occur, we still expect the final 
sample to be sufficient for a meaningful measurement of the At polarization. 

4.4 Angular distributions and angular resolutions 

The reconstruction efficiency modifies the angular distributions used in the polarization determination. 
Figure 1 1 shows how the angular distributions change due to detector acceptance for a Monte Carlo 
sample with polarization of -75%. 

The angular resolution of the five angles is presented in Figure 12. We used this angular resolution 
in the statistical uncertainty study. 
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100r 



°-3 -2-10123 °-3 -2 -1 0 1 2 | 

Figure 11: Comparison of fast Monte Carlo events without kinematics and detector acceptance cuts 
(open circles) and Monte Carlo events after full detector simulation and reconstruction (solid circles). 



4.5 Background 

Due to its production rate the main background source for our Ab reconstruction will be the prompt pro- 
duction and decay of J /iff — > jJ. + IJ.~ which are then combined with A candidates in the event. However, 
the long lifetime of the Ab allows us to reduce significantly this kind of background by applying a life- 
time cut. After a A/, lifetime cut (a cut of 200 /im on the proper transverse decay length), this background 
was found to be negligible and it is not considered in this study. 

In order to investigate the different contributions of long— lived background particles not removed by 
the lifetime cut mentioned above, we used a inclusive J '/iff Monte Carlo sample of bb — »• J/iffX requiring 
in addition to a J/y , a A in each event (bb — > J/yAX). This A could be produced along with the J / y 
from a B hadron decay or just be part of the event, and the invariant mass of the J/y + A combination 
should be within 5100 - 6100 MeV. Figure 13 shows the invariant mass distributions of Ab candidates 
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Figure 12: Angular resolution from fully simulated Monte Carlo data. The fit is the result of using double 
Gaussian distributions. 



reconstructed in this Monte Carlo sample. The observed level of background under the A& signal is of 
few percents, and it is considerably reduced after extra cuts like the lifetime cut mentioned above. In 
Figure 13 another wider distribution due to At — > J /\ylP{Ay) is observed very close to our At —> J / yA 
signal. This is due to the branching ratios of both decays channels being the same as set by default in 
PYTHIA. This behavior has not been observed at Tevatron experiments where hundreds of A/, — > J/yA 
events are reconstructed. Therefore we expect the branching ratio of the Ab — > //i//E°(Ay) decay to be 
considerably smaller than the branching ratio of the A b — > J /yA, and that the resulting background will 
be much smaller than shown. 
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1200 




Figure 13: Invariant mass distribution from A& candidates identified in b — > J/\ffAX Monte Carlo sample. 
Composition at generation level with smearing from reconstruction (left) and fit to the fully reconstructed 
events (right) after vertexing requirement are shown. 



5 Extracting A b polarization and decay parameters 
5.1 Fitting method 
5.1.1 Likelihood function 

To extract polarization and decay amplitudes we performed an un— binned maximum likelihood fit to the 
angular distributions. The log— likelihood function Jzf is defined by: 

X = -2 £ logos' AP)), (12) 



w„ t ,(g-,i, P)= .^me, eye ^ 

f fw(d',A,P)T(d,d')dddd' 
w(6' ,A,P) is the p.d.f defined in Equation 6, 6' are the measured angles, 6 are angles without detector 
effects, and T(d, 6') is defined as 

T(e,e') = e(e)R(e,e'), (U) 

where e(6) is the efficiency function and R (6, 6') is the resolution function. 
In the ideal case the resolution function is: 

R(d,d') = 80-6'), (15) 

then we have 

w*.0.lf)= , (16) 

where ^ = J F i (6)e(d)d6 are the acceptance corrections values, which have to be calculated in advance 
to perform the fit. 

The final log— likelihood may be re— written as a sum of two terms: 

^- 2 l M igl£;^ )+io8(e(?))i - 
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Since the second term does not depend on the parameters we want to measure, the main challenge is to 
find the acceptance function. 

5.1.2 Detector acceptance corrections 

The acceptance corrections integral ^ i = J ' F i {0)e{0)dO can be approximated by the following form, 
using the Monte Carlo integration techniques 



7 gen j = q G(6) 



where N gen is the number of generated events, N acc is the number of accepted events after the simulation 
of the fiducial acceptance and pj cut and G is the p.d.f which has been used to generate the 0. 

If the generation of the events is done using certain p.d.f (w), the acceptance can be calculated by the 
simple expression: 

F0) 



I 



Ngen p 0 W(6,A,P) 



We used this expression to calculate the acceptance in the case when w is the p.d.f from Equation 6. 

This method can be used under the assumption that the acceptance does not depend on the measured 
parameters, and that the angular resolutions are close enough to the ideal resolutions. In order to check 
the first assumption we plotted the ratio 

Jw(e,A,P)e(d,A,P)de 
Jw(d,A,P)e(d,A,P = 0)dd 

for the different polarization values (see Figure 14). No significant dependence of the acceptance on the 
polarization is observed in this test. 




Figure 14: Ratio defined in Equation 20 as a function of polarization. 



The angular resolutions are shown in Figure 12. To test the effect of these resolutions, Monte Carlo 
fits were performed including a smearing of the data based on the Gaussian fits in Figure 12. Fit results 
with and without smearing are consistent within the statistical uncertainty. Figure 15 shows, as an ex- 
ample, a comparison of fit results for a sample of 2000 events with polarization of -75% when fits are 
performed on the sample of generated Monte Carlo events. 
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Figure 15: Comparison of fit outputs from generation level Monte Carlo with and without Gaussian 
smearing due to finite angular resolution. Error bars are statistical uncertainties from the fit with Gaussian 
smearing included. 



5.2 Fits to fully simulated Monte Carlo data 

In order to extract polarization and decay parameters from the Monte Carlo data samples, final se- 
lection cuts were applied. A proper transverse decay length greater than 200 \im is required to remove 
contamination from prompt produced J / y events. The proper transverse decay length for the Ab candi- 
date is given by: 

A = -^r=L w ^, (21) 

03y)r* '■ p t 

where (j3y)£* and M\ b are the transverse boost and the mass of the Ab, and L xy is a transverse decay 
length. The transverse decay length is defined as = ■ p T /pr where is the vector that points 
from the primary vertex to the A^ decay vertex and p T is the transverse momentum vector of the A;,. A 
minimum pj of 500 MeV is required for any track used in the A^ reconstruction. In addition, a pj > 
4000 MeV is required for the muon with larger pj , and pj > 2500 MeV for the second muon. These 
cuts reduce the A^, sample by 21%, mainly due to the lifetime cut. 

Table 6 shows the results of performing a likelihood fit to our fully simulated Monte Carlo data, for 
a sample of 2000 Ab events, corresponding to around 5 fb _1 of collected data. Figure 16 shows the 
difference between the input values in Monte Carlo and the extracted values of polarization and decay 
parameters by the likelihood fit. We used as fitting parameters: \a + \, |a_|, \b+\, a+ — j3_, a_ — j3_, 
j8 + — j6 , and the polarization P. 

Detector acceptance corrections in Equation 19 were computed separately from the two Monte Carlo 
samples with different polarizations which are used in this study. Collections computed in the Monte 
Carlo sample of -75% polarization were used in the fit of the Monte Carlo sample of -25% polarization, 
and vice versa. Due to the limited statistics in the Monte Carlo samples used to calculate the acceptance 
corrections defined in Equation 19, a bagging (from bootstrap aggregating) technique [23] was used to 
generate multiple samples in order to avoid the effect of statistical fluctuations. This technique consists 
of generating replicates of a data set by selecting at random events from the original data set allowing 
repetition of events. We generated 1000 bootstrap replicates of the fully simulated Monte Carlo data 
sample. The J^- factors (Equation 19) were computed from each generated data sample and the average 
was taken as the value for each of the twenty correction factors. Systematic uncertainty due to the 
width of the correction factors distributions in these 1000 generated data sets was estimated by repeating 
the fit to fully simulated Monte Carlo using the J^- values from the each of the generated samples, and 
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Parameter 


Value ± Uncertainty 


Value ± Uncertainty 


Value 




(Polarization = -25%) 


(Polarization = -75%) 


(Input at generation level) 


Polarization 


-0.213 ± 0.069 


-0.882 ± 0.064 


-0.25/-0.75 


\a + \ 


0.461 ±0.051 


0.413 ± 0.023 


0.429 


|o_| 


0.289 ± 0.058 


0.161 ±0.035 


0.260 


\b+\ 


0.259 ±0.071 


0.370 ± 0.027 


0.295 


a + -p 


-0.991 ± 0.640 


-2.050 ±0.134 


-1.612 


a -p 


0.856 ± 0.364 


0.681 ±0.342 


1.231 


P+-P- 


-1.442 ± 0.666 


-2.624 ±0.187 


-1.849 



Table 6: Fit results from fully simulated and reconstructed Monte Carlo events with input polarization of 
-25% and -75%. 



assigning the width of the distribution of fitted parameters as a systematic uncertainty. This systematic 
eiTor (also shown in Figure 16) can be reduced with more Monte Carlo statistics for the ^ i calculation. 




Figure 16: Comparison of fit results for polarization of -25% (left) and -75% (right) with respect to input 
values from Monte Carlo generation. The statistical and systematic uncertainties are included. 



5.3 Estimate of statistical uncertainties 

To estimate statistical uncertainties as a function of polarization, we used a fast Monte Carlo probabilistic 
approach to generate polarized particles. The fast Monte Carlo includes angular resolution from the 
fully reconstructed samples and detector acceptance simulation. We generated a large number of samples 
with different values of polarization. A maximum likelihood fit was used to extract the decay parameters 
and the polarization. Detector acceptance corrections were calculated from high statistics fast Monte 
Carlo data simulated without polarization. Figure 17 presents the expected statistical uncertainty in the 
polarization P and in a\ b as a function of the polarization value for the integrated luminosity of 30 fb _1 
. The study was done for a^ b = -0.457 with the same input model as used in fully simulated Monte 
Carlo. In Figure 17 also the correlation between OL\ b and P is shown as a function of polarization. The 
correlation values were extracted from the Maximum Likelihood fit results. 
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In our study we used specific set of decay amplitudes, presented in Table 3, to demonstrate our ability 
to extract these parameters. To insure that the success of our analysis techniques did not depend on the 
amplitudes chosen, we conducted a fast Monte Carlo study using a different model with 0C\ b = 0. 1 [24] 
to test our procedure in a case of smaller 0C\ b value. We found that it is possible to satisfactorily extract 
CCA b and the polarization even with such a change in the amplitude values. 



Figure 17: Expected statistical uncertainty on polarization (top) and on cca,, (center) as a function of the 
polarization P. Bottom plot shows the expected correlation between 0C\ b and the polarization P. All 
plots show results from the fast Monte Carlo study, obtained for the expected number of events in 
data sample of 30 fb _1 . 



6 Results and conclusions 

In this note we have presented the results from a series of studies to determine if polarized A& baryons 
can be reconstructed in ATLAS and have their polarization and a\ b parameter measured. Our results 
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indicate that the answer is affirmative. Ab events should be identifiable through the reconstruction of 
their four charged final state particles, and the angles between these particles can be measured with suf- 
ficient accuracy to determine the parent's polarization. With trigger constraints and detector cuts fully 
specified, our more complete analysis suggests that the number of events we should expect after 30 fb _1 
of data will only be 13,000, compared to the 37,500 noted in the ATLAS -TDR [22]. Different additional 
detector and background effects, which are difficult to model at the current level of detector description, 
can further reduce the signal sensitivity. These effects could include the detector and trigger inefficiency, 
misalignment, pile-up events and increased combinatorial background due to e.g. the fake tracks. Never- 
theless, even with a reduction of 50%, a polarization measurement with a statistical uncertainty of several 
percent should be possible in a regime where polarization is larger than 25% as experimentally measured 
at lower energies. Efforts will continue to develop algorithms to improve the various reconstruction and 
trigger efficiencies and in consequence providing an enhanced yield of reconstructed particles in data 
samples. 

We note that almost all models predict that the A& polarization at the LHC at small Feynman x should 
be vanishingly small. Measurement of a significant polarization would have to be regarded as a signal of 
an unexplained effect, either from the domain of existing physics, or of new physics altogether. 

We further note that the development of Ab polarimetry as a tool for studying spin effects at the LHC 
could be important. For example, members for the SUSY community are quite interested in knowing 
what fraction of the b quark polarization ends up in the polarization of a At, since this could provide a 
way to test if b quark SUSY partners have the correct handedness. Only a few hundred Ab decays would 
be required to, for example, determine if its polarization were 100% or -100%. Challenges clearly exist, 
however, in determining the polarization transfer fraction, which requires a source of b's such as from 
Z — > bb, and in dealing with the fact that only 10~ 5 of Z?'s generate decay into the A& channel we have 
described here. Our work on this topic will continue. 

Other related studies that should continue include mechanisms for comparing the 0C\ b parameters 
from Ab and its antiparticle as a test of CP. We will accumulate data on both. If CP is conserved, the two 
parameters should be equal in magnitude but opposite in sign. While the precision of this test will not be 
high, and while models predict that any CP violation would be small in this sector, nevertheless, such a 
test would be unique in this domain and should be made. 

Finally, as noted in Section 1, the lifetime of the A/, remains a topic of significant interest. Such a 
measurement will be a natural by-product of our efforts to extract the Ab spin parameters. 
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Study of the Rare Decay B° s 

Abstract 

We investigate the feasibility of measuring of the rare decay B® — > jJ. + jJ.^ in 
ATLAS. The contribution of inclusive and of the most important non-combinatorial 
background is studied. 

1 Introduction 

The rare decays, B® — ► £ + £~ with ^ ± = e ± ,jJ, ± , or x ± , are mediated by flavour-changing neutral currents 
that are forbidden in the Standard Model at tree level. The lowest-order contributions in the Standard 
Model involve weak penguin loops and weak box diagrams that are CKM suppressed. Examples of the 
lowest-order diagrams are shown in Figure 1. Since the B ( s ' meson is a pseudoscalar that has positive C 
parity and the transition proceeds in an £ = 0 state, the electromagnetic penguin loop is forbidden. The 
two leptons are either both right-handed or both left-handed leading to additional helicity suppression. 
Thus, branching fractions expected in the Standard Model are tiny. 




Figure 1: Lowest order Standard Model contributions to B Q S —>■ . 



The early searches for rare B meson decays started with radiative penguin decays, first observed by 
CLEO in 1993, where they presented evidence for the exclusive decay B — > K*y and for the inclusive 
decay B ^X s y a year later [1,2]. 

The B factory experiments, BaBar and Belle, have measured these decay modes with more precision. 
The present world average for the inclusive mode is 3&{B -»• X s y) = (3.55 ±0.26) x 1(T 4 [3]. BaBar 
and Belle also observed the decays B — > K^£ + £~ and B — >• X s £ + £~ that are two orders of magnitude 
smaller than B — > X s y [4, 5]. The decay B {) s — > jJ. + jJ.~ is expected to be further reduced by three orders of 
magnitude. 

In extensions of the Standard Model, the —> jJ. + jJ.^ branching fraction may be enhanced by 
several orders of magnitude. Thus, several experiments have searched for these decays. The largest B ( s ' 
samples have been collected by CDF and DO corresponding to a luminosity of 2 fb _1 but no signal has 
been observed. The lowest branching fraction upper limit was set recently by CDF yielding 3$(B° S — ► 
jU + ;U~) < 5.8 x 10~ 8 @95% confidence level [6]. This is still about an order of magnitude higher than 
the Standard Model prediction. As ATLAS has an elaborate muon system extended over a large region of 
the solid angle, the dimuon final state is expected to be reconstructed with high efficiency and good mass 
resolution. Thus, there are good prospects for observing this decay in the dimuon channel and measuring 
its branching fraction with reasonable precision. 

2 Theoretical description 

The Standard Model amplitude for the process B s ^ — > £ + £~ is calculated from the effective Hamiltonian 

h& = -% ,°L v; h v tq (cM^)+c s ^)^)+c P ^)M^)+h.c., (i) 

V2 nsm dw 
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where C,-(/i) are Wilson coefficients that present the perturbatively calculable short-distance effects and 
are local operators that describe the non-perturbative long-distance effects of the transition. The 
scale parameter }X is of the order of the ft-quark mass (~ 5 GeV), dw is the weak mixing angle, a is 
the electromagnetic coupling constant and V* b V tq are CKM matrix elements for t — >• b and t ^ q = s,d 
transitions, respectively. 

The dominant contribution results from the axial- vector operator {?iq, [7]: 

^10 = (hfqL)(£Yn75£). (2) 
The Wilson coefficient C\q has been determined in the next-to-leading order (NLO) of QCD. The NLO 
corrections are in the percent range and higher-order corrections are not relevant [8]. In NLO an excellent 
approximation in terms of the MS mass of the top quark, m t , is given by: 

The measurements of the top quark mass at the Tevatron, mf ole = 17 1 .4 ± 2. 1 GeV [9] , yield an MS mass 
of m, = 163.8 ± 2.0 GeV and the world average of the W-boson mass is m w = 80.403 ± 0.029 GeV. The 
accuracy of this approximation is 5 x 10~ 4 for masses of 149 GeV < m t < 179 GeV. 
The other two operators represent scalar and pseudoscalar couplings to the leptons: 

G s = m b (b R q L )(U),0 P = m b (b R q L )(iy 5 £). (4) 

The Wilson coefficients, C s and C P , are determined from penguin diagrams that involve the Higgs boson 
or the neutral Goldstone boson, respectively. Although they are not helicity suppressed, their contribu- 
tions are tiny in the Standard Model and they may be safely neglected in Standard Model calculations. 
The B q — > jl + lJL~ branching fractions including the scalar and pseudoscalar contributions are given 

by: 



647T- 3 sin 4 dw \ 



4m^ 
~ Ml 



[(' 



where Mg q , Tg ? , and fs q respectively are mass, lifetime and decay constants of the B q meson. The decay 
constant is determined in different models, including quark models, QCD sum rules and unquenched 
lattice theory. The accuracy is presently of the order of 10 - 15%. Evaluating a at the Z-mass scale, 
a(Mz) = 1/128, the following predictions were made for the B q — > jJ. + jJ.^ branchings fractions in the 
Standard Model [8]: 

= (3.86 ± 0,5) x ^^^L x 10-', (6, 

Br(B S^V) - (..06 ± 0^04) x Jg_ x 10-». 

In extensions of the Standard Model, such as supersymmetry (SUSY), Higgs doublet models or 
models with extra gauge bosons, scalar-current, pseudoscalar-current or axial-vector current interactions 
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may arise with new particles in the loop. This yields new contributions in the Wilson coefficients C\o,Cs, 
and Cp. Since the scalar and pseudoscalar operators are not helicity suppressed, they may give rise 
to a large enhancement of the branching fraction. Furthermore, the contribution of the pseudoscalar 
operator may produce destructive or constructive interference with the axial vector operator. Thus, new 
physics may increase or decrease the branching fraction with respect to the Standard Model value. For 
example, in the minimal supersymmetric Standard Model (MSSM), the B® — > jU + ;U~ branching fractions 
are proportional to tan 6 (/3) l . The branching fraction of — > jJ. + jJ.~ is expected to be a factor of 40 
lower than that for B s — > jJ. + jJ.~ , hence, the latter is the focus of this note. 

3 ATLAS strategy for B° s -> ~ study 

Measurements of the properties of B decays with such extremely low branching fractions in ATLAS 
is possible namely due to the large beauty cross-section and luminosity of the LHC machine. Thus at 
luminosity 10 33 cm - 2 s~ l 10 12 fihadron pairs will be produced each year. It is expected that ATLAS 
will record 10 8 events with B decays each year by using B-physics triggers [10]. Triggers dedicated to 
rare dimuon B® — > jJ. + jJ.~ decays will be described in the Section 4.2 of this document. 

Since the branching fraction is so small in the Standard Model, semileptonic B decays and even some 
rare B decays may yield substantial backgrounds. The key issue for B® — > }X + IX~ discovery at the LHC 
is the suppression of the backgrounds. The ATLAS strategy for observing B® — > jJ. + IJ.~ is as follows. 

The first step is to trigger on events containing a8j^ M + M candidate using dedicated trigger 
algorithms which are described in this document. In the offline analysis the selections will be refined to 
reduce backgrounds. To achieve final separation of signal from background we will employ statistical 
methods based on several variables. Both parts of the offline selection are described in this paper. 

Once recorded data are available, the background in the signal region will be estimated using side- 
bands in the distribution of the muon pair invariant mass. In the current study the background was 
estimated using simulated events. Two categories of backgrounds were simulated: the so called com- 
binatorial background from bb pairs producing two muons in the final state; and the exclusive back- 
grounds, coming from two-body hadronic B decays and from the process B® — »■ K~n + v. The exclusive 
backgrounds contribute to the signal region and the lower mass sideband only. They do not occur in the 
higher mass sideband, so their contribution to the signal is estimated separately. 

After the number of background events in the signal region has been determined, the number of signal 
events Nb can be determined from a comparison of the total number of events found in the signal region, 
and the estimated background. For low statistics an upper limit on Nb corresponding to certain confi- 
dence level is determined using appropriate statistical methods. Once Nb is determined the B° s — > jJ. + jJ.~ 
branching fraction, {%{B Q S — > can be calculated using a relative normalisation to the reference 

channel B + -> J /v(M 4 V~) K+ ■ 

This document presents a Monte Carlo simulation study which follows the strategy described above. 
We start from the trigger level in Section 4.2. This is followed by the offline analysis, optimisation 
of discriminating variables and finally the determination of background and signal contribution in the 
signal regions, in Section 4.4. Systematic uncertainties are analysed in Section 5, followed by the start- 
up strategy in Section 6. 

It should be stressed that due to large uncertainty in the predictions of the bb production cross-section 
at the LHC energy this paper cannot derive a precise sensitivity to 8$(B° S — > at ATLAS but rather 

to show the ATLAS potential for this study and its discovery capability under some assumptions. 

'tan(/3) is the ratio of vacuum expectation values for charged and neutral Higgs bosons. 
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4 Monte Carlo study 

4.1 Simulation and event selection 

The Monte Carlo simulation samples used in the analysis have been generated as part of the central AT- 
LAS Monte Carlo data production runs, and details of this simulation have been given in the introduction 
to this chapter. 

The list of generated signal and background events is given in Table 1. To ensure that most of the 



Process 


# Events 


fi' 1 — > jU + jU~ 


47.5k 


bb^\i+\i~X 


146.5k 


B {) ^K- 71+ 


50k 




50k 



Table 1: List of processes and number of events analysed 

generated dimuon events passed the trigger, only events containing two muons with p T larger than 6 
GeV and 4 GeV, were retained for detector simulation. For the signal channel 5° — >• jU + ;U~, multiply- 
ing the cross-section reported by PYTHIA with the branching ratio 3.42 x 10~ 9 gave a cross-section of 
15 fb. 47.5k events have been generated and passed through the full detector simulation and reconstruc- 
tion. Simulation of pileup has not been available, thus it was not simulated for either the signal or the 
background events. 

The sample of dominant background process events, bb decaying semileptonically giving two muons 
in the final state, were simulated with the same versions of the software, and with the same kinematic 
cuts as for the signal. The PYTHIA cross-section for such a sample is estimated to be 110 nb and a total 
of 146.5k background events that passed the reconstruction stage were used in the physics analysis. 

In addition to the combinatorial background, there are several B backgrounds that may contribute to 
the signal region. These include two and three body decays where two of the final state particles are K+, 
71+ or jx+. Although the rate for misidentification of kaons or pions as muons, due to punchthrough or 
decay in flight, is only of the order 0.5%, the small 3§(B Q S — ► fJL+fX~) requires investigation of the other 
rare B decays. The decay modes which we consider to be most important are summarised in Table 2. 



process 


branching fraction 


Ref. 


5° K+Ti- 


(1.82±0.08) x 10~ 5 


[11] 


5° - TC+Tt- 


(4.6 ±0.4) x 10~ 6 


[11] 


fi u -► K+K 


< 3.7 x 10 _/ @90%CL 


[11] 


B» - 71+71- 


< 1.7 x 10~ 4 @90%CL 


[11] 


B'l -► 71+ K- 


<2.1 x 10~ 4 @90%CL 


[11] 


B ( l -► K+R- 


<5.9x 10- 5 @90%CL 


[11] 


B^K-jx+v 


~ 1.36 x 10- 4 


* 2 


B u -f n-jx + v 


(1.36±0.15) x 10- 4 


[11] 



Table 2: B meson decays contributing to the non-combinatorial background 



2 An estimation based on isospin symmetry and the measurement of B° — > 7t il + v. 
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We have studied one of the two-body and one of the three-body decays with full simulation, namely 
B® — > K~n + and B° s — > A' /x + v. Contributions from the other channels were estimated to be similar 
or smaller. To enable the production of a sizable event sample with decay in flight, a special GEANT 
simulation option which forces kaons and pions from selected B mesons to decay in the inner detector 
volume was developed [12]. The position of the decay is randomly selected from a uniform distribution 
between the origin and the exit point from the inner detector. 

4.2 Trigger strategy 

We describe trigger methods for selection of the B () s —>■ jJ. + jJ.~ channel developed by the B trigger group 
as part of the project presented in this document. The full description of the ATLAS B-physics triggers 
is given in the introduction to the B-physics chapter [10]. The first level trigger (LI) performance for 
dimuon channels can be found in [13] and details of the second level (L2) dimuon trigger implementation 
in [12]. 

At the LHC start-up, the luminosity level is expected to be of order 10 31 cm _2 s _1 and a pj threshold 
as low as 4 GeV can be used at LI. The dimuon rate after LI is expected to be only a few Hz. This 
admits the possibility of applying L2 track reconstruction in the full volume of the inner detector. This 
detailed approach allows the study of dimuon background features to understand of their composition. 
With the subsequent rise of luminosity the LI pj threshold will increase to 6 GeV. The dimuon rate 
after LI is expected to rise to about 360 Hz at L = 10 33 cm _2 s _1 and the L2 track reconstruction in the 
full volume of inner detector will be replaced by the Region of Interest (Rol) guided mode, documented 
in [12]. 

The simulation of the trigger in the current study is performed by applying the strategies for lumi- 
nosity L = 10 33 cm _2 s _1 . At LI the threshold of pj > 6 GeV has been applied and events containing 
two LI muon signatures are analysed further using the L2 topological dimuon trigger algorithm with a 
threshold of pj > 6 GeV. Following the Rol defined at LI, the muon candidates are reconstructed in 
the muon spectrometer, then matched to the tracks reconstructed in the inner detector (inside the Rol) 
and combined into one track. The invariant mass of two opposite sign muons is required to be less than 
7 GeV. These muons should also be successfully fitted to a common vertex. Only loose selection cri- 
teria are used at this step {% 2 < 10). Implementation of the third level trigger, the event filter (EF), is 
not finalised yet and the offline reconstruction efficiency is used to estimate the EF efficiency (the same 
reconstruction algorithms are supposed to be used at EF). 

Results on the LI and L2 efficiencies as well as an estimated EF efficiency for signal B® — > jx + ix~ are 
given in Table 3. The LI efficiency is defined as the ratio of the B ( s ' — > jJ. + jJ.~ events passing the LI trigger 
and the input events generated with p T > 6 GeV and |t]| < 2.5 for both muons from the B ( s ' — ► }X + }X~ 
decay. The L2 efficiency is defined as the fraction of events accepted by LI satisfying the above L2 
reconstruction and selection cuts. The efficiency of the event filter is estimated as the fraction of events 
that both satisfy L2 and also are successfully reconstructed at EF. These values of the efficiency have 
been used in the subsequent analysis. 

Various types of trigger algorithms and trigger thresholds will be used in the real experiment depend- 
ing on the luminosity achieved, dedicated computing resources available for the online event processing 
and the actual beauty yield at LHC energies. 

Table 3: Trigger efficiency of signal B® — > fl + H~. The methods of calculating efficiencies at LI, L2 and 
EF levels are given in the relevant place in the text. 



L1*L2 efficiency 


EF w.r.t L2 


Overall trigger eff. 


0.52 


0.88 


0.46 
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4.3 Event reconstruction and analysis. 

Muon reconstruction quality is of high importance for the B ( s ' —> jJ. + jJ.~ channel. The muon candidates 
produced by the STACO [14] method were used. This method combines the independently reconstructed 
inner detector and muon spectrometer tracks. Figure 2 shows the muon reconstruction efficiency as a 
function of a true muon pj. The efficiency is defined as the number of muon candidates reconstructed and 
matched to the Monte Carlo particle tracks in the corresponding p T bin divided by number of generated 
muons. The p T spectrum of generated muons is superimposed on the efficiency plot. 




Figure 2: Muon offline reconstruction efficiency as a function of p T . The superimposed histogram - p T 
spectrum of muons in the signal events (right scale). 

For the physics analysis, we select events containing identified muon pairs with opposite charges. 
Aside of the kinematic cuts (pj 1 ^ > 6(4) GeV and |t] w ,^ 2 | < 2.5) no additional cuts have have been 
applied. These two muons then constitute a B meson candidate. The VKalVrt vertexing package [15] is 
used to fit tracks into a vertex. We require the vertex quality to have % 2 < 10. The momentum resolution 
is important as a narrow mass search window reduces the background contribution. Figure 3 shows 
the dimuon mass distribution for the cases when both muons are in the barrel region (|7]jti,ju 2 | < 1-1) or 
in the end-cap (I^^J > 1.1). The Gaussian fit (using bins with contents > 10% of maximum) gives 
(7 = 70 MeV for the barrel and o = 124 MeV for the end-cap. We used 90 MeV as an estimate of the 
invariant mass resolution for the signal events. 

In this document we present a cut-based method for signal extraction and background rejection. For 
the future, we are investigating another method using a boosted decision tree [16]. 

In the cut based analysis a set of discriminating variables is chosen and using the signal and back- 
ground simulated events the optimal set of cuts is determined. The signal events are identified by re- 
quiring that the dimuon invariant mass is consistent with the mass of B s meson. To reduce background 
events where two muons originate from different sources (e.g. independent semileptonic decays of b and 
b quarks), the following discriminating variables were chosen (values used in the final analysis are given 
in parentheses): 

• Transverse decay length of the B s candidate (L xy >0.5 mm ) 
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lnv.Mass(p.n) [GeV] lnv.Mass(n(i) [GeV] 



Figure 3: Reconstructed B s mass in barrel - when both muons have |rj| < 1.1 (left) and in the end-cap - 
both muons have | X] | > 1.1 (right) 

• The pointing angle a between the dimuon pair summary momentum and the direction of the decay 
vertex as seen from the primary vertex (a <0.017 rad ) 

• Isolation 1^^ = p^ /(pj 11 +'Lip' T (AR < 1)) , where the sum is over all tracks with pj > 1 GeV 
(excluding the muon pairs) within a cone of AR < 1 , where AR = \J (AT7) 2 + (A0) 2 and At] and 
A(j) are the pseudorapidity and azimuthal angle of track i with respect to the momentum vector of 
the muon pair ( 7^ > 0.9 ). 

• An asymmetric search window for M^, £ [M B o — o,M B o + 2a] , is used to avoid a possible contri- 
bution from B° d ^ n + n~ decay. 

Figures 4, 5 and 6 show distributions of discriminating variables for signal and background events. 
Due to the low Monte Carlo statistics, it is not feasible to perform a cut analysis as in a real experiment 
(i.e. applying all cuts simultaneously) since this will leave no events for the analysis. However, some 
of the discriminating variables show no (or small) correlations between each other. This allows the 
estimation of the rejection power of such variables separately. Then the product of all efficiencies can 
provide a reasonable estimate of the total rejection. Table 4 shows a correlation matrix for the variables 
used in this study. The correlation between the pointing angle a and the transverse decay length L^. is 
higher than among other variables so the pointing angle and the transverse decay length are examined 
simultaneously to take their correlation into account. The systematic uncertainty due to this correlation 
is estimated as +50%. 

Figure 7 illustrates the rejection power of each cut. One cut is applied at a time to the combinato- 
rial background events and to the B Q S — > K ~ fXV and BP S — > K~n + events where one or two hadrons are 
misidentified as a muon. The combinatorial background is effectively suppressed by these cuts while the 
non-combinatorial events are less well rejected. 

Table 5 summarizes the output of this cut-based analysis. For the bb — ► MJ.X background in the 
left column the efficiencies are given separately for each cut, whilst in the right column the combined 
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Transverse decay length [mm] 



Figure 4: Transverse decay length of reconstructed candidates. The signal is shown as closed circles, 
background as opened circles. Distributions are normalised to 1. The vertical line indicates the lowest 
transverse decay length allowed for selected events. 




Figure 5: Distribution of isolation variable 1^ for the reconstructed B° s candidates. The signal is shown 
as closed circles, background as opened circles. Distributions are normalised to 1. The vertical line 
indicates the lowest values of variable allowed for selected events. 
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0 0.1 0.2 0.3 0.4 0.5 



Figure 6: Distribution of pointing angle a for the reconstructed B® candidates. The signal is shown as 
closed circles, background as opened circles. Distributions is normalised to 1. The vertical line indicates 
the highest values of a allowed for selected events. 









a 






1 


-0.09 


0.04 


-0.03 


Inn 




1 


-0.07 


-0.03 


a 






1 


-0.17 










1 



Table 4: The linear correlation coefficients among the discriminating variables for background events. 
The statistical uncertainty is about ±0.05 for each coefficient. 
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efficiency is given for the cuts on the pointing angle and on the transverse decay length. As one can see 
the total rejection is largely overestimated if all cuts are treated separately, so the combined value is used 
to estimate a total yield of background events. The contribution from B° s — > K~ }XV and — > K~n + is 
found to be negligible comparing to the combinatorial background contribution. The errors quoted for 
the efficiencies are statistical only, so they represent only the size of the available Monte Carlo sample, 
but not the expected accuracy of the experiment where the initial number of background events will be 
much higher. More details on uncertainties will be given in Section 5. 

Figure 8 shows the dimuon mass distribution for signal and background events after all selection cuts 
have been applied. For the combinatorial background, the contribution for the left and right side of the 
signal region is estimated in the same way as for the signal region (Table 5). 
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Figure 7: The Monte Carlo di-muon mass distributions for signal B° — > i u + i u _ (histogram), combinato- 
rial background (closed circles) and non-combinatorial background (open circles and triangles) for (a) 
preselected events, (b) after cuts on transverse decay length, (c) pointing angle and (d) isolation. The 
number of events has been scaled to 10 fb _1 of integrated luminosity. 
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Figure 8: Dimuon mass distribution for surviving events after applying all three cuts. The signal is 
shown as histogram, combinatorial background by closed circles and non-combinatorial backgrounds 
by opened circles and triangles. The combinatorial background is estimated assuming a factorisation of 
applied cuts. Statistics are given for an integrated luminosity 10 fb -1 . 

Table 5: Selection efficiencies and number of signal and background events for integrated luminosity 
of 10 fb -1 . Preselection criteria used: 4 GeV < M(jUju) < 7.3 GeV , vertex fit % 2 < 10 , transverse 
decay length L xy < 20 mm. Numbers of expected events are computed according to the Standard Model 
expectation. In the left column for the bb — > }X}xX background the efficiencies are given separately for 
each cut, in the right column the combined efficiency is given for the cuts on the pointing angle and on 
the transverse decay length. 



Selection cut 


fi" — > ;U + jU efficiency 


bb —> jU + ;U X efficiency 


> 0.9 


0.24 


(2.6±0.3)-10- 2 


> 0.5mm 
a < 0.017 rad 


0.26 
0.23 


(1.4±0.1)-l(r z 
(8.5±0.2)-10- 3 


(1.0±0.7)-10- 3 


Mass in [-a, 2a] 


0.76 


0.( 


)79 


TOTAL 


0.04 


0.24 -10- 6 


(2.0±1.4)-10- 6 


Events yield 


5.7 




14^ 



5 Systematic uncertainties 

There are several sources of uncertainty in this analysis. Some of them are relevant only for the Monte 
Carlo study, whilst others should be taken into account with the real data analysis as well. 

In this presented analysis, the expected number of signal and background events is estimated by 
counting directly instead of the normalisation procedure described in Section 3, which is supposed to 
be used in a real experiment. Consequently the dimuon trigger and reconstruction efficiencies, and 
acceptance, are not canceled by those from the reference channel and must explicitly be taken into 
account. Using the methods developed by ATLAS [12] this systematic uncertainly is estimated to be of 
about few percent. 

There is a theoretical uncertainty of a factor of two in the b-production cross-section at LHC energies, 
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which clearly affects the Monte Carlo predictions. Consequently all numbers of events from Table 5 scale 
accordingly. 

The difference between the real and simulated kinematic properties of detected particles (e.g., due 
to not fully accounting for misalignment or material effects) can also introduce a bias in our predictions. 
However Figure 2 shows that the most of muons from B® — > have a pj in the region of the 

efficiency plateau so the possible deformation of pj (as well as r\) spectra could change the resulting 
efficiency by not more than a few percent. The uncertainty from the cuts factorisation hypothesis is 
assumed to be approximately 50%. 

Some uncertainties will, to a large extent, cancel if we use a normalisation channel B + — > J/yK + 
to estimate the 3S(B® — > jU + /x _ ) as the dimuon trigger conditions are similar for both channels. Without 
experimental data, it is difficult to estimate the uncertainty in the number of background events in a 
given mass range. In the DO analysis [17], the sideband extrapolation method is estimated to give an 
uncertainly of 20-30%. Indeed, this is the main source of uncertainty in the DO analysis of B° s — ► . 
Corrections should also be made for the contribution of B° d — > decay in the experimental sample. 

In total, the uncertainty from the systematic errors discussed in this Section is approximately ±25%. 
In addition, the procedure adopted to estimate backgrounds via cut factorisation has large uncertainties, 
which are estimated to be of the order of 50%, as discussed above. In addition, a 70% uncertainty arises 
from Monte Carlo statistics. Overall, we choose to combine these in quadrature to obtain an indicative 
overall uncertainty on the background of +90%/-75%. 

6 Start-up strategy 

Already at 1 fb _1 of integrated luminosity, ATLAS can have O(10 6 ) of dimuon events in a mass window 
4 GeV < M(jUju) < 7 GeV (after vertexing and quality cuts). It will allow tuning of the selection proce- 
dure either for the cut-based analysis or for the multivariate methods for the background discrimination. 
Events that survive background suppression will be used to estimate the background contribution to the 
signal search region. The contribution from combinatorial background will be estimated using the side- 
bands interpolation procedure. The contribution from exclusive backgrounds due to fake dimuons from 
hadronic two-body B decays or from B° s — > K~fx + V, will be determined on the basis of the study of the 
hadron/muon misidentification probability. The background estimation will be compared with the num- 
ber of events observed in the signal region. Following this information an upper limit on the number of 
signal events Nb corresponding to certain confidence level will be determined, using appropriate statisti- 
cal methods. Finally, the value of Nb will be used to extract the upper limit on the B® —>■ jl + ix~ branching 
fraction, &{B° S — > fx + jx~), using a reference channel B + — > J /\yK + . In this procedure a ratio of geomet- 
ric and kinematical acceptances of the signal and the reference channels will be determined from the 
Monte Carlo simulation. Trigger and offline reconstruction efficiencies largely cancel for dimuons in 
these channels. The reference channel B + — > J /yK + will also be used to check the Monte Carlo simu- 
lation. The efficiency of the final selection cuts on discriminating variables for the signal B® — > /x + 'pr 
will be determined using Monte Carlo simulation (validated with the reference channel). 

7 Conclusions 

We have presented the strategy for searching for the rare decay B° s — >• with the ATLAS detector. 

Whilst we do not expect to observe this decay during the early stages of the LHC, as more luminosity 
becomes available and our understanding of the backgrounds improves, it should be possible to identify 
a signal for the process. There are uncertainties due to the relatively unknown beauty production cross- 
section at the LHC, and also the limited Monte Carlo statistics available for this study. Within these 
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limitations, assuming the Standard Model, we expect a signal of 5.7 events with a background of 14+}q 
events for an integrated luminosity of 10 fb _1 . It is evident that the background uncertainties are large 
in this study. However, background estimates based on real data will be able to make use of much 
higher statistics and these will provide reduced uncertainties, as well as allowing the evaluation of more 
sophisticated methods of analysis. 
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Trigger and Analysis Strategies for B° s Oscillation 
Measurements in Hadronic Decay Channels 

Abstract 

The capabilities of measuring B ( s ' oscillations in proton-proton interactions with 
the ATLAS detector at the Large Hadron Collider are evaluated. B® candidates 
in the Djn + and Dja\ decay modes from semileptonic exclusive events are 
simulated and reconstructed using a detailed detector description and the AT- 
LAS software chain. For the measurement of the oscillation frequency a Am s 
sensitivity limit of 29.6 ps _1 and a five standard deviation measurement limit 
of 20.5 ps _1 are derived from unbinned maximum likelihood amplitude fits 
for an integrated luminosity of 10 fb _1 . The initial flavour of the B ( s ' meson is 
tagged exclusively with opposite-side leptons. Trigger strategies are proposed 
for scenarios of different instantaneous luminosities in order to maximise the 
signal channel trigger efficiencies. 

1 Introduction 

As tests of the Standard Model the CP-violation parameter sin(2j3) will be measured with high pre- 
cision (at the percent level) as well as properties of the B ( s '-meson system, like the mass difference of 
the two mass eigenstates Am„ the lifetime difference AT S /T S and the weak mixing phase (j> s induced 
by CP-violation, with <j> s ps 2 A 2 r\ in the Wolfenstein parametrisation. The different masses of the CP- 
eigenstates B^ (CP-even) and fif (CP-odd) give rise to B s mixing. The observed B ( s ' and 5*' particles 
are linear combinations of these eigenstates, where transitions are allowed due to non-conservation of 
flavour in weak-current interactions and will occur with a frequency proportional to Am s . B® oscillations 
have been observed at the Fermilab Tevatron collider by the CDF collaboration [1] measuring a value 
of Am s = (17.77 ±0.10(stat) ±0.07(sys))ps _1 and DO collaboration [2] reporting a two-sided bound 
on the B ( s ' oscillation frequency with a range of 17 ps _1 < Am s < 21 ps _1 . Both results are consistent 
with Standard Model expectations [3]. In ATLAS, the Am s measurement is an important baseline for the 
B -physics program and an essential ingredient for a precise determination of the phase (j> s . CP-violation 
in B®-B® mixing is a prime candidate for the discovery of non-standard-model physics. For the channel 
B® — > J which has a clean experimental signature, a very small CP- violating asymmetry is predicted 
in the Standard Model. The measurement of any sizeable effect of the weak-interaction-induced phase 
$s in the CKM matrix, which lies above the predicted value, would indicate that processes beyond the 
Standard Model are involved. Furthermore, the determination of important parameters in the B*' meson 
system will be valuable input for flavour dynamics in the Standard Model and its extensions. 

In this note an estimation of the sensitivity to measure the B®-B® oscillation frequency with the AT- 
LAS detector is presented. The signal channels considered are the hadronic decay channels B ( s ' — > Djn + 
and B® — > Djaf with D~ — > (j)7l~ followed by 0 — > K + K . In the case of B Q S — >• D~a\ the a| decays 
as a\ — > pn + with p — > % + %~ . Including the sub-decay D~ — > K*°K~ [4] would increase the event 
statistics by about 30%. However, for these sub-channels, which require an additional trigger signature, 
the increase of the overall trigger rate would be unacceptable. Detailed information of the signal and the 
exclusive background channels is given in Section 2. The high event rate at the Large Hadron Collider 
(LHC) imposes very selective requirements onto the B-physics trigger strategies, reducing the rate by 
about six orders of magnitude for recording events. Since an initial "low-luminosity" running period is 
scheduled with a luminosity starting at 10 31 cm _2 s _1 and rising to 2 • 10 33 cm~ 2 s _1 , followed later on by 
the design luminosity of the LHC of 10 34 cm _2 s _1 , the B-trigger must be flexible enough to cope with the 
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increasing luminosity conditions. The overall B-trigger strategy as well as the different strategies dealing 
with the luminosity scenarios in the initial running periods are discussed in Section 3. An important part 
of the mixing measurement is to identify the flavour at production, i.e., whether the observed B s meson 
initially contained a b or ah quark. A detailed description of an opposite-side lepton flavour tag and of 
the various sources of the wrong tag fractions is given in Section 4. The selection of B ( s ' candidates with 
kinematic cuts as well as mass resolutions of the B ( s ', are explained and shown in Section 5. A luminosity 
of 10 33 cm _2 s _1 and no pileup is considered for the detailed analysis of signal and background channels. 
Strategies for lower and higher luminosities are also discussed in the same section. The results of the 
signal-candidate selection are used as input to a toy Monte Carlo simulation generating a sample of B ( s ' 
candidates, which is used for the amplitude fit method [5] to obtain the Am s measurement limits. The 
construction of the likelihood function, the Monte Carlo sample and the extraction of the Am s sensitivity 
are discussed in Section 6. 

2 Simulated Data Samples 

Simulated &-quark pairs are generated using Pythia [6], with the &-quark required to decay to one of 
the specified signal channels. The &-quark decays semileptonically producing a muon with pj > 6 GeV 
within 1 77 1 < 2.5. Details on generation, simulation and reconstruction of the simulated data samples are 
given in the introduction of the B-chapter [7]. 

In addition to the simulated signal samples B® — > Dj((j)7l~)7l + and B® —> D~ (§n~)a\ , correspond- 
ing exclusive background channels that give an irreducible contribution to the selected B ( s ' signal were 
investigated. Two B° d decay channels, B° d — > D K + ja\ and B d — > Df%~/a[, and one B® channel, 
B° s — >• D*~n + jaf, were simulated for both hadronic decay channels. The dedicated trigger studies de- 
scribed in Section 3 require additional samples, such as the inclusive background channels bb — > jl6X, 
bb — ► fxAX and cc — > jJ,4X containing semileptonic b or c decays requiring one muon with a generated 
Pt > 4 GeV (or 6 GeV) and further decay products (X). Also, one particular signal sample (as a choice 
B® —>■ D~af) requiring one muon with a generated pj > 4 GeV (identified by (jll4)) is used for the trig- 
ger studies. A sample of minimum bias events is used for the determination of overall trigger rates. See 
Table 1 for the number of events generated and the cross-sections calculated from the values given by 
PYTHIA and the appropriate branching ratios [8]. Errors on the cross-sections include statistical errors 
and contributions from the uncertainties on the branching ratios. 

Effects of pileup and B-meson mixing were not included in the simulation of any of the samples. 

3 Trigger Strategies 

The trigger strategy used for the B® — »■ Djn + and B ( s ' — > D~af channels is to identify the Df decaying 
to <(>(—>■ K + K~) n, which is common to both decay channels. At level one (LVL1) a muon is required to 
enrich the content of the triggered data sample with B -events. The high level trigger (HLT) is split into 
level two (LVL2) and Event Filter (EF). A search for a Df is performed following one of two strategies. 
The first method, the FullScan approach, performs reconstruction of tracks within the entire Inner De- 
tector. It is an efficient method, but time consuming and its feasibility depends on the background event 
rate. The second method performs track reconstruction in a limited volume of the Inner Detector only, 
which is defined by a low-/? r jet region of interest (Rol) identified at LVL1. This Rol-based method 
is faster but there is a loss in efficiency due to the requirement of a LVL1 jet Rol in the event and the 
geometrical restriction to the Rol. 

The increase of the luminosity after LHC startup affects the trigger in two ways: the trigger rates for 
the jet and muon trigger will increase, seeding the HLT Df algorithm more frequently, and combinatorial 
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Table 1 : Number of events generated and calculated cross-sections for the different signal and exclusive 
background simulated data samples for the B® — > D~n + and B® —> D~a\ analysis and particular samples 
used for dedicated trigger studies. Branching ratios for particle decays into final states are included. 
*)The branching fraction has not been measured yet, only an upper limit exists. 



Channel 



Events 



Cross-section [pb] 



Signal 



+ D-a+ 



88 450 
98 450 



10.4 ± 3.5 
5.8 ±3.2 



Background B {) d -^D+n 
B°^D-n+ 
B°^D*-n + 

B^D-at 
B°^D*rai 



43 000 
41000 
40500 
50000 
50000 
100000 



0.2 ±0.1 
6.2 ± 1.1 
9.1 ±2.8 
< 8.9 *) 
3.7 ±2.1 
12.1 ±2.7 



Trigger B l s > Djaf (ju4) 

bb -► ii6X 
bb -► liAX 
cc — > \lAX 
minimum bias 



50000 
242 150 
98 450 
44 750 
2 623 060 



13.6 ±7.6 
(6.14 ±0.02) -10 6 
(19.08 ±0.30) TO 6 
(26.28 ±0.09) TO 6 
70 -10 9 



background from pileup affects the performance of the selection algorithm. Therefore, trigger menus 
corresponding to different LHC luminosities are discussed in Sections 3.4 to 3.6. 

The trigger efficiencies are presented for the B® — ► D~a\ channel. Results for the B® — >• Dj7l + 
channel are expected to be similar within a few percent (see Section 3.2). 

3.1 LVL1 Trigger Selection 

The ATLAS hardware allows three LVL1 low-pr muon trigger thresholds to be defined at once, which 
can only be adjusted between runs by reconfiguring the lookup tables implemented in the muon trigger 
firmware. In order to study more than three thresholds we investigated the two available, pre-defined 
menus (named A and B) with respect to the low-pr muon trigger thresholds 1 . 

The three implemented low-pr thresholds for trigger menu A are: 0 GeV 2 (named MU00), 5 GeV 
(MU05) and 6 GeV (MU06). For trigger menu B, the three low-p T thresholds are 6 GeV (MU06), 8 GeV 
(MU08), and 10 GeV (MU10). Figure 1 shows the efficiencies of the low-pr LVL1 muon trigger signatures 
as a function of the true pj of the muon with the highest pj in the event. 

The LVL1 trigger efficiency depends strongly on the threshold chosen for the transverse momentum 
of the muon as shown in Table 2. Note that there is a discrepancy between the MU06 efficiencies from both 
trigger menus, which will be taken as a systematic uncertainty of the current implementation. Although 
these dedicated trigger studies have been performed with the B Q S —> D~a\ sample, the LVL1 efficiencies 
for B° s — ► D~n + {jx6) have been checked and agree well with those in Table 2. 

The input to the LVL1 calorimeter trigger is a set of ~7 200 trigger towers with granularity A(j) x 
At] « 0.1 x 0.1 formed by the analogue summation of calorimeter cells. There are separate sets of 
trigger towers for the EM and hadronic calorimeters. The LVL1 jet algorithm employed here uses a 

'All presented trigger thresholds are meant to be inclusive, i.e. to include all events fulfilling a trigger signature with a pj 
threshold equal to or higher than the indicated one. 

2 This requires a coincidence between the muon chambers without an actual threshold applied. Due to the detector geometry 
this corresponds to an effective transverse momentum threshold of about 4 GeV. 
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°0 5 10 15 20 25 30 °0 5 10 15 20 25 30 

P T [GeV] p T [GeV] 



(a) Thresholds for menu A [9] (b) Thresholds for menu B 

Figure 1 : Muon trigger efficiency as a function of the true pr of the muon with the highest pr in the 
event for the B® —>■ D~a\(ix4) for (a) trigger menu A and (b) trigger menu B. 

Table 2: LVL1 muon trigger efficiencies for the signal datasets B° s —> D~a\(ll4) and B° s —> D~a\{jx€) 
and the exclusive background samples. The first three lines refer to trigger menu A [9], while the last 
three lines refer to trigger menu B. For bb — > /J.4X, bb — ► jl6X and cc —> /J.4X, the Monte Carlo data 
samples are only available using trigger menu A. 



Menu 


Threshold 


Efficiency [%] 








B'> -> D-a{ 


bb plAX 


bb -> pL6X 


cc — > ]i\X 






0*4) 


0*6) 










MUOO 


75.65 ±0.19 


86.77 ±0.15 


71.74±0.14 


86.60 ±0.07 


70.42 ±0.22 


A 


MU05 


68.41 ±0.21 


82.60 ±0.17 


63.51 ±0.15 


81.91 ±0.08 


62.05 ±0.23 




MU06 


58.93 ±0.22 


8 1.90 ±0.17 


52.28±0.16 


8 1.00 ±0.08 


50.44 ±0.24 




MU06 


61.15±0.22 


83.83±0.16 








B 


MU08 


44.78 ±0.22 


77.64 ±0.19 










MU10 


34.89±0.21 


65.47 ±0.21 









cluster of x At] of approximately 0.4 x 0.4 (corresponding to 4x4 trigger towers). The projections of 
the vectors of the energy depositions onto the plane perpendicular to the beam axis (transverse energy, 
Ej) are summed over both the electromagnetic and the hadronic layers. The jet algorithm moves the 
cluster template in steps of 0.2 across the (j) x r\ plane. An Rol is produced if the 4 x 4 cluster is a local 
Ej maximum (as defined in [10]) and the cluster Ej sum is greater than the required threshold. The 
jet Rol is usable if the average number of Rols per event (Rol multiplicity, see Fig. 2 and Table 3) is 
small, ideally about 1-2. Clearly, a compromise is required as an increased threshold will reduce the 
multiplicity, but will also give a reduced efficiency for finding the B jet in an event. 

For a transverse energy threshold of 4 GeV, which is implemented to initiate the LVL2 Df trigger in 
trigger menus A and B, the jet trigger has an acceptance of (98.36 ± 0.06) % based on all events in the 
5° Djaf sample. 
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Figure 2: Rol multiplicity distributions for the background samples (a) for bb — > /J.4X, (b) for bb — > \ldX 
and (c) for cc —> jlAX as a function of the jet Rol energy threshold [9]. Only Rols with X] < 2.4 have 
been taken into account. This corresponds to the requirement that the Rol is to be contained within the 
solid angle covered by the Inner Detector. 



Table 3: Mean and root mean square of the Rol multiplicity distributions (Figure 2) for the background 
samples as a function of the jet Rol transverse energy (Ej) threshold [9]. Only Rols with Y\ < 2.4 have 
been taken into account. A strong anticorrelation between the Ej threshold and the mean Rol multiplicity 
is observed. 



Threshold 


bb^r 


IX4X 


bb^r 


H6X 


cc — > 


IX4X 


[GeV] 


Mean 


RMS 


Mean 


RMS 


mean 


RMS 


4 


2.847 


1.746 


2.883 


1.754 


3.235 


1.759 


5 


1.301 


1.244 


1.441 


1.295 


1.643 


1.300 


6 


0.703 


0.952 


0.881 


1.046 


0.998 


1.048 


7 


0.454 


0.786 


0.634 


0.911 


0.703 


0.900 
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3.2 LVL2 Trigger Selection 

The first step of LVL2 is to confirm the LVL1 muon trigger decision using more precise muon momentum 
measurement. Secondly, information from Inner Detector and muon chambers are combined to give a 
further improvement in the momentum measurement. 

As described above, the LVL2 tracking can be run in either FullScan or Rol-guided modes. In both 
cases, the same algorithm (named DsPhiPi) is used to combine the reconstructed tracks and search first 
for a <j> and then for a Df. In the Rol-guided approach tracks are reconstructed in a region A0 x At] = 
1.5 x 1.5 around all jet Rols with Ej above a certain programmable threshold [11]. A pj cut of 1.4 GeV 
is applied to all reconstructed tracks. 

Opposite sign track pairs are considered as a 0 candidate if they pass the following cuts: |Az| < 3 mm, 
where z is the distance along the beam line of the track's point of closest approach to the centre of the 
detector, |A0| < 0.2 and |Atj| < 0.2. 

The tracks are combined using a K mass hypothesis and a cut around the 0 mass m^PDG) = 
1019.46 MeV [8] is applied. Track pairs passing the cut are then combined with all other tracks as- 
suming a % mass for the third track. An event is selected if the mass of the track triplet is close to the 
Df mass m Dj (PDG) = 1968.2 MeV. The mass cuts used are 1005 MeV < m KK < 1035 MeV for the <j) 
candidates and 1908 MeV < m KKn < 2028 MeV for the D s candidates. 

The LVL2 track fit masses are shown in Figure 3 and Table 4 for the Rol-guided approach and 
FullScan. The standard deviations obtained from the Gaussian fits show that the mass cuts used corre- 
spond to 3.0 standard deviations for the (j) mass distribution and 2.8 standard deviations for the Df mass 
distribution [9]. The results for the Rol-based approach and those for FullScan agree well. 




Figure 3: LVL2 track fit mass distributions of (a) 0 and (b) Df candidates (corresponding to a 0 or Df 
particle from the signal decay in the Monte Carlo truth information) for the FullScan- and Rol-based 
LVL2 trigger signatures from 5° — > D~af events fulfilling the respective LVL2 Df trigger signature and 
MU06 [9]. 

The acceptances of possible trigger strategies up to LVL2 are given in Table 5 for the signal samples 
and in Table 6 for the background datasets. The LVL2 trigger rates for the B® —>■ DjK + channel are 
expected to be lower by a few percent since the average pj of the B*' candidates and consequently the 
average pj of the D s candidates is smaller for the B® — ► D~n + channel than for the B® — > D~af channel 
due to different track selections (see Fig. 5 and Section 5.1). 
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Table 4: LVL2 track fit masses for the FullScan- and Rol-based LVL2 trigger signatures (only candidates 
corresponding to a 0 or Df particle from the signal decay in the Monte Carlo truth information) from 
B® — >• D~a~l events fulfilling the respective LVL2 Df trigger and MU06. The table shows the results of 
Gaussian fits within the trigger mass windows to the mass distributions from Fig. 3. The results for both 
trigger strategies agree within statistical errors. 





FullScan-based 
LVL2 trigger [9] 


Rol-based LVL2 
trigger 


m(0): mean [MeV ] 
m(0): std. dev. [MeV ] 


1019.55 ±0.05 
5.07 ±0.06 


1019.52 ±0.05 
5.04 ±0.05 


m(Df ): mean [MeV ] 
m(Df): std. dev. [MeV ] 


1966.9±0.3 
21.7±0.3 


1967.0±0.3 
21.5±0.3 



Table 5: Acceptances of LVL2 (Rol and FullScan, FS) for the B° — > D s af sample for trigger menus A 
andB. 



Menu A 


Menu B 


Trigger 


Passes (in %) 


Passes (in %) 


Trigger 


Passes (in %) 


Passes (in %) 


scenario 


(M6) 


(M) 


scenario 


(M6) 


(M) 


Events 


50000 


50000 




50000 


50000 


L2_mu0 


85.19±0.16 


72.05±0.20 


L2_mu6 


77.13±0.19 


35.03±0.21 


L2_mu5 


79.65±0.18 


49.39±0.22 


L2_mu8 


45.54±0.22 


18.96±0.18 


L2_mu6 


75.66±0.19 


34.41±0.21 


L2_mul0 


26.04±0.20 


10.91±0.14 


FS & L2jiiu0 


32.98±0.21 


22.93±0.19 


FS & L2jnu6 


29.99±0.21 


12.71±0.15 


FS & L2 jiu5 


30.79±0.21 


16.75±0.17 


FS & L2 jnu8 


19.14±0.18 


10.91±0.14 


FS & L2jnu6 


29.38±0.20 


12.49±0.15 


FS & L2 jnulO 


11.83±0.15 


4.92±0.10 


Rol & L2 jmO 


28.74±0.20 


19.14±0.18 


Rol & L2jiu6 


26.19±0.20 


11.10±0.14 


Rol & L2 jnu5 


26.88±0.20 


14.26±0.16 


Rol & L2 jnu8 


17.09±0.17 


7.05±0.12 


Rol & L2_mu6 


25.68±0.20 


10.91±0.14 


Rol & L2 _mul0 


10.80±0.14 


4.56±0.09 



Table 6: Acceptances of LVL2 (Rol and FullScan) for the background samples bb —> jlAX, bb — > jl6X 
and cc — > /J.4X (trigger menu A). 



Trigger scenario 


passes (in %) 


passes (in %) 


passes (in %) 




(bb H4X) 


(bb p6X) 


(cc -► n4X) 


events 


98 450 


242 150 


44750 


FS & L2jnu0 


2.00±0.05 


3.73±0.05 


2.71±0.08 


FS & L2 jiu5 


1.47±0.04 


3.48±0.05 


1.93±0.06 


FS & L2jmi6 


1.11 ±0.03 


3.32±0.05 


1.43±0.05 


Rol & L2_mu0 


1.73±0.04 


3.34±0.05 


2.34±0.07 


Rol & L2 jiu5 


1.30±0.04 


3.12±0.05 


1.73±0.06 


Rol & L2 jiu6 


1.01±0.03 


2.99±0.05 


1.32±0.05 



1173 



S-Physics - Trigger and Analysis Strategies for 5° Oscillation . 



3.3 Event Filter Selection 

The muon confirmation at the Event Filter (EF) employs a muon track reconstruction algorithm using 
muon detector data only, similar to the algorithm used for offline reconstruction. 

The EF Df selection is very similar to that at LVL2. The track reconstruction can be performed in 
FullScan or Rol-guided modes, which share a common EF signature. The search for 0 and Df particles 
currently uses the same mass cuts as at LVL2, even though better mass resolutions are expected for the 
EF than for LVL2. In the future the mass cuts might be tightened and additional selection cuts will be 
added as discussed in section 3.5. EF output rates, which are only available for the minimum bias sample, 
are discussed in the following subsections. 

3.4 Trigger Strategies for Early Running 

At the lowest luminosities (10 31 cm _2 s _1 and 10 32 cm _2 s _1 ), the trigger selection needs to be as efficient 
as possible, which means running a loose trigger. To estimate rates and perform timing studies a trigger 
menu with a different set of muon thresholds [12] is applied to the minimum bias sample. Table 7 
shows the expected trigger rates for muons at LVL1 and after confirmation at LVL2. The output rates 
of the DsPhiPi trigger at LVL2 and the EF are given in Table 8 for 10 31 cm~ 2 s _1 . For a luminosity 
of 10 33 cm _2 s _1 and higher, the rates are expected to increase because of event pile-up and cavern 
background events. 

Table 7: Muon rates based on 2.6 • 10 6 minimum bias events. Rates set in italics are based on an inter- 
polation using an exponential approximation of the rate dependence on the muon threshold concerned. 
Effects caused by event pile-up and cavern-background events are not included. 



Luminosity 


LVL1 


[cm-V 1 ] 


L1_MU00 




L1_MU06 




L1JVIU10 


10 31 


1.3 ±0.02 kHz 




480 ± 10 Hz 




266 ± 8 Hz 


10 32 


13.0 ±0.2 kHz 




4.8 ±0.1 kHz 




2.7 ±0.1 kHz 


1 • 10 33 


130 ± 2 kHz 




48 ± 1 kHz 




27 ± 1 kHz 


2-10 33 


260 ± 4 kHz 




96 ± 2 kHz 




54 ± 2 kHz 




LVL2 




L2_mu0 


L2_mu5 


L2_mu6 


L2_mu8 


L2_mul0 


10 31 


450 ± 11 Hz 


213 Hz 


120 ± 6 Hz 


50 Hz 


25 ± 3 Hz 


10 32 


4.5 ±0.1 kHz 


2.1 kHz 


1.20 ±0.06 kHz 


500 Hz 


250 ± 30 Hz 


1 • 10 33 


45.0 ± 1.1 kHz 


21 kHz 


12.0 ± 0.6 kHz 


5 kHz 


2.5 ± 0.3 kHz 


2-10 33 


90.0 ± 2.2 kHz 


42 kHz 


24.0 ± 1.2 kHz 


70 kHz 


5.0 ± 0.6 kHz 



In addition to the overall allowed output rate, the time constraints of the HLT system are limiting the 
DsPhiPi trigger. The maximum allowed average computing times are 40 ms at LVL2 and 1 s at the EF. 
Most of the time is taken in the tracking algorithms as can be seen in Table 9 which shows the average 
CPU time used by the tracking and hypothesis algorithms at LVL2 and EF. 

Table 10 summarises the LVL2 efficiencies and the expected numbers for B® — »■ D~af(fl4) events 
before and after the application of event selection cuts in the analysis as well as the estimated LVL2 and 
EF trigger rates for different luminosity scenarios and different trigger choices. The L2 and EF output 
rates shown in this table are deduced from the rate information given in Tables 7 and 8. The LVL2 
muon trigger efficiency estimates presented in Table 10 are based on the LVL2 results obtained with the 
B® —> D~a\(flA) sample shown in Table 5. 
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Table 8: Output rates for the DsPhiPi trigger based on 2.6 • 10 6 minimum bias events at 10 31 cm 2 s l . 
Rates set in italics are based on an interpolation using the results from Table 7. 



Muon input trigger 


LVL2 


EF 


Rol 


FullScan 


Rol 


FullScan 


L1_MU00 


23 ± 3 Hz 


31 ± 3 Hz 


14 ± 2 Hz 


19 ± 2 Hz 


L1_MU06 


11 ±2 Hz 


13 ± 2 Hz 


6.4 ± 1.3 Hz 


7.5 ± 1.4 Hz 


L2_mu0 


15 ± 2 Hz 


18 ± 2 Hz 


6.1 ± 1.3 Hz 


6.9 ± 1.4 Hz 


L2_mu5 


9.5 Hz 


9. 9 Hz 


4.5 Hz 


4.5 Hz 


L2_mu6 


6.7 ± 1.3 Hz 


6.4 ± 1.3 Hz 


3.5 ± 1.0 Hz 


3.2 ± 0.9 Hz 


L2_mu8 


J.9Hz 


3.3 Hz 


2.1 Hz 


i.7Hz 


L2_mul0 


2.5 Hz 


2.0 Hz 


1.2 Hz 


0.9 Hz 



Table 9: Average CPU times on an HLT computing node (Dual core Intel(R) Xeon(R) CPU 5160 @ 
3.00 GHz) using 900 bb -> /x6X events. 







Algorithm 


Time/Rol 


Time/event 


LVL2 


tracking 


Idscan_RoI 


15 ms 


23 ms 






Idscan_FullScan 




91 ms 




hypothesis 


DsPhiPi 




< 1 ms 


Event Filter 


tracking 


Rol 


130 ms 


208 ms 






FullScan 




470 ms 




hypothesis 


DsPhiPi 




< 1 ms 



At 10 31 cm~ 2 s _1 , once the improvements discussed in Section 3.5 have been applied to the EF 
algorithms, it should be possible to run the FullScan-based trigger at the LVL1 4 GeV muon rate and 
to remain within the constraints given by available trigger resources. As the luminosity is increased to 
10 32 cm~ 2 s _1 , we will need to raise the muon threshold for the FullScan-based trigger or to move to a 
Rol-based trigger. However, the muon threshold should be kept as low as possible in order to achieve 
the highest possible trigger efficiency and to allow for as many —>■ Dja^ events as possible to pass. 
Compared to earlier publications like [13] the bb cross-section as shown in [7] is at the upper limit of 
what is expected and therefore the muon rates are likely to be overestimated. 

3.5 Trigger Strategies for Running at 10 33 cm 2 s 1 

At 10 33 cm _2 s _1 the trigger needs to remain as efficient as possible while operating within the constraints 
of the trigger system's resources. The EF output rate is expected to be about 10-20 Hz for S-physics. 

The muon rates expected at LVL1 and LVL2 for different thresholds and luminosities are included 
in Table 7. The LVL2 muon rates are the input rates for the LVL2 tracking algorithms. Using the 
information on jet Rol multiplicities from Figure 2 and Table 3, the computing times from Table 9 and 
the muon rates in Table 7, trigger strategies are determined for different luminosities. 

For a luminosity of 10 33 cm~ 2 s _1 , a LVL1 trigger muon in combination with the Rol-based D s trigger 
will be used. It is planned to use thresholds of 6 GeV for the trigger muon and 5 GeV for the jet Rol 
trigger. It is clear from Table 8 that in order to run such a trigger the LVL2 and EF selections will have 
to be tightened. This may be achieved by introducing vertex fitting and by reconstructing the B ( s ' at the 
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Table 10: LVL2 and EF output rates for minimum bias events, LVL2 trigger efficiencies and numbers 
of expected B° — ► D~a\(}lA) events without or with selection cuts. Rates set in italics are estimates 
based on the interpolated and extrapolated rates given in Table 7. LVL2 output rates marked by ^ are 
downscaled by a factor two, the estimated rate reduction for a D s vertex requirement at LVL2. For EF 
output rates marked by *, an estimated rate reduction factor of 60 accounting for EF B ( s ' reconstruction is 
applied. (See section 3.5 for details.) The results in the columns "LVL2 eff." and /vf y£ ^ °^ tput (without 
and with selection cuts applied) are based on the B® — > D~a\ {jJ.4) Monte Carlo data sample. Num- 
bers marked by # are corrected for the estimated efficiency loss by a D s vertex requirement at LVL2. 
The integrated luminosities and expected event numbers correspond to one year running at the given 
instantaneous luminosity (10 7 seconds). 



J? 


fjfdt 




LVL2 eff. 




B^D;a+ 


LVL2 rate 


EF rate 




[pb]- 1 




[%] 




incl. sel. cuts 


[Hz] 


[Hz] 


10 J1 




L2muOFS 


22.93 ±0.19 


308 




18 ± 2 


6.9 ± 1.4 






L2mu5FS 


16.75±0.17 




47 




4.5 






L2mu6FS 


12.49±0.15 


168 


35 


6.4 ± 1.3 


3.2 ±0.9 


10^ 


1000 


L2mu6FS 


12.49±0.15 


1678 


351 


64 ± 13 


32 ±9 






L2mu5RoI 


14.26 ±0.16 


1916 


267 


95 


45 






L2mu6RoI 


10.91 ±0.14 


1466 


322 


67 ± 13 


35 ± 10 




10000 


L2mu5RoI 


12.01 ±0.13* 


16 134* 


3 582* 


475* 


7.5* 






L2mu6RoI 


9.19±0.12# 


12 344* 


2709* 


335±93 + 


5.8±2.0* 






L2mu8RoI 


5.94 ±0.10* 


7 976* 


1757* 


796"'" 


3.5* 






L2mulORoI 


3.84 ±0.08* 


5159* 


1 132* 


126* 


2.0* 


2-10 JJ 


20000 


L2mu6RoI 


9.19±0.12* 


24687* 


5 418* 


670 ±187"'" 


11.7 ±4.0* 






L2mu8RoI 


5.94 ±0.10* 


15 953* 


3 517* 


392* 


7.1* 






L2mulORoI 


3.84 ±0.08* 


10318* 


2 264* 


252* 


4.1* 



EF level. 

Preliminary studies at LVL2 show that a requirement for a vertex fit to the 3 tracks of the D s candidate 
can achieve a factor 2 rate reduction for a drop in efficiency from 38% to 32%. This estimate is applied 
to cells marked by # and * in Table 10. Also, it might be an option to further reduce the rate by tightening 
the acceptance windows for and mo, on LVL2, but the resulting rate reduction and the expected signal 
efficiency loss will need to be studied. 

A considerable rate reduction at the EF level may be achieved by reconstructing the B ( s '. A preliminary 
study using offline selection cuts (see Section 5.1), which have been relaxed to simulate wider mass 
window and vertexing requirements for the reconstructed particles, has been performed with the bb — > 
jlAX and cc — »■ jiAX samples. The resulting rate reduction factor, estimated to be approximately 60, is 
applied to cells marked by * in Table 10. According to this study, the overall trigger and reconstruction 
efficiency for the B Q S — >• Dja\ signal events will be reduced by about 55%. Although these estimates will 
need to be confirmed by an implementation of a simplified B° s reconstruction at the EF level, reasonable 
EF output rates are expected to be achievable. 

The numbers of expected B° s — > D~a\(lxA) events for 10 fb _1 of data for a luminosity of 10 33 cm _2 s _1 
are given in Table 10. It will be necessary to establish a muon trigger threshold as low as possible to 
maximise the signal event yield. 

3.6 Trigger Strategies for Higher Luminosities 

As luminosity increases, it is necessary to stay within the limits of the LVL2 trigger processing times and 
allowable output rates. As Table 10 shows, this will require increasing the muon threshold to 8 GeV or 
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10 GeV and to add additional trigger elements in the EF as discussed at the end of Section 3.5. Another 
option, however a less efficient one, is to prescale the 6 GeV rate before running the track reconstruction. 

4 Flavour Tagging 

The measurement of B ( s ' oscillations needs the knowledge of the flavour of the B® meson at production 
time and at decay time in order to classify events as mixed or not mixed. The tagging algorithm tries 
to determine the flavour at production time, whereas the decay particles of the signal B° determine the 
flavour at decay time. In this analysis soft muon tagging (see B-physics chapter of [4]) is used and 
the general application on the simulated data samples is shown in Section 4.2 without applying trigger 
conditions or any selection cut. Tagging results specific for the hadronic channels under investigation 
including trigger and selection cuts for B° candidates are given in Section 5.3. 

4.1 Soft Muon Tagging 

In proton-proton collisions b quarks are produced in pairs leaving the signal B ( s ' and the opposite side b 
hadron with the opposite flavour. In the case of a semilep tonic decay as shown in Fig. 4, the charge of 
the produced lepton is correlated with the flavour of the signal B° s meson at production time. The charge 
of the muon with the highest reconstructed pj is used for the determination of the flavour. Because of 
the muon trigger, in hadronic B® decay channels soft muon tagging has a high tagging efficiency E tag = 
N tag /N a u limited by the muon reconstruction efficiency. Details on the aspects of muon reconstruction 
and identification in ATLAS can be found in [14]. 




opposite side jet signal B jet 



Figure 4: In the case of a signal B° s , the associated opposite side b hadron decaying semileptonically 
produces a negatively charged lepton. 

The dilution factor is defined as D tag = jf^f- where N c is the number of events correctly tagged 
and N w is the number of events with a wrong tag. These wrong tags arise from mixing of the tagging 
b hadron, muons from decays b — > c —>■ jx, additional c pairs and various particles decaying into muons. 
The wrong tag fraction CO = N W /{N C +N W ) is the ratio of wrongly tagged events to all tagged events. As 
the generation of the simulated data does not include B meson oscillations, mixing of the tagging side 
hadron is introduced artificially using the integrated mixing probabilities % d and % s [8]: 

r(s° -> b°^ u+x) r(B° -► a? _^ u+x) 

Xd = r(fl°-V*) = °- 188±a003 Xs = rwZpJx) = °- 49924±0 - 00003 

4.2 Application to Signal Samples 

In Fig. 5 the transverse momentum of the signal B Q S mesons is compared for the two B ( s ' decay channels, 
the vertical lines show the mean values of the two distributions. This difference arises from the different 
kinematical configuration due to the condition on all charged final state particles p T > 0.5 GeV at Monte 
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Carlo generation. As B° — ► D~a\ has a total number of six final state particles, the mean transverse 
momentum of the signal B Q S is higher compared to B*' —> D~ n + with a total number of four final state 
particles. The difference in the B ( s ' transverse momentum spectrum is also expected at offline reconstruc- 
tion level due to the different pj selection cuts for the % and the a\ combinations (see Section 5.1). This 
leads in the case of the B® — >• Dj7l + sample to an overall wrong tag fraction of ft) = 20.29 ±0.14 % 
and in the case of the 5° — > Dja\ channel to a wrong tag fraction ft) = 21.05 ± 0.1 1 %, which is higher 
compared to the B® — > D~K + channel (see all events in Table 11 in Section 5.3). 



Figure 5: Normalised distributions of signal B { ] transverse momentum p? of the two channels B° s —> 
D~iz + and B° s — > D~a\. The vertical lines represent the mean values of the distributions, in the case of 
B° s -► D~n + the mean is 14.80±0.03 GeV, in the case of B° s -► Dja+ 17.68 ±0.03 GeV. The observed 
difference is due to the different particle selections. 

In Fig. 6 the wrong tag fractions and the sources of these wrong tags are compared for both B ( s ' decays 
channels. The wrong tag fraction is shown as a function of the tagging muon's transverse momentum 
Pt(Li), in Fig. 6(a) for B° D~n + and in Fig. 6(b) for B® D'af. In the regime pr(n) < 11 GeV the 
B® —> D~d\ wrong tag fraction is higher. As mentioned above this difference arises from the different 
track selections of the two decay channels. In both channels the two main sources of wrong tags are 
mixing of neutral B mesons on the tagging side and muons from cascade decays b —>■ c —> jJ,. As the 
overall wrong tag fraction is decreasing with the muon pj, also the part with a wrong tag due to the 
cascade b —>■ c —>■ pi is decreasing at the same rate. A further source of mistags are additional cc-pairs. 
The wrong tag fraction of this part stays about constant with increasing pr(^i)- A small part of the wrong 
tag fraction originates from J/x/f, 0, p, rj or x particles decaying into muons. Additional sources like 
muons from kaons and pions or hadrons misidentified as muons can be neglected [12]. 

A bb pair produced in proton proton collisions has a transverse momentum equal to zero at first order. 
Going through fragmentation and hadronisation, the p T of the signal B° meson and the opposite side b 
hadron are still correlated, and therefore a muon coming from a semileptonic decay of the b hadron also 
is correlated with the signal B ( s '. Hence a muon from a cascade b — ► c — >• jX is more likely to pass the 
LVL1 muon trigger when B Q S meson has a higher pj, leading to the increase in wrong tag fraction with 
Pt(B®). This behaviour is shown in the Fig. 6(c) and 6(d). 




P T (B S ) [GeV] 
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(c) fl ( > -^D-K+ (d) Bj -» D7 a+ 



Figure 6: Wrong tag fraction as functions of tagging muons transverse momentum pr(n) in (a) and (b) 
and wrong tag fractions as functions of signal Monte Carlos B ( s ' transverse momentum Pt(B®) in (c) and 
(d). The wrong tag fraction is shown with mixing of the tagging side b hadron and without mixing. 
Without mixing, the different sources of wrong tags are shown. The main contribution is coming from 
b — >• c — >• n followed by additional c pairs. Additional sources shown are muons coming from J/yr, 0, 
p, 17 and z. 



1179 



S-Physics - Trigger and Analysis Strategies for 5° Oscillation . 



4.3 Systematic Uncertainties of Soft Muon Tagging 

The calibration of the soft muon tagger will be done with events from the exclusive decay channel 
B + — > // }i~)K + . The high branching ratio and the simple event topology allows the measurement 
of this channel during the initial luminosity phase at the LHC. Without mixing on the signal side, these 
events can be used to estimate the systematic uncertainties of soft muon tagging. 

For an integrated luminosity of 1 fb -1 160000 events of the decay channel B + — > J }i~)K + are 
expected [15] at ATLAS. About 13.5 % events are estimated to have an additional third muon for flavour 
tagging. Requiring a minimum transverse momentum of 6 GeV for this additional muon, the number of 
events will be reduced by a factor of three. Assuming that the wrong tag fraction in this channel behaves 
like in the hadronic B° s decay channels, the expected statistical error of the wrong tag fraction would be 
of the order of 0.1 % for 1 fb -1 integrated luminosity. 

5 Event Selection 

For the following analysis selecting B s candidates the default trigger choices are to require MU06 and 
JT04 trigger elements at LVL1 and to perform a search for the D s — >• §{K + K~)k decay within a jet Rol 
at LVL2. Resulting event numbers and plots are given for 10 fb -1 unless indicated otherwise. 

5.1 Signal Event Reconstruction 

For the reconstruction of the B s vertex only tracks with a pseudo-rapidity \r\ \ < 2.5 are used proceeding 
via the following steps. The 0 decay vertex is first reconstructed by considering all pairs of oppositely- 
charged tracks with p? > 1.5 GeV for both tracks. Kinematic cuts on the angles between the two tracks, 
AtpKK < 10° and A6kk < 10°, are imposed, where <p denotes the azimuthal angle and 6 the polar angle. 
The two-track vertex is then fitted assigning the kaon mass to both tracks. Combinations passing a fit- 
probability cut [16] of 1% (~ # 2 /dof = 7/1) with the invariant mass within three standard deviations of 
the nominal (j) mass are selected as (j) candidates. The plots in Fig. 7 show the invariant mass distribution 
for all m KK combinations overlaid with the 0 candidates matching a generated 0 from the signal decay 
(grey filled area) fitted with a single Gaussian function. For the B® — > Djn + channel the mass resolution 
is Otf, = (4.30 ±0.03) MeV and for the B° -► Dja+ channel 0$ = (4.28 ±0.03) MeV This mass window 
for accepted (j) candidates is shown by the vertical lines. No trigger selections are applied for the mass 
plots shown in Fig. 7 to Fig. 9. 

From the remaining tracks, a third track with pj > 1.5 GeV is added to all accepted 0 candidates. 
The pion mass is assigned to the third track and a three-track vertex is fitted. Three-track vertex candi- 
dates which have a fit probability greater than 1% (~ # 2 /dof = 12/3) and an invariant mass within three 
standard deviations of the nominal D s mass are selected as D s candidates. The plots in Fig. 8 show 
the invariant mass distribution for all rnxxn combinations overlaid with the D s candidates matching a 
generated D s from the signal decay (grey filled area) fitted with a single Gaussian function. For the 
B® —> Djn + channel the mass resolution is o Ds = 17.81 ±0.13 MeV and for the B° s —> Dja^ channel 
Oq, = 17.92 ±0.13 MeV. The 3<7d s mass range for accepted D s candidates is shown by the vertical lines. 

For the B® — »• D~a^ channel a search is performed for af candidates using three-particle combina- 
tions of charged tracks for events with a reconstructed D s meson. In a first step p° mesons are recon- 
structed from all combinations of two tracks with opposite charges and with pj > 0.5 GeV, each particle 
in the combination being assigned a pion mass. A kinematic cut AR^ = a/A^|^+A?]|^ < 0.650 is 
used to reduce the combinatorial background. The two selected tracks are then fitted as originating from 
the same vertex; from the combinations passing a fit probability cut of 1% (~ # 2 /dof = 7/1), those with 
an invariant mass within 400 MeV of the nominal p° mass are selected as p° candidates. Next a third 
track with pj > 0.5 GeV from the remaining charged tracks is added to the p° candidate, assuming the 



1180 



S-Physics - Trigger and Analysis Strategies for 5° Oscillation . 



pion hypothesis for the extra track. A kinematic cut AR pjt < 0.585 is applied. The three tracks are fitted 
to a common vertex without any mass constraints. Combinations with a fit probability greater than 1% 
(~ # 2 /dof = 12/3) and with an invariant mass within 325 MeV of the nominal a\ mass are selected as af 
candidates. 

The B° candidates are reconstructed combining the Df candidates with af candidates with opposite 
charge and different tracks. A six-track vertex fit is performed with mass constraints for the tracks from 
0 and D s ; due to the large a\ natural width the three tracks from the a\ are not constrained to the a\ mass. 
The total momentum of the B ( ] vertex is required to point to the primary vertex and the momentum of the 
D s vertex to the B® vertex. Only six-track combinations with a vertex fit probability greater than 1% (~ 
X 2 /dof = 27/12) are considered as B® candidates. 

For the B® — > Djn + channel for each reconstructed D s meson a fourth track from the remaining 
tracks in the event is added. This track is required to have opposite charge with respect to the pion track 
from the D s and pj > 1 GeV. The four-track decay vertex is fitted including 0 and D s mass constraints, 
and requiring that the total momentum of the B Q S vertex points to the primary vertex and the momentum 
of D s vertex points to the B° s vertex. In order to be selected as B ( s ' candidates, the four- track combinations 
are required to have a vertex fit probability greater than 1% (~ # 2 /dof = 20/8). 

For both channels, B° — >• D~n + and — >• Djaf, the signed separation between the reconstructed 
B® vertex and the primary vertex is required to be positive (the momentum should not point backward to 
the parent vertex). To improve the purity of the sample, further cuts are imposed: the proper decay time 
of the B ( s ' has to be greater than 0.4 ps, the B () s impact parameter (shortest distance of the reconstructed 
B° s trajectory from the primary vertex in the transverse plane to the reconstructed B® decay vertex) is 
required to be smaller than 55 jJ,m and pj of the B ( ] must be larger than 10 GeV. The plot in Fig. 9(a) 
shows the invariant mass distribution for all mKKnn B s candidates matching a generated B s from the signal 
decay for the — ► D~ n + channel fitted with a single Gaussian function and giving a mass resolution of 
Ob s = 52. 80 ±0.68 MeV. For the B® — ► Djaf channel Fig. 9(b) shows the invariant mass distribution for 
all m K Kn%nn B s candidates matching a generated B s and giving a mass resolution of o Bs = 40.82 ±0.53 
MeV. The difference in the B° s mass resolutions is caused by the pj spectrum of the % in the B ( s ' — ► Djn + 
decay being harder than the pj spectra of the three pions in the B Q S — > D~a[ decay, as the pion momentum 
resolution is worse for higher p T . A final mass cut of two standard deviations on the candidates is 
applied for further analysis (see vertical lines in Fig. 9). For some events more than one candidate 
is reconstructed and in that case the candidate with the lowest # 2 /dof from the vertex fit is selected for 
further analysis. 

No relevant effects induced by the trigger selections on fit variables of the mass plots or the kinematic 
distributions of the B s candidates are found (discussed in Section 6.3). All differences are within the fit 
errors. 

5.2 Background Channels 

Two main sources of background are considered: irreducible background coming from a decay channel 
that closely mimics the B® signal and combinatorial background coming from random combination of 
tracks. 

5.2.1 Exclusive Background Channels 

The exclusive samples listed in Table 1 are used as irreducible background sources. See Table 12 in Sec- 
tion 6 for the numbers of reconstructed candidates after applying the same selection cuts as for the signal 
samples. The histograms in Fig. 10 show the invariant mass spectrum of reconstructed B ( s ' candidates for 
the B ( s ' — >• D~n + and B ( s ' — >• D~af channel respectively (no trigger selection cut applied). The different 
contributions are scaled with the cross-section given in Table 1 . 
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980 1000 1020 1040 1060 980 1000 1020 1040 1060 
m KK [MeV] m KK [MeV] 



(a) B® -> D- 7Z+ (b) B° s -> Djd£ 

Figure 7: Reconstructed mass iukk for all combinations within the signal sample (black line) and KK 
candidates corresponding to a 0 particle from the signal decay in the Monte Carlo truth information (grey 
filled). The standard deviation obtained from a fit within two standard deviations of a Gaussian function 
(dashed) to the distribution defines the three standard deviation cut range (vertical dashed). No trigger 
conditions are applied. 




1800 1900 2000 2100 1800 1900 2000 2100 

m KKl[ [MeV] m KKlt [MeV] 



(a) B° -> D-K+ (b) B° -> D~a+ 

Figure 8: Reconstructed mass rnxKn for all combinations within the signal sample (black line) and KK% 
candidates corresponding to a D s particle from the signal decay in the Monte Carlo truth information 
(grey filled). The standard deviation obtained from a fit within two standard deviations of a Gaussian 
function (dashed) to the distribution defines the three standard deviation cut range (vertical dashed). No 
trigger conditions are applied. 
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(a) B° s -> D- K+ (b) B° s -> D7«| 

Figure 9: niKKnn ( a ) an d ™,KKmrnn (b) reconstructed mass and fit of a Gaussian function to the distribu- 
tion. Each KKkk (KKnnnn) candidate displayed corresponds to a B s particle in the Monte Carlo truth 
information. The two standard deviation cut range is shown by the vertical dashed lines. No trigger chain 
applied. 




Figure 10: mKKnn (a) and niKKnnnn (b) reconstructed mass for signal and background channels. In (b) 
the upper limit for the branching fraction of the channel Bj —> D s a\ is used. 
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In the case of B d —>■ D+a^, there is no measurement of the branching fraction available. The current 
upper limit is therefore used as a conservative estimate. The similarity to the B d — > D+n~ channel 
indicates that the B d —> D^a^ cross section could be in the same order as the one of the B d — > Dfn~ 
channel and therefore the B d — > D+a^ contribution in Fig. 10 (b) could be much smaller. 

The B s — > D*n and B s —>■ D*a\ channels are treated as a background source, since the momentum 
and hence the lifetime estimation for this decay is flawed due to the missing photon from the decay of 
theD*. 

5.2.2 Combinatorial Background 

The limited statistics of the combinatorial background samples (e.g. 242 150 events for bb — > /x6X) do not 
allow us to give a reasonable estimate for the signal to background ratio. This ratio as well as kinematic 
properties of the combinatorial background, like the shape of the proper time distribution will be studied 
once early data are available. 

5.3 Tagging Results 

Soft muon tagging is applied on all available simulated data of the two hadronic signal channels B Q S — > 
D~n + and B® — > D~a\ . Table 11 shows the number of events, the tagging efficiency and the wrong tag 
fractions. The tagging efficiency corresponds to the fraction of events where muon candidates have been 
successfully reconstructed. The events with at least one muon are separated into events with a good tag 
and events with a wrong tag resulting in a wrong tag fraction. 

Comparing the results for all simulated events between the two signal channels, the difference in 
the wrong tag fraction is of the order 1 % due to the different kinematical topology. As the Rol trigger 
applies a pj cut of 1.4 GeV on all reconstructed tracks, low pj B ( ] mesons are rejected leading to an 
increased wrong tag fraction for the triggered events (see Fig. 6(c) and 6(d)). After event reconstruction 
overall wrong tag fractions of (O = % for the channel B° s D~n + and m = 23.31±[Jjf % for 

the channel B® — > Dja\ are observed. Effects of mixing are included as described in Section 4.1. 

Table 1 1 : Tagging efficiencies and wrong tag fractions for the signal channels B° s — > D~ n + and B® — > 
D~a\ are shown for three different stages: all simulated events, all triggered events passing the LVL1 
muon trigger and the LVL2 Rol trigger, and finally the numbers for the reconstructed events. The errors 
are statistical only. 



Process 


Type of 
Events 


Number of 
Events 


Tagging 
Efficiency [%] 


Wrong Tag 
Fraction [%] 


B° s ^D-7l+ 


all events 


88450 


96.08 ±0.07 


20.29 ±0.14 




triggered 


21613 


98.77 ±0.07 


22.96 ±0.29 




reconstructed 


5687 


98.79+°^ 


22.30+°;! 


B® -► D~af 


all events 


98450 


95.93 ±0.06 


21.05 ±0.13 




triggered 


27118 


98.55 ±0.07 


23.91 ±0.26 




reconstructed 


5757 


98.47+°^ 


11 o 1+0.56 
Zi - il -0.55 
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6 Determination of Am s 

6.1 Methods for the Determination of Am s and its Measurement Limits 

For the determination of the B ( s ' oscillation frequency the maximum likelihood method is used. The 
likelihood Jzf is a function of the proper time t and the mixing state }i, parametrised by Am s and AF S , 
applied to five classes of events simultaneously: mixed and unmixed 5°, mixed and unmixed B d , and 
background with lifetime but no mixing. The B ( s ' and B d classes have characteristic wrong tag fractions 
(0, which are determined on event-by-event basis as described previously. By maximising the likelihood 
for a given event sample one can then extract the model parameters. 

For obtaining the 5 o discovery and 95% exclusion measurement limits on Am s the amplitude fit 
method is used because the maximum likelihood method was found to have some disadvantages in that 
case [5]. The estimation of the maximum value of Am s measurable with the ATLAS detector is using 
B ( s ' candidates from the B® — > Djn + and B® — »■ Dja\ hadronic channels. The numbers of reconstructed 
events after applying the trigger selection (L1JV1U06 and LVL2 Rol) and the B Q S selection cuts as well 
as the expected numbers for an integrated luminosity of 10 fb^are given in Table 12 for all signal and 
background channels. The effective cross-sections for the various processes can be found in Table 1. 
Significant background comes from the B d —>■ Djn + ja\ and B Q S —> D*~K + ja\ channels, and from the 
combinatorial background. Due to limited sample size the estimation of the combinatorial background 
is very approximate. 

The relative fractions of the signal and the background contributions will be determined by a fit of 
mass shape templates to the reconstructed B ( s ' mass distribution employing a wider mass window than 
used here for the final extraction of Am s , similar to the method used by CDF [1]. The mass shape tem- 
plates will be determined from Monte Carlo mass distributions of the individual channels. Uncertainties 
in the knowledge of the shapes will be taken into account as part of the systematical uncertainty. 

Table 12: Signal and background samples used for the study of B^-B 0 , oscillations and number of events 
as obtained from the analysis as well as expected numbers for an integrated luminosity of 10 fb _1 . 



Process 


Simulated 


Rec. 


Rec. events 




events 


events 


for lOftr 1 


B° s ^D~7t + 


88 450 


5 687 


6 657 


B° d ^Dfn~ 


43 000 


1814 


99 


B d ^D-n+ 


41000 


23 


35 


B° S ^D*-K + 


40500 


495 


1 116 


B u s -► D-a\ 


98 450 


5 757 


3 368 


B d ^Dja- 


50000 


1385 


<2454 


B° d ^D-a+ 


50000 


49 


36 


B° -► D*-a+ 


100000 


870 


1052 



6.2 Construction of the Likelihood Function 

The probability density to observe an initial B® meson (j = d, s) decaying at time to after its creation as 
a B°j meson is given by 

Pj (to, juo) = Fj ~ e- r ^» (cosh + Aiocos(Am^o)) (D 



1185 



S-Physics - Trigger and Analysis Strategies for 5° Oscillation . 



where AI^ = T J H - T J L , Fj = (F J H + r[) /2 and /jq = - 1. 

For the unmixed case (an initial B°- meson decaying as a B® meson at time to), the probability density 
is obtained by setting Hq = +1 in Eq. 1. Here the small effects of CP violation are neglected. Unlike 
AFj, which can be safely set to zero, the width difference AF S in the B®-B® system could be as much as 
20% of the total width [17]. 

However, the above probability is modified by experimental effects. The probability as a function of 
Ho and the reconstructed proper time t is obtained as the convolution of pj(t 0 , /Xo) with the proper time 
resolution Res ; -(7 \t 0 ): 

9j(t,tki) = Pj(tQ,to) Resy(? | to) dt 0 (2) 

with the normalisation factor 

N = C (So P 'jV A*o) Res ; I 0 dt ') dt ■ ( 3 ) 

Here f m i n = 0.4 ps is the cut on the B ( s ' reconstructed proper decay time. Plots in Fig. 11 show the 
proper time resolutions, which are parametrised with the sum of two Gaussian functions around the 
same mean value. The widths from the fit are 0\ = (68.4±3.3) fs for the core fraction of 53.2% and 
a 2 = (157.2±5.7) fs for the rest of the tail part of the distribution for the B° -> D~K + channel. The 
values for the B° s -> D~af channel are G\ = (72.5±4.3) fs for the core fraction of (58.0±6.8) % and 
a 2 = (144.7±7.3) fs for the tail part. 

Assuming a fraction (Oj of wrong tags occurring at production and/or decay, the probability becomes 

qj(t,n) = (1 - (Oj) qj (t,n) + (Ojqj(t, -n) (4) 

where /Xo has been replaced by }X in order to indicate that now we are talking about the experimental 
observation of same or opposite flavour tags. For each signal channel, the background is composed of 
oscillating B () d mesons, with probability given by Eq. 4, and of non-oscillating combinatorial background, 
with probability given by Eq. 5, which results from Eq. 1 and Eq. 4 by setting Am = 0 and Ar = 0 : 

p cb (^)= ^ e - r ^[l +i u(l-2 Wcb )] (5) 

For a fraction fkj of the j component (J = s, d, and combinatorial background cb) in the total sample 
of type k, one obtains the probability density function 

pdf k (t,n)= £ fkjqj(t,fl). (6) 

j=s,d,cb 

The index k = 1 denotes the B° Djn + channel and k = 2 the £ ( s ' D~a^ channel. The likelihood of 
the total event sample is written as 

J2f(Am„AT,) = nnpdf ik (/ i ,Ai i ) (7) 

k=i (=i 

where n£ v is the total number of events of type k, and N C h = 2. Each pdf^ is properly normalised to unity. 

Figure 12 shows how the experimental effects, as parametrised in Eqs. 2, 4, and 6, modify the 
distribution of the proper time t of a Monte Carlo data sample. 
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tree " ^ [PS] t rec - t 0 [ P S] 

(a) B° -> D~7t + (b) B° -> D-a+ 



Figure 11: The resolution o t of the proper time of simulated Z?° fitted with two Gaussian functions 
(dashed lines). Both Gaussian functions use a common mean value. 




*0 IP 8 ] W [PS] t rec [ P S] 



Figure 12: A sequence of plots showing how a true B® oscillation signal with Amf en = 17.77 ps _1 (a) 
is diluted first by the effect of a finite proper time resolution (b) and then by adding background events 
and including the effect of wrong tags (c). The plots contain samples of events equivalent to 10 fb -1 of 
integrated luminosity. They were generated using the Monte Carlo method described in Section 6.3. Only 
the case of mixed events is shown. For illustration a ^ 2 -fit of the function Cexp(— t/r)(l —Dcos(Am s t)) 
is overlaid to the solid histogram in (c), where D can be interpreted as the combined dilution factor: 
D 0. 1 here. Note that this is different from the unbinned maximum likelihood fit to the total event 
sample of mixed and unmixed events which is actually used to derive results in this study. The dashed 
histogram in (c) describes the contribution from all background sources. 
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6.3 Monte Carlo Data Sample 

For the amplitude fit method a simplified Monte Carlo method is applied to generate a B ( s ' sample using 
the following input parameters: for each signal channel k the number of reconstructed signal events 
for an integrated luminosity of 10 fb _1 and the number of background events from B d s decays is 

given in Table 12. For the combinatorial background the ratio N^/N^ is taken to be 1. The wrong 
tag fraction is assumed to be the same for both B ( s ' and B d mesons in a specific signal channel (a> s = 0)4), 
however, the values are slightly different for the two signal channels (see Table 11). 

A Monte Carlo sample with A^ sig = signal events oscillating with a given frequency Am s 

(e.g. Am s = 100 ps -1 , which is far off the expected value for Am s ), together with N B o = 

background events oscillating with frequency Am^ . v and N c \, = + combinatorial events (no 
oscillations) is generated according to Eq. 1. 

The uncertainty on the measurement of the transverse decay length, Od xy (see Fig. 13), and the true 
value of the g-factor go (g := m/pj) as seen in Fig. 14(a), are generated randomly according to the 
distributions obtained from the simulated samples, fitted with appropriate combinations of Gaussian and 
exponential functions. For the — > Djd\ channel the true p\ distribution shown in Fig. 14(b) is fitted 
with a combination of a parabola function in the low p\ region and a sum of two exponential functions 
in the high p% region. The go values are obtained by converting generated p% values at random. 

From the computed true decay length, d%, = to/ go, the corresponding reconstructed decay length 
is generated as d xy = d^ y + <Jd xy • (Hdxy + S dxy £l). *o is the proper time of the generated B s . Sd xy is the 
width and jXdxy the mean value of the Gaussian shape of the pull of the transverse decay length ^lAsl 
shown in Fig. 15. The fitted values are S dxy = 1.099 ±0.011 and \i dxy = (8.76 ± 1.47) • 10~ 2 for the 
B° s -► D-% + channel respectively S dxy = 1.113 ±0.011 and \i dxy = (5.40 ± 1.48) • 10~ 2 for the fi ( s > -► 
D~a\ channel. The reconstructed g-factor is generated as g = g 0 + gojUg + goS g Qf. The distribution 
of the fractional g-factor as shown in Fig. 16 is fitted with a Gaussian resulting in a width of 
S g = (0.89 ±0.01) • 10~ 2 and a mean value of p. g = (0.27 ±0.12) • 10~ 3 for the B° s -► D~n+ channel 
respectively S g = (0.82 ±0.01) • 10~ 2 and p, g = (0.56 ±0.11) • 10~ 3 for the B° -> Dja\ channel. Both 
Q. and Q! are random numbers distributed according to the normal distribution. From the transverse 
decay length and g-factor, the reconstructed proper time is then computed as t = gd^. The probability 
for the event to be mixed or unmixed is determined from the to and Am s (or Ara^) values using the 
expression (1 - cos (Am/o)/ cosh (AT/o/2)) /2 which is left from Eq. 1 after the exponential part has 
been separated. 

For a fraction of the events, selected at random, the state is interchanged between mixed and unmixed, 
according to the wrong tag fraction o^g. Half of the combinatorial events are added to the mixed events 
and half to the unmixed events. 

For the exclusive B d s background channels as well as the combinatorial background, the recon- 
structed proper time is generated assuming that it has the same distribution as the one for signal B ( s ' mesons 
coming from the Djn + and D~a\ sample respectively, no mixing included. 

The Am s measurement limits are obtained applying the amplitude fit method [5] to the sample gen- 
erated as described in the previous section. According to this method a new parameter, the B® oscil- 
lation amplitude s/ , is introduced in the likelihood function by replacing the term 'jUocosAm^o' with 
'jUo-^cosAm^o' in the B® probability density function given by Eq. 1. The new likelihood function, 
similar to Eq. 7, again includes all experimental effects. For each value of Am s , this likelihood function 
is minimized with respect to si, keeping all other parameters fixed, and a value s/ ± <7^ at is obtained. 
One expects, within the estimated uncertainty, s/ = 1 for Am s close to its true value, and s/ = 0 for Am s 
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g o [ps/mm] p y [GeV] 



(a) B° s -> D- 7Z+ (b) B° s -> D^af 

Figure 14: The true value of the g-factor go = to/d% of simulated B® from the B® — > D~n + sample 
(a) fitted with the sum of three Gaussian functions (dashed lines) and the true transverse momentum 
distribution p\ of simulated B Q S from the B® — > D^aj 1- sample (b) including trigger selection. 
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(a) Amplitude vs. Am, 



(b) Significance vs. Am s 



Figure 17: The 5° oscillation amplitude (a) and the measurement significance (b) as a function of Am s 
for an integrated luminosity of 10 fb _1 for a specific Monte Carlo experiment with Amf en = 100 ps -1 . 



far from the true value. A five standard deviation measurement limit is defined as the value of Am s for 
which l/ov = 5, and a sensitivity at 95% CL. as the value of Am s for which 1/cv = 1.645. Limits are 
computed with the statistical uncertainty aj^ at . A detailed investigation on the systematic uncertainties 
a^ st , which affects the measurement of the B ( s ' oscillation, is presented in [18]. 

6.4 Extraction of the Am s Sensitivity 

For the nominal set of parameters (as defined in the previous sections), AT S = 0 and an integrated lu- 
minosity of 10 fb _1 , the amplitude ±lo^ at is plotted as a function of Am s in Fig. 17(a). The 95% CL. 
sensitivity to measure Am s is found to be 29.6 ps -1 . This value is given by the intersection of the dashed 
line, corresponding to 1.645 a^ at , with the horizontal line at stf = 1. 

From Fig. 17(b), which shows the significance of the measurement S(Am s ) = l/ov as a function of 
Am s , the 5a measurement limit is found to 20.5 ps -1 . 

The dependence of the Am s measurement limits on the integrated luminosity is shown in Fig. 18(a), 
with the numerical values given in Table 13. 



Table 13: The dependence of Am s measurement limits on the integrated luminosity ££ . 





5 a limit 


95% CL. sensitivity 


[fb- 1 ] 


[ps- 1 ] 


[ps- 1 ] 


3 


14.5 


25.0 


5 


17.0 


27.0 


10 


20.5 


29.6 


20 


23.7 


32.0 


30 


25.3 


33.2 


40 


26.4 


34.1 



The dependence of the Am s measurement limits on AT S /T S is determined for an integrated luminosity 
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Figure 18: The dependence of Am s measurement limits (a) on the integrated luminosity and (b) on 
AT S /T S for an integrated luminosity of 10 fb -1 . The dashed horizontal line in (a) denotes the CDF 
measurement. 



of 10 fb \ other parameters having their nominal value. The AT s /r s is used as a fixed parameter in the 
amplitude fit method. As shown in Fig. 18(b) no sizeable effect is seen up to AT S /T S ~ 30%. 



6.5 Extraction of the Am s Measurement Precision 

Whereas the Am s measurement limits are obtained by using the amplitude method (see previous section), 
in case of the presence of an oscillation signal in the data the value of the oscillation frequency Am s and 
its precision are determined by minimising the likelihood (given by Eq. 7) with respect to Am s . In this 
fit AI^ is fixed to 0, because a study has shown that the systematic uncertainties resulting from varying 
Ar v /r v in the range 0 to 0.2 (suggested by the present uncertainty) are practically negligible. 

An example of the likelihood function is given in Fig. 19(a), in which the Amf en in the Monte Carlo 
sample has been set to the value measured by CDF for illustration. From this type of graphs the precision 
of the measurement of Am s is extracted and plotted in Fig. 19(b) as a function of the integrated luminosity 
for three values of Amf en . 



6.6 Discussion of Results 

In this note it is shown that with an integrated luminosity of 10 fb -1 ATLAS is able to verify the CDF 
measurement of Am s = (17.77 ±0.10(stat) ± 0.07 (sys))ps _1 at the five standard deviation level. For 
these parameters the statistical error on Am s is calculated to be about 0.065 ps _1 . 

In a preceding study [18] it was found that over a wide range of values for Am s and integrated 
luminosity the systematic uncertainty on the measured value of Am s was smaller by at least a factor of 
10 compared to the statistical uncertainty. The list of contributions to that systematic error estimation 
included the wrong tag fraction with a relative error of 5% compared to 2.5% found in this study. For 
the reasons mentioned above, the evaluation of systematic effects has not been repeated here. The study 
of the effect of varying Ar s (as explained in the previous section) is new, but the contribution to the 
systematic uncertainty is also very small. 
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Figure 19: (a) The negative natural logarithm of the likelihood for a specific Monte Carlo data sample 
for an integrated luminosity of 10 fb -1 and a true value of Amf en = 17.77 ps -1 . The inset shows a 
zoom around the minimum, (b) The statistical error a staS (Am s ) as a function of the integrated luminosity 
for values of Amf en of 15, 17.77 and 20 ps -1 . For comparison: the CDF statistical error on their Am s 
measurement is 0.10 ps -1 . 



Systematic uncertainties on the overall trigger efficiencies mainly effect the statistics available for 
the analysis. However, an important systematic effect for the Am s measurement would be introduced in 
case different trigger efficiencies for positively and negatively charged muons are observed. In order to 
constrain this effect, dimuon events from a calibration channel like B + — > J/\ffK + with J/yr — > jJ. + jJ.^ 
which are triggered by a single muon trigger could be used. 

Clearly LHCb can measure Am s more precisely than ATLAS (0 sta t(Am s ) ~ 0.01 ps -1 with 2 fb -1 of 
data [19]), but the Am s measurement with ATLAS is needed for the simultaneous fit of all parameters of 
the weak sector of the B®-B® system (weak mixing phase 0 S , Am s , Y s and AY S ). This will be performed by 
a combined analysis of the channels described in this note and the B° s — >• 7 /yty channel [20]. The ATLAS 
measurement is an independent cross-check of the measurements performed by other experiments. 

7 Summary and Conclusions 

We have studied the capabilities of the ATLAS detector to measure B® oscillations in pp collisions 
at 14 TeV using the purely hadronic decay channels S° — >• D~{$k~)k + and — >• D~(<j>n~)a^. For 
an integrated luminosity of 10 fb -1 a Am s sensitivity limit of 29.6 ps -1 and a five standard deviation 
measurement limit of 20.5 ps -1 is obtained from a likelihood fit employing the amplitude fit method. 
This result depends only weakly on the lifetime difference AT,. The trigger is based on a single muon 
trigger with adjustable muon pj thresholds between 4 and 10 GeV on all trigger levels and an active 
search for D s ^(j>n decays by the High Level Trigger. For 10 3 1 cm~ 2 s~ 1 we will be able to afford a muon 
trigger with the loosest pj threshold combined with a FullScan D s — > (j)7l search. For 10 32 cm _2 s _1 , we 
will need to increase the muon pj threshold to 6 GeV and possibly employ the Rol-based LVL2 trigger. 
In both cases, the trigger rates can be kept at an acceptable level. For higher luminosities, we need to 
implement additional constraints in the HLT in order to reduce the event output rates further. 
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The offline event reconstruction searches for the hadronic decay of the B ( s ' and requires a muon with 
a minimum pj of 6 GeV for each event. While the flavour of the B ( s ' at decay time is determined from the 
charge of the D s particle, the identification of the initial B® flavour at production time is extracted from 
the charge of the soft muon in the event, taking effects of B { ] mixing into account. An overall tagging 
efficiency of 98.8 ±0.2% and 98.5 ±0.2% as well as average wrong tag fractions of 22.3 ±0.6% and 
23.3 ± 0.6% for the B ( ] -> Djx + and the B° s D~a\ channels, respectively, are obtained. 

About 100000 Monte Carlo events of each sample have been produced without BP S oscillations. 
Monte Carlo events for several exclusive B° s and B d background channels as well as for inclusive back- 
ground like bb — >• jiX and cc — >• jiX have been used. The hadronic decay of the signal side B° s is recon- 
structed constraining the masses of intermediate particles in the decay chain. A B Q S mass resolution of 
52.8 ±0.7 MeV and 40.8 ±0.5 MeV is obtained and after the application of all analysis cuts, 6657 and 
3 368 events are expected for an integrated luminosity of 10 fb~ ! for the B Q S — > D~n + and the B° — ► D~a\ 
decay channels, respectively. We have considered several exclusive background channels, contribut- 
ing to the background inside the B ( ] mass window. The B° d — > D~n + ja\ channels hardly contribute 
(percent level of the signal), but the B® —> D*~7l + ja\ make a considerable contribution of about 16% 
(B ( s ' -»■ Djn + ) and 31% (B ( s ' -> Djaf). While the B° d D~n + channel is expected to contribute with 
about 1.5% relative to the B° s — > Djn + signal, we can only estimate the B d — >• Dja\ contribution to be 
less than about 70% of the 5° —> Dja^ signal, given that this decay channel has not yet been observed. 

In future, the B® —> D*~7C + ja\ channels may be considered signal rather than background. An 
estimate of the combinatorial background is severely limited by the available Monte Carlo event statistics. 
We plan to use early data to obtain a realistic estimate. For an integrated luminosity of 100 pb -1 at 
10 32 cm _2 s _1 we only expect about 90 events in the B° D~n + and B° s D~a~l channels, while the 
B + — > J /y{nn)K + decay channel, which has a large branching ratio, may be used to calibrate the soft 
muon tagging with early data. On LVL2 the processing time for the D s — > <j>7l trigger may be reduced by 
restricting the track reconstruction to Regions of Interest (Rol), seeded by a LVL1 jet energy trigger. This 
typically leads to a reduction of the trigger efficiency by a few percent. At an instantaneous luminosity 
of 2 • 10 33 cm _2 s _1 several options will be considered to achieve acceptable Event Filter output rates. 
Besides further constraining the mass windows of the D s —xf>n trigger, other improvements are obtained 
by checking for a good reconstruction quality of the D s vertex or the implementation of a trigger element 
in the Event Filter which searches for the full B ( s ' decay chain. However, there is an uncertainty of a factor 
two in the overall bb cross-section at the centre-of-mass energy of 14 TeV and therefore the trigger rates 
may vary. 

Due to the achieved sensitivity limit to measure the B ( s ' oscillations we expect to be able to verify the 
CDF measurement of Am s = 17.77 ±0.10 (stat) ±0.07 (sys) ps" 1 at the five standard deviation level with 
a statistical error on Am s of about 0.065 ps -1 . This will provide a reasonable precision which allows us 
to combine the measurement described in this note with the analysis of the B ( s ' — > J/yfQ channel [20] in 
a simultaneous fit for all parameters of the weak sector of the B®-B® system. 
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Abstract 

The investigation of the dynamics responsible for electroweak symmetry break- 
ing is one of the prime tasks of experiments at present and future colliders. 
Experiments at the CERN Large Hadron Collider (LHC) will be able to dis- 
cover a Standard Model Higgs boson over the full mass range as well as Higgs 
bosons in extended models. In this introductory paper the Higgs boson produc- 
tion cross-section and decay branching ratios according to the Standard Model, 
and its Minimal Supersymmetric extension, are presented and discussed. 

1 Introduction 

The Large Hadron Collider at CERN, almost ready to start colliding proton beams at y/s=\4 TeV, will 
play an important role in the investigation of fundamental questions of particle physics. While the Stan- 
dard Model of electroweak [1] and strong [2] interactions is in excellent agreement with the numer- 
ous experimental measurements, the dynamics responsible for electroweak symmetry breaking are still 
unknown. Within the Standard Model, the Higgs mechanism [3] is invoked to break the electroweak 
symmetry. A doublet of complex scalar fields is introduced, of which a single neutral scalar particle, the 
Higgs boson, remains after symmetry breaking [4]. Many extensions of this minimal version of the Higgs 
sector have been proposed, mostly discussed a scenario with two complex Higgs doublets as realized in 
the Minimal Supersymmetric Standard Model (MSSM) [5]. 

Within the Standard Model, the Higgs boson is the only particle that has not been discovered so 
far. The direct search at the e + e~ collider LEP has led to a lower bound on its mass of 114.4 GeV [6]. 
Indirectly, high precision electroweak data constrain the mass of the Higgs boson via their sensitivity to 
loop corrections. Assuming the overall validity of the Standard Model, a global fit [7] to all electroweak 
data leads to the 95% C.L. m H < 144 GeV. The 95 % C.L. lower limit obtained from LEP is not used in 
the determination of this limit. Including it increases the limit to 182 GeV [7]. 

On the basis of the present theoretical knowledge, the Higgs sector in the Standard Model remains 
largely unconstrained. While there is no direct prediction for the mass of the Higgs boson, an upper limit 
of ~1 TeV can be inferred from unitarity arguments [8]. 

Further constraints can be derived under the assumption that the Standard Model is valid only up to 
a cutoff energy scale A, beyond which new physics becomes relevant. Requiring that the electroweak 
vacuum is stable and that the Standard Model remains perturbative allows to set upper and lower bounds 
on the Higgs boson mass [9, 10]. For a cutoff scale of the order of the Planck mass, the Higgs boson 
mass is required to be in the range 130 < M# < 180 GeV. If new physics appears at lower mass scales, 
the bound becomes weaker, e.g., for A = 1 TeV the Higgs boson mass is constrained to be in the range 
50 < M H < 800 GeV. 

Direct searches for the Standard Model Higgs boson at the Tevatron include looking for its production 
via gluon fusion and subsequent decay to WW'*' . The observed 95% C.L. limit by CDF (with an 
integrated luminosity of 2.4 fb" 1 analysed) is 0.85 pb for Mh= 160 GeV, which is about 1.6 times the 
Standard Model prediction [1 1]. Similarly the D0 Collaboration excludes at 95% C.L. the production of 
this boson with a cross-section about 2.4 times the one predicted by the Standard Model (o^). Searches 
at low mass are done studying Higgs bosons produced in association with the W and Z, and looking for 
H ^bb with leptonic IVanZ decays (e, CDF sets a 95% C.L. limit to 8.2 x cf^, while the one from 
D0 is 1 1 x a|^, for M#= 115 GeV. Preliminary results on the combination of the results from these two 
experiments lead to a 95% C.L. limit on the Higgs boson production cross-section to about 5. 1 x Osm for 
M H = 115 GeV, and 1.1 xo^ for M H = 160 GeV [11]. 
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The Minimal Supersymmetric Standard Model contains two complex Higgs doublets, leading to five 
physical Higgs bosons after electroweak symmetry breaking: three neutral (two CP-even h and H, and 
one CP-odd A) and a pair of charged Higgs bosons H ± . At tree level, the Higgs sector of the MSSM 
is fully specified by two parameters, generally chosen to be m A , the mass of the CP-odd Higgs boson, 
and tan/3, the ratio of the vacuum expectation values of the two Higgs doublets. Radiative corrections 
modify the tree-level relations significantly. This is of particular interest for the mass of the lightest 
CP-even Higgs boson, which at tree level is constrained to be below the mass of the Z boson. Loop 
corrections are sensitive to the mass of the top quark, to the mass of the scalar particles and in particular 
to mixing in the stop sector. The largest values for the mass of the Higgs boson h are reached for large 
mixing, characterized by large values of the mixing parameter X t := A t — cot /3 , where A t is the trilinear 
coupling and jx is the Higgs mass parameter. If the full one-loop and the dominant two-loop contributions 
are included [12, 13], the upper bound on the mass of the light Higgs boson h is expected to be around 
135 GeV (m/j-max scenario). While the light neutral Higgs boson may be difficult to distinguish from its 
Standard Model counterpart, the other heavier Higgs bosons are a distinctive signal of physics beyond 
the Standard Model. The masses of the heavier Higgs bosons H, A and are often almost degenerate. 

Direct searches at LEP have given lower bounds of 92.9 (93.3) GeV and 93.4 (93.3) GeV on the 
masses of the lightest CP-even Higgs boson h and the CP-odd Higgs boson A within the m/,-max (no- 
mixing) scenario. [14] In those scenarios, the mixing parameter in the stop sector is set to values of 
X t = 2 TeV and X t = 0, respectively. Given the LEP results, the tan/3 regions of 0.9 < tan/3 < 1.5 and 
0.4 < tan/3 < 5.6 are excluded at 95% confidence level for the m^-max and the no-mixing scenarios, 
respectively [14]. However, it should be noted that the exclusions in tan/3 depend critically on the exact 
value of the top-quark mass. In the LEP analysis m t = 179.3 GeV has been assumed. With decreasing 
top mass the theoretical upper bound on m\ t decreases and hence the exclusions in tan /3 increase, while 
for m t of about 183 GeV , or higher, the exclusions in tan/3 vanish. 

Direct searches at the Tevatron have been performed looking to the tau-pair and b-pair production. 
With an integrated luminosity of 1.8 fb" 1 no excess of events has been observed, and exclusion limits on 
production cross-section times branching fraction to tau pairs for a Higgs boson mass in the range from 
90 to 250 GeV have been set. The expected reach of this search, assuming MSSM Higgs production, 
extends below tan/3= 40 for Ma in the mass range Ma- 120 to 160 GeV [15]. 

The charged Higgs boson mass is related to niA via the tree-level relation = + m\ and it 
is less sensitive to radiative corrections [16]. Direct searches for charged Higgs bosons in the decay 
modes — > TV and — > cs have been carried out at LEP, yielding a lower bound of 78.6 GeV on 
m H ± independent of the — > TV branching ratio [17]. At the Tevatron, the CDF and D0 experiments 
have performed direct and indirect searches for the charged Higgs boson through the process pp — >• tt 
with at least one top quark decaying via t — >• H^b. These searches have excluded the small and large 
tan/3 regions for masses up to ~160 GeV [18]. Other experimental bounds on the charged Higgs 
boson mass can be derived using processes where the charged Higgs boson enters as a virtual particle. 
For example, the measurement of the b — > sy decay rate allows indirect limits to be set on the charged 
Higgs boson mass [19] which, however, are strongly model dependent [20]. 

The high collision energy of the LHC will allow the search for Higgs bosons to be extended into 
unexplored mass regions. The experiments have a large discovery potential for Higgs bosons in both the 
Standard Model and in the MSSM over the full parameter range. Should the Higgs boson be light, i.e. 
have a mass in the range favoured by the precision electroweak measurements, the experiments at the 
Tevatron might also get indications of the existence of a Higgs boson. 

In this chapter, the potential for Higgs boson searches at the Large Hadron Collider with the ATLAS 
experiment is reviewed, focussing on the investigation of the Higgs sectors in the Standard Model and in 
the MSSM. 
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Table 1: Set of parameters used in the evaluation of the Higgs boson cross-sections (see text). 



M uds = 190 MeV 


M c = 1.40 GeV 


Mi = 4.60 GeV 


M, = 172 GeV 


M z =91.187 GeV 


M w = 80.41 GeV 


G F = 1.16639 x 10- 5 GeV- 2 


N F = 5 


A QCD = 165 MeV 


A^g = 226 MeV 


a^°(M z ) = 0.130 


Qf £0 (M z ) = 0.118 



Table 2: Cross-sections for Higgs boson production via gluon fusion at LO and NLO in the mass range 
100 < Mh < 1000 GeV. The first column gives the Higgs boson mass, the second the LO order cross- 
section, the third and fourth column gives the NLO electroweak (EW) and QCD corrections respectively 
(see text), the last column presents the overall NLO cross-section. 
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2 Production cross-sections and branching fractions for a Standard Model 
Higgs boson 

2.1 Production cross-sections 

This section reports on the Standard Model Higgs boson production cross-section via gluon fusion, Vec- 
tor Boson Fusion (VBF), and the associated production with a Vector Boson (WH and ZH), to leading 
order (LO) and to next to leading order (NLO). The associated production with it is also discussed. 

The calculation is performed using the CTEQ6L1 and CTEQ6M Parton Distribution Functions (PDF) 
at LO and NLO respectively [21]. Single and two-loop calculations of the running strong coupling 
constant a s are used for the LO and NLO computation respectively. Table 1 shows the values of the 
Standard Model parameters used in the computations. 

The cross-section of the gluon fusion process is evaluated using the program HIGLU [22]. The 
calculation is performed with exact NLO matrix element. The renormalization and factorization scales 
are set to the Higgs boson mass. The values of a s are calculated according to the values of Aqcd given 
in Table 1 with 5 flavors. This approach may overestimate the LO cross-section; however this effect is 
expected to be negligible compared to the effect from the scale uncertainty. 

Table 2 reports the cross-sections for Higgs boson production via gluon fusion at LO and NLO in 
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Table 3: Cross-sections for Higgs boson production via Vector Boson Fusion at LO and NLO in the 
mass range 100 < M#< 1000,GeV. The first column gives the Higgs boson mass, the second the LO 
order cross-section, the third and fourth column gives respectively the NLO electroweak (EW) and QCD 
corrections (see text), the fifth column presents the overall NLO cross-section. 



0.04 -0.05 



3.173 0.05 -0.05 



2.427 0.04 -0.04 " 
1.355 O05 -0.04 



0.542 0.05 0.01 



0.105 0.03 0.27 



Table 4: Cross-sections for the associated Higgs boson production with W bosons at LO and NLO in the 
mass range 100 < M H < 200 GeV. 
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the mass ranges 100 < M#< 1000 GeV. The NLO cross-section is obtained from the LO one as follows: 
Onlo = Olo x ( 1 + Sqcd + 8 E w) ■ 

The cross-section of VBF Higgs boson production is estimated with the package VV2H F [22]. The 
results are reported in Table 3 as a function of the Higgs boson mass in the range 100 GeV< M#< 1000 
GeV. The renormalization and factorization scales are set to the Higgs boson mass. 

The cross-sections for WH, ZH and ttH production are one to two orders of magnitude below the 
gluon and vector boson fusion cross-sections. The values are given in Tables 4, 5, and 6 respectively. 

The cross-section for the Higgs boson produced in association with tt is estimated with the package 
HQQ [22]. The renormalization and factorization scales are set to (M#+ 2M t )/2. The QCD corrections to 
this process are known [23,24] and yield a K-factor of about 1.25. However it should be stressed that the 
main backgrounds in this analysis (ttbb and tt'))) are known to LO only. Table 6 reports the cross-section 
of Higgs boson production associated to tt in the mass range 100 GeV< M#< 200 GeV. 

The package V2HV is used to estimate the Higgs boson production cross-section in association with 
the W and Z bosons [22]. The renormalization and factorization scales are set to the sum of the invariant 
masses of the weak boson and of the Higgs boson. Tables 4 and 5 report the cross-sections of these 
processes in the mass range 100 GeV< M#< 200 GeV. 
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Table 5: cross-sections for the Higgs boson associated production with Z bosons at LO and NLO in the 
mass range 100 < M H < 200 GeV. 



l/„lG \ i a, (pb) 



0.25 -0.06 0.163 



Table 6: cross-sections for the associated Higgs boson production with tt to LO and NLO (courtesy of 
M. Spira) in the mass range 100< M H < 200 GeV. 



M H (GeV) c LO (pb) o NLO (pb) K Facte 



2.2 Decays branching ratios 

Higgs boson branching ratios are evaluated with the program HDECAY [25] . Here, the default settings 
defined by the authors are used (see Ref. [25]), except for the parameters specified in Table 1. 

Tables 7 and 8 report the most relevant Higgs boson branching ratios for 100 < M#< 1000 GeV: 
H ^bb,T + T;-,YY,ZZ(*\WWM and tt. 

Figure 1 (left) shows the branching fractions and the production cross-section (right) of the Standard 
Model Higgs boson as a function of its mass (Figures taken from Reference [26]). 

2.3 Standard Model Higgs boson search in ATLAS 

The Standard Model Higgs boson is searched for at the LHC in various decay channels, the choice of 
which depends by the signal rates and the signal to background ratios in the various mass regions. 
The Standard Model Higgs boson channels considered in this volume are: 

• pp -> H -> JY 

• pp^H^ ZZ& 41(1 = e,fl) 

• pp — > qqH — > qqT + T~ 

• pp^H^ W + W- lvlV,lVqq 
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Table 7: Relevant Higgs boson branching ratios for the mass range 100 < M H < 200 GeV. 



n ■ x x 

.07726 



Table 8: Relevant Higgs boson branching ratios for the mass range 250 < Mh< 1000 GeV. 



r H (GeV) H -> WW(*) 

4.046 ~ 0.7003 

8.505 ~ 0.6911 
15.60 0.6722 



(I.544(i 
0.5500 
0.5591 
0.5692 



0.5889 



0.5977 
0.6057 
0.6129 
0.6195 



0.2600 0.19030 



0.2652 
0.2711 
0.2773 



0.2982 0.09586 
0.3023 0.08458 
0.3059 0.07445 



0.6253 0.3092 0.06537 



• pp^ttH^ttbb 

• pp^ttH -> ttW + W~, and c 



3 Higgs Bosons in the Minimal Supersymmetric Extension to the Stan- 
dard Model 

As discussed in the Introduction, in the Minimal Supersymmetric Standard Model two Higgs doublets 
are required, resulting in three neutral and two charged observable Higgs bosons. The production of 
neutral Higgs bosons and their decays are different from those in the Standard Model. While decays into 
ZZ or WW are dominant in the Standard Model for Higgs boson masses above M#> 2 A%, for high 
values of tan j3 these decay modes are either suppressed in case of the h and H or even absent in the case 
of the A. Instead, the coupling of the Higgs bosons to third generation fermions are strongly enhanced 
for large regions of the MSSM parameter space. 
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Figure 1: Left: Branching ratios for the relevant decay modes of the Standard Model Higgs boson as a 
function of its mass. Right: cross-sections for the five production channels of the Standard Model Higgs 
boson at the LHC at 14 TeV. 



3.1 Production and decays of neutral Higgs bosons 

The Higgs boson production proceeds via two different mechanisms, the direct and the b quark a 
production, as described below. In the following 0 stands for either of the three neutral Higgs bosons: A, 
H, and h. Further details can be found in [27]. 



Direct Production: The diagram for this process is depicted in Fig. 2(a). It dominates in the range of 
low tan j6 and its rates are significantly larger than for the Standard Model. For the range of higher tan /3 it 
is still dominant for low m^. The cross-section for this process has been calculated at NLO accuracy [22] 
and the numerical values used here are listed under off^h in Table 9. 



Associated Production: Different approaches have been followed by theorists to calculate the cross- 
section for Higgs boson production in association with b quarks, each of them assuming one of the 
diagrams depicted in Fig. 2(b-d) as their leading order (LO) contribution. The implications connected 
with this choice are briefly discussed below. 

• gg^bb(j): 

The cross-section for this process has been calculated at NLO accuracy for the case of both b 
quarks at high transverse momentum [28, 29], where this calculation is considered to be reliable. 

a) b) c) d) e) 

9 b 

g gib iQQOjf » 



Figure 2: Feynman diagrams contributing to the MSSM Higgs boson production. Diagram a) is called 
'direct production', diagrams b) to e) contribute to the b quark associated production. In the above 
diagrams (j) represents either of the neutral Higgs bosons in the MSSM, h, H, or A. 
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For the cases where only one or zero b quarks with high pj are required in the final state, the 
cross-section has been calculated at NLO accuracy by integrating over the momentum of one or 
both b quarks, respectively [28, 30, 31]. In these cases the predictions are considered less reliable 
due to the occurrence of potentially large collinear logarithms of the form log(Q/ntb) from the 
gluon splitting process, where Q is the factorization scale which is expected to be much larger than 
m h . 

• bb-nfr: 

The potentially large collinear logarithms can be dealt with by absorbing them into the parton 
density function (PDF) of the b quark and thus resumming them to all orders of perturbation theory. 
The intrinsically still present b quarks from the gluon splitting process are given zero transverse 
momentum at leading order. At higher order they can acquire transverse momentum. E.g. the 
process bg — > b(j) is a NLO contribution leading to an observable b quark. The total cross-section 
for this process has been calculated at NLO [32,33] and NNLO accuracy [34] and is considered to 
be reliable when it is not required (but also not vetoed) to observe a b quark. 

• bg —>b$: 

This process is a mixture of the two processes discussed above in a sense that one b quark is 
coming from the matrix element description and the other from the b PDF. This process has been 
calculated at NLO accuracy [35] and is considered to be reliable if one observes only one b quark 
with high pr in the final state. In principle in this case one should also veto any additional b quark 
which is experimentally challenging due to the limited ^-tagging efficiency. 

• qq — ► bbtp: 

Compared to gg — > bb<$> this process only contributes at the 1%-level at LHC energies and is 
therefore only mentioned for completeness. 

A comparison of the predicted cross-sections for both the inclusive and exclusive approach is shown 
in Figure 3. The blue band corresponds to the NNLO calculation of bb — >• 0, where the width of the 
band corresponds to the residual scale uncertainties when varying the default renormalization (ju r = m H ) 
and factorization (jU/ = m#/4) scales. The red band shows the corresponding NLO calculation of gg —>■ 
bbij). The scale uncertainty for gg — »■ bbij) is of the order 20 to 30%, while the scale uncertainty for 

— > 0 is much smaller, in particular at high Higgs boson masses. This might be due to the remaining 
collinear logarithms in the gg —> bb§ calculation. However, contributions from e.g. PDF- uncertainties, 
in particular for the b PDF have not been included here. Both methods agree within uncertainties. At 
larger Higgs boson masses, the prediction from bb — > <j) is slightly higher than that from gg — »■ bb(j). One 
of the effects that might explain this is the inclusion of closed-top-loops in the gg — > bbij) calculation, 
absent in the bb —> <j> case. In conclusion, both predictions may be used to normalize an inclusive signal 
sample, where no cuts have been applied on the momenta of the b quarks at generator level. 

Due to the finite integrated luminosity assumed to be available for the analyses discussed in this note, 
the inclusive normalization is chosen. The cross-section were calculated using FeynHiggs-2.6.2 [36] 
yielding the cross-section for a Standard Model Higgs boson. The cross-sections in the MSSM were 
then obtained by scaling them by the ratio of partial widths into bb: 

rg SM (m A ,tan/3) 

a MSSMK,tan/3) = as M H) SM . (1) 

The production cross-sections for all Higgs boson masses considered here and their branching frac- 
tion into a T+T~ and }X + }X~ final states in the m^ ax scenario are summarized in Table 9 for tan/3 = 20. 



1205 



Higgs - Introduction on Higgs Boson Searches 




100 150 200 250 300 350 400 ' 100 150 200 250 300 350 400 450 500 



M h [GeV] M H [GeV] 

Figure 3: The inclusive cross-sections for the processes bb — > 0/ (blue hatched region) and gg — ► bb§ 
(red hatched region) are shown on the right-hand side; the exclusive cross-sections for bg — > b(j)(blue 
hatched region) and gg — > bb(j)(r:ed hatched region) are shown on the right-hand side. The width of the 
bands corresponds to the theoretical uncertainty due to the choice of renormalization and factorization 
scales. 



3.2 Production and decays of charged Higgs bosons 

The search strategies for charged Higgs bosons depend on the charged Higgs boson mass, which dictates 
both the production and the available decay modes. Below the top quark mass the main production 
mode is through top quark decays, t —> H + b, and in this range the H + — > T+v decay mode is dominant. 
Once above the top quark threshold, production mainly takes place through gb fusion (gb — > IH + ). 
For such high charged Higgs boson masses, the decay into a top and a b quark dominates, H + — > tb, 
but H + — > T + v can still be sizeable and offers a much cleaner signature. The process gg —>■ tbH + is 
important for charged Higgs boson production with m H + around the top mass. Since the LHC will be 
the first collider tt factory, "light" charged Higgs bosons may be copiously produced through the process 
qq, gg —*tt — > tbH + [37]. While this is the dominant production mode, there are other processes which 
also contribute to the light charged Higgs boson production, like single top events or diagrams with the 
same final state as mentioned above (tbH + ), but which do not proceed through tt production. H + events 
through single top production are not considered in this volume. 

The charged Higgs boson production cross-section is evaluated for two different MSSM scenarios. 
They are chosen such that in Scenario A the decay of H + into SUSY particlces is suppressed, and in 
Scenario B (also known as the "m/,-max" scenario) the mass of the lightest Higgs boson h° is maximised. 



Table 9: Mass, cross-section for direct production, cross section for ^-associated production, and branch- 
ing fractions into t + t~ and i u + i u _ final states for Higgs bosons in the m™ ax scenario and for tan/3 = 20. 
All values were obtained using FeynHiggs-2.6.2 and HIGLU. 

Mass / GeV °mmfR> CT A////A le 7 fb m(h/H/A^ z + %-)/% 3B(h/H/A — > 

A H h A H h AH h A H h AH h 

~Tl0 1291$ imO - - - 314810 7579 310707 8^86 9J1 

130 134.2 124.7 92517 93941 43545 189602 92897 99992 9.11 9.23 

160 160.8 128.0 32148 34706 44561 97480 93102 6650 9.42 9.46 

200 200.5 128.4 9847 11377 45957 45685 45095 2188 9.57 9.72 

300 300.4 128.6 955 1451 46986 10312 10253 979 8.22 9.51 

450 449.8 128.6 - 2019 2035 723 6.07 6.24 



8.88 0.031 

9.00 0.032 

8.40 0.033 

7.49 0.034 

6.27 0.029 

5.68 0.021 



0.032 0.031 

0.033 0.032 

0.033 0.030 

0.034 0.027 

0.034 0.022 

0.022 0.020 
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For the two scenarios the following set of parameters have been used: 

Scenario A: Scenario B ("m/,-max"): 

• m, = 175 GeV • m, = 170 GeV 

• Msusy = 500 GeV • M SU sy = 1000 GeV 

• A t = 1000 GeV • X t = 2000 GeV, where A t = X, + jU / tan /3 

• jU=200GeV • jU=200GeV 

• M 2 = 1000 GeV • M 2 = 200 GeV 

• M 3 = 1000 GeV • M 3 = 800 GeV 



^susy denotes the soft SUSY-breaking mass parameter in the sfermion sector, m t X t is the off- 
diagonal entry in the stop mass matrix, jj, the Higgsino mixing paramter, and M 2 and Mj the soft 
SUSY-breaking mass parameters in the SU(2) gaugino and the gluino sector, respectively. In the follow- 
ing, numerical NLO cross-sections for the low-mass H + region are calculated with FeynHiggs (version 
2.6.2 [36]). Heavy H + NLO cross-sections are obtained with Ref. [38] and corrected with the dominant 
supersymmetry loop corrections (reflecting the altered relation between the bottom quark mass and its 
Yukawa coupling, Am/,) as proposed in Ref. [39]. 

Figure 4 shows the results for the tbH + final state as a function of tan/3 for the MSSM scenarios A 
and B. The production cross-section has a minimum at tan/3 «7. This is caused by a mimimum in the 
H + tb Yukawa coupling and renders the so-called intermediate tan/3 region (4 < tan/3 < 10) which is 
experimentally hard to reach. 




Figure 4: Expected charged Higgs boson production cross-section in the MSSM for scenarios A and B 
for light [36] and heavy charged Higgs bosons [38]. 



Below the top quark mass, the charged Higgs boson predominantly decays into a % lepton and a 
neutrino, and for values of tan/3 >5 this branching ratio is close to 100%, as shown in Figures 5 and 6. 
Decay modes involving cs and Wh are also present, but depending on the value of tan /3 they are one or 
two orders of magnitude smaller than TV. The decay of the W boson originating from the associated top 
quark adds variety to the possible charged Higgs boson signatures, but it also provides handles for signal 
reconstruction and, even more important, for background rejection. 

Once above the top quark mass threshold, the H + — > tb decay mode shows a rapid growth and soon 
becomes an important decay mode as shown in Figure 5. Contrary to the light charged Higgs boson, 
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Figure 5: Charged Higgs boson branching ratios as a function of mass for the m/,-max scenario for 
tan/3 = 2 and tan/3 = 35 and three selected decay modes. 



for which the H + — > T + v decay mode is an almost exclusive decay mode, the heavy charged Higgs H + 
boson does not solely decay into tb, but a significant fraction is allowed to decay into other decay modes 
like T + v, W + h, cs or SUSY particles where kinematically allowed. 

The calculations are performed with FeynHiggs [36], as it allows to calculate both the BR(f — ► H + b) 
and the H + decay branching in a consistent way, and includes important corrections to the tree level 
values. Figure 6 shows the calculated branching ratios for two different charged Higgs boson masses, 
one light (130 GeV) and one heavy (600 GeV), as a function of tan/3 . 




Figure 6: Expected charged Higgs boson branching ratios in the MSSM for scenarios A and B for the 
example of a light {m H + = 130 GeV) and a heavy {m H + = 600 GeV) charged Higgs boson [36]. 
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Prospects for the Discovery of the Standard Model Higgs 
Boson Using the 77 Decay 

Abstract 

The discovery potential for the Standard Model Higgs boson through the H — *• 
yy decay in the ATLAS detector is reported. Various performance aspects 
of the Higgs boson mass reconstruction and the photon identification are dis- 
cussed. Trigger issues are also considered. The potential of an inclusive 
H — > 77 search and Higgs boson searches in association with one or two high 
Pt jets are evaluated. Studies of the associated WH, ZH and ttH produc- 
tion processes are also presented. Finally, the discovery potential is assessed 
using an unbinned multivariate maximum-likelihood fit. These studies are per- 
formed for experimental conditions expected for an instantaneous luminosity 
of » lO^cm-V 1 . 

1 Introduction 

In the mass range 1 10 < mn < 140 GeV the Higgs boson is expected to decay into two photons with a 
branching fraction large enough to render the search feasible at the LHC [1]. The inclusive search for the 
Higgs boson in the diphoton decay channel has been studied in ATLAS for many years and constitutes 
one of the benchmarks for the detector performance [2-4]. In this paper the sensitivity of ATLAS to this 
channel is re-evaluated with an updated detector description and software. The impact of higher order 
QCD and electroweak corrections on the discovery potential is also evaluated. 

The Higgs boson can be produced in association with hadronic jets of high transverse momentum, pj. 
Gluons from initial-state radiation in the gg — > H and qq^qqH Vector Boson Fusion (VBF) processes 
are the largest contributors to Higgs boson production in association with high pj hadronic jets. The 
search for a Higgs boson using the diphoton decay mode in association with one and two jets at the LHC 
has been suggested [5-7] and a previous analysis reported on feasibility studies for these final states 
using a fast detector simulation [8,9]. This paper presents an update of these studies based on a more 
complete description of the detector simulation. 

In addition to the diphoton invariant mass, other discriminating variables are incorporated into the 
analysis and combined by means of an unbinned maximum-likelihood fit. Photon reconstruction proper- 
ties and the event topology are used to separate the data sample into categories that are fit simultaneously. 

In the search for a Standard Model Higgs boson, the associated production with W, Z, or it can 
serve to complement the inclusive Higgs boson and Higgs boson + jets channels and help to determine 
the Higgs boson coupling to the Standard Model gauge bosons and the Yukawa coupling to the top 
quark [10]. These measurements would provide consistency checks of the Standard Model. They could 
also be interpreted in terms of Minimal Supersymmetric Standard Model couplings at high tan/3 (for 
niH ~ 120 GeV) and high m^. Deviations from Standard Model rates could imply new physics such 
as gauge-Higgs unification [11, 12] or the presence of resonances in models of little Higgs [13, 14], 
Left-Right symmetry [15] or technicolour [16]. The feasibility of diphoton searches in association with 
weak vector bosons is evaluated using a full detector simulation. This involves searches for diphotons in 
association with either just missing transverse energy, £"™ ss , or £"™ ss and a charged lepton (electron or 
muon), and updates earlier studies performed with a fast detector simulation [17, 18]. 
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2 Monte Carlo event generation 

The Monte Carlo (MC) event generation required for this paper is split into two main groups. The first 
group corresponds to MC samples produced with a full detector simulation based on GEANT4 [19]. 
These are used for the evaluation of detailed detector effects relevant to the analyses. The second group 
of MC samples are processed with a fast detector simulation [20] and are used primarily for the evaluation 
of the analysis sensitivity. For example, the Higgs boson mass resolution, photon identification efficiency 
and photon-jet rejection are evaluated with a full detector simulation. The resulting photon efficiency and 
photon-jet rejection are parameterised as a function of the photon pj and these parameterisations are then 
applied to the fast detector simulation. 

2.1 Signal processes 

Signal events are generated using PYTHIA [21]: this package implements Leading Order (LO) Matrix 
Element calculations for all the signal processes considered here. The gluon fusion process is also 
simulated with the MC@NLO [22,23] package. This package provides QCD Next-to-Leading-Order 
(NLO) Matrix Elements [24-27] in addition to a good description of multiple soft-gluon emission at next- 
to-next-to-leading logarithmic level (NNLL) [28-30]. This is relevant to evaluating the discriminating 
power of the diphoton pj and other relevant variables. All signal processes used here are processed 
through a full detector simulation. Signal events produced via the VBF mechanism are also modeled 
with HERWIG [31]. All the generated samples for signal processes used here are normalized to the NLO 
cross-sections [32] taking into account only QCD corrections. 

2.2 Background processes 

Background processes can be split into two main groups: backgrounds coming from the production of 
two isolated photons, which are usually referred to as irreducible, and reducible backgrounds coming 
from events with at least one fake photon. Fake photons are mostly due to the presence of a leading 71° 
resulting from the fragmentation of a quark or a gluon. Table 1 gives a summary of the MC packages 
and the cross-sections (in pb) for the irreducible and reducible backgrounds used here. 

In this paper QCD corrections to both signal and background are considered in the inclusive analysis. 
The irreducible background is mainly due to the qq,qg — > yyx processes at up to order a 2 a s (namely 
the Born and Bremsstrahlung contributions) and the gg — >• yy up to order a 2 a^ (referred to as the box 
contribution). Contributions from photons produced collinear to quarks are also taken into account at 
NLO. The DIPHOX [33] and ResBos [34-36] programs are used to assess the irreducible background 
computation. DIPHOX includes all the processes to order a 2 a s , including the Bremsstrahlung contribu- 
tion with the quasi collinear fragmentation of quarks and gluons which are computed at NLO. DIPHOX 
does not include resummation effects. ResBos includes the Born and box contributions at NLO as well 
as the bremsstrahlung contribution (but the fragmentation contribution is only at LO). ResBos includes 
resummation effects to NNLL. 

In these computations, a parton level isolation cut of 15 GeV in a cone of AR = 0.4 is used. This is 
checked against the real photon identification cuts using PYTHIA fully simulated events. DIPHOX and 
ResBos predictions for the total irreducible background agree to better than 10% [37]. 

Table 1 shows the cross-sections used to normalize the MC generation in the inclusive analysis. 
The cross-sections for qq, qg —> yjx and gg — »■ yy quoted in Table 1 were computed with ResBos for 
80 < m n < 150 GeV, where m rr is the invariant mass of the diphoton system, and pj y > 25 GeV, where 
p Ty is the transverse momentum of the photons. Photons are required to lie in the central region of the 
detector, \r\\ < 2.5. The factorization and renormalization scales are set dynamically as the invariant 
mass of the diphoton system. 
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Table 1 : Details of the cross-section calculation and event generation for the irreducible and re- 
ducible photonic backgrounds used in the H — > yy analysis. The first three columns display the 
cross-section calculators, kinematic cuts and resulting cross-sections (in pb). The last two columns 
show the MC packages used for event generation and the number of events generated with a full 
and fast detector simulations, respectively. 



Process 


0 calculator 


Cuts 


<7(pb) 


Full simulation 
# of events 


Fast simulation 
# of events 




ResBos/ 


80 < m y7 < 150 GeV 


20.9 


PYTHIA/ALPGEN 


ALPGEN 




DIPHOX 


p Tr > 25 GeV, 1 7i| <2.5 




200000/1300000 


1670000 


88^77 


ResBos 


80 < m 7y < 150 GeV 


8.0 


PYTHIA 


PYTHIA 






p T7 > 25 GeV,|Tj| <2.5 




200000 


850000 


7i 


JETPHOX 


p Ty > 25 GeV 


180-10 3 


PYTHIA 


ALPGEN 








3000000 


36700000 


a 


NLOJET++ 


p T > 25 GeV 


477 TO 6 


PYTHIA 


ALPGEN 










10000000 


37000000 



In order to evaluate irreducible backgrounds for signal-significance computations and to include the 
effect of high-px jets, the ALPGEN MC generator is used [38, 39]. 1 This includes 2 — > N tree-level 
Matrix Elements, where N = 2 — 5. The minimum parton pj and maximum pseudorapidity are set to 
20 GeV and |r/ 1 < 5, respectively. A prescription for the merging of matrix elements and parton showers 
is used [41]. The pj threshold for the merging of the parton shower and the Matrix Elements is set to 
20 GeV and AR = 0.7. 

The total cross-section of the ALPGEN samples of 77+jets events is normalized to the cross-section 
given in Table 1 . The diphoton invariant mass and transverse momentum spectra are re-weighted to the 
corresponding spectra obtained with ResBos for the Born and Bremsstrahlung contributions. This is mo- 
tivated by the fact that ResBos provides a NNLL description of the resummation effects that significantly 
affects the pj spectrum of the diphoton system up to about 40 or 50 GeV. 

The box contribution is simulated with the PYTHIA package using the leading order one loop Matrix 
Elements with full and fast detector simulations. The cross-section of this sample is normalized to the 
cross-section given in Table 1 . The diphoton invariant mass and momentum spectra are re- weighted to 
the corresponding spectra obtained with ResBos. 

The total inclusive cross-section for the yj process is obtained using the package JETPHOX [42]. 
This package simulates direct and fragmentation single photon production. The distribution of the photon 
pj that is obtained is compared with that for direct production predicted using the PYTHIA package. It 
is observed that the differential cross-section obtained with JETPHOX is a factor of 2.1 larger than that 
obtained with PYTHIA, with a weak pj dependence. Table 1 shows the cross-section for inclusive yj 
production for pj y > 25 GeV. 

The inclusive dijet cross-section is computed with the help of the NLOJET++ package [43,44], which 
takes into account QCD NLO corrections. The dijet cross-section obtained with this program is found to 
be a factor of 1.3 larger than that obtained with PYTHIA. Table 1 shows the cross-section for inclusive 
dijet production for pj > 25 GeV. 

For purposes of signal significance computation the ALPGEN MC package is used to generate sam- 

lr This generation does not include the box contribution nor the electroweak yyjj process. The latter is generated with the 
MadGraph package [40] and it is used in the Higgs boson searches in association with hadronic jets only (see Sections 5.2 
and 5.3). 
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Table 2: Event generators and cross-sections for backgrounds used in the analyses of associated 
production. Photons are required to be in |tj| < 2.7 and to have pj > 20 GeV. For the W ± (—> 
e ± v)7process the electron is required to pass the same cuts as the photons. The diphoton invariant 
mass should be in the interval 90 < m rr < 150 GeV, when appropriate. For the bbyy process the 
pj of the b quark is required to be greater than 20 GeV. 



Process 


Generator 


Cross-section 


Number of events 


Z(^U)yy 


MadGraph 


2.4 fb 


17500 


Z(-> vv)yy 


MadGraph 


4.9 fb 


17500 




MadGraph 


3.3 fb 


28200 


W {^£v)yy 


MadGraph 


3.1 fb 


31500 


W ± (^e ± v)y 


PYTHIA 


5.9 pb 


218350 


ccyy 


MadGraph 


257 fb 


5000 


bbyy 


MadGraph 


24.41 fb 


5000 


ttyy 


MadGraph 


1.97 fb 


4900 



pies of y+jets and multi-jets with a fast detector simulation. ALPGEN includes 2 —>■ N tree-level Matrix 
Elements, where N = 2 — 5. The minimum partem pj and pseudorapidity range are set to 20 GeV and 
\r]\ < 6, respectively. The same matching conditions as in the yy+jets sample are used. The samples 
generated with ALPGEN are normalized to cross-sections given in Table 1. 

A small background contribution is expected from Drell-Yan e + e~ faking a photon pair. For this 
purpose 420k events were generated using the PYTHIA package with a full detector simulation. The 
generator cross-section after requiring one lepton with pj > 10 GeV and \ rj \ < 2.7 is 1.23 nb. 

Specific backgrounds contributing to the Higgs boson plus lepton and Higgs boson plus missing 
transverse energy channels are also produced. W /Z + diphoton backgrounds can produce the same 
topology as the signal. To evaluate the contributions from these backgrounds, the MadGraph [40] gen- 
erator, which includes the Z and W bosons produced in association with two photons with tree-level 
diagrams has been used. The cross-sections for these processes, including the leptonic branching ratios, 
are shown in Table 2, requiring that the photons have pj > 20 GeV and |tj | < 2.7, and that the invariant 
mass of the two photons be between 90 and 150 GeV. 

The ccyy, bbyy and ttyy cross-sections shown in Table 2 do not include the branching ratios of 
leptonic decays. For the bbyy process the transverse momentum of the b quark is required to be greater 
than 20 GeV. In the ttyy events, also produced by MadGraph, the photons only come from the top 
quarks. The contribution from events in which photons arise from the decay products of the tt system 
was roughtly evaluated. Events were generated with PYTHIA using the Matrix Elements for the tt 
process. The production of two additional high pj photons coming from the final state radiation of the 
top quark decay products was evaluated. It was found that the ttyy background needs to be multiplied by 
a factor 7.5 to account for this effect: this factor was applied for the analysis of the associated channel. 
It is probably conservative because the photons radiated by the quarks should not be isolated and would 
be removed by the analysis cuts. 

There is a contribution to the background from events with a W boson and a photon, where the W 
boson decays into an electron and a neutrino. In some events, the electron can be mis-reconstructed as a 
photon. The PYTHIA generator is used for this background and the cross-section, including the leptonic 
branching ratio, is given in Table 2 after requiring pj > 20 GeV and \r\ \ < 2.7 for both the photon and 
the electron. For all the processes in Table 2 a full detector simulation was used and QED radiative 
corrections to the Z/W decays were treated with PHOTOS [45]. 
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3 Photon selection and 77 reconstruction 

3.1 Photon reconstruction and calibration 

Photons in the calorimeter are reconstructed from clusters of different sizes: in the barrel region (|tj| < 
1.37 ) a At]xA0 = 3x7 cluster (in units of middle sampling calorimeter cells) is used for converted 
photons to recover as much as possible the energy that might be lost due to the opening of the two 
electrons in the magnetic field. A 3x5 cluster is used for unconverted photons. In the endcap region 
(1 .52 < 1 77 1 < 2.37 ) a 5x5 cluster has been used for both converted and unconverted photons. The chosen 
cluster sizes are a compromise between maximal energy containment and minimization of the noise. In 
the simulation, all the effects that contribute to the calorimeter resolution constant term, which by design 
should be kept below 0.7%, are taken into account by smearing the reconstructed cells energies. The 
energy of the photons in the electromagnetic (EM) calorimeter has been reconstructed using appropriate 
weights for the presampler and back compartments of the calorimeter to correct for the energy lost in 
the material in front of the calorimeter, for the longitudinal leakage and for the energy losses outside the 
cluster. Different weights for unconverted photons and electrons are used, as derived from the nominal 
detector geometry (see Ref. [46] for details). Clusters are then corrected for a series of effects. The 
variations of the energy with respect to the impact point inside each cell are corrected for. These effects 
are due to the fact that the shower is not fully contained inside the cluster (the effect is larger for small 
cluster sizes) and to the calorimeter material structure in 0 . The photon candidate position is defined as 
the barycenter of the associated cluster: the fact that the cells have a finite granularity introduces a bias in 
the measured position which is corrected for as a function of the particle impact point within the central 
cell (more details in Ref. [46]). 

3.2 Photon identification 

Powerful photon identification is required to reduce the background from jets faking photons (from jj 
events and yj events) below the irreducible background. The photon identification relies on the fine 
segmentation of the electromagnetic calorimeter, especially the first layer, allowing an event-by-event 
rejection of "isolated" n°s, which are the main source of fake photons from jets. The details of the 
shower-shape variables and the cuts can be found in Ref. [47]. The shower-shape variables include the 
leakage in the first compartment of the hadronic calorimeter, variables characterizing the lateral size of 
the shower in the second layer of the EM calorimeter, and variables related to the transverse size in 
the first layer of the EM calorimeter, together with a search for a second maximum in 77 in the energy 
deposited in the strips of the first layer. The average efficiency of the calorimeter cuts for photons 
from Higgs boson decays with pj > 25 GeV has been found to be 83% when pile-up corresponding to 
10 33 cm~ 2 s -1 instantaneous luminosity is added. A track isolation cut is also applied to reduce further 
the fake background: the sum of the pi of tracks in a AR = 0.3 cone around the cluster position is 
computed for tracks with pj > 1 GeV and a cut at 4 GeV is applied. For tracks with AR < 0.1 of the 
cluster position additional cuts are applied to remove conversion tracks from the sum. The efficiency of 
this cut for photons fulfilling all other identification cuts is 98% [47]. The cuts have been optimized 
using photons from Higgs boson decays as a signal sample and an inclusive jet sample for the fake rate 
study. The optimisation of the cuts has been done mostly in the Ej range 25 — 35 GeV and the same 
cuts are applied to converted and unconverted photons. Table 3 shows the rejection factors for inclusive 
jets (normalized to jets found from the simulated particles without detector effects using a cone of size 
AR = 0.4), as measured from the inclusive jet sample. The rejection is also given separately for quark 
and gluon jets. The difference in rejection comes from the different fragmentation. After all cuts, the 
dominant background comes from "single" 7T°s. The uncertainty on the rejection (uncertainty on the 
fragmentation, modeling of the detector response) is close to a factor of two [47]. 
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Table 3: Jet rejections expected for the inclusive jet sample for Ej > 25 GeV. The results are 
shown before and after track isolation cuts for all jets and separately for quark and gluon jets. The 
errors are statistical only. 





All 


quark-jet 


gluon-jet 


Rejection (before isolation) 


5070±120 


1770±50 


15000±700 


Rejection (after isolation) 


8160±250 


2760±100 


27500±2000 



3.3 Conversion reconstruction 

Considering Higgs boson decays with photons within |tj| < 2.5, about 57% of the selected events have 
at least one true conversion with a radius smaller than 80 cm. Converted photons may start showering 
before the beginning of the calorimeter thereby degrading the energy resolution. In addition, the energy 
deposition in the calorimeter from a converted photon is geometrically broader in <j> than that from an 
unconverted photon due to the magnetic field in the Inner Detector cavity. On the other hand when a 
photon from a Higgs boson decay converts, the measurement of the conversion radius can be used to 
improve the accuracy on the measurement of the photon direction (see Section 3.4). 

Conversions are reconstructed by a vertexing algorithm using the reconstructed particle tracks, as 
explained in more detail in Ref . [48] . An electromagnetic cluster with an associated track which is one 
of the two tracks of a conversion, is classified as a double track conversion. When a photon converts and 
only one reconstructed track is associated to the corresponding cluster this photon may be mis-identified 
as an electron: in order to increase the conversion reconstruction efficiency when an electromagnetic 
cluster has an associated track with no B-layer hit and the associated track does not belong to a recon- 
structed double track conversion, this object is classified as a single track conversion. Including single 
track conversions in the analysis increases by ps 6% the signal efficiency while the overall background 
is increased by approximately the same factor. The conversion reconstruction efficiency as a function of 
the conversion radius is shown in Fig. 1: with the reconstruction software version used for this analysis, 
the overall efficiency is ps 66.4 % for conversions with a radius below 40 cm. The different contributions 
from single-track and double-track conversion reconstruction are also shown. 




Conversion radius [mm] 

Figure 1: Efficiency of single-track and double-track conversion reconstruction as a function of the 
conversion radius. 

For a given electromagnetic cluster, more than one associated reconstructed conversion can be found: 
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multiple conversions are due to secondary conversions or fake conversions. Fake conversions arise from 
two tracks that do not come from a conversion or with only one of the two tracks coming from a con- 
version. In the case of multiple conversions, the conversion with the smallest radius is selected: the 
reconstructed conversion is then correctly associated to a converted photon from a Higgs boson decay in 
97.6% of simulated signal cases. 

The jet contribution to the total background can be evaluated from the analysis of converted photons 
(see Ref. [48] for more details and updated results). The presence of tracks associated to the conversion 
provides a measurement of the transverse momentum of the converted photon in the tracker and conse- 
quently an evaluation of the ratio pj/Ej, where Ej is the transverse energy of the calorimeter cluster. 
This variable should be distributed between 0 and 1. As shown in Fig. 2, converted photons have val- 
ues of Pt/Ej which populate the region around 1 with a large non Gaussian tail to lower values due to 
bremsstrahlung of the electrons. In contrast, the distribution expected for a converted photon from a 7T° 
with the same transverse energy peaks at much lower values since the cluster collects the full energy of 
both photons from 71° decay. It is particularly instructive to note that Fig. 2 also shows that the expected 
distributions for pj/Ej for jets passing all the photon identification cuts is very similar to that obtained 
from single 7T°'s with the same transverse energy: this indicates that the residual background from jets is 
dominated by single 7T°'s. The shapes in Fig. 2 for converted photons coming from a jet and converted 
photons from the direct process can be parametrized and used to discriminate between the two compo- 
nents in a data sample of conversion events, allowing an evaluation of the yj and jj percentage in the 
background. 



Figure 2: p^jEj ratio for conversions where both tracks were reconstructed in different event samples. 



3.4 Primary vertex reconstruction 

Among the reconstructed photons passing the identification cuts, the two with highest pj are assumed 
to come from the Higgs boson decay. The azimuthal angle 0 is determined by the cluster barycenter in 
the second layer of the EM calorimeter. The pseudorapidity r\ relies on the knowledge of the position z.h 
at which the Higgs boson is produced and decays. 

Exploiting the multi-layer structure of the EM calorimeter, an estimate of the direction of each photon 
can be achieved by fitting a straight line in the (R,z) plane through the cluster barycenters detected in the 
presampler and in the first and second samplings of the EM calorimeter. The intercept of these lines with 
the ATLAS beam axis provides two independent measurements of the hard scattering vertex, Zyi and zyi 
with their uncertainties. They are combined in a weighted average with the nominal interaction vertex 



0.24r 
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at zo = 0 which has a spread of ±56 mm to yield a calorimeter estimate, Zjf°, of the primary vertex. 
The performance expected for H — > 77 events is displayed in Table 4 and Fig. 3 (left plot). The accuracy 
(RMS) of the primary vertex position reconstruction is 13 mm, 17 mm, 41 mm, when both photons are in 
the barrel, one in the barrel and one in the endcap, and both in the endcap. In the simulation of the events 
used for this analysis a 4mm shift of the electromagnetic calorimeter along the z-axis was introduced 
(see Ref. [49] for more details) with respect to the nominal position. This effect has been taken into 
account at the reconstruction level, recalculating the pseudorapidity of the cells in each calorimeter layer 
with respect to the interaction point taking as a radial position reference the geometrical center (in the 
radial direction) of the cell. To compute the photon direction, the pointing algorithm makes use of the 
radial position of the barycenter of the shower instead of the geometrical center of the cell biasing the 
distribution of the reconstructed primary vertex position obtained from calorimeter pointing with respect 
to the true position as shown in Fig. 3, left plot (see Ref. [50] for more details). When a conversion is 
reconstructed, its coordinates are added as an extra point to the straight line fit, thus improving the Higgs 
boson vertex position accuracy; the distribution exhibits large tails with an RMS of 8 mm and a narrow 
Gaussian core with a width of 0.15 mm. 

By adding the reconstructed primary vertex to the linear fit, the best Higgs boson position accuracy 
is achieved, with a Gaussian width of 0.07 mm (see Table 4 and Fig. 3, plot on the right). In the case of 
pile-up, more than one primary vertex may be reconstructed by the inner detectors. The discrimination 
of the hard-scattering vertex from pile-up vertices is done using a likelihood that is a combination of the 
calorimeter information and the sum of the squares of the pr & of the tracks originating from the vertex 
(named Pj in the following). So 

££ = ££ja^££ cal0 (1) 

where 

^ P 2=Ph(p 2 t)/pmb(p 2 t ) (2) 

J?caio = e-^ xe~* °L / e -^=e~~ 2 <€l" (3) 

The first component of the likelihood, J^pti, is, for a certain value of Pj, the probability for the 
Higgs boson vertex to have this Pj divided by the same probability for a minimum bias vertex. The 
second component is the product of the probabilities that a measured vertex is that of the Higgs boson 
vertex divided by the probability that the measured vertex position is that of a minimum bias vertex. The 
vertex misidentification due to pile-up is displayed by the columns labeled 'tail' in Table 4. When the 
vertex associated to the Higgs boson production is correctly identified among the others, the impact of 
the direction measurement on the invariant mass resolution becomes negligible with respect to that of the 
energy measurement. 

3.5 Invariant mass and signal acceptances 

The invariant mass of diphoton pairs has been reconstructed for different signal samples using the tools 
described in the previous Sections: the 2gl7i trigger selection (see Section 4 for more details) is applied 
and two identified photons are required to be reconstructed with pj > 40 GeV and pj > 25 GeV within 
the fiducial region of 0 < \r\\ < 1.37 and 1.52 < |tj| < 2.37. Fig. 4 presents the invariant mass distribu- 
tions of 77 pairs from Higgs boson decay with m# = 120 GeV. The shaded histograms correspond to 
events with at least one true converted photon with a conversion radius smaller than 80 cm. The con- 
verted photons are currently calibrated at the electron scale and the unconverted ones at the photon scale: 
in the case of the geometry with additional material in front of the calorimeter, the observed difference 
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Table 4: Performance of the Higgs boson longitudinal vertex position reconstruction, using 
calorimeter pointing only (z#'°) and the reconstructed primary vertex (z# lo+vtx ): averages (()) 
and RMS are displayed in mm. 'tail' shows the percentage of events which are outside the his- 
togramme window (±100 mm and ±1 mm for z#'° and z ^ lo+vtx respectively). 



Luminosity 


(} RMS 


(mm) 
tail(%) 


calo+vtx tru 
Z H ~ Z H 
() RMS 


(mm) 
tail(%) 


No pileup 


2.3 17.3 


0.09 


-0.008 0.10 


1.2 


1Q 33 


3.3 17.4 


0.09 


-0.010 0.10 


13.0 


2-10 33 


2.4 17.1 


0.18 


-0.007 0.10 


18.3 




900-80 -to~7o'-2o''o'2o'46'~60 80 100 
z rec -z true [mm] 



^0.14r 
^0.12 L 



o.06r 

0.04 r 

0.02r 



- No pile-up 
-10 3 Ws 1 
2*10 33 cm 2 s 1 



-1 -0.8-0.6-0.4-0.2 0 0.2 0.4 0.6 0.8 1 
z rec -z true [mm] 



Figure 3: Difference between the reconstructed primary vertex position and the true position obtained 
from calorimetric pointing and conversion track information (when available) without/with the recon- 
structed primary vertex (left/right plot), for events without pile-up (black plots) and with pile-up eval- 
uated for 10 33 and 2 • 10 33 cm _2 s _1 (red, green plots). The narrow peak on top of the broader one is 
due to events in which at least one photon has a reconstructed conversion vertex. In the right plots, the 
non-Gaussian shape is due to the overlap of barrel-barrel, barrel-endcap and endcap-endcap topologies, 
which have different resolutions. 

between the peak of the distribution of events with no conversions and the one with at least one recon- 
structed conversion is around 1 %. In the future a specific calibration will be implemented for converted 
photons using the same prescription as that used for electrons and unconverted photons. 

To be consistent with previous studies ( [4], [3]) the mass resolution obtained for the photon pairs 
has been determined from an asymmetric Gaussian fit ([-2 a , + 3 a]) of the invariant mass peak. The 
asymmetric window is used to reduce the impact of the residual low energy tails in the reported width. 
Table 5 shows the results for different Higgs boson masses with and without pileup at 10 33 cm -2 s _1 in 
the case of simulations with additional dead material in front of the calorimeter (see Ref. [49] for more 
details). Since the energy calibration coefficients have been calculated using the nominal geometry, the 
presence of additional dead material in front of the calorimeter affects the determination of the value 
of the invariant mass, moving the peak mean down by a few per mille. In addition it increases the 
amount of low energy tails: the percentage of events with a reconstructed invariant mass more than 3a 
from the mean increases from 3.5 % for the nominal geometry to 6.8 % for the distorted geometry. 
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M,.[Cc>V] " (/,. |C.eV| 

Figure 4: Invariant mass distributions for photons pairs from Higgs boson decays with rriH = 120 GeV 
after trigger and identification cuts; on the left the invariant mass distribution obtained using the nominal 
geometry simulation is reported while on the right plot the same invariant mass distribution is reported 
when additional dead material is included in the simulation. The shaded histograms correspond to events 
with at least one converted photon. 

The acceptances after trigger selection (see Section 4 for more details) and analysis cuts together with 
the mass resolutions for the different values of m# are reported in Table 5. The mass window for the 
evaluation of the signal significance (denoted mass bin) is defined as ± 1 .4 o around the central value. 
As can be seen in Table 5 the fraction of signal events in the mass bin is 26 % for m# = 120 GeV slightly 
increasing with the Higgs boson mass. The relative mass resolution a m /m is close to 1.2 % degrading 
by a few percent relative when the pileup is added. 

Table 5: Efficiencies after trigger, identification and inclusive analysis cuts (see Section 5.1). 
Reconstructed invariant mass peak positions and resolutions for different Higgs boson masses 
with and without 10 33 cm _2 s _1 pileup are also reported. Distorted geometry has been used in all 
cases. 





m H = 120 GeV 


m H = 130 GeV 


m H = 140 GeV 




No pileup 


Pileup 


No pileup 


pileup 


No pileup 


Pileup 


LI 


0.66 


0.64 


0.69 


0.65 


0.68 


0.66 


L2 


0.54 


0.52 


0.55 


0.52 


0.56 


0.53 


EF 


0.50 


0.47 


0.52 


0.49 


0.52 


0.49 


Analysis cuts 


0.36 


0.32 


0.38 


0.35 


0.39 


0.36 


Mass bin 


0.26 


0.24 


0.28 


0.26 


0.29 


0.27 


Mass Fitted (m) 


119.46 


119.47 


129.47 


129.41 


139.41 


139.41 


Cm, GeV 


1.46 


1.52 


1.54 


1.62 


1.66 


1.69 



3.6 Background analysis on fully simulated samples 

Large samples of yy and yj events as well as Drell-Yan Z — »• e + e~ were generated with PYTHIA and 
fully simulated. The differential cross-section at LO as a function of the diphoton invariant mass for yj 
and e + e~ events after the analysis procedure described in previous sections is reported in Fig. 5. For the 
jj background contribution evaluation not enough statistic was available in full simulation and it has been 
estimated only from fast simulated events using photon efficiency and the jet rejection parametrizations 
obtained from full simulation. 
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Figure 5: Diphoton candidates invariant mass distribution for yj (left) and Drell-Yan e + e (right) events 
after photon identification and analysis cuts with and without trigger selection. 

The goodness of the photon efficiency and jet rejection parametrizations used in Section 5 on fast 
simulated events to estimate the background contributions has been tested on the yj sample in full sim- 
ulation: the distribution of the cross-section as a function of the diphoton candidates invariant mass has 
found to be in agreement within 15 % in the 1 10-150 GeV mass range with the distribution from the fast 
simulated sample after photon efficiency and jet rejection parametrization used in Section 5. 

3.7 Jet tagging in simulation 

Section 5 presents two analyses which rely on tagging hadronic jets. Jet tagging is particularly relevant 
to Higgs boson searches in association with two high pj jets since such an analysis is intended to isolate 
Higgs boson production via VBF. 

In the VBF production process, it is expected that the two quark-initiated jets are observed in opposite 
hemispheres and with a large separation in pseudorapidity. Tagging these jets further suppresses the 
background processes. Furthermore, since there is no colour exchange between the two quarks, the 
Higgs boson should be observed in a large rapidity gap, where additional activity from QCD jets is 
small. A central jet veto (CJV) which suppresses the background processes is therefore used in the event 
selection, 

The relative efficiencies for jet tagging are summarized in Table 6. The following four conditions are 
applied: 

• two simulated quarks with pi > 40 GeV and pj > 20 GeV (|t]| < 5) and in opposite hemispheres, 

• two reconstructed jets with p T > 40 GeV and p T > 20 GeV (|t]| < 5), 

• the two reconstructed jets are in opposite hemispheres, 

• differences in tj between the simulated quarks and the reconstructed jets are smaller than 0.4 (this 
is referred to as matching). 

The present tagging method selects correct tagging jets in 75.7% of the events. Under a pileup 
condition of 10 33 cm _2 s _1 , a degradation of less than 5% is observed. The difference between HERWIG 
and PYTHIA is about 6%. 



4 Performance of the photon trigger on H — > 77 events. 

There are two trigger selections which are foreseen for the H — > yy analyses: 2gl7i and g55. The 2gl7i 
trigger selects events with at least two isolated photons and is efficient for photon pj above 20 GeV. The 
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Table 6: Relative efficiencies of the tagging method for VBF H — > 77 with mn = 120 GeV (see 
text). 



Selection 


HERWIG (no pileup) 


HERWIG (Pileup 10 33 ) 


PYTHIA (no pileup) 


step 1 (quark level) 


0.618 ±0.002 


0.613 ±0.003 


0.632 ±0.002 


step 2 (rec level) 


0.914 ±0.002 


0.911 ±0.002 


0.943 ±0.001 


step 3 (rec level) 


0.801 ±0.002 


0.774 ±0.003 


0.771 ±0.002 


step 4 (matching) 


0.757 ±0.003 


0.726 ±0.003 


0.713 ±0.003 



g55 trigger selects events with at least one photon and is efficient for photon p T above 60 GeV. No 
isolation is required in the g55 trigger. In the analyses presented in Section 5 only the 2gl7i menu has 
been considered to avoid possible biases in the invariant mass reconstruction: a more detailed description 
of the photon selection at the High Level Trigger is available [51]. The photon reconstruction at the 
trigger level relies only on calorimetric information. To calculate Ej and shower shapes variables the 
High Level Trigger uses algorithms similar to those used for offline analysis using information from the 
first and second sampling of the calorimeter. For photon trigger menus the isolation is only applied at LI 
trigger. 

The trigger efficiency has been evaluated for Higgs boson events having two offline reconstructed 
photons passing the following kinematic cuts: two photons with p Ty > 25 GeV and p Ty > 40 GeV re- 
spectively both in the region 0< |tj| <1.37 and 1.52< |tj| <2.37, passing the identification cuts described 
in Section 3.2. 

Table 7: Efficiency for the 2gl7i menu item to trigger on H —> 77 events with m# = 120 GeV, 
normalized with respect to the offline selections. 



Trigger Level 


2gl7i Trigger efficiency 


LI 


96.3±0.3 


L2 Calo 


95.0±0.4 


EF Calo 


93.6±0.4 



Table 7 shows in the first column the efficiency of the 2gl7i menu item to select H —>■ yy events after 
each trigger level. 

The 2gl7i trigger menu item is expected to be ~94% efficient for triggering on Higgs boson decays 
with two reconstructed photons. The efficiency loss is mainly due to the calorimeter isolation at LI 
which is not applied in the offline photon selection. 

Trigger rates for 2gl7i are studied in the context of the ATLAS trigger development. This trigger is 
found to be usable (unprescaled) up to luminosities of 10 33 cm~ 2 s _1 [51]. 

5 Event selection 

In this Section the analysis details of the various event selections are described. The event selection for 
the inclusive analysis is given in Section 5.1 and for the Higgs boson search in association with jets are 
given in Sections 5.2 and 5.3. Finally, the event selections for diphoton searches in association with 
missing transverse momentum and charged leptons are given in Sections 5.4 and 5.5. Here the various 
event selections are presented as disjoint analyses. In Section 7 the statistical power of these channels is 
combined and their impact on the Higgs boson discovery potential is evaluated. 
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5.1 Inclusive analysis 

The inclusive analysis refers to the search for a resonance in events with two photons that pass certain 
quality criteria. The analysis reported here follows closely the event selection of past studies [3,4]. The 
detector performance and optimization studies succinctly presented in Sections 3 and 4 are geared toward 
maximizing the discovery potential of the inclusive analysis. 
The following cuts are applied: 

la At least two photon candidates (see Section 3.2) in the central detector region defined as | tj | < 2.37 
excluding the transition region between barrel and endcap calorimeters, 1 .37 < | T] | < 1 .52 (crack in 
the following). At this level it is required that the event passes the trigger selection (see Section 4). 

Ib Transverse momentum cuts of 40, 25 GeV on the leading and sub-leading photon candidates, re- 
spectively. 

The fiducial cuts in la are motivated by the quality of the off-line photon identification and the 
fake photon rate (see Section 3.2). The values of the cuts on the transverse momentum of the photon 
candidates (cut Ib) are not varied and are obtained from previous optimization studies [3]. 




110 115 120 125 130 135 140 145 150 

M yy [GeV] 

Figure 6: Diphoton invariant mass spectrum after the application of cuts of the inclusive analysis. Results 
are presented in terms of the cross-sections in fb. The contribution from various signal and background 
processes are presented in stacked histograms (see text). 

Figure 6 shows the expected diphoton mass spectrum after the application of cuts la and Ib. The 
hashed histogram in the bottom corresponds to the contributions from events with one and two fake 
photons. The second hashed histogram corresponds to the irreducible backgrounds (see Section 2.2). The 
background contributions are obtained with MC samples with a fast detector simulation normalized to 
the cross-sections specified in Section 2.2. The fast detector simulation is corrected in order to reproduce 
the aspects of the detector performance critical to the analysis, which are obtained with a full detector 
simulation (see Sections 3 and 4). The expected contribution from a Higgs boson signal for mn = 
120 GeV, obtained with a full detector simulation, is also shown in Fig. 6. 
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Table 8: Expected cross-sections (in fb) for different signal (m# = 120 GeV) and background 
processes within a mass window of m rr ± 1 .4 of the mass resolution in the no pileup case reported 
in Table 5. Cuts la and lb were applied. 



Signal Process 


Cross-section (fb) 


Background Process 


Cross-section (fb) 


88 


21 


77 


562 


VBFH 


2.7 


Reducible yj 


318 


ttH 


0.35 


Reducible jj 


49 


VH 


1.3 


Z -► e+e- 


18 



Table 9: Summary of the relative systematic uncertainties on the yy and yj processes. 



Potential sources 


77 


77 


Scale dependence 


14% 


20% 


Fragmentation 


5% 


1% 


PDF 


6% 


7% 


Total 


16% 


21% 



Table 8 shows the expected cross-sections (in fb) for background and signal in a mass window of 
± 1 .4 of the mass resolution in the no pileup case reported in Table 5 around 120 GeV after the application 
of cuts la and lb. Table 8 indicates that the relative contribution from events with at least one fake photon 
constitutes 39% of the total background, about a factor of two larger than evaluated in Ref. [4]. This 
increase is mostly attributed to three factors. Firstly, a different method for the parametrization of the 
fake photon background is used here. Secondly, the budget of inactive material in front of the first layer 
of the calorimeter has increased with respect to the one used in previous studies. Finally, the contribution 
from fragmentation in the yj process (see Section 2.2) is allowed for here for the first time. 

5.1.1 Theoretical uncertainties on background prediction 

In this Section the theoretical uncertainties of the predictions for prompt single and double photon pro- 
duction used in the inclusive analysis are evaluated. 

The irreducible background rate is evaluated using ResBos [34-36], which implements a full matrix 
element calculation at NLO and the resummed formalism. It thus yields an accurate description of the 
low pTyy region, as corroborated by recent Tevatron results [52] . In the high diphoton transverse momen- 
tum, ptjj, domain the precision of its prediction needs to be assessed. To account for the incompleteness 
of the fixed order calculation two approaches are used: both the renormalisation and factorisation scales 
{jl R f ) are varied, from 0.5 x m rr to 2 x m yy , first assuming }X R = }X F then independently. 

ResBos does not provide the most accurate description of the fragmentation of partons into pho- 
tons. In particular, it implements the single-photon fragmentation only at LO. An improved estimation 
of this contribution is given by the DIPHOX program [33] which implements single and double-photon 
fragmentation at NLO. The predictions of the fragmentation and direct contributions of the two afore- 
mentioned calculations agree to within 6%. 

The systematic uncertainty related to the partem distribution functions is studied in Ref. [53] and esti- 
mated to be of the order of 6%. Systematic uncertainties related to the irreducible background evaluation 
are summarised in Table 9 they amount to an overall relative uncertainty of 16%. 
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A similar study has been performed with JETPHOX in order to evaluate theoretical uncertainties of 
the yj process. The second column of Table 9 reports the results from this study. 2 

5.2 Higgs boson plus one jet analysis 

In this Section and in Section 5.3 two event selections are presented that take into account the presence 
and properties of high pj hadronic jets in association with the photon pair. This analysis follows earlier 
studies in ATLAS using a fast detector simulation [8] . 

Parton level studies have indicated that searches for the Higgs boson in association with at least one 
high pj jet may have a strong discovery potential [5]. This analysis exploits mainly the fact that the 
gluon radiation pattern of the two leading Higgs boson production mechanisms differs strongly from the 
one expected for the reducible and irreducible backgrounds. The leading jet in the gg —>■ Hj and VBF 
mechanisms tends to be harder and be more separated from the diphoton system than in background 
events. The invariant mass of the two photons and the jet system discriminates well the signal from the 
background. 

The following event selection is chosen after the application of cut la: 

Ha Transverse momentum cuts of 45 and 25 GeV on the leading and sub-leading photon candidates, 
respectively. 

lib Presence of at least one hadronic jet with p T > 20 GeV in |t] | < 5. 

lie A cut on the invariant mass of the diphoton and the leading jet, m rr; - > 350 GeV. 

The lower bound on the jet pj is dictated by the ability to calibrate hadronic jets in ATLAS [54]. 
The large hadronic activity due to the underlying events and multiple proton-proton interactions at the 
LHC, in conjunction to the significant amount of inactive material before the calorimeter, may make it 
difficult to lower the pj threshold. The variable m rr y is the main discriminator used here to improve the 
signal-to-background ratio. Other discriminating variables could be used to further enhance the analysis 
sensitivity [5]. 

Figure 7 displays the resulting diphoton invariant mass spectrum after the application of cuts la and 
Ila-IIc. 

Table 10: Expected cross-sections (in fb) for the Higgs boson plus one jet Analysis. Results are 
given after the application of cuts la and Ila-IIc (see Section 5.2). In the last row the expected 
cross-sections within a mass window of m rr of ±2 GeV around 120 GeV are given. 



Cut 


gg 


VBFH 


VH 


ttH 


Total 




C7(fb) 


a(fb) 


a(fb) 


a(fb) 


a(fb) 


Ia-IIa 


28 


3.6 


1.7 


0.49 


34 


lib 


13 


3.5 


1.5 


0.49 


19 


lie 


3.2 


1.9 


0.22 


0.17 


5.5 


Mass Window 


2.3 


1.4 


0.17 


0.13 


4.0 



Tables 10 and 11 display the expected cross-sections for signal and background events in the range 
110 < rriyy < 150 GeV after the application of cuts la and Ila-IIc. Table 10 illustrates that the leading 

2 It is important to note that the uncertainty of the contribution of the reducible background is dominated by the uncertainty 
in the determination of the fake photon rejection. This uncertainty may be significantly larger than the uncertainties quoted in 
Table 9. 
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Figure 7: Diphoton invariant mass spectrum in fb obtained with the Higgs boson plus one jet analysis 
(see Section 5.2). The same procedure as in Fig. 6 in Section 5.1 is used to obtain the histograms in 
Fig. 7. The same codes for signal and backgrounds are used as in Fig. 6. 



Table 1 1 : Expected cross-sections (in fb) of background for the Higgs boson plus one jet Analysis. 
Results are given after the application of cuts la and Ila-IIc (see Section 5.2). In the last row the 
expected cross-sections within a mass window of m n of ±2 GeV around 120 GeV are given. 



Cut 


77 


Reducible yj 


Reducible jj 


EW yyjj 


Total 




a(fb) 


a(fb) 


a(fb) 


a (fb) 


a(fb) 


Ia-IIa 


9698 


8498 


937 


99 


19233 


lib 


4786 


4438 


444 


99 


9768 


lie 


501 


824 


89 


71 


1485 


Mass Window 


28 


17 


2.0 


1.5 


49 



Higgs boson production mechanism after the application of cuts remains the gg — > Hj process, closely 
followed by the VBF mechanism. It is important to note that the gg —> Hj process has been evaluated at 
LO ignoring the large QCD NLO corrections. 

5.3 Higgs boson plus two jets analysis 

This Section considers an event selection comprising two photons in association with two high pj jets, 
or tagging jets. In this analysis the tagging jets are defined as the two leading jets in the event. The VBF 
Higgs boson process at LO produces two high pj and relatively forward jets in opposite hemispheres 
(backward-forward). The pseudorapidity gap and invariant mass of these jets tend to be significantly 
larger than those expected for background processes. The NLO description of the VBF process does not 
significantly distort this picture. 3 

3 About 10% of the VBF events display the feature that a radiated gluon coming from one of the quark lines happens to 
become a tagging jet. In this class of events the pseudorapidity gap and the invariant mass of the tagging jets appears similar to 
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A number of variables are chosen that are sensitive to the different kinematics displayed by the signal 
and background processes [9]. The following is the optimized event selection after the application of cut 
la: 

Ilia Transverse momentum cuts of 50 and 25 GeV on the leading and sub-leading photon candidates, 
respectively. 

Illb Presence of at least two hadronic jets in |tj| < 5 with pj > 40,20 GeV for the leading and sub- 
leading jet, respectively. The tagging jets must be in opposite hemispheres, r\j\ • r\j2 < 0, where rj ; i 
and 7]j2 correspond to the pseudorapidity of the leading and sub-leading jets, respectively. Finally, 
it is required that the pseudorapidity gap between the tagging jets be large, Atj ; - ; > 3.6. 

IIIc Photons are required to have pseudorapidity between those of the tagging jets. 

Hid Invariant mass of the tagging jets, mjj > 500 GeV. 

Hie Veto on events with a third jet with pj > 20 GeV and \rj\ < 3.2 




110 115 120 125 130 135 140 145 150 

M n [GeV] 



Figure 8: Diphoton invariant mass spectrum obtained with the Higgs boson plus two jet analysis (see 
Section 5.3). 

Figure 8 displays the resulting diphoton invariant mass spectrum after the application of cuts la and 
Illa-IIIe. 

Tables 12 and 13 display the expected cross-sections for a Higgs boson signal with m# = 120 GeV 
and background events in the mass range ±2 GeV around 120 GeV after the application of cuts la and 
Illa-IIIe. Table 12 shows that the dominant Higgs boson production mechanism surviving the events 
selection is the VBF mechanism. Unfortunately, the QCD NLO corrections to the main backgrounds 
included in Table 13 are not known and therefore these results suffer from large theoretical uncertainties. 

The event selections presented in this and the previous Sections have a certain degree of overlap. 
This is particularly relevant for the VBF Higgs boson production mechanism. In Section 7 the signal 
significance of a combined analysis is presented that takes into account the event overlap. 

that displayed by a typical QCD background process. This effect is well reproduced by the HERWIG generator. 
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Table 12: Expected cross-sections (in fb) for the Higgs boson plus two jet analysis. Results are 
given after the application of cuts la and Illa-IIIe (see Section 5.3). In the last row the expected 
cross-sections within a mass window of m yy of ±2 GeV around 120 GeV are given. 



Cut 


gg^H 


VBFH 


VH 


ttH 


Total 




a(fb) 


a(fb) 


a(fb) 


(fb) 


a(fb) 


Ia-IIIa 


26.40 


3.53 


1.68 


0.50 


32.11 


Illb 


0.63 


1.44 


0.02 


0.01 


2.10 


IIIc 


0.55 


1.39 


0.01 


0.01 


1.96 


Hid 


0.32 


1.16 


0.01 


0.00 


1.49 


Hie 


0.25 


1.03 


0.00 


0.00 


1.28 


Mass Window 


0.18 


0.79 


0.00 


0.00 


0.97 



Table 13: Expected cross-sections (in fb) of background for the Higgs boson plus two jet analysis 
for niH = 120 GeV. Results are given after the application of cuts la and Illa-IIIe (see Sec- 
tion 5.3). In the last row the expected cross-sections within a mass window of m 77 of ±2 GeV 
around 120 GeV are given. 



Cut 


77 


Reducible yj 


Reducible jj 


EW yyjj 


Total 




a(fb) 


a(fb) 


a(fb) 


a(fb) 


a(fb) 


Ia-IIIa 


7417 


6355 


710 


92 


14574 


Illb 


94 


97 


13 


45 


249 


IIIc 


70 


69 


9.9 


41 


189 


Hid 


33 


34 


5.6 


38 


111 


Hie 


17 


17 


2.5 


26 


63 


Mass Window 


0.86 


0.42 


0.06 


0.59 


1.95 



5.4 Higgs boson plus missing transverse energy and isolated leptons 

The main signal production mechanism contributing to a category of events with two photons, missing 
transverse energy and isolated leptons will be from WH — »■ tvyy and ttH. The basic selection requires the 
presence of two energetic photons, missing transverse momentum, and one high energetic lepton. The 
main backgrounds for this channel are ttyy, Wyy where the W decays to £v and Wy^evy where the 
other photon is radiated by the electron or is a fake photon from an additional jet. 4 This latter background 
is multiplied by a factor 2 to include the Wy — »■ jJ,vy contribution. Another important background turns 
out to be 77 and yj, when fake electrons or muons are reconstructed. ALPGEN was used to produce 
the diphoton background including a full simulation of the detector. The cross section was multiplied by 
a factor 1.4 to account approximately for the box diagram (40% of the contribution, see Table 1). For 
reducible backgrounds (68% contribution, see Table 8), the cross-section of the diphoton background is 
scaled by a factor 1.68. It is difficult to evaluate the accuracy of this approximation but this diphoton 
process is not the leading background. A multivariate analysis based on various kinematical variables 
would require very large Monte Carlo samples for the backgrounds. Here, a simple set of basic event 
selection criteria is chosen: 

IVa The first selection criterion sets minima on the transverse momenta of the two reconstructed pho- 

4 Note that there is some double counting when both Wy and Wyy are included. 
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tons. The cut requires the pj of the most energetic photon to be higher than 60 GeV and the pj 
of the second more energetic photon to be above 30 GeV. Events where one of the two photons 
is reconstructed in the crack region are rejected. The cross-sections in the first line of the table 14 
are given in the mass range between 110 and 150 GeV. 

IVb Requiring a cut that the transverse momentum of the most energetic isolated lepton (electron or 
muon) be higher than 30 GeV suppresses efficiently the diphoton and Wy backgrounds. Only the 
electrons passing a tight electron identification cut are selected. 

IVc The diphoton background can further be strongly suppressed by requiring missing transverse en- 
ergy higher than 30 GeV. 

IVd When an electron is reconstructed, events are removed in with either of the invariant masses of 
the electron and each of the photons is close to the Z mass (between 80 and 100 GeV): this cut 
removes the events coming from the (Z/y*) + y—> e + e~y when an electron is reconstructed as a 
photon. 



Table 14: Expected cross-sections (in fb) for the Higgs boson with missing transverse energy and 
lepton analysis for m H = 120 GeV. Results are given after the application of cuts IVa-IVc (see 
Section 5.4) in the mass range 110 < m rr < 150 GeV. In the last row the expected cross-sections 
within a mass window of m rr of ±2 GeV around 120 GeV are given. 



Cut 


ivyy 

<T(fb) 


ttH^ 
xyy 

<7(fb) 


W ± yy-^ 
ivyy 

<T(fb) 


ttyy 
<r(fb) 


bbyy 
<r(fb) 


W ± y^ 
evy 
a(fb) 


(Z/y*) + y^ 
e+e-y 
(fb) 


77 
a(fb) 


IVa 


0.328 


0.467 


0.509 


2.53 


3.78 


18.0 


10.28 


2558 


IVb 


0.122 


0.103 


0.149 


0.582 


0.0098 


0.406 


2.60 


0.644 


IVc 


0.091 


0.086 


0.097 


0.474 


0 


0.263 


0.076 


0.091 


IVd 


0.084 


0.077 


0.090 


0.419 


0 


0.143 


0 


0.091 


Mass Win. 


0.064 


0.062 


0.0092 


0.042 


0 


0.014 


0 


0.010 



Table 14 summarizes the expected cross-sections for the principal signal and backgrounds after the vari- 
ous cuts used in the analysis. The uncertainty in the background level, due to Monte Carlo statistics only, 
is estimated to be 10%. However, the background contribution in this analysis can have larger sources of 
systematic uncertainties. The resulting Higgs boson mass peak and background are shown in Fig. 9 after 
all cuts have been applied. The reconstructed mass resolution is 1 .47 GeV. 

5.5 Higgs boson plus missing transverse energy 

This analysis is intended to select principally the ZH —> wyy events. In order to have an analysis 
independent from that of Section 5.4, only events which do not have a reconstructed lepton with pj > 
30 GeV have been considered here. It should be noted that cases where the lepton is of low pj or did not 
pass the tight selection criterion are included. Thus the WH signal contributes significantly. The main 
features are the presence of two energetic photons from the Higgs boson and a large transverse missing 
energy. The dominant backgrounds for this channel are the ttyy, Zyy and Wy — > evy channels where, in 
the latter case, the electron can be mis-reconstructed as a photon or the second photon is either radiated 
by the electron or is a fake photon. Here, the diphoton background is multiplied by the same factors as 
in Section 5.4 and the ttyy by the factor 7.5 as explained in Section 2.2. 
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> 0.08 
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Figure 9: Expected distribution of the invariant mass of the two photons for the signals and main back- 
grounds after applying the analysis cuts for events having one lepton reconstructed in the final state. 
Due to a lack of MC statistics for the diphoton and the Wy backgrounds, their expected distribution is 
approximated by showing an average of the number of events passing the analysis cuts in the m n mass 
range shown. 



Va As in Section 5.4, a cut on the transverse momentum of the most energetic photon above 60 GeV 
and a cut on the second more energetic photon pj of 30 GeV are applied to suppress the diphoton 
background. Events where one of the two photons is reconstructed in the crack region are then 
removed. 

Vb The selection is then based mostly on the requirement of high missing transverse momentum. A 
cut of Ej 1Sii > 80 GeV suppresses almost completely the yy background while reducing the Wy 
background by a factor 20 and the ZH — > wyy signal by a factor 2. 

Vc In order to further suppress the Wy background, where the electron is often reconstructed as a 
converted photon, events where either of the photons appears to have converted are rejected. 

Vd At this point, because of potentially significant background from QCD events, difficult to evaluate, 
a cut requiring that the scalar sum of the pj of the jets in the event be larger than 150 GeV is 
imposed. It suppresses the contribution from the ttyy and bbyy backgrounds, as well as of the ttH 
signal. 

Table 15 summarizes the expected cross-sections after the different cuts applied for this analysis for 
signal and backgrounds. The expected mass distributions of diphotons from the associated W/Z plus 
Higgs boson and from the backgrounds are shown in Fig. 10, after the application of all cuts. To account 
for the Wy — ► jivy, the Wy — ► evy background has been multiplied by two in the figure although some 
double counting is introduced. The uncertainty in the background level, due to Monte Carlo statistics 
only, is estimated to be 15%. The reconstructed mass resolution is 1.31 GeV. This result is expected to 
be sensitive to uncertainties in the simulation and reconstruction of £"™ ss tails. 
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Table 15: Expected cross-sections (in fb) for the Higgs boson with missing transverse energy 
analysis. Results are given after the application of cuts Va-Vd (see Section 5.5) in the mass range 
110 < niyy < 150 GeV. In the last row the expected cross-sections within a mass window of m rr 
of ± 1 . 8 GeV around 1 20 GeV are given. 



Cut 


ZH -> 
vvyy 
a(fb) 


WH 
(vyy 
a(fb) 


ttH -» 
xyy 

C7(fb) 


Zyy^ 
vvyy 
a(fb) 


W ± yy^ 
Cvyy 

CT(fb) 


ttyy 
<r(fb) 


bbyy 
<r(fb) 


W ± y^ 
evy 

(T(fb) 


77 
a(fb) 


Va 


0.115 


0.207 


0.364 


0.325 


0.360 


1.95 


3.77 


17.55 


2558 


Vb 


0.058 


0.062 


0.080 


0.126 


0.071 


0.461 


0.010 


0.789 


0.211 


Vc 


0.046 


0.049 


0.064 


0.096 


0.056 


0.377 


0.010 


0.191 


0.141 


Vd 


0.042 


0.042 


0.006 


0.093 


0.050 


0.021 


0.005 


0.120 


0.073 


Mass Win. 


0.034 


0.033 


0.0056 


0.009 


0.006 


0.002 


0.0005 


0.012 


0.007 





Signal 

ATLAS |§| Irreducible bkg \ 
j | Reducible bkg 





PlO 115 120 125 130 135 140 145 150 
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Figure 10: Expected distribution of the invariant mass of the two photons for the signals and main 
backgrounds after applying all the diphoton and £™ ss analysis cuts. Due to a lack of statistics for the 
diphoton and the Wy backgrounds, their expected distribution is approximated by showing an average of 
the number of events passing the analysis cuts in the m n mass range shown. 



6 Maximum-likelihood fit 

An unbinned extended multivariate maximum-likelihood fit to extract the H — > yy signal and background 
event yields is performed. With respect to the cut analysis presented in Section 5, the fit takes advantage 
of further discrimination information from the kinematic and topological properties of H — > yy decays. 
In addition to the diphoton invariant mass, m rr , the transverse momentum of the Higgs boson, Pt,h, 
and the magnitude of the photon decay angle in the Higgs boson rest frame with respect to the Higgs 
boson lab flight direction, |cost?*|, are included. To reduce the model dependence, the most relevant 
parameters describing the probability density functions (PDF) for the dominant background are freely 
varied and determined simultaneously with the event yields by the fit. Only the parametric shapes of the 
PDFs are obtained from Monte Carlo simulation. Sufficient background statistics in the sidebands must 
be retained to ensure that the floating shape parameters do not increase the statistical errors on the signal 
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yield. The m Yr resolution depends on the diphotons' pseudorapidities, with core Gaussian widths varying 
between 1.2 and 3 GeV. Disjoint pseudorapidity regions (denoted categories), are defined such that the 
m Y y resolution is similar for events belonging to the same category, and maximally different for events 
of different categories. The m 7r resolution also depends on whether or not photons have converted at an 
early stage in the material in front of the calorimeter (see Section 3.3). H —>■ yy events can have zero, 
one or two reconstructed conversions, which are split into corresponding categories. Following Section 5 
disjoint categories are also introduced for events accompanied by zero, one and two or more jets, where 
the last category contains predominantly VBF events. The use of categories separates sub-populations 
of events with different properties and hence improves the accuracy of the likelihood model. 

The signal and background samples used to build the likelihood model are selected with the criteria 
described in Section 5. The full detector simulation is used for signal processes, while the background 
processes are generated with the fast simulation. The trigger and photon identification efficiencies, and 
the misidentification in background events are parametrised using the full detector simulation. The prob- 
ability of a photon to convert into an e + e~ pair with one or two tracks being reconstructed, and the energy 
resolution of the converted photon are parametrised using the full detector simulation. 

6.1 Likelihood model 

The probability density function, Pf, for an event i in category c is the weighted sum of the probability 
densities of all components, namely Pf = Nuf^Pfj ,• + Ly=i ^.Pf}. where Nh is the number of H — >• yy 
signal events determined by the fit, 5 ffj is the fraction of signal events in category c (c = 1, . . . ,n cat , 
note that £ c /# = 1) taken from simulation, and Ng. is the number of background events of type j 
(7 = 1,..., «bkg) found in category c, which is determined from the fit. The signal and background PDFs 
Pfj with U = H,Bj, are the products Pfj i = T(k=\Pu( x k,i) °f trie PDFs p c u (xkj) of the discriminating 
variables x^j, k = 1, . . .,n var , used in the fit. The extended likelihood over all categories and events is 
given by = n"=i lid Pf> where N c (N c ) is the total number of events expected (observed) in 
category c with c = 1, . . . ,n cat . The PDFs Pfj i depend on parameters (coefficients) that may be freely 
varying in the fit. The parametrisations and hence the adjustable parameters differ between categories in 
general, but can also span over several categories. 

6.2 Fit variables 

The H — >• yy distribution of m n forms a Gaussian peak with tails to lower values from photon energy 
losses before the calorimeter. It is well modeled by a Crystal Ball (CB) function [55] 



where t = (m rr — mH — 8 mH )/o(m rr ), N is a normalisation parameter, m# is the Higgs boson mass, 8 mH 
is a category dependent offset, a represents the diphoton invariant mass resolution, and where n and a 
parametrise the non-Gaussian tail. To catch outliers an additional broad tail Gaussian is added to Equa- 
tion 4, which is, however, only relevant for events falling into the "bad" m n category (see Section 6.3). 
Within a sufficiently narrow mass window, the background distribution of m rr forms an exponential tail 
described by a single slope parameter £ . 

Because of the large available statistics for simulated signal samples, shape uncertainties due to defi- 
ciencies in the functional description are negligible. Systematic effects are due to simulation inaccuracies 
and are discussed in Section 7. A high-fidelity description of the background PDFs is mandatory because 

throughout this section the subscript S is idenfitied with the H — » yy signal. 




for? > -a, 
otherwise , 



(4) 
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Figure 11: Signal (left) and background (right) distributions of the Higgs boson decay angle, |cos0*| 
(top), and the diphoton transverse momentum (bottom) for events with zero jets (full dots), one jet (open 
circles) and VBF topology (full triangles, not shown for background because of a too low relative cross- 
section). The corresponding PDF parametrisations are overlaid (see text). 

the large majority of the events entering the fit are of that type. It is achieved by flexible parametrisations 
with a sufficient number of parameters determined by the fit, ensuring a stable fit result with respect to 
shape redefinitions. 

The distribution of |cos0*| for a scalar Higgs boson is uniform. However, acceptance effects, pri- 
marily from the minimum pj requirements for the photons (See Section 5.1), suppress |cos0*| values 
towards one, where the photons are collinear with the Higgs boson lab frame momentum. The empirical 
signal PDF is interpolated by a double Gaussian function. The phase space for background events from 
^-channel graphs and quark or gluon fragmentation at NLO is enhanced for photons collinear with the 
diphoton lab momentum, so that the background |cos0*| distribution exhibits some clustering towards 
large values. Acceptance suppression competes, however, with this enhancement thus reducing the dis- 
crimination power of the variable. It is found that the | cos0*| distributions differ significantly between 
the 77, yj and jj backgrounds, with stronger enhancements at large | cos(9*| values for the backgrounds 
originating from jet misidentification. The inclusive shape of these backgrounds is parametrised by the 
sum of a positively defined third order polynomial and two Gaussian functions. The inclusive signal and 
background distributions of | cos0*| for events with and without jets are shown in Fig. 11. 

The Higgs boson transverse momentum exhibits a strong rise at low values and a long exponential 
tail beyond the maximum. The distribution is fitted by a sum of two bifurcated Gaussian functions 
(distributions where below and above the center half Gaussian distributions with different widths are 
used) and one symmetric Gaussian. The diphoton transverse momentum distribution for background is 
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Figure 12: Regions of photon pseudorapidities with different invariant mass resolutions for unconverted 
photons (left) and at least one converted photon (right). The text per box denotes the region number, the 
percentage of events occurring in the region and the RMS of the diphoton invariant mass for H — ► 77 
events. To simplify the likelihood model, events with photons in regions (1) and (8) are merged and 
represent category "good" (signal fraction 24%), events in regions (2), (3), (4) and (6) correspond to 
category "medium" (60%), and regions (5) and (7) are "bad" (15%). 

softer than that for signal and can be described by the sum of three exponential polynomials (see Fig. 11). 

6.3 Fit categories 

Eight photon pseudorapidity regions with different diphoton invariant mass resolution for H — ► 77 events 
are identified. They are illustrated in Fig. 12 for unconverted photon pairs (left plot) and events with 
at least one photon conversion (right plot). The regions are chosen to be symmetric with respect to an 
interchange of the photons. The crack region is excluded from the photon selection. The m Y y RMS 
values vary between 1.3 GeV in the centre and large-T] regions to 3.1 GeV in the regions closely beyond 
the crack. To simplify the likelihood model, the eight regions are merged into three categories that are 
distinguished in the fit (see Fig. 12). 

Two categories are introduced to separate events without and with at least one photon conversion 
to take account of the worse resolution of the latter events. Photon conversions reconstructed with one 
or two tracks are not explicitly distinguished. Additional categories are introduced for Higgs boson 
production with zero, one, and two or more accompanying jets, using the requirements described in 
Sections 5.2 and 5.3. No separate categories are introduced in the present analysis to distinguish Higgs 
boson production in association with a W and Z boson or a tt quark pair; these are included in the 
previous selections. Separating them would add some power, but would increase the complexity. 

6.4 Correlations 

The likelihood product used to derive the event PDF P{j ignores correlations between the discriminating 
variables x^. The classification in categories of events with distinct properties improves the accuracy 
of this assumption. The remaining (positive) linear correlation coefficients is lower than 7% (10%) for 
signal (background) events among the three fit variables tolerable. 
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Table 16: H — > 77 discovery potential for a Higgs boson mass of 120 GeV, various likelihood 
setups and a simulated integrated luminosity of 10fb _1 . The left column gives the discriminating 
variables used in the fit, the second column the categories, the third (fifth) column the average 
AlnJz? and its statistical uncertainty as derived from the toy MC samples, and the fourth (sixth) 
column quotes the estimated Gaussian signal significance in units of one standard deviation. Due 
to the background dominance, the variance of the significance in the toy experiments is approxi- 
mately one. The fits have been performed with fixed Higgs boson mass (Higgs boson mass floating 
within [112, 128] GeV). The significance for floating Higgs boson mass has been obtained with 
toy MC simulation (the AlnJz? cannot be directly interpreted in terms of significance and is given 
for completeness only). Due to the large number of required toy MC fits it has not been computed 
for the most involved fit with highest expected significance. 



. , , „ . Higgs boson mass fixed Higgs boson mass floating 

Fit variables Categories „. ._ r , /A1 „. ._ r , 

fe (AlnJz?) Significance [a] (A In J?) Significance [a] 

m yy - 2.67±0.04 2.31 ±0.02 3.54± 0.05 1.44 ±0.02 

m n tj 3.18 ±0.05 2.52 ±0.02 

m yy tj, Conversions 3.32±0.05 2.58±0.02 

myy 77, Conversions, Jets 5.99 ±0.07 3.46 ±0.02 6.66 ±0.07 2.64 ±0.02 

m y y, |cos0*| J], Conversions, Jets 7.33±0.08 3.83±0.02 

myy,P T ,H tj, Conversions, Jets 7.03 ±0.08 3.75 ±0.02 

m^P^H, |cos0*| tj, Conversions, Jets 8.49±0.08 4.12±0.02 9.25±0.09 



6.5 Fit performance 

Studies are performed involving large samples of toy MC simulation to assess the discovery potential for 
fits using likelihood models of increasing complexity. 6 The abundance of signal and background events 
used in these fits is tuned to the NLO expectation for lOftT 1 of integrated luminosity, taking into account 
the trigger and reconstruction acceptance. 

The results for fits with fixed and floating Higgs boson mass (the latter only done for the m rr -only 
and the full fits) are summarised in Table 16. For each toy MC sample we perform two fits, one with 
floating signal yield and another with zero signal to test the background-only hypothesis. The log- 
likelihood difference, AlnJz?, found in these fits, estimates the false discovery probability (p-value). The 
(AlnJz?) values given in Table 16 are obtained from Gaussian fits to well-behaved pull distributions. 
For fixed Higgs boson mass, the signal significance in terms of a can be approximated by the quantity 
\J — 2AlnJz?. For floating Higgs boson mass, the extra degree of freedom in the fit yields a higher value 
of (AlnJz?). However in this case the p-value and significance must be evaluated with toy MC simulation 
of background-only samples. As expected, the obtained significances are lower than in the fixed mass 
case, in spite of the higher (AlnJz?) values. 



7 Discovery potential 

This Section reports on the potential for the discovery of a Higgs boson in the mass range 120 < m H < 
140 GeV using the event counting computation and the maximum likelihood fit formalism (see Sec- 

6 Although required for real data the evaluation of goodness-of-fit estimators is not discussed here, because the generated 
toy data is intrinsically consistent with the underlying model. 
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tion 6). The signal significance is evaluated for each of the analyses presented in Section 5. The discov- 
ery reach is given for a combined analysis including the utilization of additional discriminating variables. 
The last signifies the maximum discovery potential for a Standard Model Higgs boson in the mass range 
specified above. 

Table 17: Expected cross-sections (in fb) of signal (S) and background (B) for the different anal- 
yses presented in Section 5 within a mass window of ±1.4a rr as a function of the Higgs boson 
mass (in GeV). 



m H 


Inclusive 
S B 


ff + ljet 
S B 


# + 2jets 
S B 


H+E™ ss +e 
S B 


S B 


120 
130 
140 


25.4 947 
24.1 755 
19.3 610 


4.0 49 
4.3 47 
3.9 46 


0.97 1.95 
0.96 1.72 
0.81 1.72 


0.134 0.077 
0.112 0.076 
0.079 0.076 


0.075 0.037 
0.063 0.037 
0.045 0.036 



Table 17 shows the signal and background effective cross-sections for the different analyses presented 
in Section 5 as a function of the Higgs boson mass. 7 Table 18 displays the corresponding expected signal 
significances for 10fb _1 of integrated luminosity. The first sub-column under each analysis shows the 
signal significance based on event counting, o(S,B), where S and B correspond to the number of signal 
and background events in a mass window of ± 1 .4 o rr around m#, respectively, and where o n is the mass 
resolution (in the no pileup case reported in Table 5). 8 The values of o(S,B) reported in Table 18 can be 
compared with earlier studies performed by the ATLAS collaboration [2-A, 8,9]. The inclusion of pileup 
decreases the event counting signal significance by at most 10%. 

The signal significances reported in the second and third sub-column under each analysis, o{g and 
(j(l° at , are obtained by means of a one dimensional fit using the diphoton invariant mass as a dis- 
criminating variable by letting the Higgs boson mass fixed and floated, respectively. 9 Each analysis 
is treated independently from each other. In these fits the Higgs boson mass is let free in the range 
1 10 < m H < 140 GeV (except for m H = 140 GeV for which the range was set to 120 <m H < 150 GeV) 
to take into account for the coverage of the analysis. 10 The results in Table 18 do not take into account the 
event overlap among the three analyses. The last column reports the event counting signal significance 
of the three analyses combined, taking into account the event overlap. 

Table 1 8 illustrates that the inclusive search for a diphoton resonance is the most sensitive one for a 
search of a Standard Model Higgs boson. As reported in Section 2, the inclusive analysis is evaluated 
using QCD corrections for both signal and background processes. This is not the case for the H + ljet 
and other analyses. The discovery potential of H + ljet analysis could be further enhanced if QCD NLO 
corrections were applied on both signal and reducible backgrounds [56,57]. 

Table 19 shows two fit-based signal significances compared to o{g and a[^" at reported in Table 18. 
The values of o^ D and OcIZd correspond to the signal significance computed by means of a three di- 
mensional fit, including m rr ,PT,H and | cos0*| (see Section 6.2) by means of fixing and floating the mass, 
respectively. At this stage the event classification according to \r]\ regions is used (see Section 6.3). 11 

7 The contribution from Drell-Yan is computed for the inclusive analysis only. 

8 The event counting significance for the inclusive and H + ljet analyses are approximated by S/VB. For the rest of the 
analyses a Poisson-based computation is used due to the small expected number of background events for 1 0 fb~ 1 of integrated 
luminosity. 

9 For the sake of simplicity, the fit-based signal significance does not include the Drell-Yan contribution. 
10 The range of the fit to the background is set to 1 10 < m H < 150 GeV. 

n For the final results presented in this section, a simplified fitting model as well as a more conservative classification are 
used with respect to those considered for Table 16. In particular the diphoton category with two jets is not split into r\ categories 
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Table 18: Signal significances (expressed in terms of Gaussian sigmas) for a Standard Model 
Higgs boson as a function of the mass (in GeV) using the different analyses reported in Sec- 
tions 5.1-5.3 for 10fb -1 of integrated luminosity. Results are reported in terms of the signal 
significance based on event counting, a(S,B), and a fit-based signal significance, o[ D x and o(^' at 
(see text). 



m H 


Inclusive (with K-factors) 

o(S,B) c{S °& oat 


#+ljet 
cr(5,B) 


(no K-factors) 
n Fix n Float 


# + 2jet (no K-factors) 

a(S,B) o{g <J&° at 


Combined 

o{S,B) 


120 


2.6 2.4 1.5 


1.8 


1.8 1.3 


1.9 2.0 1.1 


3.3 


130 


2.8 2.7 1.8 


2.0 


2.1 1.6 


2.1 2.1 1.2 


3.5 


140 


2.5 2.2 1.3 


1.8 


1.7 1.2 


1.7 2.0 1.0 


3.0 



Table 19: Signal significances (expressed in terms of Gaussian as) for a Standard Model Higgs 
boson as a function of the mass for 10fb _1 of integrated luminosity. Different fit-based approaches 
are used. The significances, <j[ D x , OqI d and a c\lo correspond to a one dimensional fit, to a 
three dimensional fit using the first type of event classification and to a three dimensional fit (see 
Section 6.2) using all classifications considered in Section 6.3, respectively using a fix Higgs 
boson mass (see text). The significances o(^ ,at , o^d an( ^ CT c 2 3D ' correspond to fit based results 
obtained with a floating Higgs boson mass. 



m„[GeV] 


°w 


a tk>at 


a C| ! 3D 


°C,3D 


a C ? ! 3D 


-.Float 
°( \3D 


120 


2.4 


1.5 


3.1 


2.1 


3.6 


2.8 


130 


2.7 


1.8 


3.4 


2.4 


4.2 


3.4 


140 


2.2 


1.3 


3.2 


2.2 


4.0 


3.2 



The values of Oq3d m ^ G cl?>D correspond to the maximum achievable sensitivity of all the analyses 
reported in Section 5 combined. In addition to the procedure followed to obtain o^ D and o^^, a 
classification of events according to the presence of hadronic jets is used. 

Figure 13 displays a summary of the expected signal significance for the inclusive and final combined 
analysis for 10fb _1 of integrated luminosity as a function of the Higgs boson mass. The solid circles 
correspond to the sensitivity of the inclusive analysis reported in Section 5.1 using event counting with 
background and signal rates assumed. The solid triangles linked with solid and dashed lines correspond 
to the sensitivity of the inclusive analysis by means of one dimensional fits, with a fixed and floating 
Higgs boson mass, respectively. The solid squares linked with solid and dashed lines correspond to the 
values of <Jq\ d and Oy,''^"' given in Table 19, respectively. 

The stability of the fits are checked against changes in the composition of the background. The values 
of a[ D x for the inclusive analysis are recomputed by increasing and decreasing the reducible background 
by a factor of two. The fits are redone with the same functional forms as in the nominal analysis and the 
results are consistent with the expectations obtained by using S/\J~B. 

The degradation of the Higgs boson discovery sensitivity due to systematic uncertainties is consid- 
ered. Various sources of systematic errors are evaluated. The Higgs boson mass resolution has significant 
impact on the sensitivity. A large degradation of the Higgs boson mass resolution is chosen by the addi- 
tion of a 1 % constant term in the photon energy resolution. The impact of Higgs boson mass resolution 
is evaluated for the inclusive analysis by means of one-dimensional fits with a fixed Higgs boson mass. 

and the conversion categories are not used. 
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Figure 13: Expected signal significance for a Higgs boson using the H — > 77 decay for lOfb^ 1 of inte- 
grated luminosity as a function of the mass. The solid circles correspond to the sensitivity of the inclusive 
analysis reported in Section 5. 1 using event counting. The open circles display the event counting signifi- 
cance when the Higgs boson plus jet analyses (see Sections 5.2 and 5.3) are included. The solid triangles 
linked with solid and dashed lines correspond to the sensitivity of the inclusive analysis by means of 
one dimensional fits, with a fixed and floating Higgs boson mass, respectively. The solid squares linked 
with solid and dashed lines correspond to the maximum sensitivity that can be attained with a combined 
analysis (see text and Table 19). 



If the resolution is increased both in toy Monte Carlo experiments and in the fitting function a decrease 
of 8% in the signal sensitivity is observed. If, however, the resolution is only degraded in toy Monte 
Carlo experiments, so that the fitting model does not accommodate the Higgs boson mass resolution 
degradation, then the effect is a 12% reduction in sensitivity. 

The systematic uncertainty due to Pjjj has been estimated by using PYTHIA signal events, reweighted 
to NLO using ResBos, and fitted with the nominal model based on MC@NLO event simulation. This 
inconsistency between generation and fit reduces the significance of the full fit (including all categories 
and the variables m y7 , Pt,h, and | cos0*|, see Table 16) by 5%. 

The sensitivity of the associated production channels has been studied separately for the diphoton + 
£miss + i e pt on anc j f or the diphoton + £"™ ss analyses (see Sections 5.4-5.5). With 30 fb _1 these channels 
have the potential to contribute to the overall discovery signal of a SM Higgs boson, but we choose 
not to report the value of the signal significance because the systematic uncertainties on the background 
normalization are large given the present status of simulations. 

8 Conclusions 

The feasibility of the search for a Standard Model Higgs boson in the H — > 77 decay with the ATLAS 
detector at the LHC has been presented. The detector performance issues relevant to the search have 
been evaluated using a full detector simulation. Triggering effects and the impact of pile-up are also 
reported. A signal significance based on event counting of 2.6 (4.6) can be obtained with 10 (30) fb -1 of 
integrated luminosity for m# = 120 GeV in the case of inclusive analysis. Despite the slight degradation 
with respect to previous studies, the feasibility of the search for a Standard Model Higgs boson in the 
H — > 77 decay is confirmed. In addition to the inclusive analysis the search for diphotons in association 
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with jets is considered. The addition of these channels enhances the event counting signal significance 
to 3.3 (5.7) with 10 (30) fb _1 of integrated luminosity for m# = 120 GeV. The expected sensitivity can 
be enhanced by means of an unbinned maximum-likelihood fit dividing the data sample into categories 
depending on the event topology and exploiting a number of discriminating variables: the increase in 
the discovery potential increases up to 3.6 (2.8) in the case of fixed (floating) mass fit for 10 fb _1 of 
integrated luminosity. 

The search for H — > yy in association with Z/W or tt has been addressed, indicating that a good signal- 
to-background ratio can be achieved. However, the uncertainty on the background contribution in these 
analyses is large given the present status of simulations. 
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Search for the Standard Model H -> ZZ* -> 4/ 

Abstract 

The Standard Model Higgs boson discovery potential through its observation 
in the 4-lepton (electron and muon) final state using the ATLAS detector is 
investigated. Fully simulated signal and background samples were produced 
with the latest simulation of the ATLAS detector. The samples were subse- 
quently digitized and reconstructed using the ATLAS offline software. The 
analysis performance dependence on kinematic, lepton reconstruction and iso- 
lation cuts is studied for Higgs boson masses ranging from 120 to 600 GeV. 
The statistical and systematic uncertainties on the background estimation are 
evaluated and their impact on the Higgs boson discovery potential and exclu- 
sion limits is discussed. 

1 Introduction 

The search for the Standard Model Higgs boson is a major goal of the Large Hadron Collider (LHC). 
The first proton-proton LHC data at 14 TeV center of mass energy are expected in 2009. The Higgs 
boson mass is a free parameter in the Standard Model, however there is strong expectation motivated by 
precision electroweak data [1] and direct searches [2] that a low mass Higgs boson (114.4 — 199 GeV, 
95% confidence level) should be discovered at the LHC. The experimentally cleanest signature for the 
discovery of the Higgs boson is its "golden" decay to four leptons (electrons and muons): H ZZ —>■ 
41. The excellent energy resolution and linearity of the reconstructed electrons and muons leads to a 
narrow 4-lepton invariant mass peak on top of a smooth background. The expected signal to background 
ratio after all experimental cuts depends on the Higgs boson mass itself. The major component of the 
background consists of irreducible ZZ — »■ 41 decays. The most challenging mass region is between 120- 
150 GeV where one of the Z bosons is off-shell giving low transverse momentum leptons. In this region 
backgrounds from Zbb —> 4£ and tt — > 4£ are important and require tight lepton isolation cuts to keep 
their contribution well below the ZZ continuum. 

2 Detector simulation, Monte- Carlo samples, trigger and event recon- 
struction 

In this section a brief summary of the detector simulation is presented and the Monte-Carlo (MC) sam- 
ples used in the analysis are described. This includes an outline of the various cross-sections and the 
corresponding theoretical uncertainties. Finally a brief description of the electron and muon trigger and 
offline reconstruction is presented. 

2.1 Detector simulation and Monte- Carlo samples 

The ATLAS detector is simulated by the GEANT4 [3] software. Simulation, digitization and reconstruc- 
tion are all performed within the ATLAS software framework Athena. The set of H^ 41 samples 
used in this analysis covers the mass range from 120 to 600 GeV. Simulation of pileup, cavern back- 
ground and minimum bias events is performed by mixing them with the Higgs boson signal at digiti- 
zation level [4], [5]. An instantaneous constant luminosity of 10 33 cm~ 2 s _1 is assumed. The cavern 
background consists of thermalized slow neutrons and low energy photons escaping the calorimeters [6]. 
The expected level of cavern background is increased by an overall "safety factor": in this analysis a 



1243 



Higgs - Search for the Standard Model H 



-► ZZ* -► 4/ 



safety factor of 5 is used. 

The 41 analysis is sensitive to uncertainties in the knowledge of the material distribution in ATLAS, 
to distortions of the magnetic fields, and to the accuracy of the Inner Detector (ID) and of the Muon 
Spectrometer (MS) alignment. Some of these uncertainties have been taken into account by using geo- 
metrical layouts including extra material. The layout used in this analysis includes additional material in 
the ID, and between the barrel presampler and strips and barrel cryostat upstream and downstream the 
calorimeter. The calibration of the LAr Electromagnetic Calorimeter (LAr EMC) is based on the nom- 
inal geometry without this extra material and its inclusion in our study provides a realistic systematic 
effect in the analysis. A detailed description of the layout used in simulation and reconstruction can be 
found in [7]. In the analysis presented in this note, uncertainties due to misalignment corrections are not 
included: the same misaligned layout is used both in simulation and reconstruction. 

The Higgs boson signal samples were generated exclusively by PYTHIA [8] (version 6.3 for mu = 
130 GeV and version 6.4 for the rest of the masses), while for the background samples various event 
generators were used. For the signal, PYTHIA calculates the cross-sections in leading order (LO) taking 
into account both gluon and vector boson fusion (VBF) diagrams. Next-to-leading order (NLO) effects 
are considered by scaling the total PYTHIA cross-sections [9]. During generation, a 4-lepton filter was 
applied to the samples, requiring 4 true leptons with p T > 5 GeV/c within |t] | < 2.7. The filter acceptance 
and the cross-section as a function of the Higgs boson mass is given in Table 1. The number of available 
MC events are shown in the last column. 



Process 


a w ■ BR \f b] 


o NW -BR[fb] 


Filter acc. 


Events 


#[120] 41 


1.68 


2.81 


0.584 


40K 


#[130] -► 41 


3.76 


6.25 


0.633 


40K 


if [140] -► 41 


5.81 


9.72 


0.662 


40K 


H [150] -► 41 


6.37 


10.56 


0.685 


10K 


H [160] -»4Z 


2.99 


4.94 


0.704 


40K 


H [165] -► 41 


1.38 


2.29 


0.712 


40K 


H [180] -»4Z 


3.25 


5.38 


0.733 


40K 


H [200] -► 41 


12.39 


20.53 


0.753 


50K 


H [300] -► 41 


7.65 


13.32 


0.782 


10K 


H [400] -► 41 


6.07 


10.78 


0.814 


40K 


H [500] -► 41 


2.98 


5.12 


0.842 


40K 


H [600] -► 41 


1.53 


2.53 


0.853 


40K 



Table 1: Monte Carlo signal data samples, 4-lepton (e,fl) filter acceptance, LO and NLO cross-sections, 
and number of events used in the analysis as a function of the Higgs boson mass in GeV (reported in 
square brackets). The cross-sections in the table include the branching ratio of the Higgs boson to ZZ* 
andZ->ll, l=e,/i. 



2.1.1 Corrections to leading order cross-sections for background processes 

The backgrounds considered in this analysis, together with their cross-sections and K-factors, defined as 
the ratio between the NLO and the LO cross-sections, are listed in Table 2. For the generation of these 
samples several MC generators are employed. The it background was generated using MC@NLO [10]. 
The QCD ZZ was generated with PYTHIA6.3, the Zbb background with AcerMC3.1 [11], and the WZ 
background with HERWIG 6.5 [12] interfaced to Jimmy [13] for simulation of the underlying event. The 
cross-sections listed in Table 2 are all at LO (except for the it which is at NLO), and they do not include 
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the lepton filter efficiency. The following corrections are applied to the cross-section of each process to 
correct for effects of subprocesses not originally included in the generators: 

• an additional 30% is applied to the QCD ZZ background cross-section to account for the missing 
quark box diagram in PYTHIA; 

• 8.5 pb are added to the Zbb cross-section to account for the qq -*Zbb diagrams that are not included 
in the generation. 

The last column in Table 2 shows the events available for each sample. Each Z boson in the QCD 
qq — > ZZ irreducible background is forced to decay into lepton pairs (including all three flavours). 

The Zbb reducible background was generated using ACERMC3.1 [11] with Parton Density Func- 
tion (PDF) set CTEQ6L, QCD scales jj, R = jj, F = m z , massive ft-quark, interfaced to PYTHIA 6.3 for 
showering and hadronization. The Z — > II (I = e,/J,) decay is forced at generator level. The full Z/y* 
interference is taken into account, with a cut on the resonance mass at 30 GeV. The AcerMC dataset 
includes only the gg —>■ Zbb process. The qq — > (Z/y*)bb contribution, where q is a light quark, is not 
included; its contribution to the total LO Zbb cross-section is less than 15%. The AcerMC LO cross- 
section, including the Z — > // branching ratio (BR), evaluated with the above parameter choice, amounts 
to 52.03 ± 0.03 pb for the gg -> (Z/f)bb and 8.64 ± 0.01 pb for the qq -> (Z/f)bb. 



Process 


Generator 


a- BR [fb] 


Corrections 


FA 


Evts [k] 


qq^ZZ^ 4£ 


PYTHIA6.3 


158.8 


+47.64 


[4£]0.219 


100 


gg -► Zbb -► 2£bb 


AcerMC/PYTHIA6.3 


52030 


+8640 (qq -► Zbb) 


[41] 0.00942 


430 


gg -► Zbb -► llbb 


AcerMC/PYTHIA6.3 


52030 


+8640 (qq -► Zbb) 


[31] 0.147 


200 


gg,qq^tt 


MC@NLO/Jimmy 


833000 




[41] 0.00728 


400 


qq^WZ 


Jimmy 


26500 




[3£] 0.0143 


70 


qq ^Z inclusive 


PYTHIA6.3 


1.510 6 




[lfl0.89 


500 



Table 2: Background samples, generators used, acceptance of the multi-lepton filter (FA), LO cross- 
section (except for tt, which is NLO) and corrections applied. The number of events is given in the last 
column. For ZZ, l = e,)j,,z while for the rest l = e,\x. The relative errors on the filter acceptances (FA) 
are smaller than 0.4%. 



2.1.2 Next-to-Leading-Order cross-sections for the background 

The it sample, generated by MC@NLO, is the only sample in Table 2 that already includes NLO pro- 
cesses. To evaluate the NLO cross-sections for the other background processes, the program MCFM [14] 
is used. The overall conditions for all MCFM NLO calculations are the following: CTEQ6M, \Ir = \If = 
m z , m b = 0 full (Z/y*) interference. Any two final state partons are merged in a single jet if their sepa- 
ration AR(jj) is smaller than 0.7. 

In the case of the Zbb sample, the following additional selections are applied: mz > 30 GeV; pjib) > 
10 GeV, \r\(b)\ < 2.5. The NLO cross-section obtained from MCFM is: 

270.4^0 : 6( jUr) +5.4 (jUf) ± 12 . 5( PDF) ± l.O(stat) pb 

where the first two uncertainties come from the QCD renormalization and factorization scales, varying 
independently the energy scale of the process from 0.5m z to 2m z , while the last one quotes the PDF 
uncertainty, calculated by making use of 40 sets of PDFs for CTEQ6M (20 plus and 20 minus). This 
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m z/z , > 12 GeV 



m zz (GeV) 


K-factor 


[115,125] 


1.15 


[125,135] 


1.21 


[145,155] 


1.25 


[155,165] 


1.34 


[175,185] 


1.31 


[195,205] 


1.32 


[295,305] 


1.40 


[395,405] 


1.52 


[495,505] 


1.84 


[595,605] 


1.81 



Table 3: K-factors for the ZZ background. The error on the K-f actors is dominated by the systematics 
(from PDF and renormalization and factorization scales) and amounts to 3.3%. 

result has proved to be quite stable with respect to the variation of the cuts both on the minimum jet pj 
and the minimum jet separation AR(jj). The LO cross-section evaluated with MCFM on the same phase 
space is 189.9 ±0.2 pb, resulting in a K-factor of 1.42. The effective cross-section can be evaluated as 
follows 

a e ff = a L0 BR(Z^ee,nn) FA K = Sl2A fb, 

where a BR(Z —> ee,flfl) is the AcerMC cross-section listed in Table 2 rescaled to include the qq 
contribution, and FA is the acceptance of the generator filter, as reported in Table 2. The statistical error 
on the filter efficiency is 0.2% for this dataset. 

The NLO cross-section for the qq — > ZZ* process is calculated with MCFM, applying the same 
kinematic selection on the Z boson masses as in PYTHIA (m z (*) > 12 GeV): 

o NW = 22At° 0 i(HR = fi F ) ±0.1 {PDF) pb 

where the statistical error is much smaller than the systematic ones. The effective cross-section used in 
the analysis can be evaluated according to 

o eff = o w ■ [BR(Z -► ll)} 2 -FA ■ (K + 0.3) = 34.82 • (K(m zz ) +0.3) fb, 

where Olo • [BR{Z — >• ll)] 2 is the LO cross-section, that includes the Z branching ratio to leptons from 
PYTHIA and amounts to 159±0.05 (stat) fb. The additional 30% accounts for the correction coming 
from the gg — >• ZZ* due to the quark box, as listed in Table 2. The relative statistical error on the filter 
acceptance FA is 0.3% for this process. The mass-dependent ^-factors used in this analysis are shown 
in table 3. 

The NLO cross-section for the WZ background is evaluated by applying the same cut on the boson 
masses (m z * j w * > 20 GeV) as in the HERWIG-Jimmy generator used for the LO process: 

W~ : a NL0 = 21.7±°|(Ai« = Hf) ±0.9(PDF) pb 

W + : a NL0 = 34.8+ 1 ) |( J u R = Mf ) ± l.O(PDF) pb 



1246 



Higgs - Search for the Standard Model H 



-► ZZ* -► 4/ 



where the QCD scale uncertainties are included. The statistical error is negligible. The effective cross- 
section can then be evaluated as follows: 

c eff = o NL0 {W + Z + W~Z) ■ FA = 807 fb 

where Onlo(W + Z + W~Z) refers to the sum of the two contributions. The relative statistical error on the 
filter efficiency for this process is 0.4%. 

2.2 Trigger 

The simulation of the full ATLAS trigger chain allows to evaluate the impact of the on-line selection 
on the Higgs boson search. Level-one (LVL1) trigger objects (Region-of-interest, ROI), available in the 
simulation, correspond to small At] x A0 regions where a lepton that satisfies the online selection criteria 
is found. LVL1 muon thresholds are programmable in the pj range from 4 GeV/c to about 40 GeV/c. 
Similarly to the LVL1 muon Trigger, electron/photon ROIs can be of eight types, depending on the high- 
est Ej threshold satisfied. The settings in this case are also programmable. In this analysis, two types of 
electron/photon Ej thresholds are considered, Ej thres =\5 and 22 GeV. For both thresholds, cuts on iso- 
lation and on leakage in the hadronic calorimeter are also applied at LVL1. The LVL1 electrons/photons 
and muons are subsequently confirmed by the High Level Trigger (HLT). The first step is the Level-2 
trigger (LVL2) where fast algorithms are used to validate the selected lepton. Events with leptons of 
a given quality and with energy above certain fixed thresholds are retained for a more accurate recon- 
struction and selection by the Event Filter (EF), where algorithms based on the offline reconstruction 
software are used for the final selection. The choice of HLT selection thresholds is optimized for physics 
performance, assuring that trigger rates satisfy the system latency. The main aspects of the electron and 
muon trigger systems are described in [15] and [16]. 

The acceptance of the muon trigger as a function of the generated pj for the threshold p T thres =20 
GeV/c, is shown in Fig. 1 . The efficiency above threshold is explained by the geometrical coverage of 
the muon LVL1 trigger detectors; in the barrel, the space occupied by detector feet, supports, services, 
etc., limits the geometrical acceptance to about 80%, while the efficiency of the trigger algorithm itself 
is very close to 100%. The trigger efficiencies of electrons for a selection threshold of Ej thres -22 GeV, 
including electron identification and isolation cuts, is shown in Fig. 2. 




P T [GeV] E T [GeV] 



Figure 1 : Muon trigger: selection efficiencies of the 
three trigger levels as a function of the true muon 
transverse momentum. The selection threshold is 
p T thre S=20 Ge v/ C . 



Figure 2: Electron trigger: selection efficiencies of 
the three trigger levels as a function of the true elec- 
tron transverse energy. The selection threshold is 
E T thres =22 GeV. 



In this paper, several trigger menus foreseen for the LHC running at luminosity L=10 33 cm 2 s 1 , are 
considered. The trigger efficiency for single and double lepton triggers, for a signal sample with a Higgs 
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Unbiased sample 


After event selection 


Trigger Menu 


4e | 4jU 2e2^i 


4e | 4jU | 2e2\l 


1/120 


0.1 


95.3 


71.3 


0.4 


98.2 


72.7 


le22i 


94.7 


0.4 


68.6 


99.8 


0.1 


78.1 


2e\5i 


76.3 


<0.2 


33.2 


98.9 


<0.2 


60.2 


lAi20or le22i 


94.7 


95.3 


95.7 


99.8 


98.2 


98.9 


2^10 or 2el5ior ljiilOand lel5i 


76.4 


93.3 


87.8 


98.9 


97.6 


96.9 



Table 4: Trigger selection efficiencies (in %) for various trigger menus, computed for H — > ZZ* — > 
4/ (m#=130 GeV), for the full trigger chain LVL1+HLT. The first three columns show the efficiencies 
computed on the full event sample. The last three columns show the efficiencies for events selected by 
the H —> ZZ* —> 41 analysis. The absolute errors on the efficiencies are 0.4% for the unbiased samples, 
and 0.2% for events passing the offline selection. 



boson mass of 130 GeV, is shown in Table 4. The results are shown for two classes of events: those 
filtered at generator level, requiring 4 leptons within pr> 5 GeV and \ri\ <2.7 (unbiased sample), and 
those passing the signal selection of reconstructed events, described later in this note. A double-lepton 
trigger with 10 GeV threshold for the muons and 15 GeV threshold for the electrons (isolation required), 
selects Higgs boson to four lepton decays with an efficiency higher than 97 %. In the following, the 
single-lepton menu requiring l/x20 or le22i will be applied as trigger selection. 

2.3 Electron and muon reconstruction 

The leptons used for the efficiency studies are required to satisfy the generator level kinematic cuts 
|tj| < 2.5,and p T > 5 GeV. The lepton efficiency is defined as the ratio of reconstructed to generated 
leptons originating from Z decays. The reconstructed leptons include leptons not originating from Z 
decays (non-prompt leptons) and fakes, so it is important to study these candidates and to estimate the 
fraction of leptons coming from non-Z decays and fakes. The non-Z lepton fraction is defined as the 
number of reconstructed leptons matched to true leptons not originating from Z decays, divided by the 
total number of reconstructed leptons. The fraction of fakes is defined as the number of reconstructed 
leptons not matched to a true lepton, divided by the total number of reconstructed leptons. 

2.3.1 Electron reconstruction 

The details of electron reconstruction are described in [17] and [18]. Here the electron definitions used 
in this analysis are briefly summarized. An electron is selected using the offline algorithm requiring: 

1. a cluster in the barrel and endcap LAr EMC, reconstructed by the ATLAS offline software; 

2. an inner detector track associated with the cluster; 

3. cluster containment in the LAr EMC; 

4. consistency of the lateral shower shape of the cluster with an electron isolated from hadronic 
activity; 

5. the lateral shower shape to be inconsistent with a n° — »■ 77 decay, using the strip section of the LAr 
EMC; 
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6. track quality requirements: a certain number of hits on the Pixel and SCT detector is required, and 
a transverse impact parameter smaller than 0. 1 cm. 

The containment and isolation requirements (2,3,4) are satisfied using the so-called LooseElectron defi- 
nition. This definition uses shower shape variables calculated with the middle sampling of the LAr EMC. 
The addition of requirements 5 and 6 corresponds to the MediumElectron definition. In this analysis the 
MediumElectron definition with the addition of calorimetric isolation using all cells (EM and hadronic) 
inside a AR = \f &r\ 2 +A0 2 = 0.2 cone (MediumElectron+CaloIso) has been used. The isolation cuts 
are tj -dependent, as described in detail in [17]. 

True electrons not originating from a Z boson decay represent a background to the H — ► ZZ* — > 
41 search. Typically they are not isolated and in most cases these electrons originate from heavy quark 
decays. It should be stressed that leptons from B-mesons are more isolated than those coming from 
D-meson decays, and therefore are more difficult to reject. The isolation also depends on the event 
topology. The non-Z electrons are a small fraction (less than 1%) of the total number of electrons in the 
H — ► ZZ* — ► 41 events. The electron efficiency as a function of the pseudorapidity tj and the transverse 
momentum p T for the Loose, MediumElectron and MediumElectron+Calolso definitions is shown in 
Fig. 4. A significant drop in efficiency is observed at low pj. This is due to the loss of discrimination 
power of the shower shape cuts. A summary of electron efficiencies and fraction of fakes and non-Z 




Figure 3: Electron reconstruction efficiency as a Figure 4: Electron reconstruction efficiency as a 
function of tj. The electron-id criteria are described function of pj. The electron-id criteria are described 
in the text. in the text. 

electrons is presented in [17]. The total fraction of fakes depends on the isolation cut and is particularly 
high for lower p T . For low momenta, p T < 15GeV, the fraction of these electrons is about 8% of the 
total reconstructed electrons in this pj range. The contribution from electrons which do not come from 
Z decays is a fraction of the fake electrons dominated by heavy flavour (c,b) decays. The b-originated 
electrons are a factor of 1.5-2 more than the c-originated electrons. 



2.3.2 Muon reconstruction 

The muon identification in ATLAS relies on the Muon Spectrometer (MS) for standalone reconstruction 
as well as on the ID and Calorimeters for combined muon reconstruction. In order to combine the muon 
tracks reconstructed in the ID and the MS, the ATLAS offline muon identification packages have been 
developed. The purpose of these packages is to associate segments and tracks found in the MS with the 
corresponding ID track in order to identify muons at their production vertex with optimum parameter 
resolution. Details on the muon system design and performance can be found in [19]. Details on the 
algorithms of the muon identification described here can be found in [20]. 
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The standalone muon reconstruction algorithm measures the five parameters of the reconstructed 
tracks with their associated 5x5 covariance matrix at the entrance of the Muon Spectrometer. The muon 
momentum is corrected using an energy loss parameterization. Overall, the reconstructed track param- 
eters with their full covariance matrices are provided at three locations: 1) at the entrance to the muon 
spectrometer, 2) at the entrance to the calorimeters, 3) at the perigee of the track. Standalone tracks 
reconstructed in the MS are combined with tracks reconstructed in the ID in the region of T7 <2.5. This 
combination improves the momentum resolution for tracks with momenta up to 100 GeV and suppresses 
the rate of fake muons. A statistical combination of the two sets of track param eters is made, weighted 
by their corresponding covariance matrices. Given a certain AR = a/Atj 2 +A0 2 cone, all tracks from the 
ID are combined with the ones from the MS and the pair giving the best % 2 is kept. The same procedure 
is repeated until no more combinations are possible. 

An algorithm has been developed in order to recover muons which fail to be reconstructed in the MS 
(either because they are low pj muons or because the number of stations is insufficient). The principle 
of the algorithm is based on the extrapolation of ID tracks to the inner stations of the MS and their 
matching to a segment reconstructed in these stations that was not yet associated to a combined track. 
The extrapolation is also performed to medium stations, in the regions of 77 between 1 < 77 < 1 .4, where 
there is a type of inner stations missing, resulting in a drop of the reconstruction efficiency. 

The reconstruction efficiency of muons from a 130 GeV Higgs boson sample as a function of their 
transverse momentum and 77 , is shown in Figs. 5 and 6 respectively. 




Figure 5: Muon reconstruction efficiency as a func- 
tion of pt- Empty (filled) markers show the ef- 
ficiency of the combined (combined+extrapolated 
from the ID) algorithm. Reconstructed muons of a 
Higgs boson sample of 130 GeV mass decaying into 
four muons are used. 



Figure 6: Muon reconstruction efficiency as a func- 
tion of r\ . Empty (filled) markers show the efficiency 
of the combined (combined+extrapolated from the 
ID) package. Reconstructed muons of a Higgs boson 
sample of 130 GeV mass decaying into four muons 
are used. 



3 Background Rejection 

The large Zbb and tt background cross-sections compared to the Standard Model Higgs boson cross- 
section, require further reduction of these processes by applying additional lepton identification criteria. 
A reduction well below the irreducible ZZ* background yield is a safeguard against large uncertainties 
on the production cross-sections of these final states. One can exploit the fact that leptons originating 
from the Z boson decays are expected to be significantly more isolated than the ones originating from 
heavy quark leptonic decays. These leptons are also expected to originate from the main interaction 
point, while &,ooriginating leptons should come from secondary displaced vertices. 
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In this section, a set of isolation and impact parameter cuts for background rejection is described. In 
this analysis the cuts have been chosen so that the expected rate for Zbb to 4-leptons (4e, 4/j, and 2e2jj.) is 
no more than one third of the ZZ rate. For Higgs boson masses above 160GeV, the reducible background 
contribution is expected to be less than 10% of the irreducible ZZ. 

3.1 Muon Isolation 

For the muon final state both calorimetric and track isolation criteria have been considered. 

• The calorimetric isolation discriminant is defined as the sum of the transverse energy deposited in 
the calorimeter inside a cone of a given radius AR = a/Atj 2 + A0 2 , around the muon. The energy 
deposition of the muon itself is subtracted from the isolation energy. The cone energy of the least 
isolated muon is used as a discriminant. 

• The track isolation discriminant is defined as the sum of the transverse momenta of the inner 
detector tracks in a cone of radius AR around the muon. The inner detector muon track is excluded 
from the sum. The least isolated track of all muons in the event, is used as discriminant. 

In Figs. 7 and 8 the rejection of the Zbb background as a function of the Ajx signal efficiency (m#=130 
GeV) is presented, for different calorimetric and track isolation cones, respectively. A cone size of 0.2 is 
adopted for both calorimetric and track isolation as a conservative choice. Moreover, as shown in Figs. 9 
and 10, the background rejection improves when the isolation quantities are normalized to the transverse 
momentum of the muon, thus the normalized isolation discriminants are used in the analysis. 




Figure 7: Zbb rejection versus H — >• Ajx efficiency, Figure 8: Zbb rejection versus H — ► \\x efficiency, 
for m H = \30GeV, for various calorimetric isolation for m H = \30GeV, for various track isolation cone 
cone sizes. sizes. 

The distributions of the normalized calorimetric and track isolation variables calculated using the 
cones chosen in this analysis for the signal and the main backgrounds are presented in Figs. 11 and 12, 
respectively. The cuts on the calorimetric and tracker isolation are chosen so that the efficiency for the 
signal, after application of the two cuts, is close to 90%. The selection cuts are placed at 0.23 and 0.15 
for the calorimetric and the tracker isolation, respectively. 

3.2 Electron Isolation 

For the electron final state both calorimetric and track isolation in the inner detector are considered. 
Although partial calorimetric isolation along the tj direction is already part of the electron-id require- 
ments, an extra calorimetric isolation in a cone of AR = 0.2 is applied. The definition of the electron 
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Figure 9: Zi>Z^ rejection versus H — > A\i efficiency, 
for ma = \30GeV, for standard and normalized 
calorimetric isolation calculated in a A/?=0.2 cone 
around the muon track. 



Figure 10: Zbb rejection versus H — > A\i efficiency, 
for inn = \30GeV, for standard and normalized 
track isolation calculated in a A/?=0.2 cone around 
the muon track. 




Figure 11: Normalized calorimetric isolation Figure 12: Normalized track isolation (A/?=0.2) 
(A/?=0.2) for the signal (m H = 130), the Zbb and tt for the signal, the Zbb and tt backgrounds for the 
backgrounds for the 4/j, channel. 4/j, channel. 



track isolation is the same as for the muons: it is the sum of the transverse momenta of inner detector 
tracks in a cone with radius AR around the direction of the electron track. The p T of the electron track 
is excluded from the sum. In an attempt to remove from the sum tracks originating from conversions of 
bremsstrahlung photons, only tracks which have at least one hit in the B -layer (the innermost layer of 
the Pixel detector) are considered in the sum. The track isolation is normalized to the electron pj. The 
distribution of the electron track isolation is shown in Fig. 13, for a cone size AR=0.2. For the analysis 
all leptons must satisfy a Upr/pr < 0.15 tracking isolation cut. 



3.3 Impact parameter analysis 

Leptons from tt and Zbb backgrounds are most likely to originate from displaced vertices. Further 
rejection of these backgrounds can be achieved by placing a cut on the transverse impact parameter sig- 
nificance (defined as dO/Odo, where dO is the distance of closest approach in the transverse plane) of 
the tracks associated to the leptons. For electrons, bremsstrahlung smears the impact parameter distribu- 
tion, hence reducing the discriminating power of this cut with respect to muons. The impact parameter 
is calculated with respect to the event vertex fitted using a set of tracks reconstructed in the ID. This 
allows to remove the effect of the spread of the vertex position, which at LHC is 15 ;Um along each of 
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Figure 13: Signal and background distributions for electron track isolation, normalized to the electron 
track p T , in a cone AR <0.2. 



the transverse x and y axes. In Figs. 14 and 15, the transverse impact parameter significance for muons 
and electrons is shown. Tracks with dO significance greater than 3 are not included in the primary vertex 
fit, and this causes the shoulder visible in the distributions. The discriminating variable used in the event 
selection for Zbb and tt background rejection, is the maximum lepton impact parameter in the event. 
This is shown in Figs. 16 and 17 for 4/x and 4e events, respectively. For electron tracks, the maximum 
impact parameter normalized to its error is required to be less than 6, while for muons less than 3.5. 




do significance dO significance 



Figure 14: Transverse impact parameter significance 
for muons from signal and reducible background 
events. 



Figure 15: Transverse impact parameter significance 
for electrons from signal and reducible background 
events. 



4 Event selection 

In this section the full set of cuts performed in this analysis are summarized and the event kinematic 
reconstruction is described. These cuts including the isolation and vertexing cuts covered in the previous 
section are summarized in Table 5. 

4.1 Event preselection 

Events that pass the trigger selection are required to further satisfy certain lepton preselection criteria. 
An electron must satisfy the LooseElectron requirement described in Section 2.3.1, and have an Ej>5 
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dO significance dO significance 



Figure 16: Maximum impact parameter significance Figure 17: Maximum impact parameter significance 
in 4-muon events, for signal and reducible back- in 4-electron events, for signal and reducible back- 
grounds, grounds. 



Event Preselection 


Four leptons: LooseElectrons or muons. 

p T >l GeVand |tj| < 2.5, at least two with p T >20 GeV 


Event Selection 


Kinematic Cuts 


Lepton quality: 2 pairs of same flavour opposite charge leptons. 

Electrons must be MediumElectrons satisfying the Calolso criterion. 

For H masses of 200 GeV and higher, four LooseElectrons are required instead. 

Z, Z* and Higgs boson reconstruction: single quadruplet with 

\mn -m z \ < Amn GeV, m ll2 > m 34 . 


Isolation and 
vertexing cuts 


Muon Calorimetric isolation (LEj / Pt < 0.23). 
Lepton Inner detector track isolation (Lpt / Pt < 0.15). 
Cut on maximum lepton impact parameter 
(<f () /cj 0 < 3.5 for muons, do/<Jd 0 < 6.0 for electrons). 



Table 5: Summary of the analysis cuts for the 4£ analysis. The two lepton pairs are denoted as mm 
and mm. The values of the mass window Ami 2 and of the cut m 34 are defined in Table 6. 



GeV and |t]| < 2.5. Muons are selected by requiring pr>5 GeV and |t]| < 2.5 (see section 2.3.2). The 
final stage of event preselection requires at least four leptons with p T >7 GeVand \r}\ < 2.5, with at least 
of two these leptons having pr>20 GeV. 

4.2 Event selection and kinematic reconstruction 

Following Table 5, in this section we review the kinematic cut part of the event selection. These cuts 
have been optimized separately for each Higgs boson mass considered in this note. 

Lepton quality requirements: 

Events used in this analysis are required to have at least four leptons (e, }x) which can be coupled in 
pairs of opposite charge and same flavour. These leptons are required to satisfy the requirements: 

• Electrons: for Higgs boson masses below 200 GeV, electrons are required to satisfy the Medium- 
Electron quality requirement, and the calorimetric isolation in a AR = 0.2 cone (Calolsolation). 
For masses above or equal to 200 GeV, due to the higher momentum of the decay electrons, the 
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MediumElectron requirement is relaxed and these electrons are required to satisfy the LooseElec- 
trons quality (see Section 2.3.1). This is a conservative choice, due to the fact that complete studies 
on Z+jets background rejection were limited by the number of simulated events available. 

• Muons are required to be either reconstructed by the combined reconstruction algorithm, or by 
the tagging algorithm which is matching tracks in the inner detector to hit patterns in the muon 
spectrometer (see Section 2.3.2). 

Z, ZM and Higgs boson mass reconstruction 

The 4-lepton Higgs boson candidate mass reconstruction proceeds after selecting one single lepton 
quadruplet in an event. When more than one quadruplet is found, the one with a dilepton mass closest to 
the nominal Z mass, and with the highest ^rleptons associated to the second Z, is chosen. The resolution 
of the dilepton mass can be improved by applying a Z-mass constraint to the pair with a mass closest 
to the Z invariant mass. When both Z's are on-shell (for Higgs boson masses of 200 GeV and above), 
the Z-mass constraint can be applied to both lepton pairs. The constraint itself is a convolution between 
the nominal Z Breit-Wigner distribution, and a gaussian distribution centered at the measured Z value 
with o equal to the experimental resolution. The distribution of the Higgs boson mass reconstructed in 
the case of a 130 GeV Higgs boson is shown in Figs. 18, 19 and 20 for the 4e, 4\i and 2e2}X channels 
respectively. Only the gaussian region of the distribution is considered in the fit in the case of the 4e 
channel: the fraction of events falling within ±2a from the mean value is 81.7% in this case. The tail of 
the distribution is due to electron bremsstrahlung losses upstream the calorimeters. 




nw [GeV] 



Figure 18: Reconstructed H(130 GeV)^ 4e mass 
after application of the Z-mass constraint fit. 




Figure 19: Reconstructed H(130 GeV)^ 4]X mass 
after application of the Z-mass constraint fit. 



The Higgs boson mass resolution for masses for which the Higgs boson has a negligible intrinsic 
width, is shown in Figs. 21 and 22. In these figures it is shown that the Z-mass constraint improves the 
mass resolution by 10% to 17%. The 4-lepton mass shifts after the Z-mass constraint are shown in Fig. 
23 for each of the three decay channels. Independently of the fit, the 4e mass is biased to lower values due 
to material effects (see Section 2.3.1). For this reason the electron energy has already been corrected by 
+1% based on the difference of the reconstructed Z mass and the PDG Z mass. The Z-mass constraint is 
only slightly correcting for these effects. In the 4\i channel, the bias introduced by the Z-mass constraint 
is negligible. 

Finally, the set of kinematic cuts applied to the reconstructed Z invariant masses is shown in Table 
6. The cuts have been optimized using the expected distributions for signal and backgrounds, and the 
expected dilepton resolution. In this table Z\ is the dilepton with a mass closest to the nominal Z mass, 
while Z2 is the lower mass dilepton pair. To estimate the significance, the events in a ±2a mass window 
are selected. 
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Figure 20: Reconstructed H(130 GeV)^ 2e2 i u mass after application of the Z-mass constraint fit. 



150 160 170 180 
Higgs Mass [GeV] 



Higgs Mass [GeV] 



Figure 21: Higgs^ Ae mass resolution as a func- 
tion of the Higgs boson mass. Open circles denote 
the resolution obtained when no Z-mass constraint 
is applied, while full circles show the resolution in 
the case of the Z-mass constraint. 

4.3 Event selection results 



Figure 22: Higgs^ 4/x mass resolution as a func- 
tion of the Higgs boson mass. Open circles denote 
the resolution obtained when no Z-mass constraint 
is applied, while full circles show the resolution in 
the case of the Z-mass constraint. 



The cut flow for the selection of a 130 GeV Higgs boson is shown in Tables 7 and 8, for signal and 
backgrounds respectively. The same is shown for the tt background in Table 9. In this case, the 
available MC statistics is not sufficient to determine the number of expected events, and only upper 
limits at 90% CL are set. The final selection efficiencies are shown in Figs. 24 and 25 The distributions 
of the reconstructed 4-lepton mass, obtained after all cuts, are shown in Figs. 26, 27, 28 for three of the 
low-mass values (130, 150, and 180 GeV), and in Figs. 29, 30, and 31 for three of the high-mass ones 
(300, 400, and 600 GeV). 

The number of expected events shown for the three decay channels combined is computed using NLO 
cross-sections (Section 2.1), for a luminosity of 30 fb _1 . The NLO cross-sections after the full event 
selection, are shown in Table 10 for the three decay channels separately and combined. In this table, 
signal events are selected within a m# ±2o mH mass window, and systematic errors are not yet taken into 
account in the significance calculation. Here o mtl is the experimental 4-lepton mass resolution, shown in 
Table 6. The significances for the each of the three channels, and their combination, are summarized in 
Fig. 32. 
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Figure 23: Shift of the mean 4-lepton mass for each of the three decay channels, with and without the 
Z-mass constraint on the dilepton mass. 



H Mass 


Zi mass window 


Z2 mass cut 


Hma 


5S resolution (GeV) 


(GeV) 


(GeV) 


(GeV) 


4e 




2e2/x 


120 


±15 


>15 


2.0 


1.8 


1.9 


130 


±15 


>20 


2.2 


1.8 


1.9 


140 


±15 


>30 


2.2 


2.0 


2.1 


150 


±15 


>30 


2.3 


2.1 


2.2 


160 


±15 


>30 


2.4 


2.2 


2.3 


165 


±15 


>35 


2.5 


2.4 


2.4 


180 


±12 


>40 


2.8 


2.7 


2.8 


200 


±12 


>60 


3.9 


3.7 


3.8 


300 


±12 


±12 


8.4 


8.4 


8.4 


400 


±12 


±12 


16.5 


17.3 


17.2 


500 


±12 


±12 


33.8 


34.4 


32.8 


600 


±12 


±12 


52.2 


57.2 


53.2 



Table 6: Cuts applied to the reconstructed leading and sub-leading Z masses, and the Higgs boson mass 
resolution values used to define the signal region. 



4.4 Estimates of WZ and Z+Jets backgrounds 

The contribution of other potentially dangerous backgrounds have also been estimated. Examples are the 
WZ^ 3£ and the inclusive Z+X where the Z decays leptonically. In the WZ^ 3£ background pileup and 
cavern background are included. WZ is found to give a negligible contribution to the total background: 
the upper limit on the expected number of events is lower than the limit on tt events. The Z+jets process 
provides one of the most serious backgrounds to the 4e-channel at low masses. With the available MC 
statistics (500K events, see Table 2), no event survives the lepton quality and pj selection cuts. An 
estimate based on cut factorization leads to an evaluation of the expected number of events after the full 
event selection, that would correspond to one event after the initial lepton quality and p T cuts. This is at 
the level of about twice the Zbb background, i.e. below the ZZ* continuum. The available MC statistics 
is therefore not sufficient to set stringent limits on the Z+jets background. However, the rejection based 
on the lepton quality and p T cuts should guarantee the complete removal of this background. 
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Selection cut 


Selection step 


Signal 






4e 


All 


2e2jU 


Trigger selection 


1 


94.7 


95.3 


95.7 


Lepton preselection 


2 


57.0 


73.8 


66.8 


Lepton quality and pj 


3 


24.7 


60.5 


39.7 


Z's mass cuts 


4 


17.1 


42.9 


27.6 


Calo Isolation 


5 


17.1 


39.5 


25.4 


Tracker Isolation 


6 


16.5 


38.1 


24.7 


IP cut 


7 


15.1 


36.5 


23.2 


H Mass cut 


8 


12.5±0.3 


31.4±0.5 


19.2±0.4 



Table 7: Fraction of events (in %) selected after each event selection cut, for each of the three decay 
channels, and for a 130 GeV Higgs boson. The efficiencies of each selection are calculated with respect 
to the fraction of events in which the Higgs boson decays into the corresponding channel, and that pass 
the generator filter described in Section 2. 1. 



Selection cut 




4e 


ZZ 

All 


2e2jU 


4e 


Zbb 

Ail 


2e2jU 


Trigger 


1 


96.6 


96.6 


96.6 


91.4 


91.4 


91.4 


Preselection 


2 


13.8 


17.6 


31.4 


2.6 


9.4 


12.0 


Lepton quality and p T 


3 


7.3 


16.0 


21.9 


1.110 1 


2.1 


1.7 


Z mass cuts 


4 


6.9 


14.8 


20.2 


4.7-10~ 2 


1.1 


8.4-10- 1 


Calo Isolation 


5 


6.9 


13.9 


19.5 


4.7-10- 2 


8.510 2 


1.2-10- 1 


Track Isolation 


6 


6.8 


13.6 


19.2 


1.310 2 


3.3-10- 2 


4.4- 10~ 2 


IP cut 


7 


6.2 


13.0 


17.8 


5.6-10" 3 


1.1-10 2 


1.8-10- 2 


H Mass window 


8 


5.2-10- 2 


11.310 2 


12.0-10" 2 


1.6-10" 3 


1.2-10" 3 


3.0-10- 3 



Table 8: Fraction of events (in %) selected after each event selection cut for the background processes. 
The 130 GeV Higgs boson mass selection cuts are applied. 



4.5 Effects of pile-up and cavern background 

The effects of pile-up and cavern background in the analysis are studied for the analysis for the Higgs 
boson mass of 130 GeV . The signal selection efficiencies are shown in Fig. 33, for the Aji and Ae channel, 
as a function of the selection cuts listed in Table 7. Table 12 summarizes the results for the three decay 
channels. The effect of pileup is to decrease the signal selection efficiency by about 10%, for all three 
decay channels. This decrease is due to a slight decrease in the trigger efficiency, and of the calorimetric 
and tracker isolation cut efficiencies. Part of the loss can be recovered by reoptimizing these cuts. 

5 Systematic uncertainties 

Central to the H^ At analysis is the estimate of the background in a candidate signal region. In this sec- 
tion the systematic uncertainties on quantities associated with the background estimation and the signal 
efficiency are discussed. First the theoretical uncertainties are presented. Subsequently, the impact of 
experimental systematic uncertainties on the event selection is discussed. Some of the systematics, like 
the theoretical uncertainties and systematics on the signal efficiency, affect the estimate of the expected 
sensitivity. Signal significance extraction from real data is affected by systematics on the knowledge of 
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Selection cut 




4e 


4jU 


2€l\x 


Trigger 


1 


75.1 


75.1 


75.1 


Preselection 


2 


1.0 


4.7 


10.1 


Lepton quality and 


3 


6.810 3 


7.3-10- 1 


5.810 1 


Z mass cuts 


4 


1.610 3 


2.0-10- 1 


l.o-io- 1 


Calo Isolation 


5 


1.610 3 


1.610 3 


5.4-10~ 3 


Track Isolation 


6 


2.6-10- 4 


2.5-10- 4 


l.o-io- 3 


IP cut 


7 


2.6-10- 4 


<6-10~ 4 


2.6-10- 4 


H Mass window 


8 


<6-10- 4 


<6-10- 4 


<6-10- 4 



Table 9: Fraction of events (in %) selected after each event selection cut for the tt background. For small 
available statistics 90% CL limits are considered. 
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Figure 24: Selection efficiency as a function of the 
Higgs boson mass, for each of the three decay chan- 
nels, for the case of only one on-shell Z. 



Figure 25: Selection efficiency for as a function of 
the Higgs boson mass, for each of the three decay 
channels, for the case of two on-shell Z's. 



lepton energy scale and resolution, as determined for example from inclusive Z studies, lepton recon- 
struction efficiency, and reducible background knowledge from control samples. 

5.1 Theoretical uncertainties 

The major theoretical uncertainties in the prediction of the inclusive background cross-sections are the 
PDF uncertainties and uncertainties related to the QCD renormalization and factorization scales. Scale 
uncertainties reflect theoretical uncertainties due to the omission of higher order diagrams. PDF and 
scale uncertainties have already been discussed and evaluated for the main backgrounds and are only 
recalled here (see Section 2.1). In summary, for the calculation of the NLO inclusive cross-sections, the 
QCD scales have been independently varied in the range (0.5-2) x the energy scale of the process. The 
PDF uncertainty has been evaluated by making use of 40 sets of PDF's for CTEQ6M (20 plus and 20 
minus). 

5.2 Experimental uncertainties 

Systematic effects on the H^ 41 analysis arise from experimental uncertainties related to the lepton re- 
construction. The major contributions in the total systematic uncertainty in the H^ 4£ yield come from 
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Figure 26: Reconstructed 4-lepton mass for signal Figure 27: Reconstructed 4-lepton mass for signal 
and background processes, in the case of a 130 GeV and background processes, in the case of a 150 GeV 
Higgs boson, normalized to a luminosity of 30 fb~ 1 . Higgs boson, normalized to a luminosity of 30 fb~ 1 . 





200 210 220 230 240 &0 220 240 : 

[GeV] 



Figure 28: Reconstructed 4-lepton mass for signal Figure 29: Reconstructed 4-lepton mass for signal 
and background processes, in the case of a 180 GeV and background processes, in the case of a 300 GeV 
Higgs boson, normalized to a luminosity of 30 fb~ 1 . Higgs boson, normalized to a luminosity of 30 fb~ 1 . 



uncertainties in lepton energy scale, reconstruction and identification efficiency. The impact of these un- 
certainties on the analysis is studied by applying variations to offline reconstructed variables. The level 
of these variations has been provided by the performance groups. 
Uncertainties in lepton energy scale 

Uncertainties on the energy scale of electrons arise from the EM calibration. These are considered by 
varying by ±0.5% the Ej of the reconstructed electrons. Energy scale uncertainties for muons arise due 
to the imperfect knowledge of the magnetic field. Here the recostructed muon pj is varied by ±1%. 
These values are assumed on the basis of the foreseen in-situ determination of the detector performance. 

Uncertainties in lepton energy resolution 

The level of knowledge of the material distributions in ATLAS affects the lepton energy reconstruction. 
To properly evaluate the impact of this contribution on the analysis, the reconstructed electron energies 
are smeared with a Gauss function using a Oe t = 0.0073-£y. This extra smearing deteriorates the trans- 
verse energy resolution of 50 GeV electrons by a relative 10%. In the muon system, an additional term 
can be added to this smearing, to take into account misalignment uncertainties. The total muon smearing 
is G\ / PT = 0.01 1 /pj® 0.00017 (with pj in GeV). In the pr range of interest for Higgs boson searches, 
the second term is negligible. The values of the corrections described above have been chosen so that the 
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Figure 30: Reconstructed 4-lepton mass for signal 
and background processes, in the case of a 400 GeV 
Higgs boson, normalized to a luminosity of 30 fb _1 . 



Figure 31: Reconstructed 4-lepton mass for signal 
and background processes, in the case of a 600 GeV 
Higgs boson, normalized to a luminosity of 30 fb~ 1 . 




Figure 32: Expected signal significances computed Figure 33: Fraction of selected events with and with- 
using Poisson statistics, for each of the three decay out pile-up and cavern background, for the cuts in 
channels, and their combination. Tables 7 and 8 (130 GeV H^4e and 4/i analyses). 

nominal resolution on muon pj of 3% is increased to 3.3% after applying the extra smearing. 
Uncertainties in lepton reconstruction efficiency 

The impact of uncertainties in the lepton reconstruction efficiency can be estimated by discarding a fixed 
fraction of leptons before the analysis. The level of uncertainties considered here is 0.2% for electrons 
and 1% for muons, motivated by performance group studies. 

Material effects in electron efficiency 

Uncertainties in electron efficiency receive a large contribution from uncertainties in the knowledge of 
material upstream the LAr EMC. Systematic effects influence shower shape discriminants included in the 
electron identification criteria. Examples of such discriminants are the mean energy fraction in a core of 
3x7 middle sampling cells normalized to a window of 7x7 cells, and the mean energy fraction outside 
a 3-strip core and inside a 7-strip window. As discussed in [17], the presence of extra material shifts 
and changes the shapes of these distributions, hence reducing the discrimination power of these cuts. 
The integrated effect in electron efficiency is rather small (less than 2%). However the true systematic 
uncertainty in the efficiency due to the knowledge of the material depends on how well the material and 
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Mass (GeV) 




120 


130 


140 


150 


160 


165 


180 


Selection 




















Signal 


0.043 


0.124 


0.239 


0.297 


0.162 


0.078 


0.205 




zz*/r 


0.028 


0.027 


0.020 


0.017 


0.033 


0.044 


0.196 


4e 


Zbb 


0.006 


0.013 


0.004 


0.004 


< 0.004 


0.006 


0.002 




« 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 




Significance (30 ft -1 ) 


0.9 


2.4 


4.8 


6.1 


3.3 


1.4 


2.1 




Signal 


0.108 


0.311 


0.563 


0.707 


0.381 


0.177 


0.476 




zz*/f 


0.052 


0.059 


0.061 


0.017 


0.073 


0.080 


0.258 




Zbb 


0.019 


0.009 


0.009 


0.008 


0.006 


0.004 


0.008 






<0.04 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 




Significance (30 ftr 1 ) 


1.7 


4.4 


7.0 


8.4 


5.1 


2.6 


4.1 




Signal 


0.130 


0.381 


0.709 


0.932 


0.485 


0.229 


0.642 




zz*/r 


0.063 


0.063 


0.081 


0.074 


0.102 


0.116 


0.483 


2e2jU 


Zbb 


0.030 


0.025 


0.013 


0.009 


0.009 


0.004 


0.004 




it 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 




Significance (30 ft" 1 ) 


1.8 


4.8 


7.7 


9.8 


5.6 


2.8 


4.2 




Signal 


0.281 


0.816 


1.511 


1.94 


1.03 


0.484 


1.32 




zz*/r 


0.143 


0.150 


0.163 


0.151 


0.208 


0.240 


0.938 


All 


Zbb 


0.055 


0.047 


0.026 


0.021 


0.015 


0.013 


0.013 




it 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 


<0.04 




Significance (30 ft" 1 ) 


2.8 


7.1 


11.5 


14.2 


8.2 


4.2 


6.2 



Table 10: Cross-sections (in ft) after the full event selection for the signal and irreducible and reducible 
backgrounds. The cross-sections are given for each of three channels Ae ,A\i and 2el\i , and for their 
combination. When no event is passing the event selection, 90% C.L. limits on the cross-section are set. 
For each channel and for their combination, the expected significance is given for 30 ft~ 1 . It is assumed 
that the background in the signal region is known with negligible uncertainty. The background is 
assumed not to contribute to the signfificance, 



the shower shapes can be measured using data. 

5.3 Summary of the systematic uncertainties 

The impact of the various lepton systematic uncertainties on the Higgs boson signal yield and background 
rejection is summarized in Table 13 for the various 4£ final states. Variations have been applied to both 
signal and background samples, and the analysis algorithm ran with the hypothesis of mu = 130 GeV. 
A 3% uncertainty on the luminosity has been taken into account in the total systematic error calculation. 
The total systematic error on the signal efficiency has been included in the calculation of the exclusion 
limits described in the following. 

6 Background extraction from data and significance estimation 

In Section 4 the significance has been obtained assuming that the background is known with a negligible 
uncertainty. In this section various methods to extract the background from data, evaluate the background 
uncertainties, and include them in the significance calculation, are presented. 
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Mass (GeV) 




200 


300 


400 


500 


600 


Selection 














4e 


Signal 


1.41 


0.917 


0.737 


0.370 


0.174 




XL* If 


0.689 


0.342 


0.225 


0.228 


0.179 




Significance (30 fb" 1 ) 


7.4 


6.6 


6.3 


3.4 


1.9 


All 


Signal 


1.94 


1.16 


0.918 


0.463 


0.206 




72* If 


0.867 


0.468 


0.379 


0.346 


0.296 




Significance (30 fb _1 ) 


9.0 


7.2 


6.3 


3.6 


1.8 


2e2\l 


Signal 


3.33 


2.13 


1.69 


0.825 


0.377 




zz*/r 


1.53 


0.836 


0.610 


0.570 


0.438 




Significance (30 fb -1 ) 


11.7 


9.8 


9.0 


5.0 


2.7 


All 


Signal 


6.68 


4.21 


3.34 


1.66 


0.76 




72* If 


3.09 


1.65 


1.21 


1.14 


0.914 




Significance (30 fb" 1 ) 


16.5 


13.8 


12.7 


7.1 


3.8 



Table 1 1 : Cross-sections (in fb) after the full event selection for signal and irreducible background. For 
Higgs boson masses above 180 GeV the contribution of the reducible backgrounds to the total back- 
ground cross-section is negligible. The cross-sections are given for each of three channels 4e, 4\i and 
2e2\i , and for their combination. When no event is passing the event selection, 90% C.L. limits on the 
cross-section are set. For each channel and for their combination, the expected significance is given for 
30 fb -1 . It is assumed that the background in the signal region is known with negligible uncertainty. 





4e 


4yc 


2e2jU 


No Pileup and CB 
With Pileup and CB 


12.5±0.3 
12.1±0.3 


31.4±0.5 
28.4±0.5 


19.2±0.4 
17.0±0.4 



Table 12: Selection efficiencies in %, after all cuts, for the signal at 130 GeV, and for each of the three 
decay channels. The efficiencies are shown for the two cases: with and without the addition of minimum 
bias low-luminosity pileup and cavern background with safety factor 5. 



The main challenge in measuring the background for the 4£ channel over a very wide mass range 
(from mu 120 GeVto m 4 c 600 GeV), comes from the fact that signal and background shapes and cross- 
sections vary considerably. The dominant background comes from the ZZ**) continuum. While we 
expect to measure the ZZ in the high mass region (M^ > 1 80 GeV) where the reducible backgrounds 
become negligible, in the low mass region the significant presence of Zbb and tt requires their knowledge. 
Measurements of these backgrounds with early data will provide upper bounds in their expectation after 
all Higgs boson analysis cuts. 

6.1 Significance determination 

Most of the systematic uncertainties discussed in the previous section do not contribute to the signif- 
icance determination if the data distributions are fitted, and signal and background are extracted from 
the fit. The resulting uncertainty in the knowledge of the background reduces the confidence level for 
claiming a discovery. In this section a fit-based approach for background and significance extraction is 
presented. A fit of the selected 4-lepton invariant mass, using the signal hypothesis at a fixed mass and 
applying the profile likelihood ratio method to extract the significance is considered. The fit method, and 
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Zbb 


ZZ 


H 


Zbb 


ZZ 




Z&Z? 


ZZ 


H 




4e 


Ail 


2e2jU 


Scale +0.5% (+1%) 


+1.5 


+0.1 


+0.9 


+2.4 


+0.4 


+1.3 


+1.9 


+0.1 


+0.9 


Scale -0.5% (-1%) 


-1.1 


-0.2 


-0.5 


-2.3 


-0.3 


-2.5 


-1.7 


-0.2 


-1.4 


Resolution 


-0.5 


-0.1 


-0.4 


+0.1 


-0.1 


-2.6 


-0.2 


-0.1 


-0.5 


Rec. efficiency 


-1.0 


-0.7 


-0.5 


-3.8 


-4.0 


-3.8 


-2.0 


-2.1 


-1.7 


Luminosity 


3 


3 


3 


Total 


3.6 


3.1 


3.2 


5.4 


5.0 


6.0 


4.1 


3.7 


3.8 



Table 13: Impact, in %, of the systematic uncertainties on the overall selection efficiency, as obtained for 
am H = 130 GeVin the 4e,4^, and 2e2fx final states. 



the results obtained after varying the background shape, the fit range and even including background- 
only fits without the signal hypothesis, are the main subject of this section. An alternative approach with 
a global two-dimensional (2D) fit on the {mz*,mu) plane is also briefly discussed. 

The baseline method used as input to the combination of all ATLAS Standard Model Higgs boson 
searches, is based on a fit of the 4-lepton invariant mass distribution over the full range from 110 to 
700 GeV. The method, whose details are described in [21], is summarized below. 

The 4-lepton reconstructed invariant mass after the full event selection (except the final cut on the 
4-lepton reconstructed mass) is used as a discriminating variable to construct a likelihood function. The 
likelihood is calculated on the basis of parametric forms of signal and background probability density 
functions (pdf) determined from the MC. For a given set of data, the likelihood is a function of the 
pdf parameters p and of an additional parameter }X defined as the ratio of the signal cross-section to 
the Standard Model expectation (i.e. fx = 0 means no signal, and }X = 1 corresponds to the signal rate 
expected for the Standard Model). To test a hypothesized value of }X the following likelihood ratio is 
constructed: 

L(n,p) 

where p is the set of pdf parameters that maximize the likelihood L for the analysed dataset and for 
a fixed value of jx (conditional Maximum Likelihood Estimators), and (fl,p) are the values of jl and 
p that maximise the likelihood function for the same dataset (Maximum Likelihood Estimators). The 
profile likelihood ratio is used to reject the background only hypothesis (ii = 0) in the case of discovery, 
and the signal+background hypothesis in the case of exclusion. The test statistic used is = — 21nA( i u), 
and the median discovery significance and limits are approximated using expected signal and background 
distributions, for different m#, luminosities and signal strength jj,. The MC distributions, with the content 
and error of each bin reweighted to a given luminosity (in the following referred to as "Asimov data", 
see [21]), are fitted to derive the pdf parameters: in the fit, m# is fixed to its true value, while Oh is allowed 
to float in a ±20% range around the value obtained from the signal MC distributions. All parameters 
describing the background shape are floating within sensible ranges. The irreducible background has 
been modelled using a combination of Fermi functions which are suitable to describe both the plateau 
in the low mass region and the broad peak corresponding to the second Z coming on shell. The chosen 
model is described by the following function: 

f(mzz) = p6- mzz P ° + P i- mzz Pl g=^ - ( 2 ) 
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The first plateau, in the region where only one of the two Z bosons is on shell, is modelled by the first 
term, and its suppression, needed for a correct description at higher masses, is controlled by the pS and 
p9 parameters. The second term in the above formula accounts for the shape of the broad peak and the 
tail at high masses. This function can describe with a negligible bias the ZZ background shape with good 
accuracy over the full mass range. The Zbb contribution is relevant to the background shape only when 
searching for very light Higgs boson (in this study, only at m# = 120 GeV). In this case, an additional 
term is added to the ZZ continuum, with a functional form similar to the second part of equation 2. For 
the signal modelling a simple gaussian shape has been used for m# < 300 GeV, while a relativistic Breit- 
Wigner formula is needed at higher values of the Higgs boson mass. In Figs. 34 and 35 two examples of 
pseudo-experiments with the resulting fit functions for signal and background are shown. 



— 30; 




100 150 200 250 300 350 400 450 500 



m 4l [GeV] 

Figure 34: A pseudo-experiment corresponding to 
30 fb -1 of data for a Higgs boson mass of 130 GeV. 
The functions fitting the signal and the background 
are shown. 




m 4l [GeV] 



Figure 35: A pseudo-experiment corresponding to 
30 fb -1 of data for a Higgs boson mass of 180 GeV. 
The functions fitting the signal and the background 
are shown. 



The results presented in the following approximate the significance from the test statistics as \J — 21nA(jU). 
In order for the results of the method to be valid, the test statistic = — 2ZnA(ju) should be distributed 
as a x 2 with one degree of freedom. The results obtained with the strategy described above must thus 
be validated using toy MC. Such validation tests show a good agreement of the test statistic with the 
expected x 2 distribution, as discussed in detail in [21]. This allows to approximate the significance from 
the test statistic as yf — 21nA (ju). The significances obtained as the square root of the median profile 
likelihood ratios for discovery, — 21nA(/i = 0) are shown in Table 14 for all m# values considered in this 
paper, and for various luminosities. In Fig. 36, the significance obtained from the profile likelihood ra- 
tio, after the fit of signal+background is shown. The significance is compared to the Poisson significance 
shown in Section 4. The slightly reduced discovery potential is due to the fact that several background 
shape and normalization parameters are derived from the data-like sample. 

Concerning exclusion, the median profile likelihood ratios are calculated under the background only 
hypothesis, and the integrated luminosity needed to exclude the signal at 95% C.L. is the one correspond- 
ing to V— 2/wA=1.64. The integrated luminosity needed for exclusion is shown in Fig. 37. 

The median significance estimation with Asimov data can be validated using toy MC pseudo-exp- 
eriments. For each mass point, 3000 background-only pseudo-experiments are generated. For each 
experiment, the profile likelihood ratio method is used to find which /x value can be excluded at 95% 
CL. The resulting distributions are then analysed to find the median and ±la and ±2a intervals. The 
outcome of this test is summarized in Fig. 38, where the 95% CL exclusion }X obtained from single fits 
on the full MC datasets is plotted as well. As shown, the agreement is good over the full mass range. 
Fitting the 4-lepton mass distribution with all parameters left free in function 2 allows the fit to absorb 
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L 


m# (GeV) 


(fb- 1 ) 


120 


130 


140 


150 


160 


165 


180 


200 


300 


400 


500 


600 


1 


0.5 


1.1 


2.0 


2.3 


1.3 


0.7 


0.9 


2.6 


2.3 


1.9 


0.9 


0.6 


2 


0.7 


1.6 


2.8 


3.3 


1.8 


1.0 


1.3 


3.7 


3.2 


2.8 


1.3 


0.8 


5 


1.0 


2.4 


4.5 


5.2 


2.9 


1.6 


2.1 


5.9 


5.1 


4.2 


2.1 


1.2 


10 


1.5 


3.5 


6.3 


7.3 


4.1 


2.2 


2.9 


8.3 


7.2 


6.0 


2.9 


1.8 


30 


2.6 


6.0 


10.9 


12.7 


7.0 


3.8 


5.1 


14.4 


12.7 


10.4 


5.3 


3.3 



Table 14: The significances obtained from the median profile likelihood ratios for discovery -21nA( i u = 
0), for all Higgs boson masses considered and for various luminosities. 





Figure 36: Significance obtained from the profile Figure 37: The luminosity needed for exclusion 

likelihood ratio, as a function of the Higgs boson the Standard Model Higgs boson with the H 

mass. The result is compared with the one shown ZZ* — > 41 channel alone, as a function of the Hig 

in Section 4 where systematic errors on signal and boson mass, 
background have not been included, and the signifi- 
cance has been calculated using Poisson statistics. 



possible systematics. Inclusion of systematic effects on mass scale and resolution (see Section 5), and on 
the knowledge of the reducible background, have a total effect smaller than 4% on the significance from 
the fit. 

In addition to the baseline method described above, different fit assumptions have been studied and 
are discussed below for completeness. 

A background-only fit can be performed using pseudo-experiments corresponding to 30 fb -1 of 
data, including both signal and backgrounds; the function described in formula 2 is used to describe the 
background, and the fit is performed over the full range 1 10-700 GeV, but in this case all the parameters 
apart from the overall normalization are fixed to the values obtained from a fit of the full MC statistics 
available. The signal mass window, defined for each value of the Higgs boson mass as described in 
Section 4, is excluded from the fit (sideband-fit), and the expected background in the signal region is 
obtained as the integral of the background function in the signal window. The pseudo-experiments fits 
provide an estimate of background statistical fluctuations and of possible biases introduced by the fit, 
that must be taken into account in the significance determination. The statistical uncertainty on the 
background resulting from the fit is no more than 6% over the full mass range. 

Another method based on the sidebands consists of calculating, using the full MC statistics, the ratio 
T = between the background events in the signal mass window (Bmw) an d those in the sidebands 
(Bsb)- In the pseudo-experiments, the background level in the signal mass window is estimated as: 
Bmw(i) = f x Ngg s (i) and the signal as: Smw(i) = N^{i) —Bmw(i). The resulting uncertainty on the 
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Figure 38: Validation of the median significance estimation with toy Monte-Carlo experiments. The jJ, 
value corresponding to 95% CL exclusion obtained from Asimov data is compared to the one obtained 
using the median profile likelihood ratio from the toy MC pseudo-experiments. 



background is no more than 5% over the full mass range. The significance is then obtained, in both 
cases, using the profile likelihood ratio method applied to the case of a counting experiment, assuming 
the background mean, and statistical error on the background as from the fit. The results for 130, 150 
and 180 GeV Higgs boson mass are summarized in Table 15. The sideband fit and the T-ratio method 



Method 



Background 130 
error GeV 



150 



from T-ratio 6.7 



Table 15: The signal significance for various Higgs boson masses, obtained including the background 
uncertainties from the sideband fit and the T-ratio methods. 

can be used to test the impact of some systematic effects, like the mass scale and resolution in the ZZ 
threshold region at 1 80 GeV, and the uncertainties on the reducible Zbb background. The impact of a 
±1% variation of the energy scale on the significance from the fit with all parameters fixed apart from the 
normalization is of 21% for a Higgs boson mass of 180 GeV, the most critical region. A ±20% variation 
of the mass resolution has a 7% effect on the significance. Systematics connected to the knowledge 
of the reducible Zbb background have been tested removing this background from the samples used to 
calculate the fit parameters or the T-ratio, and have been found to be at the level of 8%. 

The presence of uncertainties in the shape of the m 4 ( distribution at low masses (m^ < 180 GeV) 
and the potential difficulty of predicting the shape in this region using the distribution at higher masses 
m\i > 180 GeV, motivate studies where shape information is extracted from restricted fits in this low 
mass region. Examples are fits in the region from 110 to 170 GeV, where the background is expected 
to have a simple shape. In this range we perform both background-only fits excluding the signal region, 
and fits including the signal hypothesis, using the profile likelihood method but a simpler background 
parametrization (1st order polynomial). For the background-only fits we extract the significance cal- 
culating the p-value as defined in [21], using the method described in [22]. The extracted significance 
has also been calculated throwing large numbers of toy-MC experiments that were used to calculate the 
p-value, in order to validate the method of [22]. The background-only fits in the 1 10 to 170 GeV region 
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give a significance of 5.6a for a 130 GeV Higgs boson. When the signal hypothesis is included using the 
profile likelihood method, a significance of 5.8a is found. The small increase in the significance is due 
to the inclusion of the signal hypothesis which renders the statistical test more powerful. These results 
for the 130 GeV Higgs boson are close to those obtained from the full mass range fit baseline method 
(Table 14). Independent measurements (e.g. Zbb background) are expected to provide constraints in the 
knowledge of the background thus improving the discovery potential. 

The fitting approach discussed so far involves one-dimensional fits of the mu distribution. This 
approach can be generalized to a global 2D fit on the (m z »,m 4f ) plane. Such 2D fits, in contrast to the 
baseline approach, exploit correlations between mz« and m\t, after a single set of cuts independent of 
the Higgs boson mass. Having a single set of cuts allows the extraction of the signal through both fixed 
and floating Higgs boson mass fits. The 2D models on the (mz«,m«) plane, have been obtained for the 
ZZ and Zbb backgrounds. All available signal samples (for masses from 120 to 600 GeV) have been 
used to develop a one -parameter family of surfaces (the parameter being mu) that adequately models 
the signal for any intermediate value of the true Higgs boson mass. Table 16 summarizes the expected 
significance as a function of m#, for both fixed and floating Higgs boson mass window fits, and an 
integrated luminosity of 30 fb -1 . These results are obtained using toy MC pseudo-experiments using a 
median likelihood ratio for the fixed mass case, and a p-value for the floating mass case, as described 
in [21]. The significances shown in the first row of Table 16 are consistent with the corresponding results 
of the baseline analysis shown in Table 14. 



Mass (GeV) 


120 


130 


140 


150 


160 


2D fit Fixed Higgs boson Mass 
2D fit Floating Higgs boson Mass 


2.0 
1.1 


6.3 
5.6 


11.6 
10.9 


14.0 
13.2 


8.3 
7.5 



Table 16: Median significance from a 2D global fit on the {mz*,mu) plane for both fixed and floating 
Higgs boson mass window and an integrated luminosity of 30 fb -1 . 



7 Summary and conclusion 

The potential of ATLAS in observing the Higgs boson in its H — > ZZ* — > 4£ decay mode was presented. 
For an integrated luminosity of 30 fb -1 , ATLAS will discover the Higgs boson in the M channel-alone 
in the mass range from 130 — 500 GeV, with the exception of the region around 160 GeV of the WW 
turn on, where significances of about 4a were found. In the very low mass region of 120 GeV, just 
above the LEP limit, a significance close to 3a is expected: this is a strong contribution to the combined 
significance of the various Standard Model decay modes [21]. The H — > ZZ — > 4£ channel is highly 
sensitive in the high mass region (400 GeV > > 200 GeV), and in the 150 GeV region, where the 
Higgs boson should be discovered with 5 fb -1 . 

The results obtained in this work include studies of the effect of systematic uncertainties on the 
signal extraction. In parallel to these studies, several attempts to improve the sensitivity of the channel 
by increasing the signal yield and keeping the same level of background were performed. Examples are 
the relaxing of the lep ton-id criteria for one of the four leptons, the use of calorimeter for muon tagging, 
and application of multivariate techniques. Although the first results are encouraging, these studies are 
beyond the scope of this note since they are still in their very early stages. 
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Search for the Standard Model Higgs Boson via Vector Boson 
Fusion Production Process in the Di-Tau Channels 

Abstract 

We outline a search for the Standard Model Higgs boson decaying into a t- 
pair in association with two jets, which is produced dominantly by the Vector 
Boson Fusion (VBF) process. The results indicate significant potential for a 
discovery in the low mass range. We consider fully leptonic, semi-leptonic, 
and, for the first time, fully hadronic tau decays. Mass reconstruction, central- 
jet veto, and jet tagging are discussed, and we present an approach to estimate 
the background from the data. Additional emphasis has been given to trigger 
issues and the impact of pileup. The results are based on an improved detector 
description, including misalignments, the most recent reconstruction software, 
and modern Monte Carlo event generators, including a revised prediction of 
the underlying event activity. 

1 Introduction 

The search for the Higgs boson and the source of electroweak symmetry breaking is a primary task of the 
Large Hadron Collider (LHC). It has been shown [1] that the ATLAS detector is capable of discovering 
the Standard Model Higgs boson with masses ranging from the LEP limit of 114 GeV [2] to about 
1 TeV. The low-mass region is preferred from electroweak precision measurements and in this region 
(rriH < 130 GeV) the searches for Higgs bosons decaying to taus and photons are the most promising 
for discovery [3-5]. Searches for the Higgs boson produced in Vector Boson Fusion (VBF) tend to have 
reasonably high signal-to-background ratios, making them more robust to systematic uncertainties. 

Within the Standard Model, the ability to observe the Higgs boson in multiple production and decay 
configurations makes it possible to measure the Higgs boson coupling to fermions and vector bosons [6]. 
Furthermore, the VBF processes provide a tool for measuring the Higgs boson spin and CP properties [7, 
8]. In the context of the Minimal Supersymetric Standard Model, (MSSM), the branching ratio of a 
Higgs boson decaying to photons is generally suppressed, which makes the search for Higgs boson 
decaying to taus very important. The complementarity of the coupling of the light and heavy CP-even, 
neutral Higgs bosons of the MSSM to taus makes it possible to cover most or all of the — tan/3 plane 
by reinterpreting the results for a Standard Model Higgs boson decaying into taus in the context of the 
MSSM [9,10]. 

A previous ATLAS analysis outlined the sensitivity to a low mass Higgs boson including the first 
estimates for the VBF channels [4]. These results were primarily based on a fast simulation that parame- 
terized the results of key detector performance studies performed with a full GEANT simulation. In this 
note we have considered three decay modes: the lepton-lepton (//-channel), lepton-hadron (//z-channel) 
and the hadron-hadron (M-channel) from VBF H — ► t + t~ signature. The analysis has been done using 
state-of-the art Monte Carlo generators, full GEANT-based simulation of the ATLAS detector with re- 
alistic misalignments and distortions applied to the expected material in the detector, utilization of our 
current reconstruction algorithms, and, where possible, incorporation of pileup interactions. 

This analysis requires excellent performance from every ATLAS detector subsystem; the presence 
of T decays implies final states with electrons, muons, hadronic tau decays, and missing transverse 
momentum, while the Vector Boson Fusion production process introduces jets that tend to be quite 
forward in the detector. Due to the small rate of signal production and large backgrounds, particle 
identification must be excellent and optimized specifically for this channel. Furthermore, triggering relies 
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on the lowest energy lepton triggers or exceptionally challenging tau trigger signatures. The detector 
performance aspects so important to this analysis are described in Refs. [11-19]. 

1.1 Monte Carlo samples 

Estimating the sensitivity of ATLAS to this channel requires the state of the art in Monte Carlo tools. 
The most challenging aspect of the theoretical calculations is the description of jet activity, an area in 
which the tools have evolved substantially since ATLAS ' first publication on the sensitivity to the VBF 
processes. Details of the Monte Carlo samples are outlined in Ref. [20]. The signal samples were pro- 
duced with HERWIG [21] and PYTHIA [22]. The QCD Z+jets and W+jets samples were produced 
with ALPGEN [23], which employs the MLM matching [24] between the hard process (calculated with 
a leading-order matrix element for up to 5 jets) and the parton shower of HERWIG. The electroweak 
(ELWK) Z+jets background was simulated with SHERPA [25]. The «+jets and diboson background 
samples were generated with MC@NLO [26]. In all processes with taus, the tau decay was simulated 
using TAUOLA [27]. Additional photon radiation from charged leptons was simulated with PHO- 
TOS [28]. The production cross-section for the signal is based on the next-to-leading order (NLO) 
computation and the k-factor (the ratio of the cross-section to that predicted by the lowest order calcu- 
lation) is around 5% in the target mass range of 100-150 GeV. Note that the k-factor only involves the 
QCD corrections. 

Because the GEANT-based detector simulation is computationally intensive, an event filter was ap- 
plied to each sample after the parton shower and hadronization. Most processes were required to have 
at least one lepton in the final state. For background processes a VBF filter was used to remove events 
that would fail jet-related requirements. The filter bias has been studied and well- validated, but it affects 
our ability to estimate background rates early in the analysis cut flow. Furthermore, a significant Monte 
Carlo sample was produced with the ATLAS fast simulation, ATLFAST [29], without any event filter. 
These ATLFAST samples are used for systematic studies and to aid in the estimation of background 
rates (see Section 3.4). 

The effect of in-time pileup (i.e. other soft p-p collisions in the same bunch crossing), out-of- 
time pileup (i.e. p-p collisions in neighboring bunch crossings), and the underlying event (i.e. multi- 
parton scattering and soft activity in the p-p collision of interest) are all important to this analysis. 
The underlying event has substantial theoretical uncertainty, and different models' predictions for the 
underlying event activity vary by large factors when extrapolating to the LHC energy range. Fortunately, 
the underlying event activity will be one of the first measurements at the LHC and will be well measured 
by the time the analysis described in this Note is performed. The pileup interactions are incorporated 
early in the simulation chain, at the time when the detector readout is simulated. 

2 Event selection 
2.1 Triggering 

While the ATLAS trigger system provides several possibilities for triggering that take advantage of 
the signal's complex final state, we restrict ourselves here to simple robust trigger signatures that are 
expected to have a low rate and an acceptable selection efficiency [12, 13]. For the Ih and // final states 
the events are selected by an isolated electron with pj > 22 GeV (e22i) or an isolated muon with pj > 
20 GeV (mw20). The entire trigger chain has been simulated with the use of our current trigger algorithms 
and trigger menus; however, the dilepton triggers composed of isolated muons with pj > 10 GeV and 
isolated electrons with pj > 15 GeV considered in Ref. [4] were not used in this study. The trigger 
efficiency for VBF H —>■ tt (with m# = 120 GeV) is 9.0% for events selected by the electron trigger and 
9.9% in the case of muons. The trigger efficiencies include detector acceptance and are normalized with 
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Table 1: The product of efficiency and acceptance for the signal from the e22i, mu20, 
andL 1 _TAU30 _xE40 _sof tHLT triggers . 



Trigger menu 


Efficiency x Acceptance(%) 


e22i 


9.08 ± 0.03 


mu20 


9.88 ± 0.04 


Ll_TAU30_xE40_softHLT 


3.67±0.02 



respect to the production cross-section for VBF H — > zz. Additional triggers for the Ih and // final states, 
for instance the combined z + e or z+jJ, triggers and triggers which take advantage of the tagging jets of 
the VBF process, are under study. 

The all hadronic mode, or M-channel, utilizes a different triggering strategy. Unlike the clean sig- 
nature of the electron and muon triggers, the single tau trigger is expected to be exposed the large QCD 
jets background. Therefore only tau trigger in combination with other signatures, like missing Ej or 
another tau in the event, can be considered. We use Ll_TAU30_xE40_sof tHLT as the primary trigger 
menu for the /z/z-channel in this study. It should be noted that unlike the high pj single lepton triggers, 
both the hadronic tau and £"™ 1SS triggers are based on requirements from the first level of the trigger 
system with only a loose selection in the high-level trigger [30]. The expected trigger acceptance of 
Ll_TAU30_xE40_sof tHLT is listed in Table 1 as well as those from e22i and mu20 menus. The trigger 
efficiency for the signal events (for m H = 120 GeV) is 3.7% for Ll_TAU30_xE40_sof tHLT. The disad- 
vantage of the missing Ej trigger is the relatively low efficiency on signal; therefore, alternative menus 
like double tau menus are now being developed. 

2.2 Electron and muon reconstruction and identification 

Electron candidates are formed from a cluster of cells in the electromagnetic calorimeter together with a 
matched track. The electron identification includes information from the shape of the shower, tracking 
information, and the consistency of the track and cluster. ATLAS provides multiple working points 
that trade electron efficiency for improved rejection of fakes. In this analysis we use the Medium class 
electron as it provides sufficient fake rejection and provides a higher signal efficiency. In addition to 
the standard electron identification, we require that the energy in an isolation cone of radius AR = 0.2 
around the electron contains less than 10% of the electron's Ej 1 . The isolation cut is imposed to reject 
the contamination from hadronic jets. 2 The reconstruction and identification efficiency is fairly flat after 
pj > 15 GeV, where it achieves 69.4 ± 0.2% efficiency while keeping the fake electron contamination 
at the order of 0.1%. In the VBF H — ► zz — ► jJ,jJ, +4v signal sample, the probability to reconstruct a fake 
electron was found to be slightly higher, 0.25±0.03%, reflecting some level of process-dependence. 

In ATLAS, muon candidates can be seeded from either tracks in the inner detector or in the stand- 
alone muon spectrometer. In this analysis we required the highest quality muon candidates, which are 
formed by extrapolating the track in the muon spectrometer to the interaction point, finding a matching 
inner detector track, and forming a combined track if the two tracks satisfy various quality require- 
ments [15]. The muon identification is composed of requirements on track quality and hit multiplicity 
in several muon stations. Similarly to the electrons, we require an isolation condition that the summed 
Et within a radius AR of 0.2 is less than 10% of the muon pj to reject the contamination from jets. The 
reconstruction and identification efficiency is fairly flat after pj > 10 GeV, and it achieves 91.9 ± 0.1% 
while keeps the fake muon rejection under 0.005%. 

'Due to a problem with the reconstruction, a correction to the isolation energy in the Tile gap scintillator w as required. 
2 A track-based isolation requirement was also studied and shown to have similar performance. 



1273 



Higgs - Search for the Standard Model Higgs Boson via Vector Boson Fusion . 



The electron and muon identification criteria are summarized in Table 2. The pj thresholds for 
electron and muon identification are chosen to provide stable identification efficiency and sufficient fake 
rejection. In addition, we require that the pr of the offline reconstructed lepton must satisfy the pj 
thresholds of the corresponding trigger, which is not strictly enforced due to subtle differences between 
the offline reconstruction and the trigger algorithms. 

Table 2: Summary of the identification requirements for electrons and muons. 

Lepton identification 
Electron ID: Medium 

Isolation E T (AR = 0.2) /p T < 0.1 

p T > 25 GeV for trigger electron (e22i) 

pr > 15 GeV for other electrons 
Muon ID: Combined muon 

Isolation E T (AR = 0.2)1 p T < 0.1 

Pt > 20 GeV for trigger muon (mu20i) 

Pt > 10 GeV for other muons 



2.3 Hadronic-tau reconstruction and identification 

Approximately 65% of tau leptons decays produce hadrons. The majority of hadronic tau decays are 
composed of single-prong candidates with one charged pion, which provides a track and a hadronic 
shower, and potentially associated neutral pions that provide an additional electromagnetic sub-cluster. 
In addition, three-prong tau decays are also reconstructed, but with a higher rate of fakes from QCD 
jets. Due to the high momentum of the taus produced in this process, the decay products are collimated 
into a narrow region. ATLAS currently employs two hadronic tau reconstruction algorithms [31]; both 
require a calorimeter cluster matching a track; however, one algorithm is seeded by calorimeter clusters 
and the other is seeded by the track. The two algorithms' efficiencies are complementary in different pr 
regimes, and provide rejection strategies for their energy measurement and rejection against jets. The 
calorimeter-seeded algorithm was used for this analysis. 

The calorimeter-seeded algorithm provides a log-likelihood ratio that distills discriminating power 
from a variety of track quality and shower shape information to discriminate between taus and jets [14]. 
The discriminating variable is designed to maintain a high tau efficiency while rejecting fake tau candi- 
dates from jets, leaving the precise working point to be optimized in the context of a specific analysis. 
The cuts on the discriminating variable and p T of the tau candidates were optimized with respect to a 
simple s/\Js + b performance measure. The background sample included Z+jets, W+jets, and tt+jets, 
which comprises a background sample with a representative mixture of real and fake taus. Our model- 
ing of the jet fragmentation indicates that quark-initiated jets are more collimated and have a 6-8 times 
higher fake rate than gluon-initiated jets. The relative abundance of real and fake tau candidates depends 
on the kinematic requirements imposed on the sample, thus the optimization should be performed after 
the final kinematic requirements described in Sections 2.8 and 2.9. However, the limited size of Monte 
Carlo samples requires that only a subset of the criteria used in the final event selection are applied dur- 
ing the optimization. Several subsets of the final event selection criterion were evaluated, and the final 
optimization was found to be reasonably stable and nearly independent of pj. After the optimization, the 
calorimeter-seeded algorithm's log-likelihood ratio was required to be greater than 4, corresponding to 
an identification efficiency of 50.0±0.2% and a fake jet selection efficiency of ~1% for gluon-initiated 
jets and ~2.5% for quark-initiated jets. 
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Figure 1: Reconstruction and identification efficiency of the hadronic tau (a) and the jet-fake rejection 
efficiency (b) as a function of pj, respectively. 

In addition to rejection against jets, an electron-veto was used to reject tau candidates which arise 
from electrons that have failed the electron identification. This electron- veto was performed by requiring 
that the tau candidate have at least 0.2% of its energy in the first sampling of the hadronic calorimeter and 
that the ratio of high-threshold (HT) to low-threshold (LT) hits in the transition radiation tracker (TRT) 
be less than 20% in the range \ r\ T \ < 1.7. This electron- veto procedure suppresses the electron fake rate 
by 82.5% while retaining 90% of the hadronic tau candidates selected without the veto. 

Finally, we present the hadronic tau reconstruction and identification performance in Fig. 1 (a) and 
the fake-jet tagging rate (b) as a function of pj, respectively. The selection criteria for the hadronic tau 
identification is summarized in Table 3. 

Table 3: Selection criteria for the hadronic tau identification from the calorimeter-seeded recon- 
struction algorithm. 

Hadronic tau identification 
Tau ID: Calorimeter-seeded 
p T > 30 GeV 

Track multiplicity : 1 or 3 tracks 
\charge\ = 1 

Log Likelihood Ratio > 4 
Electron Veto: 

minimum TRT //77L7X0.2 if |%|<1.7 and LT>10 
E^ AD /p T >0.002 in matched electron object 
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2.4 Jet reconstruction 
2.4. 1 Forward tagging jets 

The jet activity of the vector boson fusion process is unique in several ways, providing many handles 
to suppress backgrounds and isolate a sample of signal events with high purity. The most important 
features of the VBF process are the presence of two high-pr quark-initiated "tagging jets", which tend 
to be relatively forward and well separated in rapidity. Furthermore, due to color coherence in this elec- 
troweak process, additional QCD radiation between the tagging jets tends to be suppressed and motivates 
a Central Jet Veto (CJV) [32]. This section outlines the choice of jet algorithms and their performance, 
the definition of the tag jets, and several issues related to the CJV. 

Figure 2 shows the T7 spectra of the highest and second highest p T jets in signal and various back- 
ground samples. Because the VBF jets can be very forward, the jet finding efficiency in this region is 
important in the analysis. Furthermore, the forward calorimeters (3.1< |t?| <4.9) do not have a pro- 
jective geometry, which leads to different challenges for jet reconstruction. ATLAS currently provides 
collections of jets based on two algorithms (a seeded cone algorithm with split-merge and a kj algo- 
rithm), each with two sets of parameters (the cone size and the k T cutoff scale), applied to two different 
input representations of the energy deposits in the calorimeter (towers merged to avoid negative energy 
fluctuations from electronic noise and clusters based on the ATLAS TopoCluster algorithm) [18]. These 
different jet algorithms and the different calorimeter pre-clustering result in different performances for 
jets, especially at low pj and high \r\\. 

Jet identification efficiency and purity are defined to give a quantitative measure of the jet identifica- 
tion. The efficiency and purity were calculated with respect to generator-level jets obtained by running 
the same jet algorithm on the stable interacting particles after hadronization and before GEANT simula- 
tion. To ensure that only hadronic jets are considered, we only use dimuon events, where both taus decay 
into a muon and neutrinos, or ZIW bosons directly decay into muons. This avoids any bias in the jet 
reconstruction produced by the presence of electrons. A reconstructed jet is considered to be matched if 
the corresponding generator-level jet is within AR < 0.15 for jets with a cone size of 0.4. The matching 
cone size was chosen to avoid a single generator-level jet being matched to more than one reconstructed 
jet; with the given parameters this effect is at the order of 10~ 3 . 

The jet reconstruction efficiency in different \r\ \ regions and two different clustering algorithms is 
shown in Fig. 3 as a function of the generator-level jet pr and T7. The reconstruction efficiency rises 
over 95% for jets with p T above 50 GeV. On the other hand, the efficiency drops at |tj| ~ 1.5 and |tj| 
~ 3.2 for jets in the range of 20-30 GeV of p T . This drop in efficiency is due to the crack region in the 
calorimeter or large amounts of dead material in the corresponding 77 region. The jet collections based 
on calorimeter towers show a drop in efficiencies in the forward region due to a higher seed threshold, 
while the jet collections based on TopoClusters do not show this loss of efficiency. For this reason, jets 
based on TopoClusters have been chosen for this analysis. 

Correctly identifying the quark-initiated tagging jets from the VBF process is very important for 
the measurement of Higgs boson spin and CP properties and for making precise correspondence with 
theoretical calculations [8]. Typically, the tagging jets are found in opposite hemispheres, but there are 
two approaches to incorporating this requirement in the analysis. One option is to define the tagging jets 
as the two highest pj jets in the event, and reject the event from the signal candidates if they are in the 
same hemisphere (e.g. require r\j\ x 77^2 < 0). A second option is to define the first tagging jet to be the 
highest pj jet in the event and the second tagging jet to be the highest pj jet in the opposite hemisphere. 
In this second approach it is not required that the second tagging jet is the second highest pj jet in the 
event. These two strategies were compared, and it was found that the first method more reliably matched 
the quark-initiated tagging jets from the hard process. 

The generator-level jets match the hard-scattered quarks nearly 100% of the time above a certain 
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Figure 2: Pseudorapidity of the highest pj (a) and the second highest pj (b) jets for the Cone jet al- 
gorithm based on TopoClusters with R = 0.4 in VBF H —> tt — > fljl {m.H=l20 GeV) and background 
events. Only pj cuts were applied to jets. Solid (black) histogram is for signal, dashed (red) histogram 
is for tt — > WW — > (jUju), and dotted (blue) histogram is for Z-^ jUjU+n jets. 
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Figure 3: Jet reconstruction efficiency for the Cone jet algorithm with R = 0.4 as a function of the 
generator-level jet pj for the jets based on TopoClusters (a) and r\ for Tower- and TopoCluster-based jets 
(b). 
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Pt threshold. To estimate the purity of the tagging jets, we define the efficiency with respect to the 
generator-level jets. The reconstructed tag jets have a high purity over the entire pj and 77 range and 
do not show a strong dependence on the jet algorithms. Integrated efficiencies and purities for jets with 
pj > 20 GeV indicate that the TopoCluster-based algorithm has better performance for this analysis. 
Because additional jets often lie in the central detector region, where we wish to employ a central jet 
veto, jets with smaller cones are favored for the selection. Furthermore, calorimeter noise (including 
effects from pileup of minimum-bias events) increases with jet cone radius. Thus, we use the cone jet 
algorithm with R = 0.4 running on TopoClusters as the primary jet algorithm in this analysis. 

Having converged on a specific calorimeter pre-clustering and jet algorithm, we now present the 
kinematic properties of the jets that discriminate between the signal and backgrounds. The pr cuts on 
the tagging jets are effective at reducing several backgrounds and Fig. 2 shows that the pseudorapidity 
distributions are substantially different. Instead of relying directly on the pseudorapidity of the tagging 
jets, Fig. 4 shows that the pseudorapidity gap (a) and invariant mass of the two tagging jets (b) provide 
substantial background rejection. 
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Figure 4: Pseudorapidity gap between tag jets (a) and invariant-mass distributions of tag jets (b) in VBF 
H — > tt — > jXji. events (m#=120 GeV). A requirement T71 x 772 <0 is used in addition to the cuts on jet 
pj. Solid (black) histogram is for signal, dashed (red) histogram is for tt — > WW — > (jU/i), and dotted 
(blue) histogram is for jX}X+n jets. 



2.4.2 Central jet veto 

As mentioned above, the color coherence in the VBF Higgs boson production leads to a suppression of 
QCD radiation between the tagging jets. This color coherence is also found in the electroweak Z+jets 
background. In contrast, most of the other backgrounds have a much larger probability for additional 
QCD radiation in the central region. This is the physical motivation for a central jet veto (CJV). Figure 5 
shows the jet multiplicity distribution for the signal and backgrounds after requiring two tagged jets (with 
Pt > 20 GeV) in opposite hemispheres. The fraction of signal events with three or more jets is small. 

The experimental challenge for the CJV is to provide a cut that is robust against additional minimum 
bias events (in-time pileup events). The optimization for the central jet veto has been studied in terms 
of pj and T7. The probability to have at least one reconstructed jet with pt >20 GeV within \r\\ < 3.2 
is 1.6% from a single minimum-bias event. In Fig. 6, we present the trade-off of background rejection 
versus signal efficiency from varying the pt threshold on the third highest pt jet (markers indicate 
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thresholds of 20 and 30 GeV). A veto based on a fixed r\ -window was compared to a dynamic r\- 
window defined by the Y\ of the two tagging jets. We maintain the previous central jet veto requirement: 
no jets in | T] | < 3.2 with pj > 20 GeV. Figure 7 shows the efficiency of the central jet veto for signal, 
irreducible, and reducible backgrounds at varying levels of pileup. 

The CJV poses significant theoretical challenges as well. At the parton-level, the CJV efficiency is 
expected to be known quite well with little theoretical uncertainty. However, the current tools that allow 
for the full parton-shower and hadronization (prerequisite for an analysis based on a GEANT-based 
detector simulation) show significant uncertainties. We have observed significant differences between 
the central jet activity in signal events generated with PYTHIA and those generated with HERWIG. 
Knowledge of the uncertainty on the CJV is needed for setting limits on the Higgs boson cross-section 
and for making coupling measurements; however, it is not needed directly in establishing a deviation 
from the background-only expectation (see Section 5.3). 

In future studies we will also include a veto procedure using track information; in particular using 
vertexing information to reduce the impact of jets from in-time pileup. Furthermore, a track-based veto 
and the use of timing information in the calorimeter will also be studied to reduce the impact of out-of- 
time pileup. 




Figure 5: Jet multiplicity distribution for the sig- 
nal, Z+jets, and tt background after requiring the 
cuts up to the N jets > 2 level in the list of cuts for 
the // channel (see Table 5). 



Figure 6: Background rejection versus signal sen- 
sitivity for the central jet veto with and without 
pileup. Also shown is the case for ??-only back- 
ground. 



2.4.3 6-jet veto 

In the //-channel, the largest background contribution comes from tt(+jets) —> lvblvb(+jets). By 
introducing a veto on Z?-tagged forward jets it is possible to reduce this background [19]. Because the 
tagging jets are fairly forward in the detector, the ^-tagging requirement is rather loose, i.e. efficient, 
and the tt background can be reduced by a factor 2~3. Figure 8 demonstrates the efficiency of the ft-jet 
veto as a function of the forward jet pj for the signal and tt background. The cut on the b-tag weight 
was optimized to achieve 65.1% reconstruction efficiency for Z?-quark jets, while 9.4% mis-identification 
efficiency for the light flavor jets is retained. Note that the ft-jet veto is only used in the //-channel. 
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Figure 7: Central jet veto performance in the pres- 
ence of varying levels of pileup for signal and back- 
ground samples. 



Figure 8: Efficiency of the b-jet veto as a func- 
tion of the forward jet pj in for the signal and tt 
background. 



2.5 Missing transverse energy 

Significant missing transverse energy (£"™ ss ) is present in H — > t + t~ events because neutrinos are 
always associated with the T decays. The performance of the £™ ss algorithm plays a vital role in this 
analysis because E™ 1SS is used in the mass reconstruction of the tau pair. Ultimately, the Ej lss resolution 
is what limits the m rr resolution. Furthermore, the absolute scale of the E™ 1SS must be well calibrated 
to correctly reconstruct the Higgs boson mass. In addition to the standard E™ 1SS algorithm [17], we 
have made a dedicated correction in the presence of hadronic tau decays. The correction is based on the 
calibrated tau energy instead of the default treatment of the object that uses a jet calibration. This removes 
a ~ 1 GeV bias in the £"™ ss distribution for the Z/z-channel. The sensitivity of the signal efficiency to the 
absolute energy scale is presented in Section 5.2. By requiring a large £"™ ss , it is possible to improve 
the m xx resolution and reject many backgrounds that do not contain neutrinos (e.g. Z —>■ 11). We require 
£miss -> 20 GeV for the //j-channel and £"™ ss > 40 GeV for the 11- and M-channels. 

2.6 Mass reconstruction 

Although there are several neutrinos in the event, it is possible to reconstruct the T + T~ invariant mass 
by making the approximation that the decay products of the T are collinear with the z in the laboratory 
frame. This is a good approximation since m#/2 » m x and hence the taus are highly boosted. This 
leaves two unknown quantities and two equations: the fraction of each t's momentum carried away by 
neutrinos and the constraints from the two components of £"™ ss . For notational simplicity, consider 
the //j-channel and let / represent the momentum vector for the leptonic visible decay product and h 
represent the momentum vector for the hadronic visible decay products. By neglecting the T rest mass 
and imposing the collinear approximation, we can write 

m TT = v / 2(£/, + £v/ 1 )(£/+£w)(l -cose,/,) . (1) 
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By introducing the variables Xh and xi, the fraction of the t's momentum carried away by the visible 
decay products, we can re-write the invariant mass as 

m TT = ^L iovx Lh >0 . (2) 



One can easily solve for the x z variables by requiring that the vector sum of the neutrinos coincides with 
the two measured components of £"™ ss : 



= E h = h x l y -h y l x = N_ 

Xh E h + E vh h x l y + Ef s n y -h y l x -Ef s % D h 

and 

El k X ly-hyl X _N 

Xl ~ Ei+ E vl ~ h x l y - Ef^hy - h y l x + Ef^h x ~ A ' () 

where we have introduced N,Dh, and D/ for convenience. If the two ts are back-to-back, then these 
equations are linearly-dependent and one cannot solve for the x T s. For this reason, we require that 
cosA0 TT > —0.9. Typically the Higgs boson has significant pj due to the tagging jets. Events that 
come from the process X — > TT with no other sources of missing energy should have 0 < x r < 1, though 
resolution effects in £"™ ss may lead to unphysical solutions with either x t < 0 or x t > 1 . Equation 2 
shows explicitly that cuts on x T will impose constraints on the reconstructed mass for a given event, viz. 
m TT > m//j, which results in an asymmetric distribution for m TT . 

The sensitivity of m xx to a mis-measurement of £"™ 1SS depends on the orientation of the ts. This sen- 
sitivity can be summarized by a Jacobian factor, /. Neglecting correlation between the mis-measurement 
of the x- and y-components of £"™ ss , one can define the Jacobian as follows 



(dm*} 2 


( dm, z \ 


\dE™™ J 





Thus, we arrive at 



/ = i!^^-^f + (^-^) 2 . (6) 

The final mass measurement is a result of a fit to the m TT distribution, and it is important to incorporate 
both the asymmetry and the fact that the width of the m TT distribution is not common for all events. The 
modeling of the asymmetry and Jacobian scaling of the m TT distribution is described in Section 4.2. 



2.7 Summary of the event selection for //-channel 

The event selection for //-channel is summarized below, including some kinematic requirements specific 
to the //-channel. 

• Trigger: electron trigger e22i or muon trigger mu20. 

• Trigger lepton: at least one lepton must have a reconstructed p T greater or equal to the correspond- 
ing trigger requirement. 

• Dilepton: exactly two identified leptons with opposite charge. 

• Missing E T : £™ ss > 40 GeV. 
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• Collinear approximation: 0 < x\\,n < 0.75 and cosA0// > —0.9. 

The tighter cut on x x < 0.75 has been found to provide a better background rejection in the //- 
channel. The shape of the control samples used to estimate the signal sensitivity are obtained after 
these cuts; additional details of the data-driven background estimation are given in Section 3. In 
addition to the cuts above, the signal candidates are also required to satisfy the following cuts. 

• Jet multiplicity: at least one jet with p T > 40 GeV and at least one additional jet with 
p T > 20 GeV. 

• Forward jets: in opposite hemispheres T]yi x 7]j2 < 0, with tau centrality min{T|/i, 77,7} < T]i epi 2 < 
max{?]ji,7] 7 -2} for the two highest pj jets. 

• &-jet veto: the event is rejected if either tag jet has b-tag weight greater than 1. 

• Jet kinematics: Atj^- > 4.4 and dijet mass mjj > 700 GeV for two forward jets. 

• Central jet veto: the event is rejected if there are any additional jets with pj > 20 GeVin|Tj| < 3.2. 

• Mass window: mn — 15 GeV < m TT < m# + 15 GeV around the test mass nin- 

Table 4 summarizes the cross-section for signal events after each of the cuts described above. 



Table 4: Signal cross-section (fb) for the //-channel for various Higgs boson masses. 



Mass (GeV) 


105 


110 


115 


120 


125 


130 


135 


140 


Cross section (fb) 


394.7 


372.0 


341.8 


309.1 


266.8 


225.4 


180.1 


135.8 


Trigger 


65.6(3) 


65.1(2) 


61.1(2) 


57.2(1) 


51.5(2) 


44.7(1) 


36.5(1) 


28.3(1) 


Trigger lepton 


56.4(3) 


56.2(2) 


53.2(2) 


49.5(1) 


44.7(2) 


38.9(1) 


31.8(1) 


24.7(1) 


Dilepton 


5.73(7) 


5.86(6) 


5.80(6) 


5.46(3) 


4.94(5) 


4.30(4) 


3.61(4) 


2.88(4) 


£™ ss > 40 GeV 


3.41(5) 


3.49(5) 


3.45(5) 


3.17(3) 


2.94(4) 


2.56(4) 


2.17(3) 


1.78(4) 


Collinear Approx. 


2.34(5) 


2.38(4) 


2.33(4) 


2.15(2) 


1.95(4) 


1.69(3) 


1.46(2) 


1.16(3) 


Njets > 2 


1.96(4) 


1.97(4) 


1.95(4) 


1.77(2) 


1.61(3) 


1.41(3) 


1.20(2) 


0.95(3) 


Forward jet 


1.48(4) 


1.49(4) 


1.48(3) 


1.34(2) 


1.21(3) 


1.08(3) 


0.91(2) 


0.73(3) 


i-jet veto 


1.26(3) 


1.30(3) 


1.25(3) 


1.16(2) 


1.04(3) 


0.94(2) 


0.77(2) 


0.64(2) 


Jet kinematics 


0.70(3) 


0.69(2) 


0.70(2) 


0.63(1) 


0.58(2) 


0.52(2) 


0.43(1) 


0.37(2) 


Central jet veto 


0.61(2) 


0.60(2) 


0.62(2) 


0.56(1) 


0.50(2) 


0.45(2) 


0.38(1) 


0.32(2) 


Mass window 


0.52(2) 


0.50(2) 


0.51(2) 


(i 45. 1 1 


0.39(2) 


0.34(1) 


0.29(1) 


0.23(1) 



2.8 Summary of the event selection for //z-channel 

The event selection for //z-channel is summarized below, including some kinematic requirements specific 
to the //i-channel. 

• Trigger: electron trigger e22i or muon trigger mu20. 

• Trigger lepton: at least one lepton must have a reconstructed p T greater or equal to the correspond- 
ing trigger requirement. 

• Dilepton veto: exactly one identified lepton (ensures this sample is disjoint from the //-channel). 

• Hadronic v. exactly one identified hadronic % with opposite charge of the lepton. 

• Missing E T : £"^ iss > 30 GeV. 
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• Collinear approximation: 0 < x\ < 0.75, 0 < x/, < 1, andcosA^//, > —0.9. 

The asymmetric treatment of x/, and x/ provides background rejection in the Z/j-channel. 

• Transverse mass: in order to further suppress the W + jets and tt backgrounds, a cut on the trans- 
verse mass of the lepton and £"™ ss 

m T = ^2 p l T P £™ iss -(l- cos A0) < 30 GeV (7) 

is required, where p l j P is the transverse momentum of the lepton in the //i-channel and A0 is the 
angle between that lepton and £"™ ss in the transverse plane. The shape of the control samples used 
to estimate the signal sensitivity are obtained after these cuts; additional details of the data-driven 
background estimation are given in Section 3. In addition to the cuts above, the signal candidates 
are also required to satisfy the following cuts. 

• Jet multiplicity: At least one jet with p T > 40 GeV and at least one additional jet with 
p T > 20 GeV. 

• Forward jets: in opposite hemispheres r)ji x tj^ < 0, with tau centrality min{Tjyi, T^} < ^Up.x < 
max{T7 ; i, rij 2 } for the two highest p T jets. 

• Jet kinematics: At] jj > 4.4 and dijet mass mjj > 700 GeV for two forward jets. 

• Central jet veto: the event is rejected if there are any additional jets with pj > 20 GeVin|T]| < 3.2. 

• Mass window: mu — 15 GeV < m %% < mu + 15 GeV around the test mass mu- 

Table 5 summarizes the cross-section for signal events after each of the cuts described above. With 
30 fb" 1 integrated luminosity, about 20 signal events are expected in the mass window. 

Table 5: Signal cross-sections (fb) for the Z/i-channel for various Higgs boson masses. 



Mass (GeV) 


105 


110 


115 


120 


125 


130 


135 


140 


Cross section (fb) 


394.7 


372.0 


341.8 


309.1 


266.8 


225.4 


180.1 


135.8 


Trigger 


65.6(3) 


65.1(2) 


61.1(2) 


57.2(1) 


51.5(2) 


44.7(1) 


36.5(1) 


28.3(1) 


Trigger lepton 


56.4(3) 


56.2(2) 


53.2(2) 


49.5(1) 


44.7(2) 


38.9(1) 


31.8(1) 


24.7(1) 


Dilepton veto 


50.0(3) 


49.6(2) 


46.7(2) 


43.4(1) 


38.9(2) 


34.0(1) 


27.6(1) 


21.3(1) 


Hadronic % 


7.7(1) 


8.1(1) 


8.1(1) 


8.02(7) 


7.4(1) 


6.68(8) 


5.72(7) 


4.53(9) 


£™ ss > 30 GeV 


4.8(1) 


5.1(1) 


5.08(9) 


4.96(5) 


4.63(8) 


4.16(7) 


3.51(6) 


2.82(8) 


Collinear Approx. 


3.19(9) 


3.50(8) 


3.51(8) 


3.34(5) 


3.14(7) 


2.77(6) 


2.37(5) 


1.91(6) 


Transverse mass 


2.53(8) 


2.70(7) 


2.67(7) 


2.46(4) 


2.26(6) 


1.98(5) 


1.64(4) 


1.29(5) 


Njets>2 


2.12(7) 


2.22(7) 


2.21(6) 


2.02(4) 


1.80(5) 


1.60(4) 


1.32(4) 


1.00(5) 


Forward jet 


1.61(7) 


1.66(6) 


1.73(5) 


1.52(3) 


1.41(5) 


1.20(4) 


1.03(3) 


0.78(4) 


Jet kinematics 


0.88(5) 


0.86(4) 


0.92(4) 


0.82(2) 


0.73(3) 


0.65(3) 


0.56(2) 


0.42(3) 


Central jet veto 


0.77(5) 


0.77(4) 


0.81(4) 


0.72(2) 


0.63(3) 


0.55(2) 


0.50(2) 


0.38(3) 


Mass window 


0.68(4) 


0.68(4) 


0.70(3) 


0.61(2) 


0.52(3) 


0.44(2) 


0.40(2) 


0.30(3) 



2.9 Summary of the event selection for M-channel 

The event selection for /2/2-channel is summarized below, including some kinematic requirements specific 
to the /i/i-channel. 

• Trigger: a combination of the hadronic tau and missing Ej trigger Ll_TAU30_xE40_sof tHLT. 
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• Hadronic taus: two identified hadronic taus are required pj above 35 GeV and 30 GeV with 
opposite charge. 

• Missing E T : Ef ss > 40 GeV. 

• Collinear approximation: 0.2 < x/ji /,2 < 1, andcosA^ > —0.9. 

• Di-tau transverse mass: in order to further suppress fake-T candidates from W + jets and tt back- 
grounds, a cut on the di-tau transverse mass 

mf = ^2 #£™ ss -(l-cosA0) < 80 GeV (8) 

is required, where p^ 1 is the transverse momentum of the two hadronic tau system and A(j) repre- 
sents the azimuthal angle between p' T h and £"™ ss . This variable has been identified as potentially 
useful for the analysis. The optimal value of this cut depends heavily on the relative amount of the 
W+jets, tt and QCD backgrounds, therefore, the requirement is kept fairly loose. 

• Jet multiplicity, forward jets, angular cuts, and central jet veto in the case of the Z/z-channel. 

• Total p T : to reject events with many jets like tt, a cut on the total p T is applied: 

I \PT M + PT hl + PT jl + PT j2 + Ef™ 1 1 < 60 GeV . (9) 

• Jet kinematics: Atj^- > 4 and dijet mass rrijj > 700 GeV for two forward jets. 

• Central jet veto: the event is rejected if there are any additional jets with pj > 20 GeV in with 
hi < 3.2. 

• Mass window: m H - 15 GeV < m %% <m H + 20 GeV around the test mass m#. 

Table 6 summarizes the cross-section for signal events after each of the cuts described above. The 
events used in the analysis have been generated applying a filter that requires two hadronic taus with 
p T > 12 GeV in the final state, produced in | tj | <2.7 and with A0 < 2.9. 



Table 6: Signal cross-sections (fb) for the M-channel for various Higgs boson masses. The 
events used in the analysis had a filter applied at generation that required two hadronic taus with 
p T >12 GeV in the final state, produced in |tj | <2.7 and with A0 < 2.9. 



Mass (GeV) 


105 


110 


115 


120 


125 


130 


135 


Cross section (fb) 


394.7 


372.0 


341.8 


309.1 


266.8 


225.4 


180.1 


Trigger tau & MET 


12.4(2) 


12.1(2) 


12.0(2) 


11.4(1) 


10.4(2) 


9.2(1) 


7.93(1) 


2 Hadronic ts 


1.73(8) 


1.80(8) 


1.93(8) 


1.83(4) 


1.67(7) 


1.52(5) 


1.29(5) 


Ef ,ss > 40 GeV 


1.34(7) 


1.39(7) 


1.50(7) 


1.43(3) 


1.32(6) 


1.17(5) 


0.99(4) 


Collinear Approx. 


0.91(6) 


1.02(6) 


1.13(6) 


1.03(3) 


1.00(5) 


0.85(4) 


0.72(3) 


Di-tau Transverse mass 


0.91(6) 


1.02(6) 


1.13(6) 


1.03(3) 


1.00(5) 


0.85(4) 


0.72(3) 


Njets > 2 


0.77(5) 


0.88(6) 


0.94(5) 


0.86(3) 


0.84(5) 


0.72(4) 


0.61(3) 


Total p T 


0.72(5) 


0.84(5) 


0.91(5) 


0.83(3) 


0.80(5) 


0.69(4) 


0.58(3) 


Forward jet 


0.62(5) 


0.73(5) 


0.75(5) 


0.72(2) 


0.68(4) 


0.58(3) 


0.50(3) 


Jet kinematics 


0.37(4) 


0.43(4) 


0.41(4) 


0.45(2) 


0.41(3) 


0.36(3) 


0.28(2) 


Central jet veto 


0.34(3) 


0.38(4) 


0.36(3) 


0.39(2) 


0.35(3) 


0.32(3) 


0.24(2) 


Mass window 


0.25(3) 


0.35(4) 


0.33(3) 


0.34(2) 


0.29(3) 


0.27(2) 


0.20(2) 
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3 Background estimation 

3.1 Overview 

Despite the advances in theoretical tools and extraordinarily detailed simulation of the ATLAS detector, 
it is preferable to estimate backgrounds from data instead of relying entirely on Monte Carlo estimates. 
Below we describe data-driven background estimation techniques for each of the major backgrounds 
and estimate the systematic uncertainty of the estimates. Each technique has been developed to address 
the aspects of the background estimation which are most relevant for the analysis: the shape of the m xx 
tail from the irreducible Z — > TT, the fake tau contribution in the Z/z-channel, and the normalization of 
the QCD backgrounds. Section 4 describes how these techniques are incorporated into the final signal 
extraction, significance calculation, and mass measurement. 

While it is possible to use Monte Carlo to estimate the systematics associated with the data-driven 
background estimation techniques, we must wait for data until we can employ these methods to produce 
reliable estimates of the backgrounds. Thus, in a feasibility study such as this one we face the additional 
challenge of predicting the expected backgrounds with Monte Carlo. The challenge of predicting our 
background and the associated uncertainties are distinct from the ones that we will face once we have 
collected ~30 fb _1 of data. 

The major challenge in background prediction is related to the limited size of our Monte Carlo 
samples. Backgrounds from mis-identified leptons are difficult to estimate due to the large rejection 
factors of the identification algorithms. Even the irreducible Z — ► TT backgrounds are suppressed by 
several orders of magnitude due to the kinematic requirements. The tt background requires particularly 
large sample sizes because it is suppressed by both identification and kinematic requirements. Table 7 
summarizes the size of the Monte Carlo samples used in this study and their corresponding luminosity. 
Despite the large computing investment and generator-level filters 3 , many background samples were not 
sufficiently large to estimate rates deep in the analysis where rejection is at the order of 10 6 - 10 8 . Thus, 
the "full" GEANT simulation was augmented with a fast simulation sample ~100 times larger and a 
cut factorization method was used to predict the final background rates. While these procedures have 
large uncertainties, they are only relevant to our ability to estimate our sensitivity and will not plague the 
analysis once we have data. 

3.2 Z+jets 

While there is some theoretical uncertainty in the Z — > TT+jets background [33], the most serious danger 
of this background comes from the high-side tail in the m TT distribution, where we would expect to see 
the signal. This tail is dominated by instrumental effects; particularly mis-measurement of E™ 1SS , which 
is correlated to instrumental effects related to jet energy mis-measurement. Thus, we have developed a 
data-driven background estimation technique, which begins with a signal-free Z — > /x/x+jets sample and 
transfers the dominant instrumental effects to the Z — > TT+jets sample. This is achieved by replacing 
the muons with an equivalent tau, and carefully treating the decay of the tau. This technique is justified 
because the Z — »■ /x/x+jets events have identical jet activity and kinematics as Z — > TT+jets (before the tau 
decays) and because the relevant features of tau decays are well understood. We restrict the technique 
to the Z — >• jUjU+jets control sample because muons lose only a small fraction of their energy in the 
calorimeter, and the effect on £"™ ss is easier to estimate. After creating the emulated Z — > TT+jets 
control sample, the full event selection is applied. The normalization of the Z — > TT background does not 

3 VBFCut : N e j ll > 1 or 2, or N r > 1 with p T > 10 GeV, 1 7] | <2.7 for electron, muon and tau from W and Z, respectively. For 
the hadron level jets with cone size 0.4, Nj e , >2 with p\ >20 GeV for the highest p T jet, p\ >15 GeV for the second highest 
p T jet, and 1 7] | <5, mjj >300 GeV, Arjjj >2. 
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Table 7: Details of the background samples used in this study. The VBFCut filter is applied for 
Z/W+jets events at generator level. The corresponding luminosities for Z/W+jets and di-boson 
events are estimated for 2 jets events for Z and 3 jets events for W, and WW events for di-boson 
process respectively. In ti samples, the actual physics events are approximately 70% to the total 
number of simulated events according to the treatment of the negative weighted events. 



Process 


cross-section (pb) 


Simulated events 


Luminosity (fb 1 ) 


Z -> ZZ+jets 


35.1 


714,500 


63.1 


W — ► Zv+jets 


346.3 


765,000 


3.43 


«+jets (full) 


450.0 


1,012,941 


1.65 


«+jets (fast) 


833.0 


96,250,000 


84.3 


ti (2-muon, full) 


32.6 


904,000 


20.4 


WW /WZ /ZZ+jets 


174.2 


258,094 


0.57 


QCD di-jets (full) 


~1.4xl0 9 


1,503,250 


~10~ 6 


QCD di-jets (fast) 


~1.4xl0 9 


80,000,000 


~5 xl0~ 5 



require this emulation because it can be estimated directly from the height of the Z-peak in m xx spectrum 
obtained with the signal candidates. 

The first task is to obtain a signal-free dimuon control sample. Both loose and tight control samples 
have been used in this data-driven estimation technique. The loose control sample is used to estimate 
the ditau background not only from Z — ► TT+jets events but also from ti or di-boson backgrounds, while 
the tight control sample can be used for the Z — > TT background estimation by obtaining relatively pure 
Z — > /x/x+jets events. The loose control sample requires only a minimum sets of cuts from the ZZ-channel, 
hence ~ 10% of this control sample includes other processes such as it, diboson or even Z —> TT. The 
tight control sample, in contrast, selects pure Z — > jX}X events with less than 1% contamination from the 
other processes. A tighter event selection is used to define this control sample, it is identical to the final 
event selection in the ZZ-channel but excludes of the £"™ ss , collinear approximation, and mass window 
cuts. To improve the purity of Z events, a Z mass window cut is used: > mz — 10 GeV. Due 

to a strong correlation between the m TT and distributions before the ji — > T conversion, the m TT 
distribution is biased by a cut on m m . The lower bound on m m > m z — 10 GeV has little influence 
of the shape in the signal region; however, an upper mass cut causes a large bias. Therefore, no upper 
bound is placed on m^. After obtaining the dimuon events, the reconstructed muons are replaced by 
the Monte Carlo tau, then decayed and simulated in the ATLAS detector simulation and reconstruction 
software. 

Two different techniques for replacing the muon with the tau decay products were evaluated in this 
study. One uses the TAUOLA decay package [27] and the other uses a simple re-scaling of the mo- 
mentum and efficiency of taus, since the technical complexity of re-simulation by TAUOLA are rather 
difficult in the full detector simulation and reconstruction. A comparison of the two different methods 
provides a quantitative validation of this procedure. Several comparisons of the Z —> TT+jets control 
sample emulated from Z — ► fiji and the true Z —> TT background were performed after imposing the 
full event selection criteria for the 11- and Z/i-channels. Figure 9 shows the pj of the leptons and the 
£"™ ss distributions of the emulated sample compared to the true Z —>■ TT —>■ ZZ + 4v process. Figure 10 
(a) depicts the reconstructed visible mass for the true and emulated samples in the Z/z-channel and (b) 
shows the bin-by-bin ratio of these distributions. Excellent agreement between the true and emulated 
distributions are also observed in each of the ZZ-, lh-, and M-channels in the region of interest. The gray 
horizontal band in the figure represents ±10% around a ratio of 1, which is used to reflect the uncertainty 
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in the shape from the tau modeling and sensitivity to the analysis cuts. With this method we are able to 
accurately model both the shape and normalization of the Z — > TT backgrounds for all tau decays. 



100 120 140 160 180 200 
Lepton p T (GeV) 




Figure 9: Transverse momentum of the leptons (a) and missing transverse energy (b) for the two pro- 
cesses rescaled Z —>■ jiji and true Z — > TT — > 11 -\- 4v. 



- Full-sim True Z-yc-c 

- Full-sim Emulated Z-^x 



50 100 150 
(a) 



■■+++■+ 



100 150 200 250 



Figure 10: Reconstructed invariant mass distribution (a) and its bin-by-bin ratio (b) generated from the 
true and emulated Z — > TT —>■ Ih + 3v events. The gray band represents ±10% around a ratio of 1. 



3.3 QCD background in //z-channel 

The method described above estimates the contribution of taus for all processes, including tt, but does 
not estimate the contribution of the fake taus. The fake rate of leptons is much smaller as compared to 
the contribution from real leptons from W and T decays in the //-channel. In contrast, in the //z-channel 
roughly half of the background come from fake hadronic-taus. In addition, the W+jets background is 
comparable to tt in the //j-channel; therefore, estimating the QCD fake contribution to the //z-channel re- 
quires a dedicated procedure. Estimation of the QCD fake rate is determined from a data-driven method. 
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The technique used here exploits the track multiplicity in a cone around the tau candidate. Real 
taus typically have one or three tracks, with some spread due to tracking efficiency or the presence of 
spurious tracks. Electrons have dominantly a single track, while jets have a broad distribution with a 
higher average multiplicity. Figure 1 1 shows the track multiplicity distribution for taus, electrons, and 
jets in a cone of radius 0.7 after removing outlying tracks. It is clear that the track multiplicity can be 
used to constrain the relative abundance of the three components to the distribution. While Fig. 1 1 was 
created from Monte Carlo, the electron and jet track multiplicity distributions can easily be obtained 
from data. 

Given a sample of tau candidates, the relative abundance of taus, electrons, and jets can be found by 
fitting the track multiplicity distribution with the extended likelihood function 



where n'f x ' p is the total number of events estimated by the fit, r tau (tqcd) is the fraction of the tau (jet) 
contribution with respect to the estimated total number of events, Ai ep = 10% is the relative uncertainty 
on lepton measurement, and /' is the normalized probability for the i th bin of the track multiplicity 
distribution. The second term constrains the normalization, and the third term is an additional constraint 
term for the lepton contribution estimated by an independent analysis. The fit is performed to find the 
maximum likelihood estimate with MINUIT [34]. 

The track multiplicity distribution for the QCD jets is modeled from samples of QCD dijets that 
produce tau candidates with pj in the range of 17- 280 GeV. No event level selections are applied at this 
stage. Similarly, the multiplicity distribution of the tau signal and lepton background are modeled with 
Drell-Yan Monte Carlo samples; however, all analysis requirements up to transverse mass cut are applied. 
Pseudo-datasets were generated for various luminosities based on the corresponding cross-sections and 
multiplicity distributions. The highly uncertain QCD multi-jet background was scaled to be five times 
larger than the estimated rates of it and W+jets after event selection. A fit was performed for each of the 
2000 pseudo-datasets and the results were used to estimate quantify the performance of the method. The 
expected error on the fraction r tau is presented in Fig. 12 as a function of luminosity. 

The fraction r tau in the signal candidates remaining after the transverse mass cut can be measured to 
within 5% accuracy with 1 fb _1 integrated luminosity. The largest uncertainty in this method comes from 
the dependence of the track multiplicity on the jet p T . The systematics were estimated by dividing the 
jet into two samples; those with pj <70 GeV and those with pj >70 GeV. Additionally, the presence 
of E™ ss in the pure QCD processes is strongly correlated to the event kinematics. Thus, we assign an 
additional systematic associated with the variation observed when repeating the method with modified 
track multiplicity distributions with and without the requirements on £"™ 1SS . The systematics associated 
with the QCD shape contribute about 2% to r tau measurement. 

3.4 Cut factorization method 

The analysis cuts described in Section 2 have rejections against backgrounds of the order of 10 s . Only a 
few tens of events are expected with 30 fb _1 of data, and the background Monte Carlo samples generally 
correspond to 5 fb _1 or less. The lack of sufficiently large Monte Carlo samples requires an approxi- 
mate procedure to predict the background rate at the end of the analysis. We utilize a cut factorization 
method in which the analysis cuts are divided into three categories that are roughly uncorrelated so that 
the rejection can be factorized. The first category are related to the tau decays from the Higgs boson 



L t rack{r qcd, r tau ) = Ui ' Pois(rig p x {r tau f tau + r QCD f) et + {l-r tau - r QCD )f\ ep ) |JVjJ 




(10) 



xGaus{N^ d \n' e %{\ - r tau - r QCD )Au p n^ p i\ - r tau - r QCD )) 
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ATLAS 

Likelihood-seeded TauRecJ 



ATLAS 

Likelihood-seeded TauRec 
-•- QCD(tf+W) x 2 
-a- QCD(tf+W) x 5 



Figure 1 1 : Track multiplicity distribution for QCD 
fake events and electron-fake events as well as the 
T signal. 



Figure 12: Expected errors of the fraction r tau as a 
function of luminosity. The QCD events are scaled 
to x2 and x5. 



candidate (trigger, lepton ID, hadronic tau ID, E™ 1SS , the collinear approximation, and the transverse 
mass) and the rejection is dominated by detector performance issues. The second category of cuts are 
related to the tagging jets (forward jets, jet separation, and dijet mass) and the rejection is dominated 
by the kinematic properties of the events. The third category consists of those cuts which are strongly 
correlated to both the forward tagging jets and the tau decay products (centrality, central jet veto, and 
mass window cut). The method itself is simple; the background rejection rate is determined for each of 
the categories individually, and the product of these rejection rates is used to estimate the total rejection 
rate. Two variations on this cut factorization procedure were considered. In the first approach, the rejec- 
tion of the jet-related cuts were calculated without any tau-related cuts. The second approach differed 
in that it included the lepton and tau identification in order to avoid bias effects from the contribution of 
fake leptons and taus. The residual correlations between the categories of cuts contributes to an uncer- 
tainty in this technique. The uncertainties were estimated with the use of larger samples produced with 
fast simulation and fully simulated samples with generator-level filters, which enrich the backgrounds in 
the signal-like region. The factorization process was used for all background processes except for the 
irreducible Z — > tt background. 

For the W — > Zv+jets and Z —> ZZ+jets backgrounds, ALPGEN samples with up to 5-jets correspond- 
ing to 4 fb _1 integrated luminosity were used. Table 8 shows the average of the jet rejection from the two 
approaches, which is used as our final prediction. The Z events in ZZ-channel indicate that the jet-related 
cuts are not strongly correlated with the tau-related cuts, since the rejection rate is the same in the electron 
and muon modes. The correlation between the categories of cuts was investigated with an ALPGEN Z 
— > TT + n jets sample enriched with the VBFCut filter. The production kinematics are the same, but the 
enhanced £™ ss due to the x decays lets a sufficient number of events survive all the analysis cuts. The 
background predictions with and without cut factorization are consistent within the 20% statistical error. 
Thus, we assign a 20% systematic uncertainty for this evaluation method on this process. 

The tt background is the most complicated process in that it includes both irreducible and reducible 
contributions in all channels. We use 10 6 tt events produced by the MC@NLO generator. The events do 
not include the process in which both Ws decay hadronically, thus the sample corresponds to 1.6fb _1 of 
integrated luminosity. While the rejection of the tau-related cuts can be reliably estimated, the rejection of 
the jet-related cuts suffers from the limited sample sizes. The rejection of the jet-related cuts is expected 
to be very high (xlO 4 for tt), resulting in a ~30% statistical error in the final background predictions. 
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Table 8 summarizes the rejection of the jet-related cuts subdivided into events with a real or fake hadronic 
tau for the //i-channel and by lepton flavor for the //-channel. Again, the average of the two approaches is 
used for the final background prediction. Table 8 shows that there is a considerable contribution of fake- 
tau events for the //z-channel. In the //-channel, the contribution from semi-leptonic decays of B-hadrons 
was found to be very small. An estimate of the uncertainty in the method was found by comparing the 
prediction from cut-factorization to the direct estimate from an enriched sample filtered to require at least 
two muons in the event. Furthermore, the cut-factorization results were compared to a sample produced 
with the fast simulation that was approximately 100 times larger. Based on these comparisons, we assign 
50% uncertainty in the tt background prediction. 

The diboson background contribution was predicted with the same cut-factorization procedure. Even 
after using the cut-factorization procedure, the statistical uncertainty in the prediction is very large. For- 
tunately, the diboson production cross-section is much smaller than the tt processes, so the effect of this 
background is very small. The results of the cut factorization prediction are presented in Table 8, and we 
assign a large uncertainty of 50% for diboson background rate. While a number of tests were performed 
to validate the method, it is clear that this is an approximate procedure and that the limited size of the 
Monte Carlo samples fundamentally limits our ability to predict this background. 

Table 8: Rejection rates of the jet-related cuts for Z/lV+jets, tt and diboson events in //- and 
//z-channels, respectively. 



Acceptance 

(%) 


Z-ee/jU/U+jets 


W — ► ev/,uv+jets 


//-channel 

tt(ee/nn/ ef l) 


ww/wz/zz 


Jet kinematics 
Central jet veto 
Mass window 


2.32(2) / 2.43(2) 
1.00(1)/ 1.02(1) 
0.230(5) / 0.202(4) 


2.4(3)/ 1.5(2) 
1.1(3)/ 0.8(2) 
0.1(1) / 0.1(1) 


0.72(8) / 0.52(5) / 0.60(4) 
0.1 1(3)/ 0.04(1)/ 0.10(1) 
0.019(7)/ 0.002(1)/ 0.010(3) 


0.43(5)/ 0.56(3)/ 0.33(3) 
0.26(5)/ 0.26(4)/ 0.10(2) 
- /- / 0.03(1) 




j (%) 


Z - ee/nn 




//i-channel 

rr (tau/non-tau) 


ww/wz/zz 


Jet kinematics 
Central jet veto 
Mass window 


2.60(5)/ 2.8(1) 
1.14(3)/ 1.30(9) 
0.22(1)/ 0.1 1(2) 


2.7(1)/ 2.6(1) 
1.7(1)/ 1.3(1) 
0.10(3)/ 0.05(2) 


0.93(6)/ 1.11(6) 
0.07(3)/ 0.12(3) 
0.015(8)/- 


0.2(2)/ 0.6(1)/ 0.50(3) 
0.1(1) / 0.3(1)/- 
- / 0.03(3)/- 



3.5 Background for the M-channel 

In addition to the Z+jets, W+jets and //backgrounds, the M-channel also has a background from the pure 
QCD multi-jet process. The estimation of QCD multi-jets must be made from data. A few handles exist 
for estimating the pure QCD background. First, one can utilize a sample of same-sign tau candidates to 
estimate the fake tau contribution since the sign of the tau candidate from QCD is approximately random. 
Potentially, one can utilize constraints from the track multiplicity distribution as described in Section 3.3. 
Furthermore, one can loosen the identification requirements on the tau candidates to obtain a sample 
dominated by QCD fakes, and then extrapolate this background into the signal region using knowledge 
of the fake tau's likelihood distribution obtained with data. The efficiency of these techniques must also 
be established with data, but here we assume that the same-sign sample will be able to estimate the QCD 
background with an uncertainty given by two components. The first component of this uncertainty is 
the statistical error in the control sample, which scales like 1 / y/Nss, where Nss is the size of same sign 
sample coming from QCD backgrounds. The second component is a systematic uncertainty associated 
with using the same sign sample to estimate the opposite sign sample. Experience from the Tevatron 
shows that charge correlations can be of order 13% with an uncertainty of order 3% [35,36]. Given that 
the final state requires two additional jets, which can alter the contribution of quark and gluon-initiated 
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jets, we assume a 10% systematic error associated with the charge correlation. In addition, the same sign 
sample will also include a contribution from W —> TV+jets, where the Tevatron experiments observed a 
charge correlation that was much higher with a 40% uncertainty [35]. The experience from the Tevatron 
provides some insight into this approach, but the results are not directly relevant because the LHC is not 
a p-p machine. 

As in the // and //z-channels we do not have sufficiently large Monte Carlo samples to predict the 
background for the M-channel. We employ the same cut factorization method for the Z+jets, W+jets 
and tt backgrounds as described above. The prediction of the pure QCD multi-jet background from 
Monte Carlo is hopeless without factorizing the analysis even further. A sample of 80 million QCD 
dijet events (including cc and bb processes) were simulated with the fast detector simulation. The tau 
fake rate was parameterized from full simulation as a function of Tj and pj and used to re-weight the 
dijet sample. In order for a pure QCD process to satisfy the event selection, there must be at least 
four high-/?r jets. Previous studies have shown that the parton shower underestimates the tagging jet 
requirement by a factor of 2-3 [4]. Therefore, we multiply the prediction from QCD dijets after the 
forward jet requirement by a factor of 5 to include the underestimate from the parton shower and an 
additional safety factor. Table 1 1 shows that the analysis cuts are extremely effective at rejecting QCD 
backgrounds, but a realistic estimation of the remaining background requires data. 

3.6 Summary of the background predictions 

Tables 9, 10, and 1 1 summarize the background predictions for the //-, lh-, and M-channels, respectively. 
The tables also indicate the statistical uncertainty on the estimates from the limited size of the Monte 
Carlo samples. Note that the mass window cut is set with respect to the test Higgs mass of 120 GeV. 
The effective cross-sections estimated with the cut-factorization method are marked with an asterisk. 
Furthermore, the predictions from the tt sample simulated with the fast simulation are shown, where the 
effective cross-section was normalized to the fully simulated sample after the collinear approximation 
for the //-channel and after the transverse mass cut for the //j-channel. 

Table 9: Summary of the backgrounds for the //-channel. An asterisk is used to indicate cross- 
sections estimated from the cut factorization method. Note that at least one of taus are decayed 
leptonically in QCD Z — > T + T~+jets process. 





Z-> T+T-+jets(>l) 






Z-»/+r+njets 


W -> /v+njets 


diboson 




QCD 


ELWK 


Full 


Fast 


(n> 1) 


(n> 1) 


ww/zz/wz 


Cross section (fb) 


168.4xl0 3 


1693 


833x 


10 3 


768.6xl0 3 


8649xl0 3 


174.1 xlO 3 


Trigger 


51.5(l)xl0 3 


230(1) 


209.8(2 


x!0 3 


633.8(4)xl0 3 


441 1(9) xlO 3 


32.0(l)xl0 3 


Trigger lepton 


42.7(l)xl0 3 


190(1) 


179.1(2)xl0 3 


588.0(4) xlO 3 


38 15(9) xlO 3 


28.0(l)xl0 3 


Dilepton 


4.25(5)xl0 3 


19.2(4) 


21.7(1) 


xlO 3 


369.9(5) xlO 3 


2.5(2) xlO 3 


3.95(6)xl0 3 


£™ ss > 40 GeV 


744(18) 


9.9(3) 


16847( 99) 


2683( 67) 


1148(176) 


1744( 49) 


Collinear Approx. 


454(14) 


6.2(2) 


1817(33) 


Atlfast 


104( 12) 


46(21) 


73(9) 


Njets > 2 


262( 8) 


5.8(2) 


1722( 32) 


1699(4) 


73(8) 


14( 6) 


51(8) 


Forward Jet 


39( 2) 


2.0(1) 


294( 13) 


324(1) 


10(3) 


> 1.2(2)* 


8(3) 


6-jet veto 


30( 2) 


1.5(1) 


89( 7) 


90.3(9) 


9( 3) 


> 1.0(2)* 


5(2) 


Jet kinematics 


2.71(5) 


0.57(5) 


11.8(3)* 


26.7(5) 


0.66(3)* 


0.19(4)* 


0.33(5)* 


Central jet veto 


1.24(3) 


0.43(4) 


1.9(1)* 


2.6(1) 


0.27(1)* 


0.10(2)* 


0.18(4)* 


Mass window 


0.23(1) 


0.04(1) 


0.10(2)* 


0.06(2) 


0.058(3)* 


0.01(1)* 


0.002(1)* 
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Table 10: Summary of the backgrounds for the Z/i-channel. An asterisk is used to indicate cross- 
sections estimated from the cut factorization method. Note that at least one of taus are decayed 
leptonically in QCD Z — > T + T~+jets process. 





Z^T+T-+jds( 1) 






Z^/+/-+njets 


W-^/v+njets 


diboson 




QCD 


ELWK 


Full 


Fast 


(n> 1) 


(n> 1) 


ww/zz/wz 


Cross section (fb) 


168.4xl0 3 


1693 


833x 


10 3 


768.6xl0 3 


8649 xlO 3 


174.1 xlO 3 


Trigger 


51.5(l)xl0 3 


230(1) 


209.8(2 


xlO 3 


633.8(4) xlO 3 


4411(9)xl0 3 


32.0(l)xl0 3 


Trigger lepton 


42.7(l)xl0 3 


190(1) 


179.1(2 


xlO 3 


588.0(4) xlO 3 


3815(9)xl0 3 


28.0(l)xl0 3 


Dilepton veto 


38.4(l)xl0 3 


171(1) 


156.4(2 


xlO 3 


216.5(4)xl0 3 


381 1(9) xlO 3 


23.7(l)xl0 3 


Hadronic % 


3062( 42) 


19.3(4) 


5224( 56) 


20250(156) 


32537(1012) 


704( 30) 


Ef- ss > 30 GeV 


850( 20) 


12.1(3) 


4251(50) 


468(26) 


21001(801) 


474( 26) 


Collinear Approx. 


514( 15) 


7.8(2) 


606( 19) 


17(3) 


324( 46) 


32( 6) 


Transverse mass 


415( 13) 


6.5(2) 


176( 10) 


Atlfast 


11(2) 


67( 18) 


14( 3) 


Njets > 2 


235( 7) 


6.0(2) 


162(9) 


167(1) 


8( 1) 


49( 11) 


7(1) 


Forward jet 


40(3) 


2.3(1) 


32(4) 


26.1(4) 


1.3(6) 


>2.9(3)* 


3(1) 


Jet kinematics 


2.7(1) 


0.72(6) 


1.8(1)* 


3.6(1) 


0.10(1)* 


0.7(1)* 


0.06(1)* 


Central jet veto 


1.2(1) 


0.49(5) 


0.25(4)* 


0.43(5) 


0.047(6)* 


0.43(6)* 


0.02(1)* 


Mass window 


0.11(2) 


0.04(1) 


0.012(5)* 


0.03(1) 


0.008(1)* 


0.020(6)* 


0.001(1)* 



Table 11: Summary of the backgrounds for the M-channel. An asterisk is used to indicate cross- 
sections estimated from the cut factorization method and/or an additional safety factor. Note that 
both taus are decayed hadronically in QCD Z — > T + T~+jets and W — > TV+jets processes. The tt 
sample is required to have at least one lepton in the top decay. 





Z — ► X X~ 

QCD 


fyrt.v(> 1) 
ELWK 




W — > zv + n jets 
(n>\) 


QCD di-jet 

(x 5) 


Cross section (fb) 


40.3xl0 3 


1693 


833 xlO 3 


922x 10 3 


19.1 10 12 


Trigger tau & MET 


1756(15) 


126(1) 


78177(232) 


39600(400) 




2 Hadronic ts 


161(4) 


4.9(2) 


373(16) 


317(33) 


2.756(3) 10 6 * 


£™ ss > 40 GeV 


108(4) 


3.7 (2) 


335(15) 


243(29) 


0.97(3) 10 3 * 


Collinear Approx. 


72(3) 


2.3 (1) 


43(5) 


20(7) 


1.7(2) 10 2 * 


Di-tau Transverse mass 


72(3) 


2.3(1) 


39(5) 


18(7) 


1.6(2) 10 2 * 


Njets > 2 


46(2)* 


2.1(1) 


34(5)* 


8(3)* 


0.86(4) 10 2 * 


Total p T 


40(2)* 


1.9(1) 


24(4)* 


8(3)* 


0.75(3) 10 2 * 


l-'orw aid jcl 


17(1)* 


1.1(1) 


9(2)* 


3(1)* 


23(3)* 


Jet kinematics 


1.4(1)* 


0.43(6) 


0.6(2)* 


0.5(4)* 


8(3)* 


Central jet veto 


0.7(1)* 


0.36(6) 


0.16(9)* 


0.3(3)* 


4(1)* 


Mass window 


0.08(3)* 


0.03(1) 


0.03(3)* 


0.1(1)* 


1(1)* 
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4 Signal sensitivity 

4.1 Overview 

In this section we outline the method for extracting the signal significance from the data and measuring 
the Higgs boson mass. In addition to a simple method based on counting the number of events in a 
mass window, we present a method based on fitting the m TT spectrum. Particular care has been given to 
the incorporation of uncertainty in both the rate and shape for the signal and backgrounds. The fitting 
strategy constrains the shape of the Z — > TT background from the data-driven techniques described in 
Section 3.2. The data-driven estimates for the it and W+jets background are not finalized, thus we rely 
on Monte Carlo to constrain the shape of those backgrounds and allow for large variation in the shape. In 
the //j-channel, the normalization of the fake-tau contribution is constrained from the track multiplicity 
measurement outlined in Section 3.3. Sections 4.2 and 4.3 describe the parameterization of the signal and 
backgrounds, while Sections 4.4 and 4.5 describe how the control samples are incorporated to constrain 
the normalization and shape uncertainties. The expected signal sensitivity is estimated by considering 
a hypothetical data set given by the median of the signal-plus-background estimate. The normalization 
of the background predictions are described in Section 3.6. Due to the limited size of the Monte Carlo 
samples, the shape of the m TT spectrum for the it and W+jets background was obtained from an earlier 
point in the event selection, just after the collinear approximation requirement in the //-channel and after 
the transverse mass cut in the //j-channel (see Sections 2.7 and 2.8). Similarly, the shape of the Z — > TT 
background in the control sample, which can be estimated by the data-driven techniques described in 
Section 3.2, was obtained from a Z —>■ TT Monte Carlo sample just after the same points in the event 
selection. Depending on how the analysis evolves, the shape of the m TT spectrum from an earlier point 
in the event selection - which is dominated by backgrounds - may also provide a useful control sample 
to validate the data-driven background estimation techniques since it is fairly stable in the later stages of 
the event selection. 

For the first time ATLAS has investigated the /z/z-channel. While the signal efficiency and m xx mass 
resolution are roughly comparable to the //- and //i-channels, a reliable estimate of the QCD background 
can only be provided with data. Therefore, we do not report on an estimated sensitivity for the M-channel 
below. 

4.2 Shape parameterization for H — > tt and Z — > tt 

The shape of the m TT distribution for signal and Z — > TT events is dictated by the resolution of £"™ 1SS 
and the kinematics of the collinear approximation. As discussed in Section 2.6, the width of the m TT 
distributions is given by the Ej lss resolution scaled by a Jacobian factor and the cuts on the x T variables 
introduce an asymmetry in the m xx distribution. A full solution would include an event-by-event Jacobian 
scaling of the width and truncation at m TT > m V!S / ■ s /xf a x% a . An approximate parameterization can be 
constructed with these features in mind. First, we consider three sub-samples of events with J < 2, 
2 < J < 5, and J > 5. Let, (/),- and TV,- denote the mean of the Jacobian and the number of events 
in the i th subsample, respectively. We account for the Jacobian scaling of the width by using a triple 
Gaussian with identical mean, normalizations according to the ratios of the iVj and widths according 
to the ratios of (/),-. To account for the asymmetry introduced by the x % cuts, we modulate the triple 
Gaussian by an efficiency envelope derived from the m VIi spectrum. The efficiency envelope reflects 
the probability that m TT is greater than m v j s and is parameterized as: 1 /2+ 1 /2erf{[m TT — (m v i s )] / V2(J V i s } , 
where (m V i S ) and (7 ViS are the mean and standard deviation of the spectrum for those events which fail 
the x x cuts. The parameter {m vis ) depends on the Higgs boson mass and was linearly parameterized by 
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Figure 13: Figures (a) and (b) show the result of a fit to a pure Monte Carlo samples of Z — > xx and 
signal (inn = 120 GeV) in the //i-channel, respectively. The dashed lines represent the three components 
of the model and the dotted curve represents the erf() efficiency envelope. These samples do not include 
pileup. 

0.576 m// + 60 GeV; the width o V j S was fixed at 10 GeV. We parametrize the m TT distribution as 

L H/Z (m, r \m, a H/z ) = JT [1 + 1 erf ^ m ^ m "") ^ j x Jjf. G aus(m rr \m, a H/z (/),-), (11) 

where the resulting function's normalization constant JV was found by numerical integration with the 
ROOFlT package [37]. 

The Z — > xx control sample described in Section 3.2 is used to constrain the mean, mz, and the overall 
width of the distribution, Oz, which are the only free parameters in the Z — > xx background model. The 
error bars in the control sample were scaled by 10% to account for the 10% shape uncertainty in the 
jl^x rescaling method and the extrapolation from the control region to the final signal region. Figure 13 
shows the result of the fit to the Z — > xx and Higgs boson signal in the //i-channel. 

An alternate parameterization was also considered. This parameterization was also composed of 
three components, but the asymmetry in the shape was modeled with a "bifurcated" or "dimidated" 
Gaussian [38], in which the width of the lower half was smaller than the width of the upper half of the 
distribution. This alternate parameterization did not fix the ratio of the widths and normalizations for the 
three components based on the distribution of the Jacobian; instead, these parameters were themselves 
parameterized as a piece- wise linear function of m//. Each of the three components required four param- 
eters to model the widths, together with two parameters for normalization constants, and two parameters 
for the linearity in the Higgs boson mass. In total the signal model was represented by 16 parameters. 
Results from these two parameterizations were in good agreement. 

4.3 Shape parameterization for tt and W+jets 

In contrast to the irreducible Z — > TT+jets background, the W+jets background is dominated by situations 
in which one of the tau decay products comes from a W decay and the second tau decay product is a fake 
from a jet. The tt background is even more complicated because the decay products from top contain a 
real tau contribution as well as a fake tau contribution. It is difficult to estimate this background using 
Monte Carlo because one must understand in detail both the jet kinematics for as well as the lepton or x 
fake rate as a function of pj and Vj . Instead, it is desirable to estimate this background with data. Since the 
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data-driven strategy for the IV+jets background is under development, we simply rely on Monte Carlo to 
provide a control sample for this background. Figure 14(a) shows the shape of the fully simulated VK+jets 
background and the fast simulation tt background sample after the transverse mass cut. The shapes are 
consistent within statistical errors. While the shapes from those backgrounds remain stable through the 
final stages of the event selection, a conservative 50% error is applied to each bin in the combined tt and 
W+jets control sample to reflect uncertainty in how this shape changes as the remainder of the analysis 
cuts are applied. 

The shape of the QCD background was parameterized with the following equation 

/ x Y 2 

LQCD(m„\ai, 02,03) = jV m" 3 t . (12) 

\m xx +ayj 

The form is motivated by a competition between the parton distribution functions and the matrix element. 
In the //i-channel, the normalization of the backgrounds with fake taus can be constrained by using 
the track multiplicity constraint described in Section 3.3; however, there is an additional uncertainty 
associated with how well the fake fraction can be extrapolated from the control sample to the signal 
like region. We apply a conservative 50% systematic on this fraction associated with the extrapolation. 
Figure 14(b) and (c) show the result of the simultaneous fit to the fake tau background described in 
the Section 4.5 with (solid) and without (dashed) the signal contribution for the //- and //z-channel, 
respectively. The variation reflects the magnitude of the shape uncertainty. 




(a) (b) (c) 



Figure 14: Figure (a) shows that the shapes are similar for these backgrounds and that the shape is stable 
in the final stages of the cut flow. The m xx spectrum for it and W+jets backgrounds after all cuts for 
the //-channel (b) and //i-channel (c) with a fit to the spectrum. The solid and dashed curves show the 
result of the simultaneous fit to the control sample and signal candidates with and without the signal 
contribution, respectively. 



4.4 Signal significance neglecting shape uncertainty 

For a given hypothesized Higgs boson mass, the mass window has been defined as m# — 15 GeV < 
m T i < mn + 15 GeV. A simple approach to estimating the expected significance of the signal is to count 
events in this range and calculate the probability for at least this many events from the background-only 
prediction. An alternate approach is to fit the m xx spectrum and use the resulting signal yield as a test 
statistic. Table 12 shows the significance obtained from number counting assuming a 10% background 
uncertainty as was done in Ref. [4] and the result from the fitted signal yield. The next subsection 
provides a final result indicating a more realistic treatment of both normalization and shape uncertanties. 
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Table 12: Expected signal significance for several masses based on number counting in a mass 
window with 30 fb~ 1 of data. Results are shown neglecting uncertainty in the background rate 
and incorporating it with two methods (see text). These results do not include the impact of 
pileup, which is discussed in Section 4.7. 



m H 


//-channel 
Counting Fitted Yield 


//i-channel 
Counting Fitted Yield 


combined 
Counting Fitted Yield 


105 


2.20 


2.43 


2.85 


3.46 


3.80 


4.17 


110 


2.46 


2.88 


3.45 


4.19 


4.46 


5.06 


115 


2.86 


3.26 


4.18 


4.96 


5.32 


5.96 


120 


2.80 


3.17 


4.23 


4.73 


5.36 


5.72 


125 


2.67 


2.96 


3.97 


4.32 


5.08 


5.28 


130 


2.42 


2.73 


3.54 


3.88 


4.62 


4.77 


135 


2.17 


2.37 


3.38 


3.60 


4.35 


4.25 


140 


1.74 


2.00 


2.66 


2.83 


3.55 


3.35 



4.5 Incorporating control samples and shape uncertainty 

By fitting the m TT spectrum to a model that accurately describes the signal and various backgrounds 
it is possible to directly incorporate uncertainty in the background shape and take advantage of the 
shape of the signal within the mass window. In order to constrain the background rate and shape, we 
simultaneously fit the signal candidates and the background control samples outlined in Section 3. The fit 
is performed twice, once letting the signal parameters float (the maximum likelihood estimates denoted 
with a single * ) and once constraining the signal normalization to be zero (the conditional maximum 
likelihood estimates denoted with a double'). The ratio of these likelihoods is referred to as the profile 
likelihood ratio, A, 

x ^ =0) = ^ata\^) : m) 
^ ' L(data\fi,b,v) 
where }X represents the signal strength in units of the Standard Model expectation and v represents the 
nuisance parameters needed to describe the shape. If the Higgs boson mass is specified, the distribution of 
—2 log A ratio asymptotically approaches a % 2 distribution with the number of degrees of freedom given 
by the number of parameters of interest 4 . The motivation for jx is that it enforces the relationship of 
the Standard Model branching ratios when combining the individual channels, maintaining the property 
that the distribution of -2 log A is % 2 with one degree of freedom. This improves the power compared 
to a method which lets the signal in each channel vary independently. If the Higgs boson mass is not 
fixed, then one must take into account the "look-elsewhere" effect, which is discussed in more detail in 
Section 6. The likelihood function used in the simultaneous fit is simply a product of the likelihoods 
from the individual measurements: 

L(data\n,m H ,v) = L track (track multiplicity! r QCD ) (14) 

x Lz (Z + j ets control \mz,Oz) 

x Lqcd(QCD control|ai,a2,a3) 

x L s+b (signal candidates | iu.,m H ,a H ,mz,az, tqcd , a \ , a 2 , a 3 ) , 

where the a,- are the parameters used to parameterize the fake-tau background and v represents all nui- 
sance parameters of the model: c#,mz,Oz,rgcD, 01,02,03- When using the alternate parameterization 



4 When constraining jj. > 0, the distribution for the background-only hypothesis is modified such that — 2 log A (/i = 0) ~ 
1 /2S(0) + 1 /2#j , and this is taken into account in computing the p- value. 
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100 120 140 160 180 
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Figure 15: Example fits to a data sample with the signal-plus-background (a,c) and background only 
(b,d) models for the Ih- and //-channels at m# = 120 GeV with 30 fb _1 of data. Not shown are the 
control samples that were fit simultaneously to constrain the background shape. The fits are performed 
to the signal and background expectation (histograms), while the overlaid data with error bars are only 
indicative of a possible data set. These samples do not include pileup. 
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of the signal, the exact form of Equation 14 is modified to coincide with parameters of that model. 

Figure 15 shows the fit to the signal candidates for ra# = 120 GeV with (a,c) and without (b,d) 
the signal contribution. It can be seen that the background shapes and normalizations are trying to 
accommodate the excess near m xx = 120 GeV, but the control samples are constraining the variation. 
Table 13 shows the significance calculated from the profile likelihood ratio for the //-channel, the Ih- 
channel, and the combined fit for various Higgs boson masses with 30 fb _1 of data. Finally, we present 
the expected significance as a function of Higgs boson mass in Fig. 16. 



Table 13: Expected signal significance for sev- 
eral masses based on fitting the m xx spectrum with 
30 fb _1 of data. Background uncertainties are in- 
corporated by utilizing the profile likelihood ratio. 
These results do not include the impact of pileup, 
which is discussed in Section 4.7. 



^10 



m H 


//-channel 


Ih -channel 


combined 


105 


1.95 


2.41 


3.10 


110 


2.44 


3.35 


4.15 


115 


2.98 


4.07 


5.04 


120 


2.92 


3.87 


4.85 


125 


2.75 


3.75 


4.65 


130 


2.46 


3.38 


4.18 


135 


2.21 


3.32 


3.99 


140 


1.80 


2.70 


3.24 



ATLAS 

\[s = 14 TeV, 30 fb 1 
ll-channel 
Ih-channel 
combined 



Figure 16: Expected signal significance for several 
masses based on fitting the m xx spectrum. Back- 
ground uncertainties are incorporated by utilizing 
the profile likelihood ratio. These results do not in- 
clude the impact of pileup. 



4.6 Mass determination 

The mass parameter m# and its error can be determined from the fits described above; however, the 
parameter in the model may not be the best estimate of the physical Higgs boson mass. Similarly, the 
error on the mass parameter from the fit should be validated with a large number of pseudo-experiments. 
Figure 17 (a) shows the relationship of the input Higgs boson mass and the reconstructed Higgs boson 
mass (i.e. the parameter mn in Equation 11) obtained with 2000 pseudo-experiments per input mass 
point. Figure 17 (b) shows the m xx resolution for the signal as a function of the input Higgs boson mass. 
When scaling the deviation by the MINOS errors, the pull distribution of mu was found to be consistent 
with the normal distribution N(0, 1). The mass resolution is found to be in the range of 8~10 GeV. A 
similar mass resolution was found in the M-channel when analyzing signal Monte Carlo samples. 

4.7 Influence of pileup 

The presence of pileup has three major effects on the analysis. First, additional p-p interactions can 
produce hadronic activity in the central region which causes events to fail the central jet veto. Secondly, 
the presence of pileup generally degrades the £"™ ss resolution, which, in turn, reduces the efficiency of 
the collinear approximation cuts and degrades the m xx resolution. Thirdly, pileup degrades the hadronic 
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Figure 17: The linearity of the fitted mass versus the input mass (a) and the mass resolution versus the 
input mass (b). These results do not include the impact of pileup, which is discussed in Section 4.7. 

tau identification. Fortunately, the electron and muon identification have been shown to be quite robust 
against pileup [15, 16]. While the jet performance is affected by pileup, the analysis is fairly robust 
against those effects. 

The simulation of pileup is technically very challenging since it is performed at a very low level 
in the detector simulation. Limited samples with pileup were available at the time of writing, and the 
£"™ ss and hadronic tau identification algorithms were not re-tuned in this context. The distribution of 
the log likelihood ratio discriminant for the calorimeter-based hadronic tau identification algorithm is 
shifted to lower values for both real taus and jet fakes. By simply adjusting the cut on the log likelihood 
ratio to 0, the same signal efficiency can be maintained with approximately a 50% drop in jet rejection. 
By re-tuning the discriminant in the context of pileup, improved jet rejection should be possible. In all 
three channels, the mass resolution is degraded from ~9.5 to ~ 1 1.5 GeV for ra# = 120 GeV due to the 
degradation of the E™ lss resolution. Figure 8 shows that the central jet veto survival probability drops 
from ~88% to ~75% at 10 33 cm^s" 1 and ~65% at 2 x 10 33 cm^s" 1 . Studies indicate that the use of 
tracking and calorimeter timing information can be used to mitigate this loss in signal efficiency. 

Given the lack of background samples simulated with pileup and the need to re-optimize the recon- 
struction and analysis in that context, we do not report signal significance estimates. 



5 Systematic uncertainties 

5.1 Overview 

The data-driven background estimation methods described above have been developed so that uncer- 
tainty in the background shape and normalization are included directly into the significance calculation. 
Because the discovery criterion is simply testing the presence or absence of the signal, it is not sensitive 
to some of the sources of systematic uncertainty. In contrast, measurement of the Higgs boson mass is 
sensitive to the energy scale of electrons, muons, hadronic taus and E™ 1SS . Furthermore, measurement 
and exclusion of o(pp — ► qqH) x BR{H — > tt) are sensitive to the uncertainty on the signal selection 
efficiency. Below we discuss the impact of these systematics on the analysis. 
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5.2 Systematic mis-measurement of the signal 

First, we consider the purely experimental sources of systematics. The approach here is to assume that 
once we have data, the estimates of the energy scales, resolutions, and efficiencies might be systemat- 
ically biased. Estimates of the systematic uncertainty from various sources are given in Table 14. The 
uncertainty estimates are common for all this series of notes, except for the uncertainties on the central 
jet veto and forward jet tagging efficiency. There is not yet a dedicated study of the expected uncertainty 
on these efficiencies from data, thus we assume uncertainty on the reconstruction efficiency to be half the 
tau identification efficiency and point out that most effects relevant to the jets have already been included 
in the jet energy scale uncertainty. We use the nominal detector performance as the central value and 
then manipulate the Monte Carlo signal to reflect these changes. For instance, in the case of the electron 
energy scale uncertainty, we coherently change all electrons to have 0.5% higher Ej, modify the E™ 1SS 
vector accordingly, and recalculate the signal efficiency. This is done individually for each source of 
systematic and upward and downward fluctuations are treated separately. In the case of the jet energy 
scale, only some elements of the uncertainty are relevant for £"™ ss . A study of £"™ ss projected onto 
the direction of the reconstructed Z in Z — ► ZZ+jets with a subset of the analysis cuts indicated that the 
£"™ ss scale can be measured within 5%; thus we only manipulate the £"™ ss vector according to a 5% jet 
energy scale shift. In the case of systematic uncertainty on resolution, we only considered a degradation 
in the resolution by the tabulated amount. Finally, for systematics on reconstruction and identification 
efficiency we assume a 1-to-l transfer to the uncertainty on the signal efficiency and include a factor 
of two when the signal efficiency scales as the square (e.g. the electron efficiency in the //-channel). 
Table 14 summarizes the effect of systematic mis-measurement on the signal efficiency. 



Table 14: Estimated scale of systematic mis-measurements and their effect on the signal efficiency, 
t When varying the jet energy scale, only a 5% mis-measurement of the jet energy was used in 
manipulating the £"™ 1SS vector. See text for details. 



Source 


Relative uncertainty 


Effect on signal efficiency 


luminosity 


±3% 


±3% 


muon energy scale 


± 1% 


± 1% 


muon energy resolution 


(T(pr)©0.011pr©l-7 10 _4 /4 


± 0.5% 


muon ID efficiency 


±1 % 


±2% 


electron energy scale 


± 0.5% 


± 0.4 % 


electron energy resolution 


a(£ r )©7.3 10- 3 £ r 


± 0.3 % 


electron ID efficiency 


± 0.2% 


± 0.4% 


tau energy scale 


±5% 


± 4.9% 


tau energy resolution 


a(E) 00.45^ 


± 1.5% 


tau ID efficiency 


±5% 


±5% 




±7%(|tj| <3.2) 




jet energy scale 1 " 


± 15%(|tj| >3.2) 


+ 16% / 

/-20% 




±5% (on Ef ss ) 




jet energy resolution 


o{E) © 0.45^ (\n | < 3.2) 






a(E) © 0.67^ ( J] > 3.2) 


± 1% 


b-tagging efficiency 


±5% 


±5% 


forward tagging efficiency 


±2% 


±2% 


central jet reconstruction efficiency 


±2% 


±2% 


total summed in quadrature 




±20% 
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5.3 Theoretical uncertainty 

In addition to the effect of systematic mis-measurement on the signal efficiency, theoretical uncertain- 
ties also limit our ability to estimate the signal efficiency. Next-to-leading order QCD calculations are 
now available for the vector boson fusion process. A dedicated study [39] investigated the overall renor- 
malization and factorization scale dependence (2%) as well as the parton distribution function (PDF) 
uncertainties (3.5%). Next-to-leading order electroweak corrections are also quite large for the vector 
boson fusion process, giving a 3% uncertainty for the full next-to-leading order calculation [40]. Re- 
cently, the dominant next-to-leading order QCD corrections to the Higgs boson plus three jets have been 
calculated for vector boson fusion, providing a scale uncertainty on the parton-level central jet veto sur- 
vival probability of 1% [41]. 

While the parton-level theoretical uncertainties are under very good control and below the level of 
both the statistical error and measurement-related systematics, the same is not true for the theoretical 
uncertainty related to the parton-shower and underlying-event. We rely on Monte Carlo simulations that 
model the parton-shower, hadronization, and underlying event to simulate the detector response. The 
uncertainty in these calculations is not comparable to the accuracy of the parton-level predictions. The 
central jet veto efficiency was studied with the signal process generated with PYTHIA (with various 
tunings), HERWIG and SHERPA and the fast detector simulation. After the analysis cuts, the different 
generators differ by 41%. Studies focusing specifically on the matrix element-parton shower matching 
indicate a substantially smaller uncertainty [33,42]. We will measure the underlying event [43,44] and 
tune the parton shower and hadronization with data, but it is likely that this contribution of the uncertainty 
will remain significant. Currently there is no estimate of the expected uncertainty related to the parton- 
shower, hadronization, and underlying event tuning. Clearly, this is an area that deserves attention as 
such a large uncertainty will hinder exclusions if a Higgs boson does not exist in this mass range and 
cross-section and coupling measurements if one does. After discussions with the authors of PYTHIA, 
HERWIG and SHERPA we feel that the residual uncertainty in the parton shower after tuning to the 
data will be less than the 18% uncertainty quoted for the jet energy scale. Thus, the uncertainty in the 
signal efficiency will be dominated by the jet energy / E™ 1SS scale uncertainty and the precise uncertainty 
in the parton shower is not relevant. Table 15 summarizes the theoretical uncertainties for the signal 
production. 



Table 15: Theoretical uncertainties which affect the estimation of the signal efficiency. 



Source 


Relative uncertainty 


Effect on signal efficiency 


PDF uncertanties 


±3.5% 


±3.5% 


scale dependence on cross-section 


±3% 


±3% 


scale dependence CJV efficiency 


± 1% 


± 1% 


parton-shower and underlying event 


±<10% 


± <10% 


total summed in quadrature 




± < 10% 



6 Discussion 

The expected signal significances in Table 13 are qualitatively consistent with the results found in 
Ref. [4]; however, the predicted cross-sections in Tables 9 and 10 are significantly different. In particular, 
the initial cross-section of the Z+jets background is smaller by nearly a factor of four and the tt back- 
ground in the //-channel is larger by nearly a factor of 2. Much of this difference reflects the evolution 
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Figure 18: The ratio of expected p- values for the 
floating and fixed mass fits as a function of the 
Higgs boson mass. This plot summarizes the im- 
pact of the "look-elsewhere" effect in this analysis. 
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Figure 19: Expected 95% exclusion of the signal 
rate in units of the Standard Model expectation, ju, 
as a function of the Higgs boson mass for the // and 
Ih -channels with 10 fb -1 of data. The exclusion 
takes into account the uncertainty on the signal 
efficiency described in Section 5. 



in the Monte Carlo generators for these challenging backgrounds. In the case of the Z+jets background, 
differences in the choice of renormalization and factorization scales and parton density functions are the 
source for part of the discrepancy; however, approximately a factor of two comes from the treatment of 
soft and collinear divergences. After substantial investigation, we concluded that the Z+jets background 
estimate used in Ref. [4] was conservative. In the case of ??+jets, the necessary matching between ma- 
trix elements and parton showers had not been developed when Ref. [4] was written. The recipe used to 
merge the ??+0,l,2 jet samples at that time and the more realistic ^-tagging performance limits our ability 
to understand in detail the source of the differences. We are confident that MC@NLO and our current 
detector simulation and offline reconstruction provide a superior prediction of this background. 

For the first time, ATLAS has investigated the potential of the M-channel. Much of this work has 
been devoted to the development and study of tau and missing Ej triggers. It now appears that the trigger 
is feasible, and the reconstruction of the signal maintains an efficiency and m xx mass resolution compa- 
rable to the //- and Z/z-channels. The open question for this channel is the size of the QCD background - 
a question that can only be answered with data. 

The results shown in Section 4 are based on a fixed mass hypothesis. If one leaves the Higgs boson 
mass a free parameter in the likelihood fits, then one must take into account the "look-elsewhere" ef- 
fect. Naively, one would expect the magnitude of the effect to be rather small for this channel given the 
~10 GeV mass resolution and the ~ 30 GeV mass range of interest. Detailed study shows that it is a mix- 
ture of two effects. First, the distribution of — 2 log X(}X = 0) is not ^-distributed under the background- 
only hypothesis; it has a longer tail which raises the p-value. Secondly, the median of — 21ogA(jU = 0) 
under the signal-plus-background hypothesis is systematically larger in the floating mass case because 
the model can adapt to fluctuations in the signal mean. Figure 1 8 summarizes the impact of these two 
effects. 
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Future work for this channel includes an extensive study of performance in the context of pileup, 
including optimization of the hadronic tau identification and E™ 1SS reconstruction. The data-driven 
background estimation for the tt and W+jets backgrounds must be further developed. In order to measure 
o{pp — >• qqH) x BR(H — > tt) at the desired level, the systematics associated with the signal efficiency 
must be reduced. This requires a technique to measure the central jet veto and forward jet tagging 
efficiency in data. Additionally, the use of the Ej lss projection method should be further developed to 
mitigate the impact of the jet energy scale uncertainty. 

While the results shown in Section 4 only include the contribution of Higgs bosons produced via 
vector boson fusion, an additional 10% contribution from the gluon-fusion production process could 
be expected at ma = 120 GeV. Finally, the theoretical uncertainty associated with the parton shower 
and underlying event, which currently gives the largest uncertainty on the signal efficiency, must be 
addressed. The tuning of the parton shower, hadronization, and underlying event model will be among 
the earliest measurements at the LHC. If the residual theoretical uncertainty is not reduced sufficiently, a 
different strategy may need to be found for the central jet veto. The expected exclusion power based on 
the uncertainty in the signal efficiency is shown in Fig. 19. 

7 Summary and conclusion 

The sensitivity of the ATLAS detector to a Standard Model Higgs boson produced via vector boson 
fusion with subsequent decay into taus has been investigated with state-of-the-art Monte Carlo genera- 
tors, a full GEANT-based simulation of the ATLAS detector and our current trigger and reconstruction 
algorithms. Particular emphasis has been placed on data-driven background estimation strategies and the 
estimation of the associated uncertainties in normalization and shape. The impact of pileup has not been 
fully addressed; however, results without pileup indicate that a ~ 5a significance can be achieved for a 
Higgs boson mass in the range 1 15 - 125 GeV after collecting 30 fb _1 of data and combining the 11- and 
Z/z-channels. The Higgs boson mass resolution is approximately 10 GeV, leading to approximately 3.5% 
mass measurement. The M-channel has also been investigated and gives similar results for signal and 
non-QCD backgrounds as the other channels; however due to the challenge of predicting the QCD back- 
ground, we do not report on an estimated sensitivity for that channel. Currently, measurement-related 
systematics and large theoretical uncertainties limit the ability for a measurement of the product of cross- 
section and branching ratio. Future work is needed to constrain these uncertainties in order to measure 
the Higgs boson couplings, spin and CP properties. 
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Higgs Boson Searches in Gluon Fusion and Vector Boson 
Fusion using the H —> WW Decay Mode 

Abstract 

The prospects for Higgs searches in the H + 0 j (H —>■ WW —> ev\l v), H + 2j 
(H — > WW — > eV|iV), and // + 2 ; (H->WW-> ivqq) channels at ATLAS are 
presented, including realistic effects such as trigger efficiencies and detector 
misalignment, with an emphasis on practical methods to estimate the back- 
grounds using control samples in data. With 10 fb _1 of integrated luminosity, 
one would expect to be able to discover a Standard Model Higgs boson in the 
mass range 135 < m# < 190 GeV and to be able to measure its mass with a 
precision of about 7 GeV if its true mass is 130 GeV or about 2 GeV if its true 
mass is 160 GeV. 

1 Introduction 

The nature of the Electroweak symmetry breaking sector is arguably the most important unknown in 
particle physics today. An introduction to the Higgs mechanism and an overview of the possible search 
channels for a (standard model) Higgs boson at the LHC can be found in Ref. [1]. This note studies 
the sensitivity of Higgs boson searches in the decay mode H — > WW to a Standard Model Higgs boson. 
Special emphasis is placed on in-situ control samples that can be used to estimate the background con- 
tributions using data and on the systematic uncertainties associated with these background determination 
methods. 

The note is organised as follows: Section 2 discusses the production mechanisms and final states 
under study, the relevant backgrounds for each process and the Monte Carlo generators used to model 
them. In Section 3, the performance of the reconstruction for the various final-state particles is described. 
In Section 4, an overview of statistical issues common to most of the analyses is presented. Section 5 
describes the event selection and control samples for a search for H — > WW — »• ev/J,v in events where no 
hard jets are reconstructed in the detector. In Section 6, the analysis of the H + 2j, H — > WW — > eVjXV 
channel is presented, and in Section 7, an analysis of events where one of the W bosons decays to two 
jets is discussed. Finally, in Section 8, the combined sensitivity of the ATLAS detector to the presence 
of neutral scalar resonances in the W pair final state is presented. 

2 Physics Processes and Monte Carlo 

In the Standard Model, there are two dominant production modes of the Higgs boson in the kinematic 
region of interest for the H — > WW decay mode: gluon fusion and Vector Boson Fusion (VBF). In 
studies of the former production mode, one must consider background processes that yield two leptons 
and significant missing transverse energy in the final state; in the latter, the final state also includes two 
hard jets (from the struck quarks) which tend to be well-separated in pseudorapidity. This note considers 
the following signal and background processes: 

• Higgs boson production via gluon fusion: This is the dominant production mechanism for the 
signal considered in the analysis of Section 5; it is modeled with 48500 events generated by 
MC@NLO [2,3]. A Higgs boson decay matrix element similar to the one used in Ref. [4] was 
used to reweight this Monte Carlo sample to include the complete spin correlations for the Higgs 
decay. 
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• Higgs boson production via Vector Boson Fusion: This is the dominant production mechanism 
for the Higgs boson searches of Sections 6 and 7. This process is modeled with the generators 
provided in PYTHIA [5] (14500 events), HERWIG [6] (18050 events), and SHERPA [7] (19950 
events). 

• pp — > WW production. This is the dominant background contribution to the analysis studied in 
Section 5; in that analysis, it is modeled with 178450 events generated by MC @NLO. It is impor- 
tant to note that MC@NLO only calculates the 0(a®) and 0(a}) contributions to this process; 
it does not compute the O(aj) contributions. Because of the large gluon luminosity expected at 
LHC it is important to include at least the gluon-initiated component (where the W pair production 
proceeds via a quark box) for the analysis of the fully jet-vetoed final state. This contribution is 
modeled with 176300 events produced using the generator in Ref. [8]. 

In the Vector Boson Fusion searches, the contributions from processes like qq — »• WW qq (mediated 
by the exchange of a weak boson or a gluon) become important. Therefore, a sample of 171250 
events generated with ALPGEN [9, 10] is used instead of MC@NLO to model the pp — > WW jj 
background in the analysis of Section 6. 

• tt production. The two top quarks decay into a pair of W bosons and two b jets. The cross-section 
is expected to be dominated by doubly resonant tt production in the Vector Boson Fusion searches 
of Sections 6 and 7; however, for the search of Section 5, one must also take care to handle the 
single-top background correctly. The absolute cross-section for the contribution from single top 
production is presently only known with leading-order uncertainties, but since the search presented 
in Section 5 normalises the top background using data, it is only necessary to consider its potential 
impact on the extrapolation from the b-tagged to the b-vetoed region. In the estimation of the 
sensitivity of all channels under study, a sample of 538300 events generated with MC@NLO is 
used to model this background. Two MadEvent [11-13] samples are used to estimate the impor- 
tance of the contribution from single-top production in the H + 0 j channel: one contains all matrix 
elements for pp — > WWbb; the other contains only the doubly-resonant contribution. 

• Z — ► 11 production. In channels with two electrons or two muons, the direct decays of Z —>■ ee and 
Z —>■ jijx dominate, but this background can also contribute to e\l final states when the Z decays to 
a pair of % leptons which both decay leptonically. This background is modeled with a sample of 
163200 events generated with PYTHIA in the H + Oj analysis of Section 5 and with a sample of 
38150 events produced using ALPGEN in the H + 2j analysis of Section 6. 

• W+n jets production, with n < 5. This is the dominant background for the lepton-hadron channel; 
however, it can also play a role in the dilepton channels as a source of fakes. This background is 
modeled using a sample of 411424 events generated with ALPGEN. 

A special subset of this background is W+c+n jets, with n < 4. The charm production in associa- 
tion with a W boson through processes like gs —>■ Wc is enhanced, due to the large partem density 
function of strange sea quarks and the large CKM matrix element |V W |. Decays of charm mesons 
are a potential source of leptons. Lepton isolation cuts are tuned on this kind of background, to 
reach the necessary rejection against leptons from semileptonic charm- or b- decays. [14]. 

• bb, cc and QCD multi-jet. Due to the large cross section of these processes they could be a further 
source of background. The large amount of CPU power needed makes a Monte Carlo simulation of 
these backgrounds currently unrealistic. For the dilepton channels the requirement of two leptons 
and missing transverse energy should reduce these backgrounds to a negligible level. 

Table 1 gives an overview of the cross sections of the signal and background samples used in this 
note. Detailed tables of the Higgs boson cross sections and branching ratios can be found in Ref. [1]. 



1307 



Higgs - Higgs Boson Searches in Gluon Fusion and Vector Boson Fusion using . 



Process 



Generator 



Cross-section(pb) 



gg -»■ H -»■ WW (M H = 170 GeV) 
VBF H -»■ WW (M H = 170 GeV) 
VBF H -» WW (M# = 300 GeV) 

Z — > TT+jets 
W+jets 



MC@NLO 
PYTHIA/Sherpa 
HERWIG 
MC @ NLO/Alpgen 
GG2WW 
MC@NLO 
PYTHIA/ALPGEN 
ALPGEN 



19.418 
2.853 
0.936 
111.6 

5.26 

833 
2015 
20510 



Table 1: Overview of the Monte Carlo generators and cross sections for the Higgs boson signal and 
background processes used in this note. The W+jets cross-section listed is the cross-section per lepton 
flavor. 



3 Reconstruction 

The final states considered here all include leptons. Section 3.1 briefly describes the lepton selection 
criteria and other central aspects of the reconstruction used in this note. Then, in Section 3.2, a few 
important details about jets, E™ lss , and pile-up are briefly discussed. Note that the results presented in this 
note were made using version 12 of the ATLAS software, whereas the results on detector performance 
in Refs. [15] and [16] are based on release 13. 

3.1 Leptons 

The electron selection consists of the following criteria : 

• Electrons are identified using the standard ATLAS criteria described in Ref. [15]. In the H + 0j 
(H —> WW —> ev/xv) and H + 2j (H —> WW —> Ivqq) channels, tight electrons are required. In 
the case of the H + 2j, H — > WW — > ev/iv analysis of Section 6, a sufficiently strong rejection 
against the W+jets background can be achieved by applying cuts on the VBF tagging jets that it is 
possible to use a medium electrons instead. 

• A track match is required. The transverse impact parameter significance of the matched track, 
do/G(k> * s required to be less than 10. 

• Candidates are required to be well isolated. Calorimeter isolation in a AR cone of 0.2 is required 
to be less than 5 GeV, and Lpj of tracks in a cone of AR < 0.4, excluding tracks from other lepton 
candidates and tracks with pj < 1 GeV, is required to be less than 5 GeV. 

• Kinematic acceptance: pj > 15 GeV and \r\\ < 2.5 (The pj threshold is larger for the lepton- 
hadron channel). Electron candidates in the crack region, 1.37 < |tj | < 1.52, are excluded. 

It has been shown that similar isolation criteria can effectively suppress the background from W+c+jet 
events, where a second lepton comes from charm decay, to negligible levels. 

W+jets is one of the main sources of fake backgrounds for the dilepton channels; it is crucial to 
achieve a good rejection against this background. The probability for a jet to be misreconstructed as 
an electron was derived from a W(— > ;UV)+jets sample which contains no true high pj electrons. The 
average efficiency to reconstruct an electron candidate without isolation cuts in the gg^H —>■ WW signal 
events is 60.3±0.5%; the additional isolation criteria reduce this to 50.0±0.5% efficiency. With all cuts, 
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the average fake rate as determined from W — »• jltv+jets events is (1.7 ±0.2) x 10 4 before isolation and 
(6.7+} J) x 10~ 5 after isolation. 

The muon selection consists of a similar set of cuts : 

• Combined muons and best match to an inner detector track [16]. 

• Calorimeter Isolation based on the etconelQ variable [16] (etconelQ < 2.5 GeV) and track isola- 
tion (in a cone of AT? < 0.4) as for electrons, but with a different pj threshold, Lpr < 3 GeV. 

• Transverse impact parameter significance less than 10. 

• pr > 15 GeV and \r) \ < 2.5 (The pj threshold is larger for the lepton-hadron channel) 

As for the electrons, similar isolation criteria were shown to be effective against the W+c+jet back- 
ground. The performance for muons is better than for electrons. The average efficiency to reconstruct a 
muon candidate in gg —> H — > WW signal events is 94.2 ±0.1% before isolation cuts and 77.1 ±0.2% 
after applying cuts. The corresponding fake rate in W — > <?v+jets events is (1.7+g'j) x 10~ 5 after isolation 
cuts are applied. 

3.2 Jets, Missing p T , and Pile-up 

In the H + 2 j searches of Sections 6 and 7, the presence of at least two jets with pj > 20 GeV and a large 
difference in pseudorapidity are required. It is frequently the case that at least one of these jets will land 
in the forward region of the detector. To minimise the effect of calorimeter noise and maximise efficiency 
for the jets of interest to this study, jets reconstructed from topological clusters are used, rather than jets 
reconstructed from calorimeter towers [17]. For VBF Higgs boson signal processes the efficiency for jets 
with a cone size of AR = 0.4 from topological clusters is 94.3% in the central detector region and 95.3% 
in the forward region. For jets from calorimeter towers the efficiency is significantly lower with 92.4% 
in the central detector region and 84. 1% in the forward region. 

All the searches considered in this note make use of analysis techniques that either select or give 
a large weight to events without additional jets; in either case, events are effectively rejected if they 
contain extra jets beyond those jets (if any) that are explicitly required by the analysis. Even in the first 
years of "low-luminosity" data taking at the LHC, analyses will be affected by the pile-up of multiple 
collisions per beam crossing. Low pj QCD interactions (minimum-bias) are present simultaneously to 
a hard interaction. Such pile-up events will give rise to jet activity that can sometimes cause interesting 
signal events to be erroneously rejected; in selecting a jet algorithm, it is necessary to consider this 
effect. Single minimum-bias events have a three times lower probability to be discarded by the central 
jet veto cut when using jets reconstructed with a cone size of AR = 0.4 instead of jets with a cone size 
of AR = 0.7. Furthermore, in the presence of pile-up the jet calibration has to correct for the additional 
energy that is not from the hard interaction. Due to the well defined jet area, this is much easier done 
with a cone jet algorithm than with a k, jet algorithm. 

For the reasons mentioned above, the analyses in this note have used jets reconstructed from topo- 
logical clusters with a cone algorithm of AR = 0.4 [17]. Jet efficiencies increase with Ej and are rather 
stable with |tj|. However it should be noted that the efficiency drops at \r\ \ > 4.8 near the boundary of 
the forward calorimeter, as expected. Therefore this note uses only jets with \ r\ \ < 4.8. Jets within a cone 
of AR < 0.4 of any electron candidate are ignored. 

There are other ways to minimise the impact of pile-up; the tracking system of the ATLAS detector 
is designed to separate the vertices of the different inelastic collisions in an event. Jets built of tracks 
emerging from the same vertex as the leptons from the Higgs boson decay should be insensitive to pile-up 
if the vertex reconstruction is working as expected. 
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Figure 1: Jet pj distributions (|t]| < 2.5) for the H —> WW signal sample with (red line) and without 
(gray filled area) pile-up. Left: Standard jets. Right: Track jets. 



While the H — > WW searches presented in this note use standard calorimeter jets for the jet veto, it 
is important to illustrate the potential of track jets. The H + 2j, H —> WW — > evjiv analysis has been 
studied as an example. 

Track jets are reconstructed from only those tracks that have pj > 0.5 GeV and with at least 7 hits 
in the pixel and semiconductor tracker. The Higgs boson production vertex is taken to be the vertex for 
which the sum of the transverse energies of the emerging tracks is largest. All tracks that emerge from 
this vertex and are not associated with an isolated lepton are fed into the standard ATLAS cone AR = 0.4 
jet algorithm. Leptons are considered isolated if the sum of the transverse energies of all tracks from the 
same vertex as the lepton in a cone of 0.01 < AR < 0.2 around the lepton is less than 5 GeV. 

The independence of the track jets to activity from pile-up is reflected in the jet pj distributions in 
the H — ► WW signal sample without and with pile-up (pile-up was generated with an average of 2.3 
additional events) in Figure 1. The additional pp collisions in presence of pile-up created additional 
standard jets at low transverse energies while the multiplicities of track jets are unaffected by pile-up. 

A central jet veto cut is constructed using track jets in the acceptance region of the inner detector 
of 1 77 1 < 2.5 and standard jets at larger pseudorapidities (up to \f]\ < 3.2). The cut on the transverse 
momentum of the track jets is chosen such that the same rejection efficiency is obtained for H — > WW 
events without pile-up as for the jet veto based on standard jets. This requires a lowering of the pj cut 
from 20 GeV for standard jets to 12.3 GeV for track jets. Table 2 shows the fraction of events passing 





H^WW 




tt 




no pile-up 


with pile-up 


no pile-up 


with pile-up 


stdjets (|tj| <2.5) 


72.0 ±1.0 


63.0 ±1.2 


28.6 ±3.4 


19.7±3.3 


track jets 


72.0 ±1.0 


73.5 ± 1.1 


28.6 ±3.4 


25.9 ±3.6 


stdjets < 3.2) 


65 .4 ±1.0 


57.0 ±1.2 


24.0 ±3.2 


16.3 ±3.0 


combination 


65.8 ±1.0 


65.9±1.1 


24.0 ±3.2 


23.1 ±3.5 



Table 2: Fraction of events (%) passing the central jet veto 



the different options for the central jet veto. The central jet veto based on standard jets is sensitive to 
pile-up while the track jets show only small sensitivity to pile-up. The combination of the track jet veto 
in 1 77 1 < 2.5 and the standard jet veto in \r\ | > 2.5 shows robustness against pile-up. 
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Figure 2: Left: The log Likelihood Ratio distributions for toy Monte Carlo corresponding to the 
H — > WW — »• ivqq analysis at 10 fb~ ! for background-only outcomes (red) and signal-plus background 
outcomes with several values of the Higgs boson mass. Right: The same plot, but for the H + 2 j, 
H — ► WW — > ev\iv analysis. 



The performance of the missing transverse momentum reconstruction is strongly affected by the 
presence of pile-up. In this study, the cell based reconstruction algorithm described in Ref. [18] has been 
used. 

Fortunately, the H —>■ WW —>■ iviv searches are not very sensitive to the resolution of E™' ss . Since 
the Higgs boson mass cannot be directly reconstructed in the 2i2v final state, a transverse mass is used; 
although this does have a peak centred around the true value of the Higgs boson mass, the distribution is 
very broad (several tens of GeV). Therefore a degraded Ej lss resolution is unlikely to significantly alter 
the transverse mass shape unless the degradation is rather severe. The H + 2j (H — > WW — > ivlv) anal- 
ysis of Section 6 makes use only of this transverse mass and observables related to the leptons and the jet 
activity in the event; the Ef lss cut in that analysis serves mostly to reject backgrounds like Z —> eej ' 
which have no intrinsic Ef' ss . TheH + Oj analysis of Section 5 does make use of the transverse momen- 
tum of the Higgs boson candidate, which is obviously very sensitive to the Ej lss resolution; however, that 
analysis involves a fitting algorithm developed with exactly this concern in mind and includes checks that 
demonstrate that the algorithm still works reliably in the presence of a degraded Ef" s measurement. 

In the case of the H —> WW —> ivqq analysis of Section 7, the E™ lss is used in the reconstruction of the 
Higgs boson candidate's invariant mass. In principle, then, a degraded Ej lss resolution does propagate 
into the Higgs boson mass resolution, and so problems with the E™' ss reconstruction are more harmful to 
the H —¥ WW — > Ivqq channel than to the dilepton channels. However, for a Standard Model-like Higgs 
boson with a mass above w 300 GeV, the natural width of the Higgs boson tends to become rather large, 
and the E™ lss resolution is again not the primary concern of the analysis. 

Mismeasurements of jets and detector inefficiencies can lead to instrumental Ej lss . It is very hard to 
predict such effects without real data and no cuts to reduce instrumental Ef lss are currently foreseen. 

4 Statistical Formalism 

The sensitivity estimates presented in this note use a fit-based hypothesis testing procedure. Unless 
stated otherwise, selection cuts are always independent of the Higgs boson mass hypothesis, no matter 
how broad the range of masses where the search is sensitive. To take advantage of the discriminating 
power contained in variables for which the distribution depends significantly on the Higgs boson mass 
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Selection 


Selection cuts 


gg^H 


tt 


WW 


Z^TT 


W + jets 




Lepton Selection+M// 


166.4 


6501 


718.12 


4171 


209.1 


pre- 


pf ss > 30 GeV 


147.7 


5617 


505.25 


526.3 


181.6 


selection 


Z — > tt Rej. 


145.8 


5215 


485.12 


164.2 


150.4 




Jet Veto 


61.80 


14.84 


238.35 


31.91 


76.12 




b-veto 


61.56 


6.85 


237.87 


30.76 


76.12 




A<j),i < 1.575, 












signal region 


M T < 600 GeV 


50.6±2.5 


2.3±1.6 


85.4±2.7 


<1.7 


38±38 




A(j),i > 1.575, 












control region 


M T < 600 GeV 


10.9±1.1 


4.6±2.3 


151.9±3.6 


30.8±4.2 


38±38 


b-tagged 














signal region 


A(j) U < 1.575 




1.14±1.14 








b-tagged 
control region 


Aft/ > 1.575 


5.71±2.55 



Table 3: Cut flows (in fb) for M H = 170 GeV in the H + 0 j, H -> WW -> evptv channel. A '-' indicates 
that the corresponding contribution is ignored in the fit. The WW background contains the two processes 
qq -> WW and gg -> WW. 



hypothesis or is not precisely predicted by theory, a maximum-Likelihood fit to those variables is used. 
To perform the hypothesis test itself, a Likelihood Ratio is used, A = L s+ b/Lb g -oniy The numerator and 
denominator of this ratio are obtained from separate fits to the data, with the number of signal floating as 
a free parameter (subject to the constraint N s > 0) in the fit for L s+ b, and with the number of signal fixed 
to zero in the fit for Lb g - 0 niy The sampling distributions of A at a given luminosity (in the presence of 
signal and in its absence) are studied by generating pseudo-experimental outcomes and performing the 
full fit to each outcome. 

Throughout this note, when a fit is performed, the Higgs boson mass m# is allowed to float as a free 
parameter in the fit. The resulting significance estimates therefore represent the probability that, in the 
absence of signal, a background fluctuation anywhere in the allowed mass range would produce an excess 
at least as significant as the observed excess. This is slightly different from the convention adopted in 
some other studies, where the significance represents the probability that a background fluctuation would 
produce an excess that is consistent with a specified mass and at least as significant as the observed 
excess. 

Figure 2 shows two examples of the Likelihood Ratio probability distributions. The left plot shows 
the sampling distribution of the Likelihood Ratio for the H — > WW —> ivqq fit discussed in Section 7 
(assuming a negligible background from fakes), and the right plot shows the corresponding distributions 
for the analysis of Section 6.2. A fit to the real data would yield one value of A; to obtain a p-value, one 
would integrate the red distribution in the acceptance region, i.e. from the observed value of A to infinity. 
In the absence of data, the expected significance is computed by taking the "observed" value of A to 
be the median of the signal-plus-background distribution for a given mass. The expected significance Z 
reported here is one-sided, i.e., Z = \/2erfc _1 (2/?). When the number of pseudo-experimental outcomes 
in the acceptance region is too small, the background-only Likelihood Ratio distribution is extrapolated 
with an exponential decay. 



1312 



Higgs - Higgs Boson Searches in Gluon Fusion and Vector Boson Fusion using . 




Figure 3: Left: transverse opening angle A<f>u of the two leptons after preselection cuts. Middle : trans- 
verse momentum pj W of the WW system after preselection cuts. Right: transverse mass Mj for events 
with Atpn < 1.575 and p^ w > 20 GeV, in a fitted toy Monte Carlo outcome containing a Standard Model 
Higgs boson with M# = 170 GeV, after 10 fb _1 of integrated luminosity. The Likelihood Ratio in this 
outcome is 30.69, which is typical for this value of Mh and this luminosity. 



5 Leptonic W Pair Production with No Hard Jets 

The H + 0j, H — ► WW —>■ £v£v channel has been shown to have a strong discovery potential. [19-22] 
The basic event selection consists of only a few simple cuts: 

• Require that the event has exactly two isolated, opposite-sign leptons (electron or muon) with 
p T > 15 GeV. 

• To suppress backgrounds from single-top production, backgrounds from dileptonic decays of bb 
and cc resonances, and lepton pairs from b —> c cascade decays, require that the invariant mass win 
of the leptons is between 12 GeV and 300 GeV. 

• Require that the event has E™ 1SS > 30 GeV 

• To suppress backgrounds from Z — ► tt, reconstruct the invariant mass of a hypothetical % pair 
using the collinear approximation [23]. If the energy fractions x\ and x\ are both positive and the 
invariant mass of the z pair M%% is in the range \M ZX — Mz\ < 25 GeV, the event is rejected. 

• To suppress backgrounds from top quark decays, reject events that contain any hard jets with 
p T >20 GeV and \rjj\ < 4.8. 

• To further suppress the top background, reject events with any jets with pj > 15 GeV and a b- 
tagging weight greater than 4. 

Table 3 shows the cross-sections in the e\l channel for signal and background after these cuts. 

The trigger efficiency for signal is sufficiently large for this analysis to be viable. Events are required 
to pass at least one of the ATLAS single-lepton or double-lepton triggers. The level- 1 trigger menus 
used here are 2EM15I, EM25I, EM60I, MU20, and MU40. For the Level 2 trigger and the event filter, 
events are required to pass the e25i, 2el5i, e60, or mu20i triggers. Details on the trigger menus can be 
found in Ref. [24]. For the e\l channel considered here the trigger efficiency for LI is 99.0%; for L2 
it is 96.7%, and for the EF it is 95.2% The efficiency is quite high, and the trigger efficiency does not 
distort the shapes of the kinematic variables of interest in the signal in a significant way. Table 3 does 
not include the trigger efficiency, but it is taken into account as an overall scale factor for the signal and 
background when generating toy Monte Carlo to test the fitting algorithm described in the next section 
and in the calculation of the expected statistical significance for this channel. 
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After the basic event selection (all cuts from Table 3 including the cut on A<f>u), the signal to back- 
ground ratio is only «1:4, but there is still a good deal of discriminating power in the final state. This 
analysis focuses on three variables: 

• the transverse opening angle A<j>n; a cut on this variable exploits differences in the spin correlations 
in the WW system in the Higgs signal and the WW background, 

• the transverse mass Mj, as defined in Ref. [25], 

• the transverse momentum of the WW system, pj W , which tends to be slightly larger for signal than 
for WW background because gluon-initiated processes tend to have more initial-state radiation 
than quark-initiated processes. 

The distributions of these variables are shown in Figure 3. To make use of the discriminating power of 
these variables, a maximum-likelihood fit is performed. 

5.1 Fitting Algorithm 

The fit is a 2-dimensional fit of transverse mass and p^ w in two bins of the dilepton opening angle A<f>u 
in the transverse plane. [26] The present study considers a fit to only those events with one electron 
and one muon, since an in-situ background extraction procedure for the Z — > eej\i\i background has 
not yet been studied. After the preselection cuts and the additional requirement that Mj < 600 GeV is 
applied, the remaining events are separated into two subsamples, one with A(j>u < 1.575 and the other 
with A<f>u > 1.575. Table 3 shows the cross-sections for the signal and various backgrounds in these two 
regions. The region with large A0» (control region) is enriched in background and the region with small 
A(j)u (signal region) is enriched in signal. 

The top background is estimated with the help of b-tagged control samples with the same kinematic 
cuts as the signal-enriched and background-enriched regions. Table 3 includes estimates of the top cross- 
section in the b-tagged control regions. In the present study, the b-tagging efficiency is assumed to be 
well-known (i.e., the ratio of cross-sections in the b-tagged and b- vetoed regions is taken from Monte 
Carlo), and the contamination from processes with only light jets in the b-tagged regions is ignored. The 
it cross-section estimates in Table 3 are based on Monte Carlo samples that use MC@NLO to model 
the top background; even if the b-tagging is well-understood, the ratio of the top cross-section in the b- 
tagged and b- vetoed regions is sensitive to the treatment of the single-top contribution to the background. 
Moreover, studies based on fast simulation samples have shown that there are also differences in the 
shape of the distributions of Mj and p^ w between top background models based on doubly -resonant top 
production and models that include the singly-resonant and nonresonant processes. (For example, the 
single-top contribution leads to a larger number of events with large Mj and small pj W .) Nevertheless, 
in the present study, standalone fits are performed on the b-tagged control samples before the fit to the 
b-vetoed regions begins, and the top background in the b-vetoed regions is estimated by extrapolating 
both the shape and the normalization from the b-tagged region to the b-vetoed region based on ratios 
obtained from MC@NLO. 

The Z — > xx background is normalized and its shape is determined by studying a sample of Z — > 
events taken from real data, where the reconstructed muons are replaced by simulated taus [23]. Two- 
muon events with a dimuon invariant mass between 82 and 98 GeV are selected, and the same jet veto 
that is used in the rest of this analysis is applied to the selected events. The effective cross-section after 
these cuts is roughly 360 pb. The reconstructed muons in the event are replaced with simulated tau 
leptons, and the remaining event selection cuts are applied. The efficiency of those cuts is about 0.07%, 
leaving an effective cross-section of roughly 250 fb. To be conservative, the present study assumes that 
efficency factors will lower this figure to about 200 fb. A standalone fit to these "data-Monte-Carlo" 
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events is performed to determine the shape and normalization of the Z — >• tt background; in the final 
fit, the shape parameters are fixed and the normalization is rescaled by a factor that is assumed in this 
study to be well-predicted. The parameters obtained from the fit to the Z — > TT data-Monte-Carlo control 
sample are not allowed to float in the main fit. 

At the time of this writing, an in-situ determination of fake backgrounds from processes such as 
W+jets has not yet been studied. For this study, a fixed probability distribution has been used to represent 
the fake backgrounds, assuming for the moment that the shape and normalization of fake backgrounds 
are well-predicted. Because of the limited size of the available W+jets Monte Carlo sample, there are not 
enough Monte Carlo events remaining after cuts to provide a prediction of the shape of the transverse 
mass vs. pj W distribution for VK+jets; the shape of this distribution is therefore taken from a set of 
events with loosened isolation and shower shape cuts. This treatment ignores any potential systematic 
uncertainty on the W+jets background; future studies will attempt to address this in more detail. 

Once the fits to the control samples are completed, a simultaneous fit of the two A(j)u bins in the 
b-vetoed region is performed. A few of the parameters that describe the shape of the transverse mass 
and pj W distributions of the WW background are allowed to float in the fit. The p^ w distributions for 
the WW background in the two regions are taken to be the same up to a parabolic distortion factor. The 
normalizations of the WW background are free to float independently; however, we add a penalty term 
of the form (R fn — R trU e) 2 / c| , where R is the ratio of the best-fit number of WW background events in 
the small-A0// region over the number in the large-A0// region, R true is the Monte Carlo prediction of the 
ratio taken from the central-value calculation, and Or is the uncertainty in the prediction of R trU e, taken 
to be 10%. This value of Or is chosen to be larger than the actual variation of R due to changes in the Q 2 
scale definition (about 5%) so that the constraint term does not cause a large bias in the observed value 
of R; such a bias could be expected to lead to a degradation in the sensitivity of the hypothesis test. The 
value of Or has not been optimized; such a study may be performed in the future. 

In order to demonstrate the robustness of the fit against systematic uncertainties, toy Monte Carlo has 
been used to compute the sampling distributions for the Likelihood Ratio in several scenarios where the 
"true" probability distribution has been distorted to model various sources of systematic error. Seven dis- 
torted scenarios are considered: four altered Q 2 scale choices (factorization and renormalization scales 
raised and lowered by factors of 8), two alternative top background models (based on leading-order 
pp — > WWbb and leading-order pp — > tt — > WWbb), and one alternative model of all irreducible back- 
grounds where the x and y components of E™ lss have been independently smeared by 5 GeV each. These 
alternative models have been derived using fast simulation because large-statistics Monte Carlo samples 
are needed in order to be sure any change in the shape of the observables is actually due to the sys- 
tematic uncertainty under study and not merely a statistical fluctuation. In these systematic scenarios, 
the background contribution from fakes is ignored. Figure 4 shows the Likelihood Ratio distribution for 
background-only outcomes (upper left) and the distribution of pulls of the fitted Higgs boson mass (upper 
right) for signal-plus-background outcomes with a true Higgs boson mass of 170 GeV. Both distributions 
are nearly independent of the systematic distortions. 

The lower left plot in Figure 4 shows the linearity of the mass determination as a function of the true 
Higgs mass. The line shows the mean of a Gaussian fit to the region around the peak of the distribution 
of best-fit Higgs masses in the toy Monte Carlo sample for the case of nominal detector performance. 
The green band shows the width of the Gaussian and is a direct measure of the variability of the mass 
estimate on repetition of the experiment; the error bars show the median fit error. The typical variability 
of the mass determination at 10 fb" 1 ranges from 5.2 GeV at M H =130 GeV to 1.6 GeV at M H =160 GeV 
to 4.2 GeV at M H =190 GeV. 

Figure 4 also shows the expected significance for an integrated luminosity of 10 fb -1 . This channel 
is most promising for Higgs boson masses near the WW threshold. 
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Figure 4: Upper Left: The log Likelihood Ratio distributions for background-only toy Monte Carlo 
outcomes corresponding to 10 fb -1 in the H + 0j, H — > WW — > Ivlv analysis. Upper Right: The cor- 
responding pull distributions for M#=170 GeV. Lower Left: The linearity of the mass determination. 
Lower Right: the expected significance for an integrated luminosity of 10 fb -1 . 



Region 


Signal, Me = 170 GeV (fb) 


ft 


WW 


Z — > XX 


W+jets 


Signal-like 


28.65±0.80 


1.14±1.14 


29.35±1.59 


<1.74 


38±38 


Control 


1.47±0.27 


5.71±2.55 


61.13±2.33 


4.06±1.53 


<114 


b-tagged 


0 


6.85±2.80 


0.11 ±0.09 


1.16±0.82 


<114 



Table 4: Cross-sections (in fb) after all cuts for a number-counting analysis with a test mass of M# = 
170 GeV in the H + 0 j, H -> WW -> ev\i v channel. 



5.2 Cross-check with a Number- Counting Analysis 

A number counting analysis of this channel has been performed as a cross-check. The signal region is 
defined by the basic event selection in addition to the following few additional cuts: pj W > 10 GeV, 
Mn < 64 GeV, A<f>u < 1.5, and 50 < Mj < 180 GeV. A control region is defined by the requirements 
pWW > iQ GeV, 80 < Mu < 300 GeV, and A<f>u > 1.5. In order to normalize the top background, it 
is also useful to define a b-tagged region with similar kinematic cuts: pj W > 10 GeV and A0// > 1.5. 
Table 4 shows the cross-sections for signal and backgrounds in these regions. Only one of the IV+jets 
events passes all the selection cuts for the signal-like region; it corresponds to a cross-section of 38 
fb. None of the W+jets events survives in either control region. Because this background is modeled 
with Alpgen W + n jets, with n < 5, and because the Monte Carlo was generated with different effective 
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luminosities for different jet multiplicities, it is not straightforward to compute a meaningful upper bound 
on the W+jets cross-section in the control regions. The quoted cross-section is simply three times the 
cross-section observed in the signal region (which is based on one Monte Carlo event). 

The ratios of background cross-sections in the various regions have to come from a theoretical pre- 
diction; in a number-counting approach there is no simple way to constrain them with data. A similar 
background normalization strategy was studied in Refs. [27] and [28]; in Ref. [27], the theoretical error 
on the ratio of WW cross-sections in the signal-like region and the control region was 5%, the system- 
atic error on the ratio of the top background cross-sections in the signal-like and b-tagged regions was 
9%, and the uncertainty on the ratio of the top background cross-sections in the control region and the 
b-tagged regions was also 9%. These systematic errors have not been re-evaluated in the context of this 
analysis, nor have the relevant instrumental systematic errors been computed; this cross-check simply 
uses these values as the systematic uncertainties on the extrapolations. 

Using the cross-sections for M# = 170 GeV in Table 4 and taking into account the additional 95.2% 
trigger efficiency on signal and background, the luminosity-dependent errors on the normalizations of the 
control samples (ignoring W+jets in the Control and b-tagged regions), and the systematic uncertainties 
described above, the expected significance of the number-counting analysis (in the Gaussian approxima- 
tion, for fLdt = 10 fb _1 ) is 8.5a. This figure ignores the error on the W+jets background prediction; in 
a future study, it will be important to normalize the W+jets background using data. In the interest of per- 
forming a meaningful comparison with the fit analysis, it is helpful to compute the significance ignoring 
the W+jets background altogether: 10.4a. This is comparable to the result from the fit; if the contribution 
from the W+jets background is ignored there, the significance is 8.8a at the same luminosity. 

6 Leptonic W Pair Production with Two Hard Jets 

This Section presents the analysis of events where a W pair is produced in association with two hard jets 
and both W bosons decay leptonically. [29-32] In Section 6.1, the main discriminators in this channel 
are reviewed. For background extraction and significance estimation, two complementary approaches 
are considered; both of them are fit-based. In Section 6.2, an analysis that uses a Neural Network to 
combine several jet variables into a single discriminator and fits the Neural Network output and Mj is 
presented. This method has the advantage that the correlations among variables related to jets (described 
in Section 6.1) are naturally taken into account by the Neural Network, and the weak correlations between 
the jet variables and the lepton variables are largely taken into account by the way the control sample 
is used in the fit. However, it has the disadvantage that the Neural Network must be trained ahead of 
time; in principle, this may introduce some model-dependence to the procedure. Ideally, one would like 
to directly model all kinematic variables in a maximum Likelihood fit; Section 6.3 describes an analysis 
that takes this approach with a fit of five kinematic variables. This method has the advantage that there 
is no training stage and therefore has the potential to be more model independent than the analysis of 
Section 6.2. However, because of the difficulty in constructing a fully correlated probability distribution 
for a 5-dimensional space, the probability distributions used in the fit of Section 6.3 include only the 
largest correlations among the kinematic variables and ignore some smaller correlations. It is possible 
to make some corrections to the shapes of the distributions to model correlations not inherently present 
in the fit model; Section 6.3 explores the prospects for extracting such corrections from b-tagged control 
samples. The combined significance estimate at the end of this note will be based on the analysis of 
Section 6.2, but further studies of both approaches are needed in order to converge on a final strategy for 
the analysis of real data. 
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Figure 5: Pseudorapidity gap between tag jets (left top plot), invariant-mass distributions of tag jets 
(right top plot), azimuthal angle gap between tag jets (left bottom plot) and E, of the third jet in VBF 
H —> WW —> jltv /IV Pythia events (m(H)=170 GeV). A requirement TJ1TJ2 <0 is used in addition to the 
requirement jet E, > 20 GeV. 



6.1 Handles for Suppressing the Top Background 

In both of the fit-based approaches under study, it is the dominant background after the preselection 
cuts, and in both approaches, similar discriminators are used to suppress it. Two discriminators warrant 
special attention: jet kinematics, which is discussed in Section 6.1.1, and b-tagging, which is discussed 
in Section 6.1.2. 

6.1.1 Jet Kinematics 

One of the main advantages of any search for a Higgs in association with two jets is the possibility to 
suppress QCD backgrounds with cuts on jet kinematics. The Vector Boson Fusion Higgs signal features 
several distinctive characteristics: 

• The two jets arising from struck quarks (often referred to as "tag" jets) tend to be the highest-^r 
jets in the event, and they tend to be well-separated in pseudorapidity; 

• they tend to have a large invariant mass; 

• there is very little jet activity in the region between the two tag jets. 
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Figure 6: The distribution of leading versus sub-leading jet weights in the events for signal (left plot) 
and the tt background (right plot). The plots are divided in three regions: (I) where there is a non-default 
^-tagging weight for more than one jet, (II) where there is only ^-tagging information for one jet in the 
event and (III) where there are no jets with ^-tagging information in the event. 



Figure 5 shows the distributions of the pseudorapidity gap between the tag jets, their invariant mass, and 
the pr of the third-highest pj jet in the event. The figure also shows the distribution of the azimuthal 
angle between the tagjets, A<j)jj. It is possible to enhance the discrimination against backgrounds with 
a cut on A(j>jj, but in this study no such cut is applied on the grounds that this angle is needed in a 
measurement of the spin and CP properties of the Higgs boson [33]. 

Only one Higgs boson mass is shown for the signal process in Figure 5, but it is worth noting that 
the dependence of these variables on the Higgs boson mass is rather weak. It is also worth noting that 
Pythia 6.4 predicts a somewhat harder pj spectrum for the third jet than Herwig and Sherpa do. This 
would amount to a difference as large as a few tens of percent in the survival probability if a jet veto cut 
were applied. 

It has been shown in previous studies that hard cuts on the kinematic variables discussed here can 
provide a strong rejection against the top background [25,34,35]. As part of the present study, it has been 
checked that the high signal-to-background ratio that can be achieved in this channel is not sensitive to 
degradations arising from detector alignment effects. 

6.1.2 Rejecting Top with B-tagging 

Another effective way to reject a significant fraction of the tt events while retaining a high efficiency for 
the signal is to apply a veto on events containing jets with high Z>-tagging weights. The tt events naturally 
contain two jets originating from ^-quarks whereas the heavy flavor content in the signal events is limited 
at tree level to having c quarks in the tagging jets, and then typically in only one of the two tagging jets. 
This makes vetoing against events with more than one jet with high ^-tagging weight especially efficient. 
Figure 6 shows the distribution of leading versus sub-leading jet weights in the events for signal (left 
plot) and the tt background (right plot). The plots are divided in three regions: I where there is a non- 
default ^-tagging weight for more than one jet, II where there is only ^-tagging information for one jet in 
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Cut 


Signal (170 GeV) 


tt 


WW+jets 


Z^TT 


W+jets 


Lepton Selection 


30.20 


8317 


838.96 


(2096) 


1323 


Forward Jet Tagging 


17.27 


946.6 


32.77 


79.30 


31.83 


Leptons Between Jets 


16.47 


617.8 


22.92 


55.13 


27.91 


Z — ► TT Rejection 


15.68 


561.8 


21.20 


39.03 


27.91 


pf ss ,M T ,mf 


12.78 


425.9 


15.28 


0 


13.96 


b-veto 


12.67 


206.72 








signal box, b-jet Veto 


9.28±0.27 


28.5±5.7 


4.75±0.30 




4.3±4.3 


signal box, no b-jet Veto 


9.65 


114.2 


4.99 




6.07 


Control, b-jet Veto 


3.02±0.15 


89±10 


9.78±0.43 




7.9±5.0 


Control, no b-jet Veto 


3.13 


311.7 


10.28 




7.89 



Table 5: Cut flows (in fb) for M H = 170 GeV in the H + 2j, H -> WW -> eVjXV channel. Numbers in 
parentheses are affected by generator-level filter cuts. 



the event and III where there are no jets with Z>-tagging information in the event 1 . The fraction of events 
in each of the three regions is: 

• 38% of the events in region I, 48% of the events in region II and 14% of the events in region III 
for the Higgs signal. 

• 85% of the events in region I, 14% of the events in region II and 0.5% of the events in region III 
for the tt background. 

The selection in the leading versus sub-leading jet weights plane was optimized for the highest in- 
crease in the number-counting significance after the following cuts have been applied: 

• Two leptons, pj > 15 GeV 

• Missing transverse energy Ef lss > 30 GeV 

• At least two jets with p T > 20 GeV and |tj | < 4.8 

• The two jets with highest transverse momentum are required to be in opposite hemispheres, with 
Ari(jetl,jet2)>3 

• Require that both leptons are between the two leading jets in pseudorapidity. 
Two cuts are applied on the b-tagging weights: 

• weight(jetl)+0.6x weight(jet2)<3 for events in region I 

• weight(jetl)<8 for events in region II 

These cuts provide a strong rejection against the top background and a large acceptance for the Higgs 
signal. 

6.2 Two-dimensional Fit 

To select the events that are used in the two-dimensional fit, the same preselection cuts that were de- 
scribed in the previous section are applied, plus two additional cuts: 

'jets can lack i-tagging information if they fall outside the acceptance of the inner tracker, or because there are no tracks 
with high impact parameter significance in the jet. 
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Figure 7: An example fit to a toy Monte Carlo outcome corresponding to 10 fb _1 of integrated luminosity 
in the H + 2 j, H — > WW — > ev/i V channel. The pseudodata contains a Standard Model Higgs boson with 
a true mass of 170 GeV. Left: The Neural Network output distribution in the signal box, for events with 
50 < Mj < 180 GeV. Right: The transverse mass distribution for events in the signal box with Neural 
Network output larger than 0.8. The Likelihood Ratio for this outcome was 22.62, which is typical if the 
signal model used as truth is derived from a Monte Carlo sample generated with SHERPA. 



• To reject backgrounds from Z — > %%, assume that the leptons are coming from % decays and com- 
pute x\ and x\, the fractions of the tau energies earned by the visible leptons in the collinear 
approximation. If x\ and x\ are positive and M TT is close to the Z mass (|M TT — Mz\ < 25 GeV), 
the event is rejected. 

• Require that the transverse mass Mj is between 50 GeV and 600 GeV, and that the transverse mass 

(as defined in Ref. [25]) is at least 30 GeV. 

In this analysis, electrons are selected using the standared ATLAS medium electron selection. The defi- 
nition of muons, and the isolation cuts applied to both, are the same as in the H + Oj, H — > WW — > e\\i V 
analysis in Section 5. 

Events surviving the cuts above are used in the fit; they are partitioned into a signal box and a 
control region defined by additional cuts on the pseudorapidity gap between the leptons and the dilepton 
opening angle in the transverse plane. An event lies in the signal box if it has A(j>u < 1.5 and Arju < 1.4; 
otherwise, it lies in the control region. Table 5 shows the cross-sections for signal and background in 
these two regions. 

The trigger efficiency has been studied in the context of this analysis. The same trigger menus as in 
theH + Oj, H —>■ WW — >• evjuv analysis are used, and the results are similar: a trigger efficiency of 99.0% 
after Level 1, 96.8% after Level 2, and 94.5% after the Event Filter. These efficiencies are quite high, and 
as for the H + 0 j analysis, it has been checked that the trigger efficiency does not significantly change 
the shape of the most important kinematic variables. Table 5 does not include the trigger efficiency, but 
it is taken into account as an overall scale factor for signal and background when generating toy Monte 
Carlo to test the 2-dimensional fit described in the next section. 

After the preselection described above, a four- variable Neural Network is used to further enhance the 
separation between the signal and the background. The inputs to the Neural Network are: 

• Ar]jj, the pseudorapidity gap between the tagjets 

• Mjj, the invariant mass of the tagjets 

• p v -f°, the transverse momentum of the leading non-tag jet in the region \r\\ < 3.2, and 
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• T]* = T]3 — (t]i + T72) /2, the pseudorapidity gap between the tag-tag system and the third jet. It is 
set to -9 if no third jet is present. 

The Neural Network classifies events as signal-like if they have large Arijj, large Mjj, low p™ to , and 
large |tj*|. Figure 7 shows the Neural Network output and transverse mass distributions in the signal 
box. The fit is a two-dimensional fit to these two quantities; both the signal model and the background 
model are uncorrelated product probability density functions (PDFs). The fit does not distinguish among 
the various types of background. 

An in-situ normalization of fake backgrounds has not yet been implemented for this channel. For the 
present study, a static model has been used to approximate the fake background from W+jets. The same 
functional form used for the irreducible backgrounds was fitted to a set of W+jets Monte Carlo events that 
pass a loosened lepton selection, and its normalization was scaled to the cross-section obtained with the 
tighter selection cuts actually used in the analysis. In the present study, none of the parameters governing 
the shape or normalization of the fake background are allowed to float in the fit; systematic errors on the 
shape and normalization of the W+jets background are ignored. 

The Neural Network output distribution for the background in the signal box is taken to be the same 
as the distribution in the control region, but it is multiplied by a linear extrapolation factor. Apart from 
the slope of this extrapolation factor, all parameters governing the shape of the Neural Network output 
distribution in the two regions are required to be the same. 

A check on the impact of altered jet energy scales on the Neural Network output distributions for 
signal and background has been performed; changing the jet energy scale by 5% in the region with 
tracking and 10% elsewhere does not change the Neural Network output shape in a meaningful way. 
This is not surprising, since the Neural Network inputs are slowly varying functions of the jet energy. 
Similarly, raising the jet pj thresholds from 20 GeV to 30 or 40 GeV to mimic a degraded efficiency for 
low-pr jets does not have a large impact on the ratio of the Neural Network output distributions in the 
signal box and the control regions. 

Given the fact that the Neural Network output shape is insensitive to the jet energy scale, one would 
expect that the ratio of distributions in the signal box and control region would be insensitive to the 
jet energy scale as well. Other possible sources of uncertainty in the ratio of Neural Network output 
distributions are underlying event activity and the Q 2 scale uncertainty; these sources of uncertainty have 
not been checked explicitly at this time. However, it is reasonable to think they might be small because 
the cuts on jets in the signal box are the same as in the control region and because the distributions of 
variables related to jets are not strongly correlated to the lepton angular variables. If the uncertainty in 
the ratio of distributions is sufficiently small, then one can fix the parameter governing the extrapolation 
factor; if it is too large, one must allow it to float in the fit. In the latter case, it may be possible to 
constrain it somewhat by looking into a b-tagged control sample; this possibility has been studied in the 
context of the five-dimensional fit discussed in the next section. All of these possibilities require further 
study in the context of the two-dimensional fit; this note will simply consider both scenarios: allowing 
the slope of the extrapolation factor to float in the fit and fixing it to a predetermined value taken from 
Monte Carlo. In both cases, it is assumed that the transverse mass distributions are well-predicted, and 
that the Neural Network output distribution is not; in practice this means that the Mj parameters are set 
to fixed values while most of the parameters describing the Neural Network output shape are allowed to 
float in the fit. 

The expected significance is shown as a function of the true Higgs mass in Figure 8 for the two 
treatments of the extrapolation factor considered in this study. The sensitivity is comparable to what 
was quoted in previous studies of this channel, at least for the case where the extrapolation parameter is 
constrained to a fixed value. 
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Figure 8: Left: The Neural Network output distribution for the H + 2j, H — > WW -^eV}XV channel in the 
signal box. Right: The expected significance at 10 fb _1 . The blue and magenta triangles were computed 
using a fixed extrapolation. 



6.3 Five-dimensional Fit 

A conceptually simpler but more technically challenging strategy is to directly fit the most important 
kinematic variables. This section describes such an approach to the H + 2 j, H — »• WW — »• ivlv channel. 

The selection cuts used in this analysis are similar to those used in section 6.2, with a few slight 
differences: 

• Either two or three jets with transverse momentum pj > 20 GeV and pseudorapidity | r\ \ < 4.9 are 
allowed. 

• The missing transverse momentum E™ lss is greater than 20 GeV. 

• The pseudorapidity gap between the tagjets is required to satisfy |At] ;; | > 2.5, and the invariant 
mass of the tagjets is required to lie in the range ntjj £ [600, 3000] GeV. 

• A Magged jet is defined as having a displaced vertex significance greater than 4.5. 

The values of the Higgs boson mass, m#, and the cross section, o(VBF H—>WW), are determined with 
an unbinned maximum likelihood fit to the distributions of x = (Mr, Atpu, Ar\u, At] jj,nijj), as obtained 
from the selected data sample. A multidimensional kernel estimation technique [36] has been used to 
help model the kinematic distributions in the control samples. 

Events in the selected data sample are divided into categories based on four properties: the flavors of 
the reconstructed lepton pair {MX, jXe, and ee); the number of reconstructed good jets (2jet and 3jet); 
events with and without a Z?-tagged jet (btag and bveto, where a ^-tagged jet is defined as having a 
displaced vertex significance greater than 4.5); and events that fall into the signal region, 'sigbox', 
having |A0/;| < 1.5 and \Ar\u\ < 1.4, and those that fall outside, into the 'sideband' region. The b- 
vetoed sigbox region is denoted as region 1, the b-vetoed sideband as region 2, the b-tagged sigbox 
region as region 3, and the b-tagged sideband region as region 4. 

Most tt background events contain a ^-tagged jet, whereas most WW background events fall into 
the bveto category. The majority of Higgs boson events is expected to be found in physics region 1. 
Events outside of this region are not considered as signal candidates. These regions are defined to be the 
background control samples. 
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For tt, WW, and H — > WW events, the selected samples are mostly dominated by di-leptonic W 
decays. For this reason, the relative fractions of /i/x, ee, and \le events (/^, f ee , 1— f^fi— fee) are taken 
to be identical for background and signal events. 

The signal model is obtained from fully simulated Monte Carlo events, and is factorized into four 
shapes, modelling Mj, A<f>u, Ar\u and (Arjjj,mjj) respectively. The signal shape is assumed to be inde- 
pendent of the lepton pair category and the number of good jets per event. 

The Higgs boson transverse mass distribution, Mj, is described with a double-sided exponential, 
having two different lifetimes, convoluted with a Gaussian. The mean of the Gaussian is interpreted as 
the Higgs boson mass, m#, and is left free in the fit. The left-handed lifetime, Zl, comes from the missing 
^-component in the transverse mass calculation. The parameter Zi is a linear function of m#, and equals 
25 GeV for a Higgs boson mass of 130 GeV, and 45 GeV for a Higgs boson mass of 170 GeV. The 
right-handed component, Zr, is the result of the non-validity of the approximation rn.fi = m V v in the 
definition of Mj, and is fixed to 20 GeV for all Higgs boson masses. The width of the Gaussian, Omet, 
is interpreted as the resolution on the missing transverse energy, and is fixed to 15 GeV. 

The distributions of A<f>u and At?// in signal are described with single Gaussians, with means fixed to 
zero, and widths of = 1.09 and Oatj = 0.62 respectively. 

The tagging jet observables Atj^- and mjj in signal are strongly correlated, and are modeled with a 
2-dimensional (2D) kernel estimation function. 

For these observables, the largest unmodelled correlation is between A<f>u and At]/;, and is found to 
be 14% from the available signal Monte Carlo sample. 

The collection of background events in regions 2-4 serves as a data control sample for the determi- 
nation of: a) the probability density functions, and b) the number of background candidates in physics 
region 1 . The background shape is assumed to be independent of the lepton pair category and number of 
good jets per event. The shape is factorized into two pieces, describing (Mr, A<f>u, At?//) and (Arjjj,mjj) 
respectively. 

The background distribution of (Mt, A0//,At]/z) - observables mostly independent of the jet charac- 
teristics - is modelled with a 3D kernel estimation function, using the events in physics region 3. A small 
efficiency correction is applied to the observable Mt, calculated as the ratio the distributions of Mt in 
physics regions 2 and 4, both determined using ID kernel estimation functions. 

Accounting for the hard pt spectrum of the ^-tagged jet sample, the distribution of (Ar}jj,ntjj) is 
modelled with a 2D kernel estimation function using the background events in physics region 2. A 
correction function is applied to ATjjy, determined as the ratio the Arjjj distributions in physics regions 3 
and 4, again determined using ID kernel estimation functions. 

The unmodelled correlations between these observables are found to be smaller than 10%, as ob- 
tained from the tt and WW background Monte Carlo samples. 

The number of background events in physics region 1 is partially estimated from the number of 
background events in categories 2-4. For this we assume the ratio of events in the sideband to sigbox 
regions (/sigbox-sideband) to be identical for the btag and bveto categories. Given the ratio of events of 
physics regions 2 and 4 (/bveto-btag), determined independently for the 2 j et and 3 j et samples, estimates 
for the number of background events in physics region 1 can be obtained. These estimates are expressed 
as the total number of background events, n^g, and the relative fraction of background event in the 3 jet 
category, / bkg _ 3j . 

Finally, the fitted number of Higgs boson signal events, is determined seperately for the 2 jet 
and 3 j et categories. 

6.3.1 Toy Monte Carlo Studies 

A fit example to approximately 1 fb _1 of fully simulated Monte Carlo events, including Higgs boson 
signal events with a mass of 170 GeV, is demonstrated in the left side of Fig. 9. The plot shows the events 
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VBF H->WW (11) 1 ftf 1 LHC Data 




Figure 9: Left: The Higgs boson transverse mass distribution for events in the defined physics signal 
region, corresponding to approximately 1 fb -1 of data, as obtained from fully simulated Monte Carlo 
events. The solid (blue) curve is the total fit projection. The background contribution is represented 
by the dashed (red) curve. Right: The expected statistical sensitivity to Standard Model VBF H — > 
— > l + vl~v decays for 1 fb _1 of ATLAS data, using the five-dimensional fit of Section 6.3. 



Fit parameter 


Value 


Global corr. 


Parameter 


Value 


Global corr. 


fbveto—btag;2)et 


0.89 ±0.14 


72% 


n B 


90.5 ±7.4 


93% 


fbveto—btag;3]et 


3.02 ±0.24 


88% 


«#:2jet 


18.6±5.5 


25%> 


fbkg-3jet 


0.72 ±0.03 


80% 


»tf;3jet 


9.6±7.9 


38% 


fsigbox— sideband 


2.64 ±0.19 


86% 


OMET 


15000 




fee 


0.11 ±0.01 


34% 


a A0;, 


0.62 ±0.03 (fixed) 




fflU 


0.47 ±0.01 


34% 


a Arj„ 


1.09 ±0.06 (fixed) 




m H 


168 ±8 


12% 




20000 (fixed) 





Table 6: The free parameters for the five-dimensional fit described in Section 6.3, and their best-fit values 
for the example toy Monte Carlo outcome in the left side of Figure 9. 



in the defined physics region 1, with the total and background fit projections overlaid. Here the signal 
shape has been obtained from fully simulated Monte Carlo events. The background probability density 
function has been obtained using kernel estimation from the (Monte Carlo) events in the background 
control samples. 

The corresponding fit results are given in Table 6. The Higgs boson mass found is consistent with the 
generated value. The total numbers of Higgs boson and background events found are consistent with the 
input values of 27 and 86 respectively. The fit is most sensitive to Higgs boson events in the 2 jet signal 
region - where most Higgs boson events are expected, and contributions from tt background events are 
minimal. For Higgs boson masses in the range [160, 170] GeV, the signal to background ratio is this 
category is approximately 1:1. The error on the fitted number of background events in the signal region 
is found to be 7.4. When not including the estimated number of background event from the background 
control samples in the fit, the error on the number fitted number of background events doubles. The error 
on the fitted number of Higgs boson events rises by 25%, and the correlation of the fitted number of 
Higgs boson events with all other fit parameters - mostly correlated to the number of background events 
- rises from 25% to 45%. The con-elation of the Higgs boson mass with all other fit parameters is 12%. 

The statistical properties of this fitting algorithm have been studied with about 15k toy Monte Carlo 
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outcomes like the one discussed in the previous paragraph. The toy outcomes have been generated with 
the same number of events seen in, and the shapes obtained from, the fully simulated Monte Carlo signal 
and background samples remnant after the event selection, corresponding to 1 fb~ 1 of data. The mass of 
the generated Higgs boson sample is varied between 110 and 200 GeV, in steps of 10 GeV. The Higgs 
boson cross-section is adjusted accordingly [1]. The fit algorithm described above has been performed 
on each generated pseudo-experiment. 

The fitted number of Higgs boson events in the 2 j et signal region - most sensitive to the Higgs 
signal - shows an average bias of less than 0.8 event over the entire generated mass range. Roughly 12 
Higgs boson events are needed in this region before a clear signal peak can be picked up by the fitter. 
The eiTor on the fitted number of 2 jet Higgs boson events varies between 3.1 and 5.6 events for no 
generated Higgs boson events to the maximum cross-section between 160 and 170 GeV respectively. 

The average fit error on the Higgs boson mass parameter is about 14.0 GeV in background-only 
outcomes, or about 9.4 GeV in outcomes containing both background and a 160 GeV Higgs signal with 
cross-section as given by the Standard Model. In Fig. 9, note that the background distribution, dominated 
by it events, peaks at around 160 GeV. Pseudo-experiments with no generated signal events tend to peak 
at this mass, where statistical fluctuations are largest. For 1 fb _1 samples, starting from the generated 
Higgs boson mass of 110 GeV with a value of +la, the fit show a decreasing bias in the mass pull 
distribution, which is consistent with zero at 160 GeV, and becomes —la at 200 GeV. The width 
of the pull is consistent with one in the generated mass range of [150, 190] GeV, and grows for lower 
cross-section, upto 1.7a at 120 GeV. 

The average statistical sensitivity derived from the toy Monte Carlo study is shown in the right side 
of Figure 9. At 1 fb _1 of data, the sensitivity to VBF H — » — ► l + vl~v decays is greater than 
one for Higgs boson masses greater than 120 GeV. (Below 120 GeV the H —>■ W + W~ branching ratio 
becomes too small for this measurement to be effective.) It reaches a maximum of 2.5a at the Higgs 
boson mass of 160 GeV. 

7 W Pair Production with Two Hard Jets in the Lepton-Hadron Channel 

This section describes the analysis of events where the W pair is produced in association with two hard 
jets and one W boson decays leptonically (with the other W decaying to jets). [25, 29, 37-42] The dom- 
inant backgrounds to this final state are W+jets and it production; it is possible that QCD multijets will 
also play an important role. Because of the large jet multiplicity, there are large theoretical uncertainties 
in the predicted normalization of the backgrounds, especially for W+jets and QCD multijets. Therefore, 
this section will emphasize studies of the signal, the W+jets background, and the it background, with 
discussion of how to normalize the backgrounds given data. Quantitative predictions of the discovery 
sensitivity will not be included here. 

Because there is only one neutrino in the final state, this channel permits a better estimate of the 
Higgs candidate invariant mass than the dilepton channels. Taken together with the large sensitivity of 
the dilepton channels for Higgs masses near 160 GeV, this means that the H — ► WW — > Ivqq channel is 
most interesting for the study of Higgs bosons with masses in excess of roughly 250-300 GeV. 

The most obvious way to estimate the invariant mass of a H — > WW —> ivqq candidate is to assume 
that both W bosons are on-shell. One can then use the W mass constraint for the W —> iv system to 
estimate the z momentum of the neutrino. 2 

The analysis of this channel uses reconstructed jets to measure the invariant masses of Higgs boson 
candidates; it is therefore necessary to consider out-of-cone corrections to the jet energies that were 
not important in the dilepton channels. Eventually, these corrections should be taken from real data; 

-The W mass constraint yields a quadratic equation which can have 0, 1, or 2 real solutions. If there are two solutions, the 
one with smaller \p^\ is used; if there is no real solution, the imaginary term is ignored and only the real part is used. 
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Cut 


IV+jets 


ttbar 


Signal (M H = 300 GeV) 


Leptonic W Selection 


2353291* 


128654 


174.27 


Hadronic W Selection 


134483 


70872 


73.26 


Forward Jet Tagging 


1076.8 


1929 


23.16 


Lepton Between Jets 


867.0 


1679 


22.93 


Mjj 


131.0 


367.7 


9.16 


Central Jet Veto 


57.98 


58.24 


8.43 


Ar ln,i 


16.07 


47.96 


6.93 


b-jet Veto 


16.07 


14.84 


6.06 


Trigger Selection 


13.06 


12.40 


5.08 


167 < Mi vqq < 1000 GeV 


13.1+4.7 


12.4±3.4 


5.08±0.29 


Control Regions 




b-tagged 


0 


26±5 


0.75±0.12 


Small-Ar^ Control Region 


500.16 


778.76 


20.79 


Small-AT] ;; - Control Region(b-veto) 


441.64 


186.13 


19.95 


Small-Ar^- (b-veto and Trigger) 


369.38 


157.47 


15.06 


Small-AT] ;7 (167 < M, Vqq < 1000 GeV) 


358±29 


154±12 


15.1±0.5 


Small-AT7 ;; Control Region(b-tag) 


37±14 


493±22 


2.3±0.2 



Table 7: Cross-sections (in fb) for the signal and various backgrounds after successive cuts in the H —> 
WW — > Ivqq channel. A '*' indicates a number that is biased by generator-level cuts. 



however, such a study has not yet been performed. For the present study, the out-of-cone correction 
from ATLFAST-B [43] has been applied to all jets. After applying this correction to the reconstructed jet 
momenta, the following cuts are applied: 

• Leptonic W selection. The event must contain exactly one hard lepton, with transverse momentum 
Pt > 25 GeV for electrons and pj > 20 GeV for muons. It is also required that the missing 
transverse momentum, p'j ss , be larger than 30 GeV. 

• Hadronic W selection. Out of all jets with large transverse momentum, pj > 30 GeV, the two 
whose invariant mass is closest to the known value of the W mass are selected. It is required that 
the reconstructed invariant mass of these two jets be between 64 GeV and 90 GeV. 

• Forward Jet Tagging. Select the two jets (excluding those from the W decay) that have the highest 
transverse momentum. These are labelled as the "tagging" jets; by construction, they are disjoint 
from the jets that form the W —> qq candidate. Require that they be high-pr {pj > 50 GeV, 
p ] T > 30 GeV), that they lie in opposite hemispheres ■ r\j2 < 0), and that they be well-separated 
in pseudorapidity {\r\j\ — rj^l > 4.4). 

• Require that the lepton be between the tagjets in pseudorapidity. 

• Mjf. require that the invariant mass of the two tagging jets be greater than 1500 GeV. 

• Central Jet Veto: reject the event if it contains any extra jets (in addition to the two jets from the 
decay of the W and the two tagging jets) with pj > 30 GeV and |tj | < 3.2. 

• b-jet veto: apply the b-jet veto cuts described in Section 6.1.2. 
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Trigger Level 



Electron Channel Muon Channel 



LI 
L2 
EF 



97.1% 85.3% 
94.3% 81.2% 
92.4% 79.6% 



Table 8: Trigger efficiencies with respect to offline cuts for the H+2j lepton-hadron channel, for a Higgs 
boson mass of 170 GeV. 



• Arjyy: require that the pseudorapidity gap between the leading jet from the decay of the W and 
the lepton be no larger than 1.5. 

• Reconstruct the mass of the Higgs boson candidate, Mi vqq , by applying the W mass constraint to 
the lepton-/?™" system. The resolution is enhanced by performing a kinematic fit to the mass of 
the hadronically decaying W, minimizing the function % 2 = (M$ c — M^ ue ) /TV + (AE/i) 2 / a n + 
(AEj2) 2 /Oj2, where a, the energy resolution for the jets, is given for now by the parameterization 
used in ATLFAST, o/Ej = 0.03 © 0.5/ ^jEj/ GeV. The left plot in Figure 10 shows the distri- 
bution of the Higgs boson candidate invariant mass after all cuts for a representative Higgs boson 
mass of 300 GeV. The figure includes the mass peak obtained in a few distorted scenarios. 

Table 7 shows the cut flow for W+jets, for ti, and for signal at a representative Higgs boson mass of 
300 GeV. At the time of this writing, a detailed estimate of the background from QCD multijet events is 
not available. 

The trigger efficiency for this channel has been estimated, considering only the single-lepton triggers: 
EM25I, EM60, MU20, and MU40 at Level 1; e25i, e60, and mu20i for Level 2 and the Event Filter. 
Table 8 shows the trigger efficiency for signal (Mh = 170 GeV) events that pass the offline cuts. Because 
the present analysis relies only on single-lepton triggers, the trigger efficiency for this channel is not as 
high as for the two-lepton channels. However, it has been checked that the trigger efficiency does not 
distort the Mj Vqq shape for signal in a meaningful way. At the time of this writing, the trigger efficiency 
for the backgrounds has not been explicitly computed; in Table 7, the trigger efficiency for background 
is assumed to be the same as the trigger efficiency for signal. 

The control sample for this channel is a region with loosened cuts on Ar]yy and Mjj. In particular, 
the event selection is the same as in the signal-like region, except that the pseudorapidity gap between 
the tagjets is required to be less than 4 instead of larger than 4.4, and the lower bound on the invariant 
mass of the tagjets is lowered to 500 GeV. Table 7 shows the contributions from the various subprocesses 
after these cuts are applied. Since both the signal-like region and the control region have a nontrivial 
contribution from top events, it is necessary to define b-tagged control samples to normalize the top 
background. These samples have the same kinematic cuts as the signal-like and control regions, but the 
b-veto cut is reversed. The cross-sections in these two regions are also shown in the table. 

The shape of the W+jets background in the signal-like region is the same as the IV+jets background 
in the control region to within the available Monte Carlo statistics. Likewise, the top background in the 
b-tagged region with the same kinematic cuts as the signal-like region has the same shape as the top 
background in the b-vetoed signal-like region. In fast simulation, there is a small difference between the 
top background shapes in the control region and the b-tagged region with the same kinematic cuts as 
the control region, but that distortion is small compared to the statistical errors expected at the luminosi- 
ties considered here. These statements are robust against a variety of systematic uncertainties, namely 
changes in the Q 2 scale (a factor of 4 up and down for both top and W+jets) as well as altered jet energy 
scales (±5% in the region with tracking, ±10% elsewhere), degraded jet energy resolution (Gaussian 
smearing to roughly double the resolution), degraded E™ lss resolution (modeled by smearing the x and 
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Figure 10: Left: the reconstructed invariant mass of the M\ yqq system for fully simulated signal Monte 
Carlo events with a true Higgs boson mass of 300 GeV. The red dashed and black dotted curves show the 
mass peaks obtained when the jet energy scale is raised and lowered by 7% in the region with \ r) \ < 2.5 
and 15% elsewhere. The blue dot-dashed curve shows the result obtained when the jet energy resolution 
is smeared by 45%/y/E in the region with \r}\ < 2.5 and 63%/ VE elsewhere. Middle: A toy Monte 
Carlo outcome corresponding to 10 fb _1 of integrated luminosity for the signal-like region of the H + 2j, 
H — > Ivqq analysis described in Section 7. Here, the background from QCD multijets is assumed to be 
negligible. Right: the corresponding distribution in the control region. 



y components of Ef' ss by 5 GeV each), and Ej lss reconstruction that has been degraded by artificially 
inducing a shift of 3% in the reconstructed E™ lss . 

The background is normalized and the signal extracted with the help of a fitting algorithm which 
implements a simultaneous binned fit in the signal-like region and the main control region in the range 
167< Mt V(jq <1000 GeV. The fit proceeds in two steps: first, standalone fits of the top background 
model to the b-tagged regions are performed. All parameters governing the shape of the top background 
distributions are allowed to float in this fit. After the fits to the b-tagged regions, these parameters are 
fixed, and the histograms are rescaled by the ratio of cross-sections in the b-tagged and b-vetoed regions 
to obtain estimates of the top background normalization and shape in both b-vetoed regions. 

The main fit is a simultaneous fit of the M\ vqq distribution in the signal-like and control regions. All 
shape parameters for the IV+jets background are free to float in the fit, but the parameters governing 
the top background shape and normalization remain fixed. The VK+jets background shape is assumed to 
be the same in the signal-like region and the control region, but the normalizations in the two regions 
are independent. Likewise, the signal shape is assumed to be the same in both regions, but the ratio 
of signal cross-sections is parameterized as a function of Higgs boson mass based on Monte Carlo. 
Figure 10 shows the result of an example fit to a toy Monte Carlo outcome with a true Higgs boson mass 
of 300 GeV. 

The performance of the fitting algorithm has been evaluated by generating pseudo-experimental out- 
comes corresponding to a luminosity of 10 fb _1 with and without true signal events. However, because 
of the large theoretical uncertainty in the prediction of the IV+jets background and the lack of a precise 
estimate of the QCD multijet background, an estimate of the expected significance is omitted here. 

8 Conclusions 

The prospects for a search for a Standard Model Higgs boson in the WW decay mode have been studied, 
using a realistic model of the ATLAS detector, including effects such as trigger efficiencies, backgrounds 
from fakes, and realistic misalignments. Three channels have been considered: H + Oj with H — > WW — > 
ev\l V, H + 2j with H — > WW — ► evjiv, and H + 2j with H — ► ivqq. In-situ background normalization 
techniques for all three channels have been proposed, and their effectiveness and robustness has been 
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Figure 11: Left: The linearity of the mass determination for the combined fit of H + 0/2j, H — > WW — > 
ev/J-V. Right: The expected significance at 10 fb _1 . 



demonstrated with detailed toy Monte Carlo studies. 

The H + 0j, H —>■ ev\iv channel is very promising for Higgs boson masses in the region around the 
WW threshold. For other masses, this channel is still very promising, but the analysis is more difficult 
because of large systematic uncertainties in the background prediction. It has been shown that these 
uncertainties can be controlled, and that the background can be normalized, with a two-dimensional fit 
in the transverse mass and the pj of the WW system. With 10 fb -1 of integrated luminosity, one would 
expect to be able to reach a 5a discovery with the H —> WW — > ev\iv channel alone if there is a Standard 
Model Higgs with a mass betwen ~ 140 GeV and ~ 185 GeV. A measurement of the mass of the Higgs 
boson at this luminosity would have a precision of less than 2 GeV for a Standard Model Higgs boson 
with a mass of 160 GeV, or a precision of less than 4 GeV for a Standard Model Higgs boson with a mass 
of 140 GeV. 

The H + 2j, H — > evjiv channel has a smaller event rate than the H + 0j channel but a similar 
significance. With 10 fb -1 of integrated luminosity, one would expect to be able to reach a 5a discovery 
in the H —> WW — > ev/J,v channel alone if there is a Standard Model Higgs boson with a mass between 
150 GeV and 180 GeV. A measurement of the Standard Model Higgs boson mass at this luminosity 
would typically return a precision less than ~4-5 GeV if the Higgs boson mass is 160 GeV, or less than 
8 GeV if the Higgs boson mass is 140 GeV. 

In the range below M# = 200 GeV, both of the dilepton channels are important; it is therefore inter- 
esting to consider a combined fit of the two. In the combined fit, a shared mass parameter has been used 
for the two channels, but the signal normalizations have been allowed to float independently to preserve 
model-independence. Figure 1 1 shows the linearity of the mass determination and the expected signifi- 
cance of a combined fit of the two dilepton channels as a function of the true Higgs boson mass. As in the 
other linearity plots in this note, the green band represents the width of a gaussian fit to the region around 
the peak of the best-fit mass distribution and the error bars show the median fit error. The combined 
significance for a Standard Model Higgs is above the 5a level for Mh larger than about 140 GeV. 

The H + 2j, H — > l\qq channel is most interesting for Higgs masses above ~ 250 GeV. It has been 
shown that the background normalization can be estimated from data, but it is difficult to make a strong 
statement about the discovery potential of this analysis until a measurement of the W+4 jets background 
can be obtained from first data. 
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Abstract 



For a light Higgs boson, with m# < 135 GeV, the largest decay mode is 
H —>■ bb. Events where the Higgs boson is produced in association with a 
tt pair manifest a distinct signature due to the presence of two W bosons and 
four b quarks. Topological and kinematical quantities are used to reconstruct 
the tt system. The identification of an additional bb pair from the Higgs boson 
decay is used to further reduce the background. 

In this analysis we focus on the sensitivity to a light Standard Model Higgs 
boson with the ATLAS detector in the channel ttH(H — > bb) using the semi- 
leptonic final state with 30 fb~ 1 of integrated luminosity. The relevant back- 
grounds to the channel are investigated and the impact of their associated sys- 
tematic uncertainties is explored. 



At the LHC ttH production is dominated (90%) by gluon fusion, as illustrated in Fig. 1. The remaining 
10% arises from quark-antiquark interactions. For a Higgs boson mass between 115 GeV and 130 GeV 
the production cross-section times branching ratio to bb varies between roughly 0.4 and 0.2 pb at leading 
order. The top quarks decay almost exclusively to bW, and therefore the various final states can be 
classified according to the decays of the W bosons. 

The all-hadronic channel is the one with the highest branching fraction, with a value of 43%. Un- 
fortunately, the large QCD multijet cross-section does not allow easy triggering with jets. Only tight 
requirements on the jet pj and on the jet multiplicity could lead to reasonable rates in the first level of 
the trigger, but these requirements come at the expense of signal efficiency. This is being studied for 
this final state together with the use of ^-tagging at the second level of the trigger to reduce the jet pj 
threshold. 

The fully-leptonic final state analysis is probably the least feasible, despite presenting a simpler 
signature to trigger on, given the presence of two isolated leptons. The branching fraction (5%) is low 
and the two neutrinos prevent the reconstruction of the top quarks. 



Figure 1 : One of the Feynman diagrams for ttH production in the semi-leptonic final state. 

The semi-leptonic final state is a good compromise with a branching fraction of about 28% excluding 
tau leptons. The experimental signature consists of one energetic isolated lepton, a high jet multiplicity 
with multiple &-tags, and missing transverse energy from the escaping neutrino, as shown in Fig. 1. The 



1 Signal 
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trigger relies on the presence of the high-pj lepton. The non-Z?-tagged jets can be used for reconstructing 
the hadronically decaying W boson, decreasing the possible combinatorial permutations. 

2 Physics backgrounds 

The production of tt events is the main background for the ttH process. Given the high jet multiplicity 
in the signal process (> 6 jets), only tt events produced together with at least two extra jets contribute to 
the preselected data sample. Since most of these extra jets come from the hadronisation of light quarks, 
this contribution is greatly reduced by asking for four jets to be identified as ft-jets. 



The irreducible background comes from ttbb production. This can proceed via QCD or electroweak, 
(EW), interactions with a total cross-section of the order of 9 pb. Some of the Feynman diagrams involved 
in the two production mechanisms are shown in Fig. 2 and Fig. 3. While the QCD production cross- 
section is ten times larger than the EW production, the latter is also important. The two ft-jets not coming 
from the tt decay have large momenta and also have a total invariant mass which is typically close to the 
Z boson mass, and can therefore contaminate the signal region. 



The ttcc background cross-section is 60% higher than ttbb [1], so it could also play an important role. 
However, it is found upon investigation that due to the c-jet rejection factor, the ttcc background plays a 
negligible part in comparison with the ttbb background. No dedicated sample is therefore simulated for 
this study. Some ttcc events are however present in the inclusive tt +jets sample used. 

Several other backgrounds, such as W+jets, tW production and QCD multijet production, could also 
have a non-negligible impact on the analysis. Even though the W plus two jets inclusive cross-section 
is about 1200 pb per lepton flavor [2], it has been shown [3] and confirmed in this analysis that the 
contribution can be reduced to a negligible level if the four Z?-tags requirement is applied. This is also 
true for the less abundant tW background, which has a cross-section of 9.5 pb [4]. Even when four b-jets 
are requested in the event, contamination via QCD bbbb production, which has a cross-section of a few 
hundred nb [5], is still possible. The reconstruction of the tt system allows a certain degree of safety 
against non-top background. None of these samples are presented in what follows. 




Figure 2: Example of Feynman diagrams for the ttbb QCD production. 




Figure 3: Example of Feynman diagrams for the ttbb EW production. 
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3 Monte Carlo samples and cross-sections 

This study uses the leading order cross-sections for the signal and ttbb samples. No calculation has yet 
been performed for ttbb at next-to-leading order, (NLO). This study also uses a tt background, simulated 
at next-to-leading order. This is the only NLO Monte Carlo sample used, and the only tt large sample 
available in ATLAS at the time this analysis was performed. No K-factors are applied to the samples 
simulated at leading order in the significance estimates. 

The signal sample is generated for a Higgs boson mass of m# = 120 GeV with PYTHIA [6] 6.403. 
The exact generated process was pp —>ttHX — > ivbqq'bbbX, with I = e or jx. The factorization and 
renormalization scales used are identical and are listed in Table 1. The signal and the ttbb events are 
generated with a lepton filter requiring at least one electron or one muon with pseudorapidity \r\\ < 2.7 
and transverse momentum above 10 GeV. The leading order production cross-section used is a(ttH) = 
537 fb [7]. The branching ratios H -> bb of 67.5% at 120 GeV [7], £v of 10.66% [8], and 
hadrons of 67.6% [8] is applied. Finally the lepton filter efficiency of e = 0.953 is also applied. The 
resulting cross-section is 100 fb. 

For both ttbb QCD and EW samples, the exact process generated is gg^ tibbX-^> ivbqq'bbbX, 
with I = e or jx. Both processes can be initiated by a qq pair, but only the dominant gluon fusion is 
simulated, with the cross-sections being increased to allow for the qq pair production. For the ttbb QCD 
sample, AcerMC 3.4 [9] is used and interfaced to PYTHIA 6.403 for the simulation of the initial and 
final state radiation, hadronisation and decay. The ttbb EW sample is generated using AcerMC 3.3 and 
PYTHIA 6.403. The leading order ttbb QCD cross-section is o(pp ^ttbb) = %.2(gg)(+0.5(qq)) pb 
and the lepton filter efficiency is e = 0.946. For the ttbb EW sample, the leading order cross-section is 
a(pp ^ttbb) = 0.90(gg)(+0.04(g<?)) pb and the lepton filter efficiency is e = 0.943. 

The reducible tt background events are generated with the MC@NLO [10] program, interfaced 
to HERWIG [11] and Jimmy [12]. The events in this sample correspond to the processes pp -^tt —> 
(£v,qq')b£vb with £ = e,/x, T. The generator versions used are MC@NLO 3.1 and HERWIG 6.510. For 
the inclusive tt cross-section we use the NLO+NLL calculation of o{pp —>?/") = 833 pb. The tt sample 
is also produced using a filter requiring one electron or one muon with pseudorapidity \r] \ < 2.7 and 
transverse momentum above 14 GeV. The tt filter also applies requirements on the jets in the generated 
events which are reconstructed using a seeded fixed-cone algorithm with a cone size of AR - 0.4 [13], by 
requiring at least: 

• six jets with p T > 14 GeV and \rj\ < 5.2 

• four jets with p T > 14 GeV and \r] \ < 2.7 

The efficiency of this generator filter on inclusive tt events is 0. 146. 

For the tt sample, about 10% of events are ttbb and are removed following the overlap treatment 
explained in Ref. [14] together with their associated cross-section. 

Table 1 summarizes the cross-sections, calculated using the Monte Carlo generators, of the different 
processes considered for this analysis, together with the corresponding numbers of generated events and 
the equivalent integrated luminosity. All branching fractions and filter efficiencies are included. 

4 Analysis overview 

The analysis consists of an initial preselection requirement which is applied to the events to ensure that 
the fundamental physics objects associated with ttH are reconstructed. Following preselection, three 
different analysis techniques are implemented in order to reconstruct the top quark pairs and the Higgs 
boson through the identification of their decay products. 
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Table 1: Summary of the different samples used for the analysis. The cross-sections are taken from 
the generators and include all branching fractions and filter efficiencies. The fourth column shows the 
equivalent integrated luminosity, taking into account all corrections (see text). For the scale calculations, 
m.H = 120 GeV and m t = 175 GeV are used. ma.x{pj 2 t , Pt 2 j) corresponds to the higher of the two values 
of pr 2 when both the top and anti-top quarks are considered. 



Process 


a (fb) 


Events 


L(fb- [ ) 


Fact. & Renorm Scale 


PDF set 


ttH (LO) 


100 


92750 


931 


Q 2 = m 2 + ma\(p T 2 t , p T 2 f ) 


CTEQ6L1 


ttbb QCD (LO) 


2371 


98350 


42 


Q = m H /2 + m t = 235 GeV 


CTEQ6L1 


ttbb EW (LO) 


255 


24750 


97 


Q = m H /2 + m t = 235 GeV 


CTEQ6L1 


tt filtered (NLO) 


109487 


710321 


6.5 


Q 2 = m 2 + \{ P T 2 t +PT 2 t) 


CTEQ6M 



The identification and association of decay products is directly related to the quality of the recon- 
structed Higgs boson signal. It mainly suffers from the misassociation of the four ^-tagged jets to the 
original partons. For this reason, the initial cut-based approach is complemented by two multivariate 
algorithms, called the pairing likelihood and constrained mass fit. 

5 Preselection 

At the preselection level we require that the event passes the trigger requirement to identify at least one 
high-^r lepton (muon or electron) coming from the decay of one of the W bosons. We then require 
that the event reconstruction identifies exactly one isolated high-/^ lepton (muon or electron). Vetoing 
the presence of a second isolated lepton is intended to remove additional sources of background. After 
the lepton requirements are met, we require at least six calorimeter jets, of which at least four must be 
loosely ^-tagged jets from the decay of the top quarks and the Higgs boson. 

5.1 Trigger requirements 

The presence of one high-^r lepton, together with missing transverse momentum, is a distinct signature 
of W boson production. These leptons can generally be used to trigger on W production with high effi- 
ciency. A logical OR of the single isolated electron (e22i) [15], high-pr electron (e55) [15] and single 
muon (mu20) [16] triggers is used. The inclusion of the e55 trigger is found to improve the efficiency 
for high-pr electrons where the e22i trigger efficiency was reduced due to the isolation requirement. 
Missing energy triggers were not available at the time of writing, but could be used in future analysis. 

The trigger efficiency is approximately 82% for the semileptonic top decays for those events which 
would otherwise pass the offline analysis. This is included consistently in the following sections. 

5.2 Reconstructed high p T lepton selection 

In this Section we explain the selection criteria used for reconstructed electrons and muons produced in 
the semi-leptonic decay of the tt system. As previously mentioned, exactly one high-p r isolated electron 
or muon must be reconstructed for the event to pass the preselection. 

To be considered for the analysis, reconstructed electrons must have transverse momentum pr > 
25 GeV and pseudorapidity \r]\ < 2.5. Further calorimeter-based cuts are applied to the loose elec- 
tron [17] definition. An isolation cut is also applied to the candidate electrons in the form of an upper 
limit of 0.15 on the ratio of the pj of the additional tracks inside a cone of size 0.2 in AR, (-y/Arj 2 + A0 2 ), 
around the electron track to the electron p T . 
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Muon candidates are reconstructed using a combination of the Inner Detector and Muon Spectrom- 
eter [18]. They must pass the acceptance cuts pj > 20 GeV and \r\ \ < 2.5. In order to remove poorly 
reconstructed muons, cuts are applied to the muon track fit quality and its transverse impact parameter, 
which helps to discriminate against muons generated by the decay of long-lived mesons. 

An isolation cut of 0.30 on the ratio between the transverse energy deposited inside a cone of size 
0.2 in AR around the muon track and the muon pj is applied. 

5.3 Jets 

To reconstruct the energy of the partons produced in the original collision, calorimeter jets are recon- 
structed using a seeded fixed-cone algorithm with a cone size of AR = 0.4 [13]. Cuts on pj > 20 GeV 
and | T7 1 < 5.0 are initially applied. Only events with at least 6 jets are kept for the analysis. All electrons 
reconstructed as jets are identified and removed from the jet collection according to the electron overlap 
removal procedure described in Section 5.3.1. Reconstructed muons which are not isolated are combined 
into jets where applicable (see Section 5.3.3), and only after this step are the jet energies calibrated for 
residual effects. The jet multiplicity is shown in Fig. 4, calculated after electron overlap removal and the 
jet T7 and p T cuts. 

In the following analyses the concept of 'correct' jets is important. This is defined by finding the 
closest reconstructed jet to each partem, after final state radiation. This match is in AR space and must 
be closer than 0.4. A W or H boson is correctly matched if both the jets being used are associated to 
the partons from its decay, while for a top quark the matching refers to the b quark jet only. Normally 
'correct' is applied in this note only to one quark or boson at a time. 

5.3.1 Treatment of overlaps between jets and electrons 

Since most electrons are also reconstructed by the jet algorithm it becomes necessary to identify them in 
the jet collection in order to avoid double counting. The criteria for the jet-electron overlap removal is the 
following: each jet matching a well-reconstructed electron (i.e. fulfilling the cuts defined in Section 5.2) 
within a AR of 0.2, and for which the ratio of the electron to the jet transverse momenta is greater than 
0.75, is discarded from the jet collection. About 4% of the jets in the signal sample are removed by this 
selection, 99% of them being actual true electrons. 

5.3.2 A-tagging 

Z^-jets are identified using the IP3D+SV1 tagger [19], which exploits both the impact parameter of tracks 
and the properties of an inclusive secondary vertex, using a likelihood approach which leads to a single 
discriminating variable: the ^-tagging weight. In order to allow for a projected decrease in light jet 
rejection of approximately 30%, the ^-tagging weight for this study is increased by 0.9 for central jets 
(|?]| < 2.5) having no associated (AR < 0.3) heavy quark or lepton (b,c,z) in the Monte Carlo simulation 
history. The b-tag weight spectrum for b-, c-, and light jets in the signal sample is shown in Fig. 4. A cut 
on the weight defines which jets will be eventually identified as b-jets in the analysis. The rejection of c- 
and light jets versus the b-jet efficiency, obtained by varying the weight cut, is also shown in Fig. 5. The 
different samples exhibit very similar behaviour. The rejection for "purified" jets is shown in the bottom 
row of Fig. 5. Purified jets are those which have no heavy flavor (b, c, T) quark or lepton within 0.8 in 
AR. 

In the preselection, a loose set of criteria are initially used to define a sample of jets as a first step to 
identifying b jets for the analyses. The requirements are that the jet is in the central region of the detector 
\f]\ < 2.5, and has b-tag weight > 0. If there are fewer than four of these jets, the event is discarded. 
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The cut-based analysis and the pairing likelihood (Sections 8 and 9) require that there are at least four 
ft-jets having ft-tag weight > 5.5. The fe-tag weight > 0 working point implies a ^-tagging efficiency of 
about 85% and a rejection of light (o) jets of about 8.6 (2.4), whereas the working point at b-tag weight 
> 5.5 implies a ^-tagging efficiency of about 65% and a rejection of light (o) jets of about 60 (6). 




Figure 4: Left: Multiplicity of jets, inside pj and r\ acceptance. Right: Distribution of ^-tagging weight 
for b-, c- and light jets in ttH events, using the IP3D+SV1 tagger. 




Figure 5: Rejection of light and c-jets versus ^-tagging efficiency. Red open markers indicate the working 
point (&-tag weight > 5.5) used for the cut-based and paring likelihood analyses, open markers with a 
central dot (right plots) represent the performance when the 30% performance degradation is applied. 
The lower plots show results for purified jets, where no heavier quark existed within a wide cone of 
A/?<0.8. 



5.3.3 Treatment of low pj muons 

About 20% of the time a B-meson decay cascade gives rise to a muon. With a four ft-jet signature in 
this channel, these muons, also called soft muons are present in almost every event. In order to improve 
the estimate of the momentum of the original b quark, these muons must be used to correct the jet four- 
momenta by adding the muon four-momentum to a jet. Two different algorithms (high and I0W-/77 [20]) 
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identify the muons, which are required to be within AR < 0.4 of the jet axis. Among the candidates 
from the high-/^ algorithm which are separated from the selected hard lepton by AR > 0.1, only the one 
with the best track quality is considered for addition. All the neighbouring candidates from the low-pr 
algorithm are considered for addition, provided they fulfil pj > 4 GeV and pj < 100 GeV, \rj \ < 2.5 
and chi-squared per degree of freedom % 2 /n < 30 for the combined fit. In addition, a loose anti-isolation 
cut is applied, requiring that the energy reconstructed in the calorimeter within a cone of size 0.2 in 
AR around the muon track divided by the muon pj is higher than 0.1. Table 2 shows that adding 
low pj muons to jets improves both the mean jet p T and resolution. Fig. 6 shows that there is also an 
improvement in the Higgs boson mass and resolution, when the correct jet combination is chosen. 

5.3.4 Calibration 

A Monte Carlo based jet correction has been derived to take into account residual calibrations, e.g. 
out-of-cone effects and neutrinos. The parametrization was derived from full simulation, so that the 
jet four-momentum is corrected by a flavor dependent rescaling factor which scales all components of 
the four-momentum. Table 2 shows that for ft-jets, the residual calibration brings the jet and associated 
partem pj into agreement from an offset of 5.4 to —0.5 GeV. Each of the analyses uses the light jet 
correction for those jets which are assigned to the W boson and corrects the other four jets as b quark 
originated. The impact of the calibration on the Higgs boson mass peak where the correct jet combination 
is chosen is shown in Fig. 6. Both the mass peak location and the resolution are improved. 

Table 2: The true partem p T minus the measured jet p T with and without adding muons and making the 
out-of-cone correction. The quoted values are the results of a Gaussian fit in the region ±20 GeV. 



Treatment 


Value 


No Calibration 


Added muons 


Calibrated 


Both corrections 


All b jets 


Mean, GeV 
Sigma, GeV 


5.7±0.05 
10.0±0.05 


5.4±0.05 
9.8±0.05 


-0.1 ±0.04 
10.2±0.05 


-0.5±0.04 
10.0±0.04 


b jets with muons 


Mean, GeV 
Sigma, GeV 


26±2.6 
18.2±1.1 


7.6±0.2 
11.8±0.2 


11.4±0.5 
13.6±0.4 


2.3±0.2 
12.0±0.2 


All light jets 


Mean, GeV 
Sigma, GeV 


2.4±0.04 
8.4±0.04 


2.3±0.04 
8.4±0.04 


-1.4±0.04 
8.9±0.04 


-1.5±0.04 
8.3±0.04 


Light jets with muons 


Mean, GeV 
Sigma, GeV 


10.5±0.9 
11.3±0.8 


2.4±0.4 
10.9±0.5 


7.3±0.7 
11.5±0.6 


-1.0±0.4 
10.7±0.4 



5.4 Results of preselection on signal and background 

The effect of the event preselection on signal and background samples is illustrated in Table 3. The 
efficiency for the four b-tag cut is different in the signal and in the irreducible background because the 
two additional b-jets have different pj and \r\ \ spectra. The preselection, at a level of four loose ft-tags, 
removes a large fraction of the signal, but also reduces the backgrounds to a level where they can be 
handled more easily; for example 98% of the ttX background is removed. It also ensures that the selec- 
tions are a subset of the requirements placed at generator level. The selected sample has approximately 
a 0.6% signal component, with a little more than 8% of the background being irreducible ttbb. Further 
tightening the ^-tagging requirement to four jets with weights of at least 5.5 reduces the samples with 
four b quarks by a further factor of four while removing 90% of the remaining ttX background. 
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Figure 6: Effect of adding a reconstructed low pj muon to at least one of the b jets (left), and the effect 
of jet calibration (right) shown for the Higgs boson mass where the correct jet combination is chosen. 
Red solid (black open) markers and solid (dotted) line show the mass distribution and fitted values for 
jets after (before) correction. The effect of the accidental wrong muon matches can be seen in the shape 
distortion. All distributions are normalized to unity. 



Table 3: Cross-sections after each preselection cut for signal and background. The last row shows the 
further effect of tightening the &-tag requirements to the level of the final selection in the cut based and 
pairing likelihood analyses. In the last column the contribution of ttbb has been removed. The errors are 
statistical only. 



Preselection cut 


tffl(fb) 


ttbb(EW) (fb) 


ttbb(QCD) (fb) 


ttX (fb) 


lepton 

+ > 6 jets 

+ > 4 loose Z?-tags 


57. ± 0.2 
36 ± 0.2 
16.2 ± 0.2 


141 ± 1.0 
77 ± 0.9 
23 ± 0.7 


1356 ± 6 
665 ±4 
198 ±3 


63710 ±99 
26214 ± 64 
2589 ± 25 


+ > 4 tight ft-tags 


3.8 ± 0.06 


4.2 ± 0.2 


30 ± 0.8 


51 ±2 



6 Reconstruction of the hadronically decaying W bosons 

Reconstructing the W boson four-momenta is necessary to reconstruct the top quarks. The reconstruction 
of the hadronically decaying W boson is done in different ways by the three analyses. 

For the cut-based and pairing likelihood analyses (see Sections 8 and 9), the highest four ^-tagged 
jets (with b-jet weight > 5.5) are excluded from the hadronically decaying W reconstruction, however 
all other jets are paired to form W candidates. Figure 7 shows the mass distribution and multiplicity of 
all W candidates for the cut-based analysis. Only candidates within 25 GeV of the true W mass are kept. 
Even with these cuts the hadronically decaying W candidate multiplicity is still very high. All jets used 
to form these W candidates are calibrated with the light-jet calibration. The likelihood analyses do not 
require an explicit cut upon this mass as the definition of the likelihood imposes it automatically, but the 
constrained fit likelihood (Section 10) imposes a requirement that the mass be between 30 and 150 GeV 
to reduce the number of combinations to be evaluated. 
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Figure 7: Cross-sections of hadronically decaying W combinations per event giving candidates within 
25 GeV of the true W mass in the signal sample, (left). Right: Invariant mass spectrum for hadronically 
decaying W candidates normalized to unity. The dotted line shows combinations where the jets from the 
W are correctly matched. 

7 Reconstruction of the leptonically decaying W bosons 

When reconstructing the leptonically decaying W, we use the lepton four-momentum as measured in 
the detector. The neutrino, transverse momentum can be inferred by measuring the imbalance of the 
transverse energy in the event. This measured quantity is referred to as missing transverse energy. 

7.1 Neutrino p z estimation 

Once the missing transverse energy is identified with pj v , the invariant mass of the sum of the lepton 
and neutrino four-momenta can be constrained to the W boson mass [21]. Because of the limited mea- 
surement resolution on the transverse missing energy, for a significant fraction of events the quadratic 
constraint equation does not have a real solution. In this case the "A = 0 approximation" can be made 
by dropping the imaginary part of those solutions with complex roots. Another method (the "collinear 
approximation") assumes that the W boson decay products are produced preferentially in the same direc- 
tion (due to the large top quark mass boosting the W boson). For the collinear approximation, one can 
assume that p u = p zv . 

Considering ttH events where one lepton is reconstructed, 72% of the time p zv has real solutions. In 
this case, both solutions are carried forward into the analyses, and the best performing final state solution 
is used. For these events the p zv resolution is 19.5 GeV. In the other 28% of cases where there are no 
real p zv solutions, the A = 0 approximation is used and the p zv resolution is 40 GeV. This performs 
better than the collinear approximation where the p zv resolution is 54 GeV. The quality of the W -boson 
reconstruction can be seen in Fig. 8. For the events where there is no real solution for p zv , the direction 
and the mass of the W boson is better represented by the A = 0 approximation. 

Since the mass constraint is lost when using the A = 0 approximation (the same would be true for 
the collinear approximation) there is an actual cut for the reconstructed W boson mass. The cut-based 
and pairing likelihood analyses only consider W candidates having a mass less than 140 GeV, while 
the constrained fit analysis does not make an explicit requirement on this but the poor x 2 will remove 
extreme cases. 
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Figure 8: Distributions of AR between the true and the reconstructed W boson (left) and of the recon- 
structed leptonically decaying W mass (right) for events where a solution for p z is found (solid black 
line) and events where an approximation is used (dotted red for A = 0, dashed blue for the collinear 
approximation). All disuibutions are normalized to unity. 

8 Cut-based analysis 

This section describes the algorithms used to reconstruct the it system and the Higgs boson. The b-jets 
are associated with the leptonically and hadronically decaying W boson candidates, to build a list of 
top quark candidates. The combination of Z?-jets resulting in the best reconstruction for the top quark 
candidates is taken as the final choice. It is important to note that the b-jets themselves satisfy the cut 
on b-tag weight > 5.5 and that if there are more than four of these jets, then the four with the highest 
weight are treated as &-jets. Events where there is no combination giving a satisfactory top quark mass 
reconstruction are discarded. The two remaining Z?-jets are used to form the Higgs boson candidate. 

8.1 Top-antitop quark system and combinatorial background 

In each event, top quarks are reconstructed by pairing two fe-jets with the W boson candidates in the way 
which minimizes the x 2 expressed as: 

2= ( m Jjb ~ mtop \ 2 ( mivb - m top 
1 V °m }jb ) V °m lvb 

where o mjjb and o mivb are the reconstructed mass resolutions estimated in simulated signal events and are 
13 and 19 GeV respectively. Only combinations fulfilling \mjji, — m top \ < 25 GeV and \mi V b — m top \ < 25 
GeV are considered for the x 2 calculation. The top quark mass distributions for the chosen combination 
in the signal sample is shown in Fig. 9. The two remaining ^-tagged jets are used to form the Higgs boson 
candidates. The mass distribution for all Higgs boson candidates in the signal sample is shown in Fig. 10, 
and in the same plot the signal and physics background cross-sections are adjacent. Here the difficulty of 
the analysis is clearly shown, requiring dedicated studies to measure the background normalization and 
its shape in data. 

As a final cut, to discriminate against tt events where no Higgs boson is produced, only events in 
a mass window of 30 GeV from the nominal Higgs boson mass are used for the final estimation of the 
cut-based analysis significance. 

The effect of the final selection for the cut-based analysis on signal and background samples is shown 
in Table 4. The selections have reduced the signal by a factor of sixteen from the preselection, but the 
signal to background has increased from 0.006 to 0.11. The irreducible ttbb background is 46% of the 
total. 
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Table 4: Accepted cross-section after each successive mass-window selection cut for signal and back- 
ground in the cut-based analysis. In the last column the contribution of ttbb is removed. Errors are 
statistical only. 



cut 


ttH(fb) 


ttbb(EW) (fb) 


ttbb(QCD) (fb) 


ttX (fb) 


^had + ^lep 
+ «+Higgs 

+ Higgs boson mass window 


2.49 ± 0.05 
2.04 ± 0.05 
1.00 ± 0.03 


2.9 ± 0.2 
2.2 ± 0.2 
0.52 ± 0.07 


18.2 ±0.7 
14.7 ± 0.6 
3.6 ± 0.3 


22.5 ± 1.9 
14.3 ± 1.5 
4.9 ± 0.9 




Figure 9: Reconstructed invariant mass spectrum for selected leptonic (left) and hadronic (right) top 
quark candidates in the signal sample. The dotted red line indicates the candidates formed by assigning 
the correct b-jet to the top quark being considered. All distributions are given in cross-section. 




m(bb) [GeV] m(bb) [GeV] 

Figure 10: Left: Reconstructed invariant mass spectrum for Higgs boson candidates in the signal sample. 
The dotted red line indicates the candidates formed by assigning the correct ft-jets. Right: Reconstructed 
invariant mass spectrum for signal and backgrounds after the cut-based selection. All distributions are 
given in cross-section. 
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9 Pairing likelihood analysis 

In the previous Section we used a cut-based approach to identify the top quark decay products. A 
straightforward improvement to such an approach is to use several discriminating topological distribu- 
tions combined together in order to build a pairing likelihood. As a first step the analysis considers only 
top quark properties as likelihood templates. Even though Higgs boson properties could help associating 
b-jets, if used, those could lead to bias in the background distributions. A correct combination is ob- 
tained when the objects used for the reconstructed variables match the Monte Carlo partons, regardless 
of whether other objects are correctly associated. On the other hand all the wrongly reconstructed objects 
are used to form the wrong combination templates. The variables used, shown in Fig. 11, are: 




Figure 11: Pairing likelihood templates for top quark topological distributions, derived from the ttH sig- 
nal sample. Solid lines represent the correct combination while the dotted lines show the combinatorial 
background in the signal itself. See text for a description of the variables. 

• mjj\ The invariant mass of the light jets from the hadronic W decay. 

• ntjjb'- The invariant mass of the hadronic top decay products. 

• mivb: The invariant mass of the leptonic top decay products. 

• Z(j, j): The angle between the light jets from the hadronic W decay. 

• AR(jj,b): The distance in R between the hadronic W and b jet from the hadronic top decay. 

• AR(l,b): The distance in R between the lepton and the b jet from the leptonic top decay. 
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The output of the pairing likelihood for the correct and wrong ft-jet combinations is shown in Fig. 12. 
As shown in this plot, even though the correct distributions are peaked at 1 , the wrong combinations still 
have a large probability of being selected. The only combination used is the one which maximizes 
the likelihood output. In order to avoid the presence of a large combinatorial contribution a cut on the 
likelihood output of 0.9 is used to select well-reconstructed events. After this cut, ft-jets are associated 
to reconstruct the Higgs boson, as shown in Fig. 12, and the two top quarks, shown in Fig. 13. The 
invariant mass distribution for the selected Higgs boson candidates for signal and backgrounds is shown 
in Fig. 14. 




Figure 12: Left hand side: combinatorial likelihood output for ttH events. Black solid (red dotted) 
histogram indicates the correct (wrong) combinations. Right hand side: invariant mass for the Higgs 
boson candidates reconstructed using the maximum likelihood configuration, after applying a cut on the 
likelihood. Dotted histogram indicates the correct combinations. The differential cross-section is shown 
in fb. 




100 120 140 160 180 200 220 240 100 120 140 160 180 200 220 240 

m(lvb) [GeV] m(jjb) [GeV] 



Figure 13: On the left (right) hand side is shown the leptonic (hadronic) top quark candidates recon- 
structed invariant mass using the maximum likelihood configuration, after applying a cut on the likeli- 
hood output. The dotted histogram indicates the correct b quark jet for the top quark being considered. 
The differential cross-section is shown in fb. 

A final cut on the reconstructed Higgs boson mass, requiring it to be within 30 GeV of the Higgs 
boson nominal mass is applied. The event yield for the whole analysis using the pairing likelihood is 
shown in Table 5. This analysis reduces the signal by a factor thirteen, and produces a sample which has 
a signal to background ratio of 0.1. The irreducible ttbb background increases to 45% of the total. 
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Table 5: Cross-sections after each selection cut for signal and backgrounds for the pairing likelihood 
analysis. In the last column the contribution of ttbb has been removed. Errors are statistical only. 



applied cuts 


tiH(fb) 


ttbb(EW) (fb) 


ttbb(QCD) (fb) 


ttX (fb) 


Leptonic W 

+ Best likelihood > 0.9 

+ Higgs boson mass window 


3.6 ± 0.06 
2.3 ± 0.05 
1.2 ± 0.04 


4.1 ±0.2 
2.5 ±0.2 
0.68 ± 0.08 


29 ± 0.8 
16 ±0.6 
4.6 ± 0.3 


48 ± 2.7 
19 ± 1.7 
6.5 ± 1.0 




Figure 14: Reconstructed invariant mass spectrum for Higgs boson candidates for signal and back- 
grounds after pairing likelihood selection. The differential cross-section is shown in fb. 
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10 Constrained fit analysis 

An alternative analysis uses a mass-constrained fit to the measured missing energy and jet and lepton 
four-momenta to help with the jet combinatorics. There are six quarks produced from the top quark and 
Higgs boson decays, and these are matched to the reconstructed jets. The % 2 from this fit is used in a 
likelihood technique together with kinematic variables, ^-tagging and jet charge. Then, all jet combina- 
tions passing loose criteria are tested and the one with the best likelihood is chosen. The analysis starts 
from the preselection as described in Section 5. The signal events are then separated from background 
in a second likelihood step. 

10.1 Mass- Constrained fit technique 

The fit varies a scale factor for the four-momenta of the jets and the z component of the neutrino mo- 
mentum. Adjustments to the jet momenta and masses through the scale parameters, produce accom- 
panying changes in the missing energy and hence in the parameters used in the reconstruction of the 
leptonically decaying W as the transverse components of its neutrino are taken to be the missing energy. 
The longitudinal component of the momentum of decaying W boson's neutrino p zv is the last fit parame- 
ter. The parameters are constrained by the estimated jet errors and by the masses of the top quarks and W 
bosons. These later are included as approximate Gaussian % 2 contributions calculated using the masses 
inferred from the current parameters as indicated in the following equation: 



where the W and top quark widths (% and o t are 2.1 and 1.5 GeV respectively. 

To simplify the fit, the hadronic top quark and W are forced to be exactly on mass shell. The scale 
factor of the higher p T jet from the W is externally varied, while the other two scale factors are calculated 
to give the correct masses. The momenta of all six jets are varied, but these three are linked. There are 
therefore five free parameters and not seven. This implies that these two particles are fixed to their 
nominal masses, while the leptonic top quark and W are given widths. 

The calculation of the momentum of the neutrino from the leptonic W decay normally has two solu- 
tions, as discussed in Section 7. Both of these are used as starting points for the fit, to ensure that it does 
not find only one local minimum, and the fit with the larger x 2 is discarded. If the W neutrino solutions 
had complex roots then the real part of these is used as an initial value. 

The jet momenta are calibrated as discussed in Section 5.3.4, and the following errors are used in the 

fit: 



where the momenta are measured in GeV. This form comes from comparing reconstructed jet pr with 
simulated initial quark pj\ in other words it includes not only detector effects but also fragmentation. 
Both light and &-jet momentum errors are treated as Gaussian distributed; for b quark jets in particular 
this is not a good description as the frequent presence of a neutrino gives tails to the measured energy 
response. 

The fit adjusts the momenta of all the jets, including those from the Higgs boson, but the fitted Higgs 
boson mass is not used in the analysis, as it offers no improvement. 




(2) 



Op hg J Plight = 0.988/ V^F0 0.035 
o P JP h = 0.888/ ^pje 0.125 



(3) 



(4) 
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10.2 b-tag information 

In order to use the ^-tagger output, as described in Section 5.3.2, for a likelihood, the distributions of 
weights expected for b-jets and non Z^-jets in this environment are required. This is done using jets taken 
from the signal simulation, where only jets with a parent quark within 0.4 in AR and no second quark 
within 0.6 in AR are used. The ratio of smoothed weight distributions with b-jet over light flavored jets 
(u,d,s) is shown in Fig. 15. 




-10 0 10 20 30 40 
B weight 



Figure 15: The ^-tagging likelihood ratio extracted from signal simulation as the ratio of b- weight distri- 
butions for ft-jets and light jets. The degree of smoothing reflects the statistical precision at each point. 

In the analysis, four jets are taken to be from b quarks. For each jet we compute Jz^', the ratio of 
b/light for jet i as shown in Fig. 15. In selecting combinations the sum Z,log 10 J^' is taken over the four 
jets. There is no requirement made on the jets from the W boson. 

10.3 Jet charge 

The assignment of jets to quarks can benefit from the jet charge measurement as we know the expected 
charges of the quarks involved. The jet charge is the momentum weighted sum of the charged particle 
charges within the jet, and it shows some correlation with the initial quark charge. The b(b) quark 
has only a charge of (-)l/3, and furthermore, after hadronisation there are oscillations which reduce the 
sensitivity, but there is some information. 

The analysis requires exactly one high pj lepton which has charge Qi, and therefore we know the 
expected charge of both of the b quarks associated to the top quarks via the relationship sign{Qi) = 
sign{Qt le ( ) = — 1 x sign(Qt had ), and the sum of the charges of the jets from the hadronically decaying W 
boson Qyy =—1x2/. The measured values of these are then compared with the expectations using a 
likelihood. The sum of the Higgs boson jet charges is much less sensitive because the expected value is 
zero, but it is also used. 

The jet charge plots in Fig. 16 are calculated using jets from the signal sample. The W plots are made 
using only true light jets which were tagged as light jets, and the b plots only from tagged ft-jets associated 
with a b quark. This is to ensure that the jet charge is independent of b-tag information. The W wrong 
combinations distribution has spikes at integral values which generally involve at least one jet outside 
the tracker acceptance contributing a charge of zero. These are less frequent in the correct combinations 
which tend to be central. The other distributions reflect ^-tagged jets which must therefore have charged 
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q(W) q(b) q(H) 

Figure 16: Left: The jet charge of W boson candidates, based on the sum of the jet-charges of the two 
jets signed by the high-^r lepton. Center: Jet charges of individual Z?-jets from top quarks; signed so 
that those where quark charge agreeing with the lepton charge from W decay are positive. Right: The 
magnitude of the sum of the jet charges of the jets assigned to the Higgs boson. Wrong b-jet combinations 
have a somewhat flatter distribution. Correct combinations are solid (black) and wrong combinations are 
dashed (red). 



particle tracks. It is assumed that the jet charge information can be calibrated from the plentiful top quark 
pair events. The normalizations are arbitrary, as they offset every combination equally. The jet-charge is 
used as ^ charge = 5%*°* top x Jggg"* «* x Sf m x J? q{H) . 

10.4 Likelihood analysis for jet assignment 

All possible assignments of jets to quarks are evaluated in turn. Those combinations which fail a loose 
quality requirement are discarded. This quality requirement is that: 

• Both of the jets assigned to the Higgs boson must have a ^-likelihood greater than zero. From 
Fig. 15 that corresponds to a cut of about -2 on the ^-tagging variable. 

• Selections on the mass of the W and top quark which decay to jets. The W mass calculated from 
the jets without fitting must lie between 30 and 150 GeV, and the top quark mass between 100 and 
250 GeV. Note that the jet energy correction factor applied depends upon whether or not the jet is 
considered to originate from a b quark in this hypothesis. 

• Total ^-likelihood greater than 8. This is the sum of the log-likelihoods of the four jets which are 
assigned to ft-jets; this roughly corresponds to the mean ^-weight of the jets being 4 or greater. 

In the preselected signal sample there are a mean of 5811 combinations to be tested per event, but 
the above quality requirement reduces this to 233; a considerable saving in time. 90% of the preselected 
signal events have at least one combination passing the above requirements. 

Events which pass these selections are processed by the ttH fitting code. Correct or wrong combina- 
tions are then used to define a likelihood ratio. 

The elements of that likelihood ratio are as follows: 

• The log 10 of the % 2 from the fit. 
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Figure 17: Left: The % 2 of the fit. Center: jet-top| m j n . The cosine of the minimum angle between top 
quarks and their daughter jets. Correct combinations have more collimated tops. Right: The t{, ept cos0*. 
The decay angle in the rest frame of the leptonically decaying top quark of the b-]et relative to the top 
quark direction in the lab frame. Correct combinations are solid (black) and wrong combinations are 
dashed (red). 

• jet • top | m i n : The minimum cosine of the angle between the top quarks and any of their four jets in 
the ttH center of mass frame. 

• t{, ept cos0*: The decay angle in the rest frame of the leptonically decaying top quark of the b-]eX 
relative to the top quark direction in the lab frame. 

• t^ ept AR: the distance in AR space between the lepton and the b quark assigned to the same top 
quark. 

• | Y\ | max : The maximum | r\ | of the considered jets. Jets from the ttH system tend to be more central 
than those from the underlying event. 

• cos0j*i_j et : Measured in the Higgs boson rest frame, this is the cosine of the angle between the 
higher pj of the two jets from the Higgs boson and the boost applied to shift from the lab frame to 
the Higgs boson rest frame. 

• m t : The hadronic top quark mass before the fit is performed. 

The variables are displayed in Figs. 17 and 18 for correct and wrong combinations. In this case 
'correct' implies that all six quarks are correctly assigned. Fig. 19 shows how each variable would 
perform if used individually to separate correct and incorrect pairings. For each variable combinations 
are selected by the likelihood ratio found using that variable alone. The Fig. shows the fraction of wrong 
pairings which would be accepted as a function of the fraction of correct ones. The fit x 2 is the most 
powerful single variable over much of Fig. 19, but the masses of the hadronic top quark and W work 
well at high efficiency while jet • top | m j n is also rather powerful. Clearly the variables have correlations, 
and these are taken into account by evaluating the likelihood in a 3D space defined by the three variables 
under study. This explicitly includes the correlations, but is limited by the simulation statistics required to 
populate the space. The combination of # 2 , jet • top | m j n and t{, ept cos 6* was adopted as the most powerful 
set of three variables found. 
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Figure 18: Left: The distribution of distances, AR, between the lepton and the b quark from the leptonic 
top quark. Center: The maximum |tj| of any jet in the combination being tested. Correct combinations 
are more central. Right: The reconstructed mass of the hadronically decaying top quark, before any fit is 
performed. Correct combinations are solid (black) and wrong combinations are dashed (red). 




Figure 19: The performance of a range of possible variables if they are used individually to find the 
correct quark-jet pairing in a ttH event. For a given efficiency for selecting the correct pairing (x axis), 
what fraction of the incorrect pairings will also be chosen (y axis). The line labelled null shows the effect 
of selecting combinations at random. 
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The remaining variables were tested to see whether their addition as uncorrelated likelihood contri- 
butions made a significant improvement, and those that did were included. The final likelihood is defined 
as follows: 

^pairing =\0g^ D , ept +logjSf lept +log^ |max 

\og% z ,jet-top| min ,t b F cos0* t b F AR 



log if cos Q * + log Jgf W( + log J%_ fag + log Jgf /e( 



The combination which produces the largest likelihood for each event is adopted. The quality of the 
chosen combination is examined in Section 11. 



10.5 Signal and background separation 

The separation of signal from background is again done using the likelihood technique. There are two 
rather different backgrounds considered: the tijj component for which ^-tagging is the primary tool and 
the 'irreducible' ttbb background which differs from the signal only in kinematic ways, which can be 
exploited to give some separation. The variables used to separate signal and background are: 

• -^pairing '■ From combinatorics. 

• E,-log 10 JS^': Sum of the log-likelihoods of the four jets used as b's in the combination chosen. 

• tff_ tag : The sum of the Z?-tagging weight of the two jets from the Higgs boson. 

• ATj(H,top) mm : The difference in tj between the Higgs boson and the closer top quark. 

• cos(tH) max : The higher of the two angles between top quarks and Higgs boson in the center of 
mass of the ttH system. 

• Hp in C.o.M: The Higgs boson momentum in the center of mass frame of the ttH system. 

• cos0j![_j et : As defined in Section 10.4. 




J □«)] 

□ ttbbEW 


, ATLAS _ 
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sum of b-tags(H) 



Figure 20: Left: Distributions for signal and backgrounds of the likelihood used to find the combinatorics 
solution. Center: The sum of the b-tag likelihoods of the four jets used as b's in the chosen combination. 
There was cut at 8 in the combinatorics preselection. Right: The sum of the Z?-tags of the jets associated 
to the Higgs boson. This variable removes tijj more than ttbb. Histograms are filled for every event 
where there is a successful fit. In all histograms, the sum of the individual histograms is shown. They 
are stacked to indicate relative contributions. 
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Figure 21: Left: The maximum cosine of the angle between either top quark and the Higgs boson when 
boosted into the ttH center of mass frame. Center: The momentum of the Higgs boson candidate in the 
center of mass frame. Note that the true Higgs bosons have a lower momentum than the background. 
Right: The cos 6* of the higher pj of the jets from the Higgs boson. This tends to be more central for 
signal events. In all histograms, the sum of the individual histograms is shown. They are stacked to 
indicate relative contributions. 



Figures 20 and 21 show the data used to produce the likelihood ratio for most of the variables. 
The backgrounds fall into two basic classes - the ttjj and the ttbb, and to deal with these two 3D 
likelihoods are defined. The first includes the three quantities which contain ^-tagging information: 
^pairing ^i^ogw^l^b-tag- These are all powerful but are highly correlated and therefore benefit from a 
correct treatment of those correlations. The second is cos(tH) max ,H p ,cos ©n-jet' which carries discrimi- 
nation based on event kinematics. It too has important correlations. These likelihoods are combined as 
if independent, with one further likelihood, derived from At] (H,top) min , added as well. All the likeli- 
hoods have been smoothed so that the expected fluctuations are below 10%, and there is therefore little 
over-training, as separate test and training samples are maintained. 

The distributions used to define the likelihood are constructed using all events for which a constrained 
fit was made, and all the components are normalized to the cross-sections at that stage. The signal 
separation likelihood is: 



*>/» = V 3 ( lo ^|L^^ (6) 

The factor of 3 makes this an average, rather than a sum, and is there purely for convenience. Note 
that there is nothing in this definition to prefer correctly paired signal events. 

Figure 22 shows the distribution of the final likelihood. It is generally dominated by ttjj events, but 
at the largest likelihood values the ttbb and signal events are more prevalent. It can be seen that any 
ttH analysis will be selecting a tail of the signal, and controlling this will be important. The signal to 
background ratio, within the mass window, rises to about 25%, and any rise above that is in a region 
affected by lack of simulation statistics. 

The final choice of working point will depend upon the details of the systematic error evaluation. 
The tighter the selection on the likelihood the higher the signal to background ratio but the smaller the 
samples in data and simulation; the latter is an important consideration. 

The maximum significance which might be expected in a measurement (evaluated as s/y/b) ignoring 
all systematic uncertainties, is obtained by cutting at a log likelihood of -4.44. This would yield a 
significance of 2.78a, but at low purity. Reducing the mass range by requiring that the candidate has a 
mass within the range 90 to 150 GeV does not appear to improve the results when systematic errors are 
not considered. 
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Figure 22: Left: The total signal separation likelihood. The top figure shows all events while the bottom 
shows only those within a Higgs boson candidate mass window of 90 to 150 GeV. Right: The integrated 
version of the lower left plot, so the total event rates passing any cut can be seen. The bottom half of this 
plot is the signal to background ratio implied. 

If an arbitrary ten per cent error on the background level is assumed then the significance for this cut, 
evaluated as s/y / b + (db) 2 , decreases to below 0.5a, while the highest significance is around 1.8a for a 
cut at -4.05. This is shown numerically in Table 6, where the expected event rates are shown for three 
different cut values. No final choice is really possible without complete evaluation of systematic errors, 
but -4.2 with the mass window cut applied does seem to be a plausible working point. The statistical 
significance for this selection is 2.18a. At this point the signal is reduced by a factor of twelve from the 
preselection, but signal to background ratio has become 0.1 25 ±0.01. The irreducible ttbb background 
has increased to 50% of the total. 

Table 6: The accepted cross-sections for signal and the main backgrounds at various stages of the 
analysis. The ttjj cross-section suffers from limited statistics. 



Selection 


ttH(fb) 


ttbb(EW) (fb) 


ttbb(QCD) (fb) 


tiX (fb) 


Initial Sample 


100 


255 


2371 


109487 


Pass preselection 


16 


23 


198 


2589 


Fit quality requirements 


14 


20 


165 


1584 


JS^/A > -4.40 


4.9 


5.1 


35 


58 


Se sjh > -4.20 


2.5 


2.3 


13.9 


11.9 


& s/b > -4.10 


1.4 


0.96 


7.11 


4.5 


Mass window 90 to 150 GeV. 


££ sjb > -4.40 


2.3±0.07 


1.4±0.17 


10.8±0.7 


22±3.1 


S? s/b > -4.20 


1.3±0.05 


0.62±0.12 


4.6±0.5 


5.3±1.5 


££ sjb > -4.10 


0.71 ±0.04 


0.23±0.07 


2.5±0.35 


2.2±1.0 



The distribution of the masses of the candidates can be seen in Fig. 23, at a cut of S£ s j b > —4.2. The 
right hand side of Fig. 23 shows details of the mass distribution for signal only. 
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Figure 23: The mass distribution after cutting at -4.2 in 5£ s jb. Left: All samples, showing the contri- 
butions stacked. The signal distribution is also shown separately at the bottom. Right: Signal only, the 
dashed (red) line equals events where the correct jets from the Higgs boson are selected. 

11 Comparison between the three analysis techniques 

The performance of the cut-based, pairing likelihood and the constrained fit analyses in terms of purity 
versus selection efficiency can be seen in Fig. 24. For the likelihood analyses, the different working points 
are obtained by varying the final cut on the likelihood discriminant. In the case of the cut-based analysis, 
the same variation is achieved by loosening or tightening the mass-window cuts on the hadronically 
decaying W and the reconstructed top quarks. For this section the efficiency is defined as the selection 
efficiency relative to the total events simulated. The purity is defined in terms of the correctness of the 
assignment of b-jets used to reconstruct the final objects. For instance, one has a pure hadronic top 
quark when the Z?-jet matches the true b parton from the top quark decay, regardless of whether the same 
happens for the hadronic W boson decay products. 

Fig. 24 clearly shows the increase of performance when using more information (likelihood) than 
just the mass of the reconstructed particles (cut-based). 

The chosen working points are indicated with solid markers on Fig. 24. Those points have not been 
optimized in terms of statistical significance, because of a lack of statistics for the ttX background and 
because due consideration of the systematic errors should also influence the decision. However, the 
significance does not change much with the choice of the cut on the pairing likelihood output since this 
likelihood is not designed to discriminate signal from physics background events. 

The ability of the three analyses to correctly identify objects in the event is compared in Table 7. The 
likelihood-based assignments perform noticeably better than the cut-based analysis. The signal efficiency 
and statistical significance are also improved. 

12 Background Shapes 

The success of this analysis relies on the accurate knowledge of the background level and shape. Monte 
Carlo predictions are affected by large systematic uncertainties, as the background rejection depends 
critically also on the jet flavor composition. For this reason it is mandatory to develop methods to 
measure background directly from real data. 
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Figure 24: Comparison of the purity of reconstructed bb invariant mass before the final mass window cut 
versus selection efficiency for the cut-based, the pairing likelihood and the constrained fit analysis. The 
solid markers show the selected working points. 



Table 7: A comparison of quality criteria for the three analyses at their working points. The mass window 
of 90 to 150 GeV is only applied for the last two rows. 





Cut Based 


Pairing likelihood 


Constrained fit 


b jet from Hadronic top correct 

b jet from Leptonic top correct 

Higgs boson jets correctly chosen 

Four b quarks correct 

Higgs boson mass peak resolution, GeV 

Signal Efficiency 


44.4±1.1% 
50.5±1.2% 
29.4±1.0% 
23.3±1.0% 
22.8±1.6 
2.04±0.05% 


49.2±1.1% 
57.4±1.1% 
34.0±1.0% 
27.5±1.0% 
20.1±1.1 
2.32±0.05% 


51.0±1.5% 
56.2±1.5% 
32.0±1.4% 
27.1±1.3% 
22.3±2.1 
2.49±0.07% 


Signal to background 
s/y/b, 301b- 1 


0.110±0.014 
1.82 


0.103±0.014 
1.95 


0.123±0.019 
2.18 



One important result of the present study is that the Higgs boson candidate mass spectrum depends 
weakly upon the ^-tagging working point. This is shown in Fig. 25, which reports the difference in bb 
invariant mass shape for the ttbb and ff+jets processes after applying the pairing likelihood analysis with 
the loose and tight ^-tagging requirement as defined in Section 5.3.2 

The complete determination of the background shape from data depends crucially on the relative 
contributions of the ttbb and ?H-jets distributions, which in turn depends on the strength of the ^-tagging 
cut applied. The bb invariant mass can be studied for a b-tag requirement, the "medium &-tag", between 
the loose and tight, such that the possible presence of signal can be still neglected. We choose a medium 
working point corresponding to a ^-tagging weight cut of 3, such that the ratio of the contribution of ttbb 
with respect to ??+jets goes from 11% to 30%, with a signal contamination of less than 3%. 

One strategy contemplated is to use the ttbb/tt+jets fraction coming from the Monte Carlo prediction 
and the total number of events from the data to normalize the Monte Carlo at the loose working point 
where the signal level is less than 1%. Using the Monte Carlo jet flavor composition and the ratio of 
the b-, c- and light jet efficiencies {^^Hght^PT j 7 ?) / ' £ b"c"iight(PT j 71 )) at ^ e loose and medium working 
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points, it is then possible to predict the shape and normalization at the medium working point. 

The data reduction when moving the fe-tag quality from "loose" to "medium" is explained by the ratio 
of the b-tag efficiencies at these two working points applied to the ttbb and tt+jets data. With a 50 pb _1 
data sample, the ^-tagging efficiency of b- and c-jets will be known with an accuracy of 5% [19], while 
the rejection of light quark jets will be measured with a 10% uncertainty. We expect a significantly 
more accurate knowledge of the ^-tagging performance with a data sample of approximately 30 fb _1 . 
This will allow the measurement of the background level of ttbb and tt+jets as a function of the bb 
invariant mass for the loose and medium ^-tagging working points. These measurements can be used to 
verify and tune with data the background prediction given by the Monte Carlo simulation, which will be 
used to extrapolate the event yeld expectation of known processes when the ft-tag quality is moved from 
"medium" to "tight". This extrapolation can be monitored, and eventually further corrected, by looking 
at the comparison with the measured data outside the mass window, where the number of signal events 
expected is small (about 4%). If necessary, this procedure can be extended by asking for three ^-tagged 
jets to further constrain the background composition and its shape and absolute normalization, to achieve 
the 5% systematic uncertainty necessary for the analysis of this processes. 




Figure 25: Ratio of the invariant mass spectrum for Higgs boson candidates after combinatorial likeli- 
hood analysis and using a loose and tight cut on the &-tag weight. Left hand side: tt+jets, right hand side: 
ttbb. The signal region shows very consistent behaviour. 



13 Systematic uncertainties 

The evaluation of systematic uncertainties, especially in the background level, is of vital importance in 
this analysis. Unfortunately it has not yet been brought to a satisfactory level and a robust method to 
infer background shapes and normalization from data, vital for this channel, still needs to be developed. 
Following the estimation of systematic uncertainties due to the standard detector effects, Table 8 shows 
the various contributions for all three analyses. It is noticeable how important the jet uncertainties are for 
both signal and background. Indeed the knowledge of the jet energy and of the ^-tagging performance 
have a crucial impact on the kinematic quantities used for the reconstruction of the it system and for the 
correct identification of the b-jets used for the analysis. Large fluctuations on the background estimations 
arise due to the lack of statistics for the ttX sample, giving rise to a relative statistical error up to 20%. 

While the theoretical uncertainties for the signal and background normalization are quite large, their 
impact can be reduced by making direct measurements. This is certainly the case for the tt cross-section, 
where the theoretical uncertainties associated with the NLO+NLL calculation are around 12% [14] while 
with only 100 pb _1 of data, a direct measurement of the cross-section for the semileptonic final state 
using ^-tagging could be performed with a much smaller error [22]. The ttbb background is only cal- 
culated at LO, the cross-section calculation has a strong scale dependence, a factor 4 when changing 
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from Qq CD = s to 2g CD =< p| > [9]. Even though the signal cross-sections used for this work are 
LO, NLO calculations are already available with a theoretical uncertainty including errors coming from 
partem distribution functions of the order of 15-20% [23] (compared to the 100-200% uncertainty of the 
LO cross-section). 



Table 8: Effect of the various systematic uncertainties on the signal and background efficiencies. 



Source 


Cut-based 


Likelihood 


Constrained fit 






signal 


background 


signal 


background 


signal 


background 




energy scale 


± 0.5% 


±2% 


± 0.3% 


±3% 


± 1% 


±3% 


Electron 


resolution 


± 0.5% 


± 0.6% 


±0% 


± 1% 


± 0.2% 


±4% 




efficiency 


± 0.2% 


±2% 


± 0.2% 


± 1% 


± 0.5% 


± 0.2% 




energy scale 


± 0.7% 


± 3% 


± 0.6% 


± 0.2% 


± 0.4 % 


±4% 


Muon 


resolution 


± 0.8% 


± 0.6% 


± 0.3% 


± 0.4% 


± 1% 


±3% 




efficiency 


± 0.3% 


±0.1% 


± 0.8% 


±0.1% 


± 0.4% 


±0.1% 




energy scale 


±9% 


±5% 


±9% 


± 14% 


±9% 


±8% 


Jet 


resolution 


± 0.3% 


±7% 


± 1% 


± 5.5% 


±5% 


± 14% 




fc-tag 


± 16% 


±20% 


± 18% 


±20% 


± 16% 


±20% 




b mis-tag 


± 0.8% 


±5% 


± 1.1% 


±3% 


±3% 


± 10% 


summed 


n quadrature 


± 18% 


±22% 


±20% 


±25% 


± 19% 


±28% 



13.1 Effect of pile-up on signal 

The portion of the semi-leptonic ttH signal sample used here is simulated a second time, but with the 
anticipated effects of pile-up and cavern background included. It is important to stress that the same 
generated events are used as input for both pile-up and non pile-up samples. The pile-up actually applied 
to the events is that expected for running at instantaneous luminosity Jz? of 10 33 cm _1 .y _1 . 

The effect of pile-up on the preselection of events (applicable to all three analyses) is shown in 
Table 9, and as can be seen, the effect of the trigger requirement is the most significant. 

Table 9: The effect of pile-up on the samples at successive stages of preselection with relative 
efficiencies. 





ttH a(fb) 


Quantity \ Sample 


No pile-up 


pile-up 


Starting Sample Generated 


100 


100 


Pass Trigger (e22i, e55, mu20) 


65 (65%) 


62 (62%) 


One high- pj Lepton 


56 (87%) 


53 (86%) 


> 6 jets (p T > 20 GeV, \r] \ < 5) 


36 (64%) 


34 (64%) 


> 4 central b-jet candidates, (\rj \ < 2.5 & b-jet weight > 0) 


16 (45%) 


15 (44%) 


Preselected 


16 (45%) 


15 (44%) 



The distribution of the number of high pj leptons in the events before preselection is shown in 
Fig. 26 for the pile-up and non pile-up samples. They are very similar, suggesting that the electron and 
muon reconstruction are not significantly affected by pile-up; however the trigger requirement reduces 
the number of events with pile-up available to the rest of the preselection. 
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Figure 26: High ^rlepton multiplicity (e ,jU ) 
for pile-up and non pile-up samples (before 
preselection). 



Figure 27: Jet multiplicity (before preselec- 
tion) for pile-up and non pile-up samples. The 
cuts pt < 20 GeV and | r\ | < 5 are applied to the 
individual jets. 




Figure 28: b jet multiplicity (after preselection) 
for pile-up and non pile-up samples. The cuts 
p T < 20 GeV, \rf\ < 2.5 and Z?jet weight > 5.5 
are applied to the individual jets. 



Figure 29: Reconstructed Higgs boson mass 
peak (mbb) for pile-up and non pile-up samples. 



Figure 27 shows the jet multiplicity before preselection. It can be seen that the number of events 
having exactly 6 jets is reduced by approximately 10%, and the number of events having more than 7 
jets is increased. The net effect will be an increase in the combinatorial background, though the extra 
jets will typically have a low pj. 

For the cut-based and pairing likelihood analyses, candidate &-jets are designated as those jets lying 
in the central region of the detector (|tj| < 2.5), with pj> 20 GeV and b-jet weight > 5.5, however, if 
there are more than four of these then the jets with the highest b-jet weights are used. 

The number of &-jets in the events both with and without pile-up after the other preselection cuts are 
applied can be seen in Fig. 28, where there is a reduction in the number of events having the requisite 
four b-jets. 

The extent of the reduction in events at the various stages of the cut-based analysis is shown in Table 
10. The most pronounced difference comes from the reduction in the number of ft-jets, and the net 
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effect of pile-up is a ~12% reduction in the number of events where it is possible to reconstruct a Higgs 
particle, as shown in Fig. 29. This effect will also manifest itself in the backgrounds, since they all 
have either two or four ^-quarks. The most interesting background pile-up study would now be with the 
tijj sample, since this relies on mis-tagging of light jets to create a physics background, however this is 
beyond the scope of this study which only examines the signal. 

It should be noted that in the course of this study, the b-]et efficiency and light-jet rejections were 
studied, however no visible differences were observed. This can be explained by the fact that even a 1 % 
drop in efficiency from 50% to 49% causes almost an 8% drop in events having four &-jets. 

Table 10: The extent of the reduction in events for pile-up and non pile-up samples for the cut- 
based analysis with relative efficiencies in parentheses. The harshest reduction comes from the 
four b-]et requirement. 



Quantity \ Sample 


ttH ( 
No pile-up 


T(fb) 
pile-up 


Preselected events 
> 4 fc-jets O-jet weight > 5.5) 
Had & Lep W inside mass-window 
t, /"-quarks rec. in mass- window 


16.0 
3.7 (23.1%) 
2.5 (66.4%) 
2.0(81.6%) 


14.8 
3.2 (21.5%) 
2.1 (66.8%) 
1.8(83.1%) 



14 Significance estimates 

The number of remaining events in the Higgs boson mass window (30 GeV around the nominal Higgs 
boson mass) have been used to compute a crude estimate of the statistical significance for this channel 
with 30 fb _1 . For such a channel in which the signal and backgrounds are very alike, this naive estimate is 
not the most relevant figure of merit, but it is still useful to compare analyses. For the cut-based analysis, 
a significance of 1.8 is achieved with signal to background ratio of approximately 0.11. It is worth noting 
that the addition of the low pj muons to jets and the residual jet calibrations performed in Sections 5.3.3 
and 5.3.4 improved the cut-based analysis significance by 0.3. With the pairing likelihood approach 
the significance is 1.95 for a signal to background ratio of 0.1. Finally the constrained fit likelihood 
gives 2.2 (1.7) for a signal over background value of 0.12 (0.14), obtained with a cut on S£ s j h of -4.2 
(-4.1). Figure 30 shows the total significance S'/y / B + (AS) 2 as a function of the systematic error on the 
background (AS) for the different analyses. As is shown in the Fig., only a background uncertanity level 
below 10% allows exploitation of the statistical power of the mass constrained fit analysis with respect 
to the cut-based analysis, and even less for the case the pairing likelihood. Even for a robust analysis 
such as the cut-based approach, the large systematic uncertainties estimated in Table 8 provide a clear 
indication that a data driven background estimation is necessary. 

15 Conclusion 

We performed a baseline sensitivity study for the detection of a Standard Model Higgs boson decaying to 
bb when produced together with a it pair. After the definition of a common preselection, three different 
techniques are used, all aimed at the reconstruction of the it system. The first one is based on the 
reconstruction of the top quark and W candidate masses (cut-based analysis). The second one (pairing 
likelihood analysis) uses a more complete description of the kinematic properties of the it system to build 
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Figure 30: Comparison of the total significance as function of systematic uncertainties (AS), for the 
cut-based, the pairing likelihood and the constrained fit analysis. Markers indicate the significance cor- 
responding to the background uncertainty estimated in Table 8. 



a likelihood discriminant and isolate the jets coming from the Higgs boson decay. The third approach 
(constrained fit) uses the known masses and jet errors as constraints to produce a combinatoric likelihood, 
and a second likelihood to separate signal from background. While the cut-based analysis is certainly 
the most stable one, relying only on the reconstructed invariant masses of the top quark candidates, 
it also performs worse with respect to the other two likelihood based analyses. On the other hand, 
these likelihood based analyses can be used successfully only after all kinematical variables are well 
understood together with their correlations. Although beyond the scope of this work, the use of more 
advanced multivariate techniques is foreseen to reduce both the combinatorial and physics background. 

The statistical significance obtained for the three approaches was 1.82 for the cut-based, 1.95 for 
the pairing likelihood and 2.18 for the constrained mass fit at signal-to-background ratios of 0.11, 0.10 
and 0.12 respectively. All the analyses suffer drastic reduction in significance as the overall systematic 
uncertainty increases. The most important individual uncertainties are those for the jet energy scale and 
^-tagging efficiency. 

From this study emerges the necessity of a strong ^-tagging algorithm which is important not only 
to suppress the ff+jets physics background but also to help reduce the combinatorial background by 
improving the hadronically decaying W reconstruction. It is also clear that the combinatorial background, 
responsible for the dilution of the Higgs boson mass peak, needs to be further reduced, possibly using 
multivariate techniques, in order to improve the statistical significance of the channel. Improvements in 
the mass peak resolution would also enhance the ability of a shape analysis from two perspectives; firstly 
it would be easier to select a signal-depleted region for any shape fits, and secondly the mass peak itself 
would become more pronounced. 

The results presented in this work can be compared with a previous ATLAS study [3] performed 
using fast simulation with a parametrized ^-tagging efficiency which had a higher performance than 
the one used here and also used PYTHIA in order to simulate the tt+X background. It resulted in a 
significance of 1.9 and 2.6 respectively for the cut-based and likelihood analyses. The results presented 
in this note can also be compared with a recent CMS study [24] reporting a significance of 1.8 for the 
electron channel and 1.6 for the muon channel, in both cases for an integrated luminosity of 60 fb _1 . 
While a detailed comparison between the two experiments is not attempted in this work, it is noteworthy 
that the jet energy resolution quoted the CMS paper could be a key factor in explaining the improved 
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sensitivity seen by ATLAS for this channel. 

The measurement of the background normalization from data is vital for this channel. Subsequent 
studies must be performed in this regard. Further methods of extracting shape information from data 
must also be developed, in particular, the extraction of the signal in the presence of a quasi-signal-like 
background as is exhibited in the invariant mass plots at the ends of the analyses. The shape information 
and any estimate of the significance obtained from it could be used in conjunction with the counting 
experiment information to improve the overall significance. 
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Study of Signal and Background Conditions in 

ttH.H WW and WH,H WW& 

Abstract 

In this note we present Monte Carlo studies of the associated Standard Model 
Higgs boson production in the ttH and WH channels with the decay H — > 
WW. These channels are intended to provide information on the Higgs bo- 
son's couplings. We study the two- and three lepton final states in ttH and 
three lepton final states in WH, based on the full ATLAS detector simulation. 



1 Introduction 

The discovery and subsequent study of the Higgs boson is one of the main aims of the Large Hadron 
Collider (LHC) at CERN. The possible mass range of the Standard Model Higgs boson is bounded by the 
lower limit set at LEP of 1 14 GeV and reaches to about 1000 GeV [1]. The ATLAS experiment will use 
all possible channels to extract information on it, because comparing the rates in the different channels 
will allow information on the couplings to be extracted. 

The sensitivity of ATLAS to a Higgs boson produced in gluon fusion or via vector boson fusion 
and decaying to W quark pairs has been discussed elsewhere in this volume [2]. This note contains the 
results of studies of the Higgs boson in the same decay mode but produced in association with either top 
quarks, (ttH,H —> ffffW), or a W boson (WH,H —> WffW). The cross-sections for these processes 
are significantly lower than for inclusive Higgs production, and the additional activity makes them more 
complex to reconstruct, but the presence of extra signatures gives more possibilities for the reduction of 
the background. 

This note explores techniques to exploit these signatures, and the signal and background conditions 
are studied in both channels. A full simulation of the ATLAS experiment is employed to estimate these, 
which represents an improvement over the fast simulation used in previous studies of ttH [3] and WH 
[4,5]. The marginal production rates and numerous background sources, many with large cross-sections, 
make this analysis difficult, and both the background and signal need to be established in some detail. 
Nevertheless, if the background can be well estimated, then for integrated luminosities of several tens of 
fb _1 measurements should be possible. 

The backgrounds considered in detail here arise from the inclusive tt process, from tt produced in 
association with gauge bosons, and from gauge bosons produced inclusively or in pairs. Unfortunately, 
it has not been possible to model all the relevant backgrounds with a complete simulation at the statis- 
tical level required; this is true for example of inclusive QCD multijet events. Section 2 describes the 
considered signal and background processes. Sections 3 and 4 give the details of ttH and WH anal- 
ysis accordingly. Section 5 discusses the results, including the signal-to-background ratio that can be 
achieved in these two channels. 

2 Signal and background Monte Carlo samples 

Signal and background were produced with various generators, through a realistic ATLAS detector sim- 
ulation based on the GEANT 4 package [6]. 
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2.1 Signal generation 

Events with a Higgs boson decaying toaW pair produced in association with a tt pair or with a W boson 
can be searched for at hadron colliders by requiring the presence of lepton pairs (£ = e,jJ,). 

In particular, for the two-lepton final states, like-sign leptons are selected; this allows a strong reduc- 
tion of the large background produced by the Z or tt leptonic decays. In order to improve the efficiency of 
the Monte Carlo data sample production, generated events were filtered before their processing through 
the ATLAS detector simulation. 

For the WH channel, only events with three leptons in the final state were selected. These leptons 
had to pass loose Tj and pj cuts. 

Samples of ttH with at least two leptons were generated and filtered for different Higgs boson masses 
between 120 and 200 GeV using the PYTHIA 6.4 generator [7]. Results obtained with these samples 
were normalized to the Next-to-Leading Order (NLO) cross-sections and branching ratios reported in 
Ref. [1]. Only the = 170 GeV mass point was studied for the WH channel, where signal events were 
generated with the MC@NLO program [8]. 

Table 1 summarizes the most important characteristics of the signal samples used for this note. 



Table 1: Signal samples generated for the ttH and WH,H — » WW^ analyses. 



Process 


m H [GeV] 


<r, 0 ,(NLO) [fb] 




Generator 


axBRx E fUter [fb] 


iV(events ) 


ttH 


120, 130, 140 


669, 534, 431 


ttH -> AW (2L) 


PYTHIA 6.4 


3.60, 6.25, 8.51 


~40k 




150, 160, 170 


352, 291,243 






9.68, 10.49,9.31 


per m H 




180, 190, 200 


204, 174, 149 






7.62, 5.50, 4.42 




ttH 


120, 130, 140 


669, 534, 431 


ttH -» AW (3L) 


PYTHIA 6.4 


2.34, 4.05, 5.49 


~40k 




150, 160, 170 


352, 291,243 






6.31,6.91,6.15 


per m H 




180, 190, 200 


204, 174, 149 






5.00, 3.54, 2.86 




WH 


170 


511 


WH -> WWW (3L) 


MCONLO 


3.42 


80k 



2.2 Background samples for ttH, H -> W 

The main backgrounds for the ttH,H —> WW'*' final states are tt, ttW, ttZ, tttt and ttbb. Single top 
events have been neglected. Jets from QCD production and WZ production processes are also sources of 
background. However, lepton identification with isolation and a jet multiplicity requirement are expected 
to reject a large fraction of these. The background from QCD multijet production has not been properly 
estimated so far and it is hoped that the selection requirements reduce it to an acceptable level. 

A special MC@NLO sample is filtered for a pair of like-sign or more than two leptons with a 
Pj > 13 GeV and |tj | < 2.6 at the generator level. It results in a filter acceptance of 0.0384. In addition, 
when there are three or more generated leptons, events with oppositely charged leptons from W bosons 
falling into a special domain ( pj > 30 GeV and \\rj\ — 1.5| < 0.2 for electron, pj > 15 GeV and ||tj| — 
1.25 1 < 0.2 for muon ) were rejected. This results in a small bias, analysis dependent. 

The Wbb sample was produced by the ALPGEN generator with only leptonic W boson, a generator 
level filter led to an additional 0.02 acceptance, and a 2.57 K-factor [6] was also included. Leading 
order ttW + jets samples were produced with ALPGEN [9]. The minimum pj for the additional jets 
was 15 GeV, while the maximum |tj| was 6.0. The generated jets were also required to be separated 
by a distance AR = y'Arp + Aip larger than 0.4. MLM matching [9] was performed to avoid double 
counting of additional jets. 

Samples of ttZ, tttt, ttbb and ttbb(EW) were produced with the leading order generator ACERMC [10]. 
The ttZ events are normalized to the total cross-section recently calculated at NLO [11], while other AC- 
ERMC samples are normalized to LO. In the ttZ sample, the decay Z — > il was forced. The ttbb(EW) 
sample contains the electroweak contribution to the production of ttbb. For both ttbb samples, the final 
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states containing four Z>-jets, two light jets and a lepton (muon or electron) were generated. Table 2 
summarizes the characteristics of all background samples relevant for the ttH analysis. 

Table 2: The samples used to estimate the background contribution in the ttH,H — ► WW^ analysis. J£ 
denotes the effective integrated luminosity available from Monte Carlo statistics. 



Process 


Generator 


Cm [fb] 


axBRx e fu ter [fb] 


A'(events) 


ifffb" 1 ] 




MC@NLO 


833000 


450000 


440k 


0.98 


tt pre-filtered 


mc@nlo 


833000 


32000 


350k 


10.9 


ttbb(EW) 


ACERMC 3.3 


900 


244 


6.5k 


26.6 


ttbb 


ACERMC 3.3 


8200 


2244 


44k 


19.6 


Wbb 


ALPGEN 


2.1 x 10 5 


1387.8 


20k 


14.4 


ttW + Qjsts 


alpgen 


189 


25.3 


20k 


790 


ttW + 1 jets 


ALPGEN 


156 


20.7 


20k 


966 


ttW + > 2 jets 


ALPGEN 


237 


34.0 


18k 


529 


ttZ 


ACERMC 3.4 


1090 


87.0 


19k 


218 


88 -» tttt 


ACERMC 3.4 


2.2 


1.44 


21k 


14583 


qq -> tttt 


ACERMC 3.4 


0.48 


0.31 


7k 


22580 



2.3 Background samples for WH, H -> WW& 

The tt and Wbb samples as given in Table 2 are used also in this analysis. For the irreducible diboson 
WZ/ZZ backgrounds only the fully leptonic decays were considered; this was done with the MC@NLO 
generator. The ALPGEN tTW+0 jet sample described in Section 2.2 was analyzed to account for the 
ttW background and as it gives a negligible accepted cross-section the samples with additional jets were 
not considered. The huge W+jet background was generated with HERWIG [12] and was normalized to 
the NLO production cross-section [6] with a filter applied, requiring at least one electron (muon) with 
p T > 10 GeV and \r\ | < 2.7 ( p T > 5 GeV and |t]| < 2.8 ). 

An overview of all background samples used for the WH analysis is given in Table 3. 

Table 3: List of background samples for the WH analysis. Jzf denotes the effective integrated luminosity 



from Monte Car 


o statistics. 




Generator 


Otoi [fb] 


OXBRX Bfuter [fb] 


W(events) 


2 [fb- 1 ] 


// no all-hadronic 


MC@NLO 


833000 


450000 


440k 


0.98 


tt pre-filtered 


MC@NLO 


833000 


32000 


350k 


10.9 


WZ 


MC@NLO3.10 


47760 


750 


36k 


48 


ZZ 


MC@NLO3.10 


14750 


72.5 


50k 


690 


VK+jets 


HERWIG 


1.91 x 10* 


2.8 x 10 7 


60k 


0.0214 


Wbb 


ALPGEN 


2.1 x 10 5 


1387.8 


20k 


14.4 


ttW +0jet 


ALPGEN 


189 


25.3 


20k 


790 



3 Selection of the ttH, H — > WW^ two and three-lepton final states 

In this study, the high pj single lepton trigger is used for the ttH two-lepton (2L) events and three- 
lepton (3L) analyses, with a trigger efficiency larger than 96% for both channels at offline selection 
level. Cut-based analyses were performed, based on the standard ATLAS reconstruction of a medium 
quality electron [13], combined muons [14], and cone size of AR = ^/Ari 2 + A(j) 2 = 0.4 (cone-0.4) tower 
jets [15]. 

Signal data sets for nine Higgs boson masses in the range between 120 and 200 GeV were analysed. 
In the following, numbers will be given mainly for the most promising Higgs boson mass of 160 GeV 
including cut flow information for the 120 and 200 GeV mass points. 
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3.1 Event selection in ttH, H -> WW& 

The event selection is based on the analysis of final states with at least two reconstructed leptons and 
jets; from now on we refer to this analysis as "basic selection". Each lepton or jet is required to have 
transverse momentum pj > 15 GeV and to lie in the pseudorapidity region |tj| < 2.5. Finally, two-lepton 
(2L) events are required to have at least six reconstructed jets, while events with three leptons (3L) must 
have at least four jets. 

The 2L(3L) selection retains 36.1% (35%) of the Higgs boson events with = 160 GeV, while 
reducing the various backgrounds (see Tables 4 and 5). 

For both selections, further suppression of the main background sources can be done by isolation. 
The isolation criteria require that the transverse energy deposited in the calorimeter around the lepton 
in a cone size AT? = 0.2 be below 10 GeV (calorimeter isolation), the maximum pj of extra tracks 
reconstructed in the Inner Detector around the lepton track in a cone size AR = 0.2 be below 2 GeV 
(tracker isolation) and the angular separation A/?/ ep _ c /j between the lepton and the closest jet be greater 
than 0.2 for an electron or 0.25 for a muon (cone isolation). This is referred to as "standard isolation" 
and it allows the reduction of the tt background by more than a factor of 10 (170) in the 2L (3L) analysis. 
The ttZ/tiW backgrounds are suppressed by a factor 2 (ttZ, 2L) to 5 (ttW + 2jets, 3L). 

Further reduction of the tt background in the dilepton final state can be achieved by requiring exactly 
two like-sign isolated leptons. This requirement suppresses the large tt processes with two leptonic 
W-decays, as well as the contribution from the ttZ process. 




(a) Dilepton invariant mass distributions (b) px-distributions of isolated and non-isolated muons 



Figure 1 : (a) Dilepton invariant mass distributions in ttH (2L) and ttZ and (b) pi -distributions of muons 
passing the loose isolation criteria. The solid distribution shows electrons from W decays in the 160 GeV 
signal sample, the dotted distribution shows muons in tt, which could not be matched to a generator-level 
muon from a W- or T-decay. All distributions are normalized to unity. 

In both final states, ttZ can be suppressed further by an explicit Z-veto: events that contain a lepton 
pair of opposite charge and same flavour with an invariant mass between 75 GeV < mu < 100 GeV are 
rejected. This veto includes all leptons passing the selection criteria and loose px-cut, here set to 6 GeV. 
The dilepton invariant mass distributions in ttH and ttZ are shown in Figure 1(a). The Z-veto decreases 
the ffZ-contribution roughly by 75%, while 98% of the signal survive in the 2L analysis. In the 3L case, 
83% of the signal events pass the Z-veto, while 80% of the ttZ contribution is suppressed. 

At this stage of the 2L selection, 73% of the remaining tt events have at least one muon from a 
semi-leptonic heavy quark decay, while the fraction of events with electrons of this origin is only 20% 
(identification of electrons embedded in jets is more difficult than that of muons). 

Further rejection of these muons from tt events is achieved by requiring the reconstructed muon pj 
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to be larger than 20 GeV, as leptons from heavy quark decays tend to be softer than leptons from W 
decays (see Figure 1(b)). After this cut, the fraction of events with at least one muon from semileptonic 
decay versus the one with at least one electron from semileptonic decay is respectively 46% and 41% . 

The detailed cut flows, with the corresponding accepted cross-sections, are listed in Tables 4 and 5. 
From now on the "basic selection" is quoted with the filter efficiency allowed for. In the 2L analysis we 
have used the special MC@NLO« sample described in Section 2.2 and a small bias has been computed 
at various stages of the cut flow. The different values have been found to be compatible and a correction 
of 1.15 ±0.10 has been applied in the tt line in Table 4. In the 3L analysis the standard MC@NLO?? 
sample has been used, and therefore no correction has been applied. 

In both cases, the largest background contribution is expected from tt events. The accuracy of the MC 
prediction of the total background expectation is limited by the available statistics and by the intrinsic 
accuracy of the simulation tools. In the case of tt the basic cross-section error is large, but the ALPGEN 
predictions for higher additional jet multiplicities suffer from even larger uncertainties. There may also 
be background contributions from W bosons with multijets which have not been reliably estimated or 
QCD multijet production or other sources which it has not been possible to simulate. 

Table 4: Cut flow and expected cross-sections [fb] for the ttH (2L) analysis. The errors presented are 
statistical only. Some backgrounds, such as W+jets, bb and tt jj have not been included. 



Sample 


OTotal ■ 














4 


ttH(2L"""\ 120 GeV) 


3.9 


1.05 




0.65 


0.52 


0.52 




0.51 


0.45±0.01 


ttH(2L tru,h , 160 GeV) 


11.1 


4.01 


3.02 


2.57 


2.09 


2.09 




2.04 


1.85±0.03 


rf7/(2L""'\ 200GeV) 


4.7 


1.83 


1.43 


1.24 


1.05 


1.04 




1.02 


0.95±0.01 


ttbb (EW) 


259.0 


15.8 


4.1 


0.9 


0.3 


0.2 




0.2 


0.11 ±0.07 


ttbb 


2360. 


177. 


31.7 


6.3 




0.9 




0.9 


0.5±0.2 




833000. 


6170. 


1970. 


870. 


500. 


16.0 




16.0 


7.4±1.1 




2.68 


0.65 


0.33 


0.26 


0.20 


0.07 




0.07 


0.06±0.00 


ffW+Oj 


61.1 


1.17 


0.46 


0.30 


0.19 


0.10 




0.10 


0.09±0.01 


ttW+lj 


50.5 


2.09 


0.93 


0.66 


0.48 


0.23 




0.23 


0.21±0.02 


ttW+>2j 


76.9 


8.6 


4.9 


4.1 


3.3 


1.58 




1.54 


1.40±0.05 


ttz 


110. 


25.7 


20.5 


18.1 


13.7 


1.6 




1.2 


1.14±0.07 


Wbb 


66721. 


1.6 


0.14 














Total background 


10.3±1.1 



Table 5: Cut flow and expected cross-sections [fb] for the ttH (3L) analysis. The errors presented are 
statistical only; systematic uncertainties are also important. Some backgrounds, such as W+jets, bb and 
ttjj have not been included. 



Sample 


OTotal ■ BR 


Basic sel. 


Calo iso. 


Track iso. 


Cone iso. 


Z-vcto 


4 


ttH (3L"'""\ 120 GeV) 


2.5 


0.66 


0.46 


0.38 


0.29 


0.24 


0.20±0.00 


ttH (3L""">, 160 GeV) 


7.1 


2.53 


1.78 


1.47 


1.14 


0.95 


0.82±0.02 


ttH (3L tmh , 200 GeV) 


3.1 


1.16 


0.82 


0.70 


0.55 


0.43 


0.39±0.01 




833000. 


1600. 


230. 


50.0 


9.3 


7.2 


2.1 ±2.1 


ttW+Oj 


61.1 


0.78 


0.17 


0.08 


0.04 


0.03 


0.03±0.01 


ttW+l) 


50.5 


1.07 


0.28 


0.14 


0.08 


0.08 


0.06±0.01 


ttW+>2j 


76.9 


2.77 


0.85 


0.60 


0.50 


0.42 


0.38±0.03 


ttZ 


110. 


15.0 


8.6 


6.8 


5.3 


1.05 


0.86±0.06 


Total background 


3.4±2.1 



3.2 Projective likelihood estimator for electron isolation 

A projective likelihood estimator, called IsolationLikelihood [13], was developed in the course of the 
ttH,H — ► WW^ (2L) analysis. Alternative to the standard isolation, this tool is meant to combine the 
separation power of several isolation variables into a single, more powerful one. It uses the likelihood 
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ratio method to reject electrons from semi-leptonic heavy quark decays using a different set of isolation 
variables than those described in Ref. [13]: 

• The additional transverse energy deposited in a cone of size AR = 0.2 around the electron cluster. 

• The sum of the p\ of all additional tracks measured in a AR = 0.2 cone around the electron cluster. 

• The transverse impact parameter significance \Ip\/5(Ip) of the electron. 

In addition, the "cone isolation" cut is also used as in the standard isolation analysis. 

When tuned to give the same electron isolation efficiency obtained with standard isolation in the 
ttH analysis, the IsolationLikelihood allows a higher rejection of non isolated electron background by 
a factor 1.5 to 4, as shown in Figure 2. Using this projective likelihood estimator could suppress the 
tt background from 7.4 ± 1.1 pb to 5.7 ±1.0 pb, while keep the same signal and other backgrounds 
selection efficiencies. It shows a potential improvement of this analysis which could be adopted at a 
small increase in complexity, but it has not been used in this document. 
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Figure 2: Non-isolated electron rejections vs. signal efficiencies obtained by the IsolationLikelihood 
estimator for four different pi and rj intervals. The large points mark the working point of the standard 
isolation cuts for comparison and indicate the size of the error bars, which are not shown for the curves. 



4 WH analysis 

Only the three lepton final state, W(H — > WW^) — > 3 (Iv) is described below. The analysis of the larger 
cross-section dilepton final state, which has an important W + jet background is currently ongoing. 

The basic selection requires three leptons that satisfy the lepton identification criteria, i.e. medium 
electron [13] and standard muon [14]. The lepton px -thresholds were set to 35 GeV for the leading 
and 15 GeV for the other lepton. As seen from Fig. 3(a) the cut on the leading lepton reduces the 
backgrounds much more than the signal. The presence of these leptons also ensures that any signal is 
efficiently recorded by the ATLAS trigger system 

In addition to a 6 GeV calorimeter isolation and a 0.25 cone isolation as described in Section 3.1, 
p% ack I p l * pton < 0.05 were forced, where p'f ck is of the track with maximal p T in a cone of A/?=0.2 
(0.3) around the muon (electron). Furthermore, a cut on the lepton three-dimensional impact parameter 
Hd/gi < 2.5 was employed to reject leptons from bottom quarks. 
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In order to reduce the WZ background, a Z veto is applied by requiring that no opposite sign and 
same flavor lepton pair has an invariant mass between mass 65 and 105 GeV (Fig. 3(b)). In addition only 
events with £"™ ss > 30 GeV were kept. To further reduce the backgrounds, we ask the sum of the pi 
of all the jets (which were preselected above 20 GeV from cone-0.7 tower jets [15]) to be smaller than 
120 GeV, as seen in Figure 3(c). 

For additional rejection of it and ttW, events having at least one jet fulfilling a loose b-tag [16] 
are removed. In order to exploit the spin correlations in the H — > WW^ signal, the minimum angular 
separation ( AR ) between lepton pairs is required to be in the range of [ 0.1 ~ 1.5 ] (so called "H-S cut"). 




(b) lepton-lepton invariant mass (c) Jet pj-sum 

Figure 3: /^-distribution for the leading leptons in the WH{7>L) signal, it, WZ, ttW and W+jet- 
production(a), invariant mass of all the lepton pairs (b) and sum of the pi of all jets (c) for the WH 
(3L) signal and the relevant backgrounds. All these plots are done after loose cut. 



Table 6 summarises the cross-sections after the cut flow described above. The filtered MC@NLO/f 
sample described in Section. 2.2 has been used here. In order to take into account the bias introduced 
by this sample, a correction of 2.36 ±0.6 has been applied. The background rate from W bosons with 
multijets, QCD multijet production or other sources, which have not yet been possible to simulate, have 
their contribution still under study. The errors on the background are, at this stage, much larger than the 
size of the expected signal. 



5 Discussion 

5.1 Uncertainties in the analyses 

Several systematic uncertainties affect the results presented in this paper. There are theory uncertainties 
associated to the the choice of the Parton Distribution Functions (PDFs), to the choice of the renormal- 
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Table 6: WH (3L) cut flow and corresponding cross-sections. The errors presented are statistical only; 
systematic uncertainties are also important. Some backgrounds, such as WWW, single top and ttZ have 
not been i ncluded. 





Input [fb] 


Basic sel. 




Z-veto 


isr 


H-S 


(b-) jet veto 


WH (3L) 


5.04 


1.18 


0.62 


0.53 


0.47 


0.36 


0.31 ±0.02 


WZ 


750. 


165.5 


1.41 


0.74 


0.63 


0.21 






833000. 


3564.3 


6.45 


6.11 


5.10 


1.02 


0.34+0- 70 


zz 


72.5 


34.5 


0.13 


0.06 


0.013 


0.008 


0.005 ±0.001 


ttW 


61.1 


1.35 


0.22 


0.21 


0.19 


0.07 


n nm+ 0005 
u.uuj_ 0003 


Wbb 


66721. 


3.1 












W — > ev+jets 


2.05 TO 7 


17.6 












W — > ^V+jets 


2.05 10 7 


27.6 












Total background 


0.45 ±0.70 



ization and factorization scales, to the description of the initial and final state radiation and to the model 
used to simulate the heavy quark fragmentation. In order to evaluate the size of these uncertainties, the 
theory parameters above mentioned have been varied within intervals corresponding to sensible choices. 
Concerning the PDFs, the MRST2000-LO set was used at the place of the CTEQ6L1. 

For the ttH analysis, the theory uncertainties have been found to induce a 9% change of the signal 
cross-section, dominated by the PDF choice. The impact to the tt process, which is the most important 
source of background to this signal, has been found to be 12% in Ref. [6]. An additional 5%, found 
in study of the signal process, associated to the uncertainty of the initial and final state radiation, has 
been included in quadrature, giving an overall 13% uncertainty on the total cross-section. However, the 
background sample is dominated by tt with extra jets, and the uncertainty on this rate is of order a factor 
two. For WH, the PDF uncertainty was found to be less than 5%, and energy scale uncertainty even 
smaller [17]. Including these effects and others (ISR,FSR) we get a total theoretical uncertainty of 9%. 

The effect of experimental systematic uncertainties has been also investigated. The main sources of 
these uncertainties are represented by the knowledge of the integrated luminosity, the energy scale and 
the energy resolution of electrons, muons and jets, as well as the tag efficiency of &-jets and the rejection 
of light quarks. The level of these uncertainties and the impact on the overall event selection is presented 
in Tables 7 and 8. Pile-up events will decrease the detector performance and the impact needs to be 
properly addressed in future studies. However, the relatively low jet transverse momentum threshold of 
15 GeV in the ttH analyses may be sensitive to this. The overall systematic uncertainty expected in the 
ttH analysis is 10% (10%) for the 2L (3L) signal and 15% (18%) for those backgrounds which have 
been quantified. In the case of the WH analysis the overall systematic uncertainty is about 10% for the 
signal, and about 20% for those background systematics which have been estimated. In each case the 
total background uncertainty is much larger than this at present. 

5.2 Conclusion 

The ttH,H — ► WWW and WH,H — ► WffW processes have been studied using two- and three-lepton final 
states. The signal and main backgrounds have been estimated using a full GEANT based simulation of 
the detector. The estimated accepted cross-sections in fb of signal and background for these processes 
are 1.9:10 (ttH 2L), 0.8:3.4 (ttH 3L) and 0.3:0.4 (WH 3L) respectively. The signal is small and clear 
distinguishing features such as resonance peaks have not been established. The backgrounds are larger 
and their uncertainties have not been fully controlled. The analysis is therefore very challenging. 

Accurate estimations of the background level using large simulation samples (made with more effi- 
cient simulation packages) as well as direct measurements using control samples from real LHC data are 
essential if a good signal significance is to be reached. For example the production of W bosons with 
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Table 7: Overview of the experimental systematic uncertainties on the signal and background predictions 



d to the ttH channel in those c 


lannels studied. All numbers ai 


e in %. 


Source of the uncertainty 


ttH (2L) 


ttH (3L) 




A signal (%) 




A signal (%) 




Luminosity 


3 


3 


3 


3 


3 


Electron ID efficiency 


0.2 


0.2 


0.2 


0.3 


0.3 


Muon ID efficiency 


1 


1.0 


1.0 


1.5 


1.5 


Electron Ej scale 


0.5 


0.1 


0.1 


0.2 


0.3 


Muon Ej scale 


1 


0.5 


0.2 


0.7 


1.0 


Electron Ej resolution 




0.1 


0.1 


0.1 


0.2 


Muon p T resolution 




0.6 


2.2 


0.3 


0.9 


Jet energy scale 


7 


1.2 


4.9 


2.7 


10 


Jet energy resolution 




1.0 


1.4 


1.9 


5.7 


Electron isolation efficiency 


1 


1 


1 


1.5 


1.5 


Muon isolation efficiency 


1 


1 


1 


1.5 


1.5 


hi , 


±3.9 


±6.6 


±5.2 


±12.3 



Table 8: Overview of the experimental uncertainties on the signal and background predictions related to 
the WH channels in those channels studied. All numbers are in %. 



Source of the uncertainty | 


WH 3L selection 




A WH (3L) (%) 


A WZ (%) 


Att(%) 


A ZZ (%) 


A ttW (%) 


Luminosity 3 


3 


3 


3 


3 


3 


Electron ID efficiency 0.2 


0.3 


0.3 


0.2 


0.9 


1.1 


Muon ID efficiency 1 


1.5 


1.7 


1.9 


1.0 


1.7 


Electron energy scale 0.5 


0.06 


0.06 


0.2 


0.02 


0.07 


Muon energy scale 1 


0.2 


0.1 


1.0 


0.08 


0.7 


Muon p T resolution 


0.1 


0.03 


0.2 


0.02 


0.4 


Jet energy scale 7 


2.5 


2.6 


17.4 


2.3 


13.6 


Jet energy resolution 


0.005 


0.03 


1.9 


0.5 


0.7 


/>tag eff. / light jet rej. 5/32 


1.0 


1.0 


2.7 


0.8 


3.2 



| Experimental uncertainty || ±4.3 | ±14.5 



large numbers of jets need to be measured, as does the fake contribution from Z?-jets. These two channels 
should then contribute to the measurement of the Standard Model Higgs boson properties, in particular 
the couplings of this boson to top and to the W. 
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Abstract 

This note describes a study of the discovery potential for the supersymmetric 
Higgs bosons h/H /Am proton-proton collisions at a center-of-mass energy of 
14 TeV in final states with x lepton pairs with the ATLAS detector at the LHC. 
The Higgs bosons are produced in association with b quarks and decay into a 
XX final state where both x leptons decay leptonically. The signature of Higgs 
bosons with masses between 1 10 and 450 GeV is analyzed and the discovery 
potential is assessed. The analysis is based on an integrated luminosity of 
30ft- 1 . 

All results are obtained using full simulation of the ATLAS detector. No pile- 
up or cavern background has been considered in this analysis. In addition a 
procedure for estimating the shape and the normalization of the irreducible 
Z — > T + T~ background from data is investigated. The discovery potential as a 
function of and tan/3 is shown for the m™ ax MSSM benchmark scenario. 

1 Introduction 

In the Minimal Supersymmetric Standard Model (MSSM), the minimal extension of the Standard Model, 
two Higgs doublets are required, resulting in five observable Higgs bosons. Three of them are electrically 
neutral (h, H, and A) while two of them are charged At tree level their properties like masses, 

widths and branching fractions can be predicted in terms of only two parameters, typically chosen to be 
the mass of the CP-odd Higgs boson, iua, and the tangent of the ratio of the vacuum expectation values 
of the two Higgs doublets, tan /3. 

In the MSSM the couplings of the Higgs bosons to fermions and bosons are different from those 
in the Standard Model resulting in different production cross-sections and decay rates. While decays 
into ZZ or WW are dominant in the Standard Model for Higgs boson masses above m# ~ 160 GeV, 
in the MSSM these decay modes are either suppressed like cos(/3 — a) in the case of the H (where a 
is the mixing angle of the two CP-even Higgs bosons) or even absent in case of the A. Instead, the 
coupling of the Higgs bosons to third generation fermions is strongly enhanced for large regions of the 
parameter space. The decay of the neutral Higgs bosons into a pair of x leptons therefore constitutes 
an important discovery channel at the LHC. The production of the Higgs bosons can proceed via two 
different processes: gluon-fusion or production in association with b quarks. 

In this note, the discovery potential of neutral MSSM Higgs bosons, produced via associated produc- 
tion with b quarks and decaying into a pair of x leptons in ATLAS at the LHC is discussed. Only x lepton 
decays into electrons and muons are considered here. Higgs bosons in the mass range between 110 and 
450 GeV are analyzed for an integrated luminosity of 30ft _1 . Both the shape and the normalization of 
the Z — > xx background which is dominant for low Higgs boson masses are estimated from Z — > jijx and 
Z — ► ee events in data. The results are interpreted in the m™ 3 * scenario as a function of the two parameter 
of the model, and tan/3 [1]. Studies concerning the semileptonic and the fully hadronic final state are 
not included in this note. These studies are ongoing and will be published separately. 

This note is organized as follows. In Section 2 the signal and background processes are introduced 
and their cross-sections are discussed. In Section 3 the analysis is discussed. After a description of 
the selection, a procedure to estimate the shape and the normalization of the irreducible Z — > x + x~ 
background from data is detailed before the discovery potential in the — tan /3 plane is assessed. In 
Section 4 the results are summarized. 
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2 Signal and background processes 
2.1 Higgs boson production 

The production mechanism of Higgs bosons in the MSSM is discussed in the introductory section of this 
chapter. The Higgs boson masses, their production cross-section and their branching fraction into a pair 
of T leptons are summarized in Table 1. 

Table 1: Masses, cross-sections for ^-associated production, and branching fractions into the T + T~ final 
state for Higgs bosons in the m™ ax scenario and for tan /? = 20. 



Mass / GeV e™ s ^f ed /fb @(h/H/A t+t-)/% 

A H h AH h A H h 



110 


129.8 


109.0 


314810 


7579 


310707 


8.9 


9.1 


8.9 


130 


134.2 


124.7 


189602 


92897 


99992 


9.1 


9.2 


9.0 


160 


160.8 


128.0 


97480 


93102 


6650 


9.4 


9.5 


8.4 


200 


200.5 


128.4 


45685 


45095 


2188 


9.6 


9.7 


7.5 


300 


300.4 


128.6 


10312 


10253 


979 


8.2 


9.5 


6.3 


450 


449.8 


128.6 


2019 


2035 


723 


6.1 


6.2 


5.7 



The theoretical uncertainty on the inclusive production cross-section, i.e. without imposing any re- 
quirements on the pt of the b jets at generator level, is estimated taking into account contributions from 
the scale uncertainty and from the uncertainty on the parton distribution functions. The scale uncertainty 
is obtained from Ref. [2] as a function of the mass of the Higgs boson. The contribution from the Parton 
Density Functions (PDFs) is estimated by exchanging MRST2002 for MRST2004 parton distribution 
functions. Since the cross-sections obtained with MRST2004 are smaller than that with MRST2002 they 
are considered conservative. Therefore, half of the difference observed with this variation is taken as a 
systematic uncertainty. 

The total uncertainty on the cross-section, obtained by adding the PDF and scale uncertainties in 
quadrature, is displayed in Fig. 1 as a function of the Higgs boson mass m A - For Higgs boson masses as 
low as m A = 100 GeV the total theoretical uncertainty is of the order of 20%. This uncertainty decreases 
to values below 10% for m A = 400 GeV. For low Higgs boson masses the contribution from the scale 
uncertainty dominates over that from the parton distribution functions while for high Higgs boson masses 
the situation is reversed. 

2.2 Background processes 

The following background processes are relevant and have been considered in this analysis (for details 
see Ref. [3]). 

• Z — >• it The Drell-Yan production of Z bosons and their subsequent decay into a pair of leptons 
constitutes an important source of background. The production cross-section has been calculated 
to NNLO accuracy and was found to be Oz^u = (2015 ± 60)pb 1 . 

Events with the Z boson decaying to a pair of T leptons constitute an irreducible background. In 
particular for low Higgs boson masses, due to the limited invariant mass resolution in the T + T~ 
final state this background is problematic and needs therefore to be estimated directly from data. 

• tt production: The cross-section for this process has been calculated at NLO+NLL accuracy and 
was found to be o ft = (833 ± 100) pb. This background is dominant for Higgs boson masses 
beyond m A = 200 GeV. 

'The cross-section is given for a cut on the invariant mass of the lepton pair of mu > 60 GeV. 
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Figure 1 : Systematic uncertainty on the signal cross-section for associated Higgs boson production as 
a function of uia- The dashed line corresponds to the contribution from the scale uncertainty, the dash- 
dotted line to that introduced by the uncertainty on the partem distribution functions, and the solid line is 
the sum of both contributions added in quadrature. 

• W+jets production: The production cross-section for this process has been calculated at NNLO 
accuracy and was found to be c%+jets = 20510pb. This dataset was complemented by a Wbb 
sample whose cross-section has been calculated to NLO accuracy (a wfo g = 176. 9pb). 

2.3 Event generation 

The Monte Carlo samples have been generated using the Sherpa [4], Pythia [5], Herwig [6], Alp- 
GEN [7], and MC@NLO [8] Monte Carlo generators. Except for Sherpa, all external matrix element 
generators are interfaced to Herwig to produce the partem shower. The x leptons are decayed using ei- 
ther Sherpa or Tauola [9]. Initial and final state radiation of photons is simulated using Photos [10]. 
Event filters have been applied for all processes in order to increase the event generation efficiency. 
Details on Monte Carlo simulation are given in Ref. [3]. 

3 Analysis of the exclusive lepton lepton final state 

The experimental signature consists of two leptons from the x decays and missing transverse energy, 
due to the neutrinos from the x decays. At least one jet tagged as coming from a b quark is required in 
the event and therefore the b quark associated production is dominant here. 

3.1 Preselection 

The preselection cuts are grouped into 'Trigger Selection', '^-Tagging', 'Lepton Selection', and cuts 
related to the reconstruction of the invariant mass of the Higgs boson. 

Trigger selection: An isolated muon (electron) with transverse momentum of at least 20 GeV (25 GeV) 
or two isolated electrons with pj > 15 GeV, or one electron with pj > 15 GeV and a muon with 
Pt > 10 GeV are required. 

b-Tagging: Since the Higgs boson is produced in association with b quarks, at least one jet has to be 
identified as coming from a b quark in order to suppress backgrounds from processes involving light 
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quarks. Jets are reconstructed using a cone algorithm with radius AR = OA 2 , and a ^-tagging weight of 
> 3 is required in order for the jet to be labeled as coming from a b quark [11]. 



Lepton selection: Electrons are required to have pj > lOGeV, be within \t]\ < 2.5 and pass the 
medium electron selection [12]. No isolation criteria are applied. Muons are reconstructed using the 
algorithm described in Ref. [13]. A minimum pj of 10 GeV and |tj| < 2.5 are required. An isolation 
cone of opening angle AR = 0.2 around the muon track is used with a maximum Ej of 6 GeV deposited 
in the calorimeters. Two leptons of opposite charge are required if the event is to be considered for further 
analysis. In case more than two leptons fulfill the above requirements the lepton pair with the highest 
scalar sum of pj is selected. 



Collinear approximation: The invariant mass of the Higgs boson candidate is reconstructed using the 
collinear approximation [14]. In this approximation the masses of the particles involved in the decay of 
the X lepton are small compared to their momenta, so that the direction of the X lepton can be approxi- 
mated by the direction of its observed visible decay products. The method assumes furthermore that the 
missing energy observed in the event is entirely due to neutrinos from the x lepton decays. In addition 
the Higgs boson is required to have some amount of transverse momentum. If that is not the case the two 
X leptons from the Higgs boson decay are back-to-back. An accurate reconstruction of the transverse 
momenta of the x leptons is not possible in that case and the resolution of the invariant mass of the x pair 
will be poor. Neglecting the masses of all leptons, the invariant mass of the Higgs boson candidate can 
be reconstructed via 




The quantity x,- = Ptv-Jptt, * s tne fraction of the x lepton momentum carried by its visible decay products. 
They are calculated from $ T in the event and the transverse momentum of the visible leptons. For 
this calculation # T is decomposed into two components, each of them pointing along the direction of 
the charged decay products of the x lepton. This fraction is required to be within physical bounds 
(0 < Xi < 1). In order for the solution to be numerically stable a cut on the angle between the visible 
decay products of the x lepton of Atyu < 3 is imposed. This also improves the invariant m TT resolution. 

The accepted cross-section for the above preselection is detailed in Table 2 for the signal and the 
dominant background contributions. 



Table 2: Cross-section in fb passing the preselection criteria as described in the text. The numbers for 
the signal samples are given assuming tan j3 = 20. 



Process 


Trigger 


Lepton Selection 


1 or 2 jets 


> 0 b-tags 


Coll. Approx 


m A = 110 GeV 


1837.6 


1154.8 


628.6 


175.7 


118.4 


m A = 130 GeV 


1511.8 


971.6 


544.6 


172.1 


115.7 


m A = 160 GeV 


987.4 


656.4 


374.9 


119.1 


80.8 


m A = 200 GeV 


497.9 


340 


199.3 


63.9 


44.9 


m A = 300 GeV 


139.4 


98.8 


60.2 


20.4 


13.9 


m A = 450 GeV 


25.3 


18.3 


11.2 


3.7 


2.4 




255114 


48045.9 


7804.8 


5479 


1096.2 


Z — > TT 


47026.8 


27654.4 


14053.2 


665.1 


440.6 


Z^ee 


1.4E6 


797747 


421393 


16197.8 


2848 .4 3 




1.3 E6 


704275 


345491 


16811.5 


3223 .4 3 


W+Jets 


17.2 E6 


91042.8 


44612.4 


1537.5 


122.4 3 



2 The radius AR of the cone is defined as AR = ^/A<j> 2 + Ari 2 
3 Results were obtained using cut factorization. 
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3.2 Event selection 




Figure 2: Invariant m T + T - distribution for a Higgs boson of mass niA = 130 GeV (left) after preselection 
cuts. The width has been determined by a fit of a single Gaussian to the peak region of the distribution. 
The right-hand plot shows the m T + T - distribution as a function of the pj of the Higgs boson. 

Cuts on kinematic variables have been optimized in an iterative procedure in order to maximize the 
statistical significance S/y/B for a potential Higgs boson signal. Since the composition of the background 
depends on the signal mass hypothesis, this has been done separately for each mass point. In addition, 
the optimization has been done separately for the ee, }X}X, and for the mixed e}X final states. 

The following variables are considered: To suppress background from tt production only events 
with less than three jets are selected. The invariant dilepton mass ntu has to be well below mz in order 
to suppress Z —> ee and Z — > jiji. events. Since there are neutrinos from the x decays in the event, 
missing ttansverse energy is required in the event. The transverse momentum of the jet tagged as coming 
from a b quark has to be above a certain value which depends on the Higgs boson mass under study. 
Requirements on the maximum pj of the leading lepton as well as on that of the lepton-lepton system 
PT,ee are imposed. In addition, the angle A(j) between the two leptons is restricted. The cut values and 
the accepted cross-section for a Higgs boson mass of = 130 GeV are given in Table 3. 

The resolution of the m T + T - distribution using the collinear approximation for a Higgs boson of mass 
Ma = 130 GeV is illustrated in Fig. 2 (left) after applying preselection cuts. The width as extracted from a 
fit of a single Gaussian to the peak region is a = 25 GeV compared to the natural width of a Higgs boson 
of that mass between less than 100 MeV up to a few GeV depending on tan/3. The ra T + T - distribution 
versus the pj of the Higgs boson is illustrated on the right-hand side of Fig. 2, showing that the invariant 
mass resolution improves with pj of the Higgs boson. 

3.3 Selection results 

The accepted cross-sections after all cuts for different Higgs boson masses and the various backgrounds 
are summarized in Table 4 for tan j8 = 20. The reconstructed XX invariant mass distributions are dis- 
played in Fig. 3 for all masses considered. The vertical solid lines mark the mass window defined as 
m— 1.65(7 < m- - <m + 2a where a denotes the invariant ///-• - resolution for a given mass hypoth- 
esis as determined from Monte Carlo simulations. All candidate events falling inside this mass window 
are used to calculate the significances for a Higgs boson signal. 

In the low mass range, the invariant mass resolution is of the order of 25 GeV. A potential signal for 
a Higgs boson is nearly indistinguishable from the irreducible Z — ► T + T~ background which dominates 
in this mass range over the contribution from tt processes, which contribute at the (10 — 20)% level. This 
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makes it necessary to estimate the shape and the normalization of the Z — ► T + T background directly 
from data. A procedure has been developed and will be discussed in detail in Section 3.5. 



Table 3: Accepted cross-section in fb for optimized cuts for = 130 GeV and tan/3 = 20. The values 
of the cuts applied are stated for the ee/jiji (upper row) and e\i (lower row) subchannels. In case only 
one number is given it applies to all leptonic subchannels. 



Variable 


Selection 


H — > TT 


tt 


Z^TT 


Z — > ee 


Z^MM 


W+Jets 


Precuts 




115.7±5.1 


1096±35 


441±16 


3223±123 


2848±108 


122±40 


p T b-jet 


(15 -66) GeV 


90.0±4.5 


443±22 


337±14 


2756±113 


2481±101 


91±35 


m u 


(27-70) GeV 
(0-70) GeV 


72.6±4.1 


138±12 


326±14 


134±25 


92±19 


60±28 


xi -x 2 


(0.04-0.4) 
(0.0-0.5) 


64.1±3.8 


108±11 


251±12 


47±15 


36±12 


40±23 




(20 - oo) GeV 
(15-°o)GeV 


52.2±3.5 


102±11 


171±10 


4.3±4.5 


5.1±4.6 


33±21 




(O-oo)GeV 
(0 - 70) GeV 


47.7±3.3 


57.9±7.9 


159.4±9.8 


4.3±4.5 


4.6±4.3 


28±19 


Pt,u 


(0-45) GeV 
(0-60) GeV 


46.5±3.3 


38.2±6.5 


155.8±9.6 


3.9±4.3 


2.5±3.2 


23±17 


A<i> u 


(2.24-3) 
(2-3) 


43.3±3.1 


32.8±6.0 


107.5±8.0 


3.6±4.1 


3.8±4.0 


21±17 


Pt leading I 


(10-80) GeV 


43.3±3.1 


32.8±6.0 


107.5±8.0 


3.6±4.1 


3.8±4.0 


21±17 


Mass Window 


(111 -198) GeV 


28.4±2.6 


19.7±4.6 


22.1±3.6 


1.8±2.9 


2.0±2.8 


12±12 



Table 4: Accepted cross-section for all Higgs boson mass hypotheses analyzed. The cross-section in fb 
for signal and background after all selection cuts is given (except for the cut on the mass window) for 
tan/3 = 20. 







H — ► T + T~ 




Z — > T + T~ 


Z^e + e- 




W+jets 


m A 


= 110 GeV 


34.4 ± 2.9 


24.0 ±5.1 


62.1 ±6.1 


1.9 ±3.0 


2.7 ±3.3 


8 ± 11 


m A 


= 130 GeV 


28.4 ±2.6 


19.7 ±4.6 


22.1 ±3.6 


1.8 ±2.9 


2.0 ± 2.8 


12 ± 12 


m A 


= 160 GeV 


18.7 ± 1.2 


39.3 ±6.6 


8.4 ±2.2 


1.4 ±2.5 


2.0 ± 2.9 


1.8 ±4.9 


m A 


= 200 GeV 


10.9 ±0.6 


28.4 ±5.6 


5.4 ± 1.8 


2.0 ±3.1 


2.1 ±2.9 


3.7 ±7.0 


m A 


= 300 GeV 


2.7 ±0.1 


32.8 ±6.0 


3.0 ± 1.3 


0.4 ± 1.4 


1.7 ±2.6 


5.8 ±8.8 


m A 


= 450 GeV 


0.50 ± 0.03 


50.2 ±7.4 


1.8 ± 1.0 


0.4 ± 1.4 


0.3 ± 1.1 


4.1 ±7.3 



In the medium mass range, the contributions from Z — > T + T~ events and from tt processes become 
equally important. The mass resolution for signal events is now of the order of (30 — 40) GeV leading to 
a broad structure which is indistinguishable from that of background events. 

In the high mass range {iua = 300 to 450 GeV), the cross-section for the signal process decreases 
rapidly. The invariant mass resolution for signal events is between (50 — 80) GeV so that a discovery 
with an integrated luminosity of Jz? = 30 fb -1 in this channel will not be possible. 

3.4 Systematic uncertainties 

In order to assess the impact of systematic uncertainties, each of these has been applied in turn and 
the impact on the result of the analysis is evaluated. The uncertainties assumed on the energy and 
momentum resolution of muons, electrons, photons, and jets are conservative estimates assuming non- 
optimal performance of the corresponding algorithms at the beginning of data taking. 

1. For muons the uncertainty on the reconstructed pj is o{\/pj) = (0.01 l/pr (g>0.00017) with pj 



1379 



Higgs - Discovery Potential of h/A/H -> t+t~ -> 




0 50 100 150 200 250 300 0 50 100 150 200 250 300 

nv [GeV] m,, [GeV] 




nv [GeV] [ QeV l 



Figure 3: Invariant m T + T - distribution for signal and background events. The distributions are shown 
after all selection cuts with the nominal masses and tan j8 values as indicated in the plots. The vertical 
lines indicate the mass window used for calculating the signal significance. 
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Table 5: Effects of systematic uncertainties as described in the text for different Higgs boson masses 
for tan /3 = 20. The effects of the systematic uncertainties considered are listed in percent. The total 
uncertainty is obtained by adding the individual contributions in quadrature. 





m A 


= HOGeV 


m A 


= 130 GeV 


m A 


= 160 GeV 


Uncertainty / % 


Signal 


ff Bkg 


W+jets 


Signal 


ffBkg 


W+jets 


Signal 


ffBkg 


W+jets 


^-tagging efficiency 


2.8 


3.9 


0.8 


4.0 


4.1 


0.7 


6.6 


3.3 


0.5 


Jet energy scale 


0.3 


6.0 


0.9 


<0.1 


5.3 


1.6 


1.1 


4.0 


1.5 


Jet resolution 


8.3 


0.8 


0.3 


<0.1 


0.2 


1.2 


6.1 


0.4 


1.2 


Electron energy scale 


0.7 


0.5 


1.0 


0.4 


0.1 


1.0 


0.4 


0.5 


0.9 


Electron resolution 


0.7 


0.6 


1.0 


1.6 


0.2 


1.0 


0.4 


0.2 


0.4 


Muon energy scale 


0.7 


0.9 


0.7 


1.2 


0.6 


0.7 


0.4 


0.4 


0.7 


Muon resolution 


1.4 


0.6 


2.5 


0.8 


1.1 


2.4 


1.7 


0.4 


2.4 


Electron efficiency 


<0.1 


0.5 


0.4 


<0.1 


0.5 


0.4 


<0.1 


0.4 


0.4 


Muon efficiency 


<0.1 


0.8 


0.7 


0.8 


0.2 


0.7 


<0.1 


0.9 


0.7 


Light jet rejection 


<0.1 


<0.1 


3.4 


0.4 


<0.1 


3.4 


0.7 


<0.1 


3.4 


Total exp. uncertainty 


9 


7.4 


4.7 


4.6 


6.8 


4.9 


9.2 


5.3 


4.8 






m A 


= 200 GeV 


m A 


= 300 GeV 


m A 


= 450 GeV 


Uncertainty / % 


Signal 


ff Bkg 


W+jets 


Signal 


ffBkg 


W+jets 


Signal 


ffBkg 


W+jets 


^-tagging efficiency 


4.8 


3.2 


0.5 


4.2 


3.1 


0.3 


4.8 


3.8 


0.1 


Jet energy scale 


0.6 


3.6 


1.5 


0.9 


3.1 


0.6 


0.4 


2.1 


0.7 


Jet resolution 


8.0 


1.6 


2.7 


0.8 


0.4 


2.7 


0.7 


0.5 


2.4 


Electron energy scale 


0.5 


0.5 


0.9 


<0.1 


0.2 


0.8 


0.4 


0.2 


1.2 


Electron resolution 


0.3 


0.1 


0.4 


<0.1 


<0.1 


0.3 


<0.1 


0.4 


0.3 


Muon energy scale 


0.5 


0.4 


0.7 


0.5 


0.4 


0.7 


0.9 


0.3 


0.9 


Muon resolution 


0.3 


0.1 


2.4 


0.8 


0.3 


2.3 


0.4 


0.7 


2.2 


Electron efficiency 


0.3 


0.3 


0.4 


0.8 


<0.1 


0.4 


0.4 


0.1 


0.4 


Muon efficiency 


0.3 


1.0 


0.7 


0.8 


1.3 


0.7 


1.1 


1.0 


0.7 


Light jet rejection 


0.8 


<0.1 


3.4 


0.4 


<0.1 


3.4 


0.4 


<0.1 


3.4 


Total exp. uncertainty 


9.4 


5.3 


5.4 


4.6 


4.7 


5.1 


5.1 


4.6 


5.1 
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given in GeV. The uncertainty on the energy scale is estimated to be ±1%, and that on the recon- 
struction efficiency is assumed to be 1% and flat in pj. 

2. For electrons and photons the uncertainty on the reconstructed Ej is o{Ej) = 0.0073 -Ej. The 
uncertainty on the energy scale is estimated to be ±0.5%, and that on the reconstruction efficiency 
is assumed to be 0.2% and being flat in Ej. 

3. For jets with |tj| < 3.2 (|tj| > 3.2) the uncertainty on the jet energy scale is taken to be ±3% 
(±10%) and the jet energy resolution is assumed to be 45%y/E (63%\/E). 

4. For the ^-tagging a degradation of the tagging efficiencies of 5% is taken as systematic uncertainty. 
For the Z±light jets background an uncertainty on the rejection rate of ±10% is assumed. 

A detailed description of the sources of systematic uncertainties can be found in [3]. The impact of 
the systematic uncertainties on the number of signal and tt background events 4 inside the mass window 
is summarized in Table 5 for Higgs boson masses between jua = 110 and 450 GeV. A sample of 48M tt 
events from fast simulation has been used for these studies. The Z — > TT background in this analysis is 
estimated from sidebands in data as described in the next Section. The number of events from Z — > jiji 
and Z — ► ee processes after all selection cuts is small compared to that from tt. Their contribution to the 
total systematic uncertainty is small compared to that from tt processes. 

3.5 Estimation of Z — > t + t shape and normalization from data 




0 20 40 60 80 100 120 140 160 180 200 0 20 40 60 80 100 120 140 160 180 200 

Dimuon mass [GeV] p T ,miss,oaio [GeV] 



Figure 4: Distribution of the invariant dilepton mass ma (left). The shaded area illustrates the distribution 
from Z — ► jXji. events, and the solid line that from signal events. The contribution from Z — ► t + t~ and 
tt events is illustrated by the dashed and dotted lines, respectively. The cut on the invariant tl mass 
is indicated by the solid vertical lines. The distribution of p™ lss in the calorimeter for Z — > jijx and 
Z — > TT — > jXfl+X events (right). All plots shown are after preselection cuts. 

As discussed above the estimation of the shape and the normalization of the irreducible Z — > t + t~ 
events from data is of great importance in particular for low Higgs boson masses where this background 
is dominant. Procedures to estimate both the shape and the normalization of Z — > T + T~ events from data 
are needed. This procedure is based on Z —>■ fXjX and Z — > ee events selected from a sideband region 
which is free of Higgs boson events, in contrast to the signal region. The method proceeds in three steps: 

4 The impact on the number of Z — > t+t~ events is not listed here since this background is estimated from data. 
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1. As a first step, a pure sample of Z — > jUjll or Z — >• ee events is selected from a sideband region as 
described below. 

2. The shape of the m xt spectrum is estimated by adapting the four-momenta of the muons such that 
they appear like coming from Z — ► TT events. 

3. The normalization of the Z — > TT background is estimated from a double-ratio comparing the 
number events found in data and in Monte Carlo in the sideband region as well as in the signal 
region. 

This procedure is described in detail below. 

3.5.1 Definition of signal and sideband regions 

Events of the type Z — > ee and Z —> fiji are selected from a sideband region (called region B) with very 
high purity and with an event topology similar to that from Z —>■ T + T~ events in the signal region (called 
region A). The following cuts are applied in order to select Z —>■ ee and Z —> jiji events in region B: 

• The invariant mass of the lepton-lepton system is required to be within 75 GeV < mu < 100 GeV, 
where tt is either a ee or a jiji final state. 

• At least one jet identified as coming from a b quark has to be found in the event. 

• The number of jets allowed in the event has to be less than three. 

The main cut defining region B is that on the invariant lepton-lepton mass. A distribution illustrating 
the cut on mu is displayed in Fig. 4 (left) showing the large amount of Z —>■ jiji events selected by 
the cuts above that only have a small contamination of events coming from tt processes and events 
containing Higgs boson decays. The number of events from Z — ► ee and Z — > jijx is around 500000 for 
an integrated luminosity of 30fb _1 and thus much larger than the number of Z — > TT events expected in 
the signal region. The purity of the Z —>■ jiji (Z — »■ ee) control sample in region B after all cuts mentioned 
above is 99.1% (97.9%) with a contribution of 0.04% (0.05%) of events from the signal process 5 with 
the remainder coming from tt processes. 

3.5.2 Estimation of the Z — > t+t~ shape from data 

The method of estimating the shape of Z — ► T + T~ events from data is based on the assumption that in 
the calorimeter this type of events is indistinguishable from that of type Z — > jijx. This method has been 
proven to work in a vector boson fusion H — > T + T~ analysis [15]. 

The muons are minimum ionizing particles and their energy deposit in the calorimeter only weakly 
depends on their momentum. Therefore, the missing energy signatures of both types of events in this 
detector component are very similar as illustrated on the right-hand side of Fig. 4. Altering the energy 
of muons in Z —>■ jiji events so that they correspond to those from Z — > T + T~ — > jUjll +4v events leads 
to identical distributions of Pt,h, PtMss an d m x + z - for both classes of events. The following procedure 
is adopted: 

• Three dimensional reference histograms from Z — > T + T~ — ► + 4v events from Monte Carlo in 
region A are created. The following variables calculated in the Z rest frame are used: 

5 A Higgs boson mass of m A = 130 GeV and tan /3 = 20 is assumed here. The contamination of Higgs boson events is even 
smaller for the other signal masses considered in this analysis. 
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1. The absolute value of the Gottfried- Jackson angle £ between the Z boson and the negatively 
charged muon. 

2. The energy of the muon with cos £, > 0. 

3. The energy of the muon with cos £ < 0. 

• The components of the momentum vector of the muons in Z — ► /x/x events from region B are altered 
in a way that they match those from Z — ► t + t~ — ► jii/i +4v events using the above reference 
histogram: 

/^altered = T^T ' faltered- (2) 

For each event, the angle £ is calculated and then new energies for the muons are chosen randomly 
from the reference histogram. After applying this procedure, the muon momenta are boosted back 
into the lab frame. 

• The missing energy in the event is re-calculated according to the new muon momenta: 

PT, miss, altered = Pr.miss ~ J^Pr, altered + £/>r,old- (3) 

n n 

A comparison of /?T,miss> of x\ -xi from the collinear approximation and of the invariant m T + T - mass 
from altered Z — > iifi in comparison with Z — > t + t~ — ► /i/x +4v events is shown on the left-hand side, 
middle and right-hand side of Fig. 5, respectively. Good agreement within the statistical uncertainties of 
the samples used is observed. 




Figure 5: Comparison of the px.miss, M -x\, and m TT spectra for events of type Z — > fifx and Z — > %%. 
Good agreement within the statistical uncertainties is observed. 

The same shape as extracted from region B using Z — ► events to estimate Z —>■ TT —> jXjl +X 
events in region A is also used for Z — > tt — > + X and Z — > tt jie + X events. As shown in Figure 
6, the m xx shapes are identical within statistical uncertainties justifying this procedure. 



3.5.3 Estimation of the background normalization from data 

In order to estimate the number of background events from the Z — >• t + t~ background process for this 
analysis, the same definition of the sideband region has been used as described above. 

The number of Z — > t + t~ background events in the signal region A can then be obtained by re- 
weighting the number of events found in data in region B by the predicted ratio of the number of events 
found in Monte Carlo in region A relative to that found in region B. In order for this method to be valid, 
the following two conditions have to hold: 

(Z = (Z^T+T-^ + 4v)iL ta (4) 
(Z - U)l c (Z - t+t- - U + 4v)l c 
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Figure 6: Comparison of the m TT shape for Z — > tt — ► /i/x +X, Z — > TT — > ee+X, and Z — > tt —> jie+X 
events. Good agreement is observed within the statistical uncertainties of the samples used. 



(Z^)Data = (Z^T+T-^^ + 4v)A ata 

(Z - U)l c (Z - t+t- -> £e + 4v)* c ' 

where the first condition means that Z — ► it events behave like Z — > T + T~ — > + 4v events, and when 
combined with the second it implies that Z — ► T + T~ — >• ££ + 4v events in region A and in region B behave 
identically. 

The calculation of the number of events from the Z — ► T + T~ process in data is performed in bins of 
Pt of the leading lepton vs. the pj of the subleading lepton with a bin size of 2 x 2 GeV 2 which was 
found to give unbiased results in previous Monte Carlo based studies [16]. This ensures that the method 
is less dependent on the differences in pj in the signal and the sideband region. Once events from real 
data are available the method has to be validated and the influence of a possible difference between data 
and the Monte Carlo prediction on the results has to be checked. 

This procedure also allows to easily take into account differences in acceptance and trigger ef- 
ficiencies by simply applying the appropriate factors to the reweighting procedure. The number of 
Z — > T + T~ U + 4v events in region A is then given by 

(Z -> t+t- -> U + 4v)U = £ ^ ((z^lflSc)- ' ((Z " U)U)ii ' 

where i and j indicate the corresponding bin in pj. The statistical uncertainty on this method is calculated 
according to Gaussian error propagation. The method has been tested using two independent Monte 
Carlo samples, one to fill the reference histogram and one to test the reweighting procedure. Good 
agreement within statistical uncertainties between the expected number of events in region A and that 
actually found was observed. 

The application of this method is straight forward for the ee and the final state. For the e\L 
final state the procedure has to be adapted since there are no Z — > efX decays. In order to estimate 
the number of background events in that case, events from Z — > ee and Z — > jujU processes are used. 
Additional correction factors are applied to account for the differences in trigger efficiencies and selection 
efficiencies in that case. 
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3.5.4 Systematic uncertainties of the background estimation procedure 



Table 6: The effects of systematic uncertainties on the calculation of the normalization. 
Uncertainty Effect Value Systematic Uncertainty 

Muon resolution a ( £ ) = ^ (^) 2 + 0.00017 2 0.16 % 

Muon energy scale ±1% 1.35% 

Muon efficiency ±1% 0.7% 

Electron resolution a (E T ) = 0.0073 • E T 1.9 % 

Electron energy scale ±0.5% 0.85 % 

Electron efficiency ±0.2% 0.1% 

The impact of the systematic uncertainties on the estimation of the number of background events 
from Z — ► T + T~ processes as described in Section 3.4 has been evaluated. The uncertainties on jet 
energy scale and jet resolution, as well as on the b tagging efficiency are expected to be negligible since 
the same effects would apply to the signal region as well as to the sideband region. The remaining 
contributions to the systematic uncertainty on the background estimation procedure are coming from the 
energy scale, efficiency and resolution connected with the electrons and muons in the final state. These 
contributions as expected in 30 fb -1 of data are summarized in Table 6. 

In addition, systematic uncertainties due to the different acceptances and trigger efficiencies for the 
Z — ► ee/jiji samples from the sideband region compared to Z —>■ TT — > ee/ jiji / ejl + X samples in the 
signal region need to be taken into account. Since the reweighting of events is done as a function of the 
p T of the leading and the subleading lepton, these are easy to apply. Since this analysis is aimed at an 
integrated luminosity of 30fb _1 , it is assumed that by then these uncertainties are evaluated to a very 
high precision. 

The overall systematic uncertainty has been evaluated to be 2.6% by dividing the available MC events 
into two independent samples. This uncertainty has been taken into account for the final results. 



3.5.5 Summary of the background estimate from data 

Both methods described above are now combined to estimate the Z — > TT background and thereby the 
significance of a possible Higgs boson signal in the low mass region. First, the number of events of 
type Z — > T + T~ is estimated using the method described in Section 3.5.3. Then, the shape of this back- 
ground is determined as described in Section 3.5.2. The background is then subtracted from the ra T + T - 
distribution. 



3.6 Results and discovery potential 

The discovery potential for the h/H/A —>■ T + T~ —>■ ££4v channel is now assessed. All sub-channels, 
i.e. ee, e\x and \x\x are combined to calculate the significance of a Higgs boson signal. A mass window 
m — 1.65a < m t + T - < m + 2a is applied to calculate the final significance where a denotes the invariant 
mass resolution of the Higgs boson signal of the corresponding mass. 

The shape and the normalization of the Z — > T + T~ background is estimated from the sideband region 
in data as described in Section 3.5. There is no corresponding procedure available yet for the tt back- 
ground so that all experimental systematic uncertainties are taken into account for all calculations below. 
Theoretical uncertainties are treated separately. The significance of a potential Higgs boson signal in the 
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Figure 7: The five a discovery potential (left) and the 95% exclusion limit (right) as a function of and 
tan/3. The solid line represents the main result of the analysis. The dashed lines indicate the discovery 
potential and exclusion limit including an addition 10% uncertainty on the tt cross-section. The bands 
represent the influence of the systematic uncertainty on the signal cross-section. 

given mass window is calculated as 

Sign. = = = , (7) 

^N„-+ (A» s ) 2 +N^„ + (A|T TT ) 2 +Nw^tv + (A^ v ) 2 +N Z ^+N Z ^ 

where S is the number of signal events, N t j and A sys («) are the statistical and systematic uncertainties 
on the tt background. The quantities iYz_> T + T - and A sys (Z — > t + t~) are the statistical and systematic 
uncertainties on the Z —>■ t + t~ background, respectively. The term in the denominator is dominated 
by the contribution from tt events; the contributions from Nz->ze and Nz-*mi and their corresponding 
systematic uncertainties are negligible. 

The discovery potential and the 95% exclusion limit in the m™ 3 " scenario as a function of tan/3 and 
mA and for an integrated luminosity of 30 fb -1 is displayed in Fig. 7 on the left-hand side and right- 
hand side, respectively. The uncertainty on the signal cross-section is indicated as bands in the plots. 
The calculation of the significance includes both statistical and experimental systematic uncertainties 
on the background. The uncertainties on the background cross-sections are not taken into account in 
the main results since they are assumed to be measured with high precision at the time of the analysis. 
However, since the cross-section for the tt background might not be measured at high precision in the 
region of phase space relevant to this analysis, the discovery potential including a 10% uncertainty on 
that background is displayed in addition (dashed line). 

The discovery potential for Higgs bosons is shown in Fig. 8 as a function of tan /3 for various 
values, and in Fig. 9 as a function of for various tan /3 values. 

4 Conclusion 

In this note a study of the discovery potential for the supersymmetric Higgs bosons h/H /A in proton- 
proton collisions at a center-of-mass energy of 14 TeV with the ATLAS detector at the LHC has been 
presented. The final state h/H /A —>■ t + t~ in b quark associated production has been investigated with 
at least one jet identified as coming from a b quark and with both z leptons decaying leptonically. 

A significant improvement for the discovery potential can be achieved if this channel is combined 
with the ^-had and had-had channel, where one T lepton decays leptonically (either electron or muon) 
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Figure 8: The discovery potential as a function of tan/3 and for Higgs boson masses as indicated in the 
plots. The solid lines indicate the discovery potential including experimental systematic uncertainties. 
The dashed lines indicate the discovery potential including an additional systematic uncertainty on the it 
cross-section. The bands indicate the impact of the systematic uncertainty on the signal cross-section. 
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Figure 9: The discovery potential as a function of ma- The solid lines indicate the discovery potential in- 
cluding experimental systematic uncertainties for tan /3 values as indicated in the plots. The dashed lines 
indicate the discovery potential including an additional systematic uncertainty on the tt cross-section of 
10%. The bands indicate the impact of the systematic uncertainty on the signal cross-section. 
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and one hadronically, or both t leptons decay hadronically [17]. 
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Search for the Neutral MSSM Higgs Bosons in the Decay 
Channel A /H/h —> 

Abstract 

Motivated by the high muon momentum resolution and identification effi- 
ciency achievable with the ATLAS detector, the observability of A/ H/h —> 
channel is explored. The high experimental resolution in this decay 
mode compensates to some extent for the suppression of the branching ratio, 
with respect to the A/ H/h — > T + T~ decays. The analyses are performed in 
the Higgs mass range from 100 to 500 GeV. Two main analysis strategies are 
applied - the search for the dimuon final states resulting from the direct A/ H/h 
production is combined with the search for the associated bbA /H/h production 
mode. 

The studies are optimized for the early stage of data taking up to an integrated 
luminosity of 30 fb _1 . All results are obtained combining the fast and the 
full simulation of the ATLAS detector with the nominal detector layout and 
the trigger efficiencies included. In addition, dedicated data samples are pro- 
duced to study the impact of the pile-up and cavern background on the analysis 
performance. The estimation of the background contribution from the experi- 
mental data and the contribution of theoretical and experimental uncertainties 
are also addressed. The discovery potential is shown in the ra^-tan/3 plane in 
context of the m"' ax MSSM benchmark scenario. 

1 Introduction 

In the framework of the Standard Model, the observability of the Higgs boson in the decay channel 
H — > H + jJ.~ is very unlikely, since the branching ratio for the Higgs decay into muons is very small and 
the backgrounds from several Standard Model processes are large. As opposed to the Standard Model 
predictions, the decay of neutral MSSM Higgs bosons A, H and h into two muons is strongly enhanced 
in the MSSM for large values of tan/3 and can be used either as a discovery channel or for the exclusion 
of a large region of the m^-tanjS parameter space (see Ref [1]). 

Compared to the dimuon channel, the A/ H/h — > T + T~ decays have a substantially larger branching 
ratio which scales as (m T /m^) 2 and thus provide a promising discovery signature, as discussed in Ref [2]. 
Nevertheless, the X identification represents an experimental challenge. The jJ. + H~ final state, on the 
other hand, has the advantage of a very clear signature in the detector. Furthermore, a full reconstruction 
of the Higgs boson final state is possible, which allows for a direct mass measurement. The dimuon 
channel provides for the most accurate Higgs boson mass measurement. 

In this note, the potential for the discovery of the neutral MSSM Higgs bosons is evaluated in the 
dimuon decay channel. The study concentrates on the region of (ma — tan/3) plane with > 1 10 GeV 
and intermediate tan/3 values between 10 and 60, which is still uncovered by the current exprimental 
limits [3, 4]. A detailed study of the ATLAS discovery potential for this channel has been recently 
performed in the low mass region below 130 GeV [5]. The study includes also higher Higgs boson 
masses up to 400 GeV. 

In Section 2, the relevant production and decay rates of the MSSM Higgs boson are briefly discussed 
in the context of the m™ ax LHC scenario [6], as well as the production mechanisms of the major back- 
ground processes. In Section 3, the Monte Carlo simulation and data samples used for the analysis are 
described. Section 4 provides a short description of the detector performance obtained from the simu- 
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lation, related to the reconstructed particles which will be present in the final state. The event selection 
criteria, the resulting efficiency of the signal selection and the corresponding background rejection shall 
be described in Section 5. Discussion of the systematic uncertainties is presented in Section 6. Sec- 
tion 7 describes the methods for the estimation of different background contributions from the the real 
data. The obtained results are finally represented by the discovery contours in the (ra^-tan/3) plane (see 
Section 9). The note concludes with Section 10. 

2 Signal and background processes 

In this Section, the properties of the signal shall be briefly described, as well as the background processes 
relevant for the MSSM Higgs boson searches in the dimuon final state at the LHC. 

2.1 Signal production and decays 

The characteristic production and decay properties of all MSSM Higgs bosons are determined at tree- 
level by the values of the two free parameters tan /3 and the mass of the A boson. These properties 
have been calculated at NNLO with the Feynhiggs 2.6.2 package [7] in the mf"* scenario, as summarized 
inRef [1]. 

The direct gg — > A/H /h production via the gluon-gluon fusion is an analogue to the Standard Model 
Higgs boson production. This process is important in the region of low tan /3 values (below 10), where the 
Higgs bosons couple most strongly to up-type quarks. For larger values of tan/3, the rate of the bbA/H /h 
Higgs production in association with b-quarks becomes dominant, due to the enhanced couplings to the 
b-quarks. 

There are two approaches to calculate the signal rates for the associated bbA/H /h production mode. 
In the first approach, the production cross-section for the gg — > bbH process has been calculated at NLO 
in Ref [8, 9]. This calculation is most reliable in the case where both outgoing b-quarks have a high 
transverse momentum (above ~15 GeV). The inclusive cross-section without any cut on the transverse 
momenta of the b-quarks is less accurate, due to additional collinear logarithms which appear in the 
calculation due to the presence of low-momentum b-quarks. An alternative approach is the calculation 
of the inclusive cross-section for the bb — > H process, for which the collinear logarithms can be absorbed 
in a parton density function for the b-quarks and resummed to all orders of perturbation theory [10, 11]. 
This later calculation has been implemented in a parametrized way into the Feynhiggs package, which 
was finally used for the evaluation of the signal cross-sections, as mentioned previously. 

The different higher-order calculations mentioned above have been extensively compared in Ref [12]. 
The two approaches agree within uncertainties. For the gg — > bbH calculation, the uncertainty related 
to the variation of the renormalization and the factorization scale amounts to 20-30%. For the bb — > H 
calculation, which is used for the analysis, the scale uncertainty is much smaller, less than 10%. However, 
it should be noted that the uncertainty on the b parton density function has not been included here. In 
order to estimate this pdf-uncertainty, a calculation of the bb — > H cross-section is performed with two 
different parton density functions, MRST2002 and MRST2004. The observed difference of ~14% is 
taken as the estimate of the pdf-uncertainty. Adding the 10% scale uncertainty to this in quadrature, the 
total theory uncertainty for the signal is estimated to be ~17% for H boson masses up to 500 GeV. 

The production cross-section of H and A bosons increases approximately quadratically with increas- 
ing tan/3, while the h boson production is tan /3 -dependent only for <130 GeV. Also the branching 
ratio of H and A boson decays into jU + ;U~ pairs become enhanced with increasing values of tan/3. The 
h boson decay is rather insensitive to the two mentioned parameters. The increase of the cross-sections 
and branching ratios with tan/3 make the A/H /h — > jU + jU~ decay channel a promising Higgs signature 
in MSSM. 
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Additionally, the signal is enhanced due to the mass degeneracy of the neutral Higgs bosons. For A 
boson masses niA <130 GeV, the h and A bosons are degenerate in mass, while the heavy H boson mass 
is rather constant (~130 GeV). In case of ma ~130 GeV, all three bosons have very similar masses. For 
ritA > 130 GeV, the h boson mass reaches its maximum value of ~ 130 GeV, independent of the A boson 
mass, while the A and H bosons become degenerate in mass. Thus the signal can be observed as the sum 
of all two or three degenerate mass states. 

2.2 Background processes 

The processes with two muons in the final state, which give a major background contribution in the 
searches for the A/H/h signal are depicted in Figure 1. 




Figure 1 : Tree-level Feynman diagrams of the dominant background processes with two isolated 
muons in the final state: a) Drell-Yan Z boson production, b) Z boson production in association 
with jets, c) tt production and d) ZZ and WW production, q is a general symbol for u and d quarks, 
while Q stands for the b and c quarks. 

The dominant background process with a very large production rate of ~1 nb is the Drell-Yan Z 
boson production, with subsequent Z decay into two muons. The invariant dimuon mass peaks at the Z 
resonance, such that the search for the A/H/h becomes unfeasible for the Higgs masses below 100 GeV. 
Even for the higher Higgs boson masses, the tail of the Z resonance still provides an overwhelming 
background. The Drell-Yan background can be suppressed by requiring the presence of one or more 
additional b-jets, originating from the associated bbA production. The major backgrounds remaining 
after this requirement are the Z boson production in association with the light jets or b-jets and the tt — > 
(W + b)(W~b) — > {ix + vb){jl~vb) background. The tt background can be distinguished from the signal 
by a higher jet activity and a large missing energy caused by the neutrinos from W decays. Additional 
background from WW and ZZ diboson productions is expected to be small, due to the much lower 
production rates. 
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3 Data samples 

Two different Monte Carlo generators are used for the signal production. PYTHIA 6.4 [13] has been used 
to generate the direct gg —> (j) —> /x/x and associated gg — > bbij) —> bb\l\i processes (where 0 = A, H, h). 
The SHERPA [14] event generator (version 1.0.9) combines all three associated production mechanisms, 
gg — > bb§, bg — > b§ and bb — > (j), in a coherent way without double-counting. This is accomplished 
by the CKKW algorithm [15] for the matching of the parton showers to the quark emission from the 
matrix elements. Both generators provide leading-order cross-sections, which have been rescaled to 
the Feynhiggs NNLO values. The studies at the DO experiment [16] have shown that the differential 
SHERPA distributions are in a good agreement with the real data and can simply be normalized to the 
previously described inclusive higher-order cross-sections. The comparison of the samples produced by 
the two generators will be described in Section 6.1. A generator filter requiring at least two muons with 
Pt >5 GeV and \t]\ <2.7 is applied to each event for all signal data samples, after the showering and 
before writing out the events into permanent storage. The background samples are listed in Table 1, 
together with the corresponding NLO cross-sections. The details of the cross-section computation can 
be found in Ref [17]. In addition to already mentioned generators, the MC@NLO 3.1 [18] and AcerMC 
3.4 packages [19] have been used for the event generation. 

Full simulation of the detector response has been performed for all signal and background event 
topologies, within the ATHENA software framework which uses the GEANT4 [20] package for the 
description of the detector response. In addition, due to the large background production rates, it is nec- 
essary to increase the number of simulated events by means of the parametrized fast detector simulation 
(Atlfast [21]). The presented analyses are based on the combination of both simulation types. The sam- 
ples obtained with the detailed simulation have been used for the tuning of the parametrized description 
of the detector performance in the fast simulation. The tuning procedure provides a very good agreement 
with the full detector simulation. Any remaining differences are treated as systematic uncertainty. 
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tt; 2ju-filter 


MC@NLO 


833 


0.072 


500 000 


full sim. 


tt; U-ffltet 


MC@NLO 


833 


0.556 


600 000 


full sim. 


(Z-> MM )+0-3 light jets 


SHERPA 


2036 


0.490 


5 000 


full sim. 


(Z^nn)+l-3 b-jets 


SHERPA 


52.3 


0.914 


5 000 


full sim. 


bbiZ^w) 


AcerMC/PYTHIA 


45 


0.788 


280 000 


full sim. 


ZZ^bb\l\i 


PYTHIA 


0.151 


0.724 


10 000 


full sim. 


WW 


PYTHIA 


116.8 


0.35 


50 000 


full sim. 


tt, no filter 


MC@NLO 


833.0 


1.0 


100 000 000 


Atlfast 


(Z-> MM )+0-3 light jets 


SHERPA 


1165.9 


0.855 


30 000 000 


Atlfast 


(Z-> )+0-3 b-jets 


SHERPA 


52.3 


0.914 


1 000 000 


Atlfast 



Table 1: Background data samples with corresponding NLO cross-sections. 



All mentioned data samples have been simulated assuming there are no additional /^-interactions per 
event. However, at luminosities of 10 33 cm _2 s _1 one expects to have 2-3 such pile-up interactions super- 
imposed to the hard scattering. In addition, the neutron and photon background of the muon spectrometer 
(so called cavern background) may increase the muon trigger rate and degrade the muon reconstruction 
performance [22]. In order to study the impact of the pile-up and cavern background on the analysis 
performance, dedicated bbA, tt and Zbb data samples have been simulated with the realistic pile-up and 
cavern background contribution. The simulated cavern background is assumed to be five times higher 
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than the prediction of GCALOR [23] and FLUKA [24] simulations, to account for the uncertainty of the 
calculation. 

4 Detector performance 

A detailed description of the ATLAS detector and its performance is given in Ref [25]. The details of 
the detector layout, the software framework used for the Monte Carlo production, as well as the details 
of the reconstruction of fully simulated events can be found in Ref [26]. In this Section, the performance 
of the reconstruction algorithms is shortly described, concentrating on the key objects for the analyses: 
muon identification and momentum measurement, jet reconstruction, b-tagging and the measurement 
of the missing transverse energy (Ej lss ). First, the results obtained in absence of pile-up and cavern 
background in the detector are shown. These are subsequently compared to the results obtained when 
both pile-up and cavern background are taken into account. 

4.1 Reconstruction performance without pile-up and cavern background 

In ATLAS, the muon reconstruction is performed by combining the information of the muon spectrom- 
eter and the inner detector. Staco and MuTag [22] reconstruction packages are used for the study. The 
average muon reconstruction efficiency is (97.15±0.04)%. This is reduced to (95.44±0.05)% if a match 
between the muon spectrometer track and the inner detector track is required. The momentum resolu- 
tion of low-pr muons is mostly dominated by the inner detector performance, while the high-py muon 
reconstruction is more sensitive to the muon spectrometer performance. The average muon momentum 
resolution is better than 3%, which allows for an excellent dimuon mass resolution, as shown in Figure 2 
for the A-boson (mA=200 GeV) produced via the associated bbA and the direct gg — > A production mode. 
As expected, the experimental dimuon mass resolution does not depend on the Higgs production mode. 
Table 2 summarizes the dimuon mass resolutions obtained for different A boson masses. 



Mean 199.6±5e-03 
Sigma 6.3 + 5.5e-03 



gg->A->(in j 
m A = 200GeV j 
tan(p) = 30 : 

Jj , , , 

150 200 250 300 350 400 

nv u -(GeV) 

Figure 2: Dimuon mass distribution for the bbA and gg — > A signal samples with an A boson mass 
of 200 GeV and tan/3=30. The distributions are fitted by the Gauss function. 

Characteristic of the bbA signal are the b-jets with generally rather low transverse momenta, as shown 
in Figure 3(a). Since the efficiency of the b-jet reconstruction decreases with the pj, the number of 
reconstructed b-jets will in general be smaller for the signal than for the it or ZZ — »• bb\i\i backgrounds, 
where the b-jets are more energetic (see Figure 3(b)). A detailed study was performed to identify the 
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(GeV) 


110 


130 


A boson ma 
150 


ss (GeV) 
200 


300 


400 


Natural width 


2.16 


2.48 


2.80 


3.60 


5.61 


8.46 


Reconstructed a 


2.59 ± 0.02 


3.83 ± 0.03 


4.11 ±0.04 


6.29 ± 0.05 


10.2 ± 0.2 


15.0 ±0.3 


Reconstructed 
mass 


109.818 
± 0.006 


129.738 
± 0.005 


149.796 
± 0.006 


199.589 
± 0.005 


298.82 
±0.04 


399.37 
±0.04 



Table 2: The natural width of the A boson and the expected width of the dimuon resonance based 
on Monte Carlo simulated data are shown for the bbA signal at different mass points and with 
tanj3=30. 




P T (b-jet) (GeV) N(b-jet) 

Figure 3: a) Transverse momentum pj of the b-jets in the bbA signal and the dominant background 
processes and b) the number of reconstructed b-jets per event. The selection criteria for the b-jets 
are described in the text. 



optimum b-jet selection criteria. The best jet reconstruction performance is observed for the jet cone 
algorithms with the cone size of AR = y/ At] 2 + A(j) 2 = 0.4, compatible with the performance of the kj 
algorithm for the same cone size. After a jet is selected as described, the b-tagging algorithm is performed 
to determine whether the jet originates from the b-quark. The minimum pj value of 20 GeV is required 
for each b-jet in order to reduce the contribution of the calorimeter noise and of the mistagged light- 
or c-jets. The rejection of light- and c-jets is essential for the suppression of the Z + jet background. 
One could extend the lower pj -bound down to ~15 GeV without a large change of the rejection rate. 
However, the impact on the final signal significance will be rather small, while the agreement between 
the full and the fast simulation is shown to decrease. 

Several b-tagging algorithms have been studied in order to define the optimum selection of the low- 
Pt b-jets coming from the signal. The best rejection is obtained by IP3DSV1 [27], which is based on the 
information obtained from the transverse and longitudinal impact parameter significances of the tracks 
and from the reconstructed secondary vertex. The distribution of the b-tagging weight obtained by this 
algorithm is shown in Figure 4 for the b-jets and the light jets in the bbA signal sample at 200 GeV 
and in the tt background sample. The arrow indicates the optimum cut value of 4. Figure 5 shows the 
obtained b-tagging efficiency for the bbA signal sample, in dependence on the b-jet Ej (Figure 5a)) and 
77 (Figure 5b)). The kinematic cuts of p T >20 GeV and |tj| < 2.5, as well as the IP3DSV1 weight-cut 
of 4 have been applied for the b-jet selection. The resulting b-tagging efficiency is (64.1 ±0.81)% for the 
bbA signal sample, with a light-jet rejection of (80±1) for the Z + jet sample. Systematic detector-related 
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3D impact parameter + secondary 



Figure 4: Distribution of the b-tagging weight for the b-jets and the light jets in the bbA signal 
and tt background sample, obtained by the IP3DSV1 b-tagging algorithm. The arrow indicates 
the optimum cut value of 4, which is used for the selection of the b-jets in the analyses. 




Figure 5: B-tagging efficiency in dependence of b-jet transverse energy Ej (a) and pseudorapidity 
TJ (b), evaluated for the bbA signal sample at m A =200 GeV and tan/3 =30. IP3DSV1 b-tagging 
weight cut of >4 has been applied. 



uncertainties are not included here. 

The final important reconstruction object is the missing transverse energy (E™ lss ), which allows for 
the suppression of the tt background. In the signal processes, there is no neutrino contribution, such that 
the measured E™ ISS value is dominated by the experimental resolution. The reconstruction algorithm for 
the calculation of the missing transverse energy is described in detail in Ref [28]. The distributions of 
^T{xy) components in the signal samples have a Gaussian part with a width a=(7.8±0.1) GeV, while the 
non-Gaussian tails (above 5a) are found to contribute less than 1.5% to the overall distribution. E™ lss is 
sensitive to pile-up effects, as will be described in the next subsection. 

4.2 Reconstruction performance under influence of pile-up and cavern background 

In the following, the detector performance related to the analysis is evaluated in dependence on the pile- 
up at luminosities of 10 33 cm _2 s _1 and cavern background (five times higher than the expectation). In 
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Figure 6(left), the efficiency and the fake rate of the muon reconstruction is shown for the bbA signal 
sample simulated without and with the pile-up contribution as a function of the pseudorapidity. The 
corresponding momentum resolution is shown in Figure 6(middle). Similar results are obtained also 
for the background samples. As can be seen from the plots, muon reconstruction is only marginally 




tni " H ' nv(GeV) 



Figure 6: Efficiency and the fake rate (left) and resolution (middle) of the muon reconstruction as 
a function of the | r\ | (for pj >20 GeV), with and without the pile-up contribution in the bbA signal 
sample with ota=200 /GeV. The right plot shows the corresponding dimuon mass distribution. 

influenced by pile-up. Consequently, the dimuon invariant mass also remains unaffected, as shown in 
Figure 6(right) for m A =200 GeV. 

On the contrary, the reconstruction of the missing transverse energy is substantially affected by pile- 
up in the calorimeter. The degradation of the ^""-resolution mainly affects the selection of events 
with a small true missing energy (signal and the Z background) as shown in Figure 7(left); tt events, 
characterized by a large missing energy, are rather insensitive to pile-up (see Figure 7(right)). This 




Figure 7: Missing transverse energy distribution for the bbA signal at 200 GeV (left) and the tt 
background (right), with and without pile-up. 

effect must be taken into account during the optimization of the event selection criteria. For instance, an 
event selection cut at Ef' ss <30 GeV, which is reasonable without pile-up, would reject too many signal 
events, once the pile-up contribution is included. Therefore, this analysis cut should rather be set to at 
least 40 GeV in the realistic LHC environment. 
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The change of the calorimeter response under the influence of pile-up affects also the jet reconstruc- 
tion. Due to a higher calorimeter activity, one expects an increase in the number of reconstructed jets. 
This can be observed in Figure 8(left), showing the jet multiplicity in the Zbb background events. 




Figure 8: Total number of jets per event (left) and the number of b-jets (right) in the Zbb back- 
ground sample, with and without the pile-up contribution. 

No significant impact of the pile-up is observed on the b-tagging (see Figure 8(right)), due to the 
additional tracking and vertex information. 

5 Event selection 

The search for the MSSM Higgs bosons can be performed by several different approaches, related to the 
number of jets one requires to be present in the final state. As mentioned before, due to a large signal 
production rate in the associated production mode, the presence of the b-jets in the final state can help 
to suppress the Drell-Yan background. On the contrary, the remaining events with 0 b-jets provide for a 
high signal rate on top of the smoothly distributed background, even at low integrated luminosity. 
The event selection methods are optimized separately for the two cases: 

• Signatures with 0 b-jets in the final state. 

• Signatures with at least one b-jet in the final state. 

The two mentioned final states are uncorrected and therefore complementary. In the case of 0 b- 
jets in the final state, the dominant background is the Drell-Yan Z boson production, while in the case 
of at least one b-jet the it background has the biggest contribution, especially for Higgs masses above 
130 GeV, which are further away from the Z resonance. 

Before describing the selection criteria, the preselection of the events is discussed, common to the 
two signatures above. The preselection is defined by the kinematic cuts on muon pj and | r\ | , together 
with the muon isolation criteria. 

5.1 Preselection 

The main characteristics of the signal signatures is the presence of two isolated muons of opposite charge 
in the final state. The /^-distribution of the muons is shown in Figure 9 for the signal and background 
processes. The signal is characterized by the relatively high-pr muons, while the background has muons 
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Figure 9: Distribution of the muon transverse momentum pj for the muons in the signal and 
background events. 



of lower momenta. The muon pj distribution in the signal is highly correlated to the Higgs boson mass. 
Therefore, the lower bound on the muon pj is kept at a relatively low value, in order to allow for a 
general search in a broad Higgs mass range. At preselection level, both muons are required to have a 
Pt >20 GeV and to be in the pseudorapidity range |tj | <2.7. 

The selected high-p^ muons are required to be isolated, in order to reject the processes in which the 
muons originate from the hadronic decays. The applied isolation criteria require the calorimeter energy 
Ej deposited in a cone of size AR = 0.4 around a given muon, divided by the muon pj to be lower 
than 0.2. The distribution of this isolation variable is shown in Figure 10(left) for different signal and 
background processes. 

The isolation criteria significantly decrease the tt background, where one of the muons comes from 
the b-decays. The power of rejection of the non-isolated muons originating from the b-quarks in the 
tt background is shown in Figure lO(right) for the standard calorimeter isolation (Ej me{)A ), and for the 
isolation normalized by the muon pj. 

Due to the high muon momenta, the signal can be efficiently triggered by the single high-/^ muon 
trigger. The efficiency of the trigger selection for the dimuon signal events is shown to be around 95% 
for all studied mass points. The detailed study of the trigger selection efficiency for events which pass all 
offline event selection criteria will be presented in Section 5.3. In the following, the results of the event 
selection without the trigger requirement are presented at first. 

5.2 Signatures with 0 b-jets and with at least one b-jet in the final state 

The large Z boson background contribution can be reduced by requiring that the jets which are present 
in the final state are tagged as b-jets. Therefore, assuming a fully performing b-tagging algorithm, the 
following set of selection criteria can be applied: 
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Figure 10: (left) Muon isolation variable Ej° ne0A / 'px(n), shown for different signal and back- 
ground processes. Here, the Ej° ne0A is the energy measured in the calorimeters in cone AR = 0.4 
around a given muon. (right) Rejection of the non-isolated muons originating from the b-quarks 
in it events as a function of the selection efficiency for isolated muons, shown for the two isolation 
variables described in the text. The filled circle indicates the working point with the isolation cut 
atE ™" eQA /p T (n) <0.2. 



• Events are required to pass the preselection criteria and to have a missing transverse energy 
<40 GeV. This cut is particularly effective in rejecting the it and WW background, which 
are characterized by a high missing energy due to the presence of neutrinos in the final state (see 
Figure ll(left)). 




Figure 11: (left) Missing transverse energy, (middle) distribution of the b-tagging IP3DSV1 weight 
after requiring pj >20 GeV and |tj| <2.5 and (right) the multiplicity of reconstructed b-jets per 
event, after applying the pj- and T]-cuts and requiring the b-tagging IP3DSV1 weight greater than 
4. Distributions for the bbA signal (m^=200 GeV) and for the major background processes are 
shown. Arrows indicate the cuts applied in the analysis. 



• Subsequently, the number of b-jets is counted in each event, requiring pj >20 GeV, \r\ \ <2.5 and 
the b-tagging IP3DSV1 weight greater than 4. The distribution of the b-tagging weights before 
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and the b-jet multiplicity after applying the weight-cut are shown in Figure 11. As shown in the 
middle plot, the b-tagging weight is effective in reducing the background from Z + light jets, at the 
expense of a significant loss of the signal. This cut is scarcely effective against the tt background, 
which, however, can be reduced by the additional cuts discussed below. Therefore, the analysis 
is divided into a channel with 0 b-jets (in which the Z background is dominant) and the channel 
with at least one b-jet (in which the tt background plays an important role and can be further 
suppressed). 

• Further tt rejection criteria have been studied for the channel with at least one b-jet. 

- Two muons originating from the decay of the same particle (Higgs boson) tend to be emitted 
back-to-back, especially if this particle has a low transverse momentum. As opposed to that, 
the muons originating from the two different particles (as in the tt events) are not con-elated 
and can be separated by any angle. Therefore, the cut is applied on the angle A0^ between 
the two muons by requiring | sinA^^j <0.75. The | sinA0^| distributions for the signal and 
background processes are shown in Figure 12(a). 

- In addition, several discriminating variables related to the hadronic activity in the events have 
been studied: the pj distribution of the b-jets, the number of jets per event, or a sum of the 
transverse momenta of all jets in the event (£ p J j'). The distributions of two of these variables 
are shown in Figure 12b) and c). The latter is shown to provide the highest rejection against 
the tt background, while at the same time remaining relatively robust under the the influence 
of pile-up. A cut atj^pj' <90 GeV is required. 




Figure 12: Discriminating variables against the tt background: a) | sin A<f>\ shown for the bbA signal 
(/ma=200 GeV) and for the major background processes, b) the maximum pj of the b-jet, and c) 
Y,Pj ts -distribution for the signal and background processes. Arrows indicate the cuts applied in 
the analysis. 



• The final number of events which is used for the calculation of the signal significances is eval- 
uated in a mass window Am=mA±2cr UiU around the A boson mass, where is the expected 
m,A -dependent width of the dimuon resonance (see Table 2). 

The signal and background event rates after each of the cuts described above are shown in Tables 3 and 4. 

A signal selection efficiency of 7-10% is reached for the channel with at least one b-jet in the final 
state. The dominant background processes are almost equally the Z + jet and the tt events. The signal 
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Cut 


bbA (fb) 


bbA (fb) 


bbA (fb) 


bbA (fb) 


bbA (fb) 


8g—*A (fb) 












400 GeV 




All events 


13.4-10 1 


8.7- 10 1 


31.5-10° 


6.9-10° 


19.310 1 


32.3-10-' 


muon preselection 


s.s^yio 1 


5.9(1)- 10 1 


22.3(3)- 10° 


5.0(1)- 10° 


14.4(3)- 10- 1 


28.6(3)-10- 1 


Ef ss <40 GeV 


S3(2)-I0 l 


5.4(1). 10 1 


20.3(2)- 10° 


4.4(1)- 10° 


11.7(2)- lO -1 


25.9(3)-10-' 


nr. of b-jets=0 




4.3(1)-10 1 


15.5(2)- 10 u 


31.6(8)10 1 


8.2(2)- 10" 1 


25.1(3)-10" 1 




5.6(1) 10 1 


34.1(8)10° 


128.0(1) 10 1 


26.4(8)10-' 


6.9(2)10 1 


204.2(1) 10 2 


nr. of b-jets>l 


16.2(7)- 10 u 


1 1.2(5)- 10 u 


4.8(1)- 10° 


izicsmo- 1 


3.5(1)- 10" 1 


8.2(5)- lO" 2 


IsinA^I <0.75 


11.7(6)- 10° 


8.3(4)- 10° 


4.0(1)- 10° 


10.7(5)-10 _1 


3.2(1)-10^ 1 


4.8(4)- lO" 2 


Ipf" < 90 GeV 


9.3(5)- 10° 


6.5(3)- 10° 


2.9(1)- 10° 


7.1(4)-10- 1 


1.7(1)- 10 _I 


2.0(3)- lO" 2 




8.3(5) 10° 


5.3(3)- 10° 


23.9(8)-10~ 1 


5.9(4) 10 1 


14.8(8)- lO" 2 


1.7(3)10" 2 



Table 3: a x BR for the signal processes at tanj3=30 after each selection cut. Numbers in brackets 
represent the statistical error on the last digit. 



Cut 


Z+ light jet 


Z + b jet 




ZZ^bb\i\i 


WW 


Total 




(fb) 


(fb) 


(fb) 


(fb) 


(fb) 




All events 


2036.0- 10 3 


52.3- 10 3 


833.0- 10 3 


151.0 


116.8-10 3 






727.7(2)- 10 3 


333.8(4)- 10 2 


57.6(1)- 10 2 


61.3(8)- 10° 


6.7(2)- 10 2 




Ef ss <40 GeV 


726.2(2)- 10 3 


330.1(4)- 10 2 


132.7(6)- 10 1 


56. 1(8)- 10° 


3.0(2)- 10 2 




nr. of b-jets=0 


710.7(2)- 10 3 


242.3- 10 2 


25.6(3)- 10 1 


22.3(5)- 10° 


2.9(2)- 10 2 




Am (130 GeV) 


35.4(1)-10 2 


8.8(2)- 10 1 


20.2(8)- 10° 


1.5(4)- 10- 1 


1.8(4)10 1 


3.7(1)- 10 3 


Am (150 GeV) 


152.5(8)-10 1 


^(lJ-lO 1 


16.0(7)10° 


0.7(3)- lO" 1 


2.6(5)- 10 1 


15.9(1) 10 2 


Am (200 GeV) 


58.9(5) 10 1 


8.4(7) -10° 


11.6(6) 10° 


0.6(1) 10 1 


1.6(3) 10 1 


62.5(6) 10 1 


Am (300 GeV) 


18.1(3) 10 1 


1.8(3) -10° 


5.1(4)- 10° 


0.1(1) 10 1 


0.2(2)- 10 1 


19.0(4) 10 1 


Am (400 GeV) 


7.8(2)-10 1 


0.8(2)10° 


2.0(2)10° 


0.1(1) 10 1 


0.2(2)-10' 


8.4(3)10 1 


nr. ofb-jets>l 


154.9(3)- 10 2 


87.7(2)- 10 2 


107.1(6)- 10 1 


33.8(6)- 10° 


0.6(2)- 10 1 




|sinA^ M | <0.75 
Ipf" <90 GeV 


84.3(2)- 10 2 


49.8(2)- 10 2 


61.7(4)- 10 1 


19.0(5)- 10° 


0.3(2)- 10 1 




44.3(1)-10 2 


33.0(1)-10 2 


8.9(2)- 10 1 


10.7(3)- 10° 


0.3(2)- 10 1 




Am (130 GeV) 


3.1(1) 10 1 


15.1(9) 10° 


7.7(5)- 10° 


0.7(3)10° 


<0.710° 


5.5(2)10 1 


Am (150 GeV) 


14.8(8) -10° 


6.0(6) 10° 


6.2(4)10° 


0.2(1)-10-' 


<0.710° 


2.8(1)- 10 1 


Am (200 GeV) 
Am (300 GeV) 


6.7(5) -10° 
2.2(3)10° 


2.0(3) -10° 
0.6(2)10° 


5.8(4)- 10° 
2.3(3)- 10° 


<0.1 10" 1 
<0.1 10" 1 


<0.710° 
<0.710° 


1.5(1) 10 1 
5.8(8)- 10° 


Am (400 GeV) 


1.1(2)10° 


0.1(1) 10° 


0.4(1)- 10° 


0.1(1) 10 1 


<0.7-10° 


2.3(7)- 10° 



Table 4: a x BR for the background processes obtained after each event selection cut. The upper limits 
are evaluated at 90% CL. Numbers in brackets represent the statistical error on the last digit. 



events are mainly lost due to the limited b-jet reconstruction efficiency, as discussed previously. This is 
in agreement with a much larger signal selection efficiency of 40-50% in the case of the channel with 0 
b-jets. Here, the Z background is observed to have a dominant contribution. The invariant dimuon mass 
distributions obtained for the two channels are shown in Figure 13. 

In the initial phase of the detector running, the b-tagging algorithms still may not have the optimum 
performance. Therefore, the discovery potential has also been evaluated for the case where no b-tagging 
requirement is imposed on the reconstructed jets. Due to the large Z background, the signatures with no 
b-tagging requirement will provide a similar discovery potential as the analysis with 0 b-jets in the final 

5.3 Trigger selection 

Previously calculated event selection efficiencies have been obtained assuming that each analysed event 
can be triggered. In this Section, a realistic trigger description is included to evaluate the effect of the 
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Figure 13: Invariant dimuon mass distributions of the main backgrounds and the A boson signal 
at masses mA=150, 200 and 300 GeV and tan/3 = 30, obtained for the integrated luminosity of 
30 fb _1 . B-tagging has been applied for the event selection. The production rates of H and A 
bosons have been added together, a) for the 0 b-jet final state and b) for the final state with at least 
1 b-jet. 



limited trigger efficiency on the final event selection. 

Since the signal processes are characterized by the two high-pr muons in the final state (see Fig- 
ure 9), the most reliable trigger item is a single high-pj muon with pj >20 GeV. The single-muon 
trigger efficiency is mostly limited by the geometrical acceptance of the trigger chambers in the muon 
spectrometer, as discussed in detail in [29]. The efficiency of the trigger selection for events passing 
all previously described event selection criteria is shown in Table 5 for the signal at 200 GeV and for 
the background samples. A very similar trigger selection efficiency can be observed for all signal and 



Dataset 


LI 


High level trigger 


bbA, 200 GeV 


97.2 


95.0 


it 

bbZ 


97.1 
97.1 


95.1 
94.8 



Table 5: Trigger selection efficiency (%) of events which pass all event selection criteria described 
in Section 5.2. The results are listed for the signal and the two background processes. 

background samples. The relatively high final trigger selection efficiency of ~95% corresponds to the 
decrease of the signal significance by 2-3% compared to the previously shown results. It has been shown 
that the observed efficiencies are effectively stable to a level of about ±0.5% at any level of the event 
selection, since the selection criteria are not affecting the muon kinematics. Furthermore, the trigger 
selection does not induce any bias to the dimuon invariant mass distribution. 

The studies described above have been performed under the assumption that the trigger threshold for 
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a single high-pr muon is 20 GeV. If this threshold should be higher, some of the signal events might 
be lost by the trigger selection. In order to evaluate the sensitivity of the signal selection on the varying 
trigger thresholds, a dedicated study has been performed. New trigger thresholds of 20, 26 and 30 GeV 
are emulated by the requirement that at least one reconstructed muon with a pj above the given threshold 
exists in the event. This muon is also required to have a matching LI region-of-interest in the muon 
spectrometer, in order to take into account the holes in the geometrical trigger acceptance. The trigger 
selection efficiencies for the new trigger thresholds are shown in Figure 14 for events remaining after 
each of the analysis cuts applied on the bbA signal with mass mA=200 GeV. No visible degradation of the 
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Figure 14: Trigger selection efficiency of the bbA signal events (01,4=200 GeV) in dependence 
on the offline analysis selection. Different curves show the results obtained for different trigger 
thresholds. 



final signal selection is observed for the trigger threshold variations of up to 30 GeV. Similar behavior 
is observed also for the signal at other mass points. This result can be explained by a rather high muon 
momenta in the signal samples, such that there is only a small fraction of signal events in which both 
muons have a pj below 30 GeV. The dependence of the tt and Zbb background selection on the trigger 
threshold of up to 30 GeV is shown to be smaller than 2% after all analysis cuts. 

6 Systematic uncertainties 

Several sources of systematic uncertainties can affect the total yields of both the signal and the back- 
ground events after applying the selection criteria specified in the previous Section. In this Section, the 
influence of the theoretical and the detector-related systematic uncertainties is evaluated. 

6.1 Theoretical uncertainties 

As mentioned previously, the Higgs boson production in association with the b-quarks has been simu- 
lated with PYTHIA and SHERPA Monte Carlo generators and the obtained differential distributions are 
scaled to the Feynhiggs NNLO cross-sections. It is important to remark that no cuts have been applied 
on the transverse momenta of the generated b-quarks, since also the event topologies with less than two 
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observed b-jets are studied in the presented analyses. (An outgoing b-quark can be considered as observ- 
able if its momentum is above ~15 GeV). Therefore, the corresponding NNLO cross-sections also have 
to be calculated in an inclusive way, with no constraints on the kinematics of the b-quarks. 

As discussed in Section 2, the total theoretical uncertainty on the inclusive cross-section amounts to 
~17%, depending on the Higgs boson mass. In addition, since the experimental search distinguishes 
between the final states with 0 and those with 1 or more b-jets (with pj >20 GeV), it is important to 
show that the proportion between the mentioned different event topologies which is obtained from the 
Monte Carlo generators also agrees with theory predictions. For this purpose, the fraction of generated 
events is calculated which have exactly one b-quark with pj >15 GeV and |tj| <2.5 in the final state 
(before hadronization). This number is then compared to the ratio of the MCFM cross-section calculated 
for the bg — »• bH process with the same cuts on the b-quarks, divided by the inclusive cross-section. The 
result of the comparison is shown in Figure 15, being comparable to the 20% uncertainty. The SHERPA 
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Figure 15: Fraction of events 0\b with exactly one b-quark with pj >15 GeV and \r\\ <2.5, 
relative to the total inclusive cross-section o tota i , shown in dependence on the Higgs boson mass 
Mn. The lines show the theory prediction as the ratio between the MCFM bg — > Hb and bb — > H 
cross-sections. Solid line: MRST2004 pdf set, dashed line: MRST2002 pdf set. Circles: result 
from SHERPA 1.0.9. Triangles: PYTHIA 6.4 gg -► bbH process. 
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prediction is about 20% higher than PYTHIA and agrees better with the theory calculation with the 
MRST2004 pdf set. The MRST2002 pdf set gives a better compatibility with PYTHIA. Taking into 
account the theory uncertainties mentioned above, one can conclude that the samples produced with both 
generators can reproduce the NLO predictions from a single b-quark rate. 

The observed differences between SHERPA and PYTHIA are visible also in the differential b-quark 
distributions. The differential pj and Y\ -distributions of the leading b-quark, as obtained by the PYTHIA 
and SHERPA generators are compared in Figure 16 for the signal at the mass of 200 GeV. The com- 
parison is performed on a parton level, before hadronization. A slightly harder transverse momentum 
spectrum and a more central pseudorapidity of the b-quarks is observed in the SHERPA events. 

6.2 Detector-related systematic uncertainties 

Systematic uncertainties related to the muon and the jet reconstruction, as well as the uncertainties on 
the b-tagging performance have been evaluated with rather conservative estimates on the level of under- 
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Figure 16: Differential pj and r\ distribution of the leading b-quark, obtained by SHERPA (gray 
histogram) and by the PYTHIA gg — ► bbH calculation (solid line). All histograms have been 
normalized to unity. 



standing of the detector performance. This level is assumed to correspond approximately to an integrated 
luminosity of 1 fb _1 . For each of the above effects, the corresponding change of the reconstructed miss- 
ing transverse energy is taken into account. Further more, systematic effects due to the differences 
between the fast and the full simulation are taken into account by means of the comparison between the 
full and fast simulation. 

Muon reconstruction uncertainties are treated separately for the reconstruction efficiency, muon 
momentum resolution and the muon momentum scale. The efficiency of the muon identification is 
assumed to be known with an accuracy of ±1%, based on the results of the tag-and-probe method for the 
muon efficiency measurement from the real data [30]. Systematic errors of the muon momentum scale 
are taken to be ±1%, arising for instance from the non-perfect knowledge of the magnetic field. The 
incomplete understanding of the material distributions inside the detector, as well as possible residual 
detector misalignment can lead to an additional smearing of the muon momentum resolution. Based 
on early detector calibration, the additional smearing is expected to be cr(^) = ^j^- ©0.00017 (in 
l/(GeV)). The first term enhances the effect of the Coulomb scattering, while the second enhances the 
contribution from the misalignment. 

Jet reconstruction uncertainties are estimated by the jet performance group [25]. In the pseudo- 
rapidity region below |t]|=3.2, the jet energy scale uncertainty of ±3% and a resolution uncertainty of 
g{E) = 0.45 • y/E are assumed. For \r\ \ >3.2, the corresponding values are ±10% and <j(E) = 0.63 • \[~E 
respectively. 

b-tagging efficiency and the fake rate are crucial for the described analysis. Conservative relative 
uncertainty of ±5% on the b-tagging efficiency and ±10% uncertainty on the rejection of the light jets 
have been assumed. 

The results obtained after implementing each of the above systematic uncertainties separately into 
the analysis are shown in Table 6 for the signal at m^=150 GeV and for the backgrounds within the 
corresponding mass window. Signals at different mass points are affected by a similar amount. The 
background uncertainties are also rather independent of the dimuon mass region, but from one exception. 
The muon momentum scale mostly affects the background in the lower mass region, close to the Z 
resonance, while the deviations become smaller for the higher signal masses, i.e. one observes ±11%, 
±5% and ±3% for the Higgs masses of 110, 130 and 200 GeV respectively. 
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Systematic uncertainty [%] 


Signal 


tt 


Z+ b jet 


Z+light jet 


Muon efficiency 


±2 


±1 


±2 


±2 


Muon pr scale 


±2 


±3 


±3 


±5 


Muon resolution 


-2 


-4 


3 


3 


Jet energy scale 


±1 


±5 


±2 


±2 


Jet energy resolution 


-1 


-3 


-1 


-1 


b-tagging efficiency 


±4 


±3 


±4 


±2 


b-tagging fake rate 


±1 


±0 


±0 


±6 


Full-Atlfast corrections 


0 


+8 


-10 


+5 


Total 


±5 


±12 


±12 


±11 



Table 6: Relative deviation of the selection efficiency in % for the signal at ra,4=150 GeV and for 
the background events after imposing each of the systematic uncertainties separately, as described 
in the text. The total deviation is given as quadratic sum of the separate contributions, including 
the one-sided corrections. 



7 Background estimation based on the measured data 

As previously discussed, theoretical and experimental uncertainties can lead to systematic errors in de- 
termination of the background rates. Additional information on the shape and the size of the background 
contributions can be collected from the real data. As shown before, two major backgrounds surviving all 
event selection criteria are the Z + jet and the tt processes. Two strategies to estimate their contributions 
from the measured data shall now be described. 

The first method makes use of the fact that the branching ratio for A/H boson decays into two 
electrons is negligible compared to the dimuon decay channel. Therefore, since one doesn't expect 
any signal in the dielectron final state, one can use this signature to determine the total background 
contribution. Additionally, the signatures with one electron and one muon in the final state provide 
the contribution of the tt background alone, since the Z + jet processes do not contribute to this final 
state. Thus, one can separately measure the two background contributions. The background estimation 
based on the -channel has been discussed in detail in [31]. Good agreement has been demonstrated 
between the dilepton invariant mass distributions for the Z —> jit/i and the Z — > ee processes. In this paper, 
the emphasis is given to a similar procedure for the determination of the tt background. 

The goal of the second method is to define a set of event selection criteria which allow for the 
higher selection efficiency for the particular background process, while simultaneously rejecting all other 
signal and background contributions. Such background-enriched control sample can be used to better 
understand the shape of the invariant dimuon mass distribution. 

7.1 Background estimation based on the e + e and /i ± e =F signatures 

The estimation of the tt background is important for the analysis channel with at least one b-jet in the 
final state. A study is performed using fast simulation of the tt background, in order to obtain a reliable 
statistical accuracy. The detector performance given by the fast simulation has been adjusted such to 
reproduce the performance obtained with the full simulation, as mentioned previously. Based on the 
studies in [31], the shape of the dilepton invariant mass distribution obtained for the jU + jU~, e + e~ and 
jX ± e T final state in the tt process are expected to be very similar. The total number of background events 
selected in each of the three final states will be different due to different reconstruction efficiencies for 
muons and electrons. However, since these efficiencies can be experimentally measured with an accuracy 



1408 



Higgs - Search for the Neutral MSSM Higgs Bosons in the Decay Channel . 



of better than 1%, this effect can be corrected for rather precisely. Additional differences may occur due 
to an additional calorimeter activity in presence of electrons. This can be taken into account by rejecting 
all reconstructed jets which overlap with reconstructed electrons. 
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Figure 17: (a) The jU + jU~ invariant mass distribution (stars) of the tt background, and its estimates 
obtained from the e + e~ (triangles) and \i^e T (full circles) final states, (b) Corresponding ratios 
of estimated and actual \l + \l~ invariant mass distributions. 



Figure 17(a) shows the invariant mass distributions obtained for the H + /J.~, e + e~ and \i^e T final 
states in it events. The dielectron distribution has been scaled down by a factor 0.84 2 , to account for 
the difference between the electron and muon reconstruction efficiency. Similarly, the -distribution 
has been scaled down by 0.5x0.84. Figure 17(b) shows the corresponding ratios of estimated and actual 
jU + jU~ invariant mass distribution. The subtraction of the \i^e T sample from the total background is 
illustrated in Figure 18 for =30 fb _1 , for the analysis requiring identification of at least one b-jet. 




nrvn (GeV) 



Figure 18: Invariant dimuon mass as in Figure 13(b), after the subtraction of the tt background 
estimated in the [^e^ final state. The distributions correspond to an integrated luminosity of 
jgf =30 fb" 1 . 
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7.2 Background estimation from the control samples 

In analyses with 0 b-jets in the final state the dominant background is the Z + jet process. Since the 
topology of this process is very similar to the signal topology, it is rather difficult to define a set of 
selection criteria which would allow for the extraction of the Z background and simultaneous rejection 
of the signal. 

Contrary to that, the it process is characterized by a relatively high missing transverse energy and a 
high jet activity. Since this background is important for the analysis with at least one jet in the final state, 
the event selection criteria of this analysis are modified by selecting only events with a missing transverse 
energy above 60 GeV and by removing the cut on the Y,Pt'~ variable. All other selection criteria remain 
the same. The purity of the it control sample obtained after the described event selection is shown in 
Figure 19(a). All remaining processes are suppressed to a negligible amount. Figures 19(b) and (c) 




m w , (GeV) m w , (GeV) m w , (GeV) 

Figure 19: (a) it control sample (stacked histogram) obtained as described in text. Full circles indi- 
cate the actually measured ju + jU~ invariant mass distribution for the it background, after applying 
the standard analysis selection cuts, (b) The measured (full circles) and the control it distributions 
(open squares), normalized to the same number of events, (c) The ratio of normalized distributions 
from (b). 



show the it background obtained with the selection criteria described in Section 5.2 for at least one 
b-jet in the final state (it measured) compared to the distribution obtained from the it control sample. 
Both distributions are normalized to the same number of events. The shape of the it background can be 
estimated by the described procedure with an accuracy of 10-20%. 



7.3 Fit function for the parametrization of the background 

The background can be parametrized by the function fs consisting of a Breit-Wigner and an exponential 
contribution, 

MX) = a i\ ^-Ml)+MlTl +a2 - eXp{ - a '- x) ] ' (1) 
where x is the running dimuon mass, while a\, a-i and a^, are the free parametrization parameters. The 
mean Mz and the width Tz of the Breit-Wigner distribution describe the Z resonance. The parameters 
^2,3 describing the exponential part can be determined by the fit on the background estimated from data, 
as described in the previous subsection. The overall normalization factor a\ is determined by the fit on 
the side bands of the dimuon mass distribution. Figure 20 shows the result of the fit on the background 
distribution obtained after all analysis cuts for the case with at least one b-jet in the final state. The 
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two dashed lines correspond to the errors on the «2,3 parameters, as obtained from the e + e data at an 
integrated luminosity of 10 fb _1 . 




150 200 250 300 



Figure 20: Fit (full line) of the background function f^g (see Eq. 1) on the background distribution 
(full circles) resulting from the analysis with at least one b-jet in the final state. The dashed lines 
represent the shape variations given by the errors on the ct2 and c?3 parameters, from the fit on the 



The accuracy of the background parametrization by the described method has been tested by means 
of large number of toy Monte Carlo experiments at different integrated luminosities Jzf. Typical results 
of the background extraction in the mass window from 188-212 GeV are shown in Figure 21 for an inte- 
grated luminosity of 15 fb _1 (left) and 3 fb _1 (right). The empirically evaluated expected uncertainty of 




Figure 21: Normalized difference of the expected number of background events (BKG expecte d) in 
the mass window from 188-212 GeV, and the number {BKGfuted) obtained from the fit method 
described in the text (left) for an integrated luminosity of =15 fb 1 and (b) =3 fb 



the background determination from the fit to data is ~8%/v / -£?[fb~ 1 ]- The variation of the exponential 
shape, due to the errors on the fit parameters «2 and c?3, plays a no n-margin al role only for dimuon masses 
above 300 GeV, decreasing the relative uncertainty to ~10%/v / -£^[fb~ 1 ]. Conservatively the latter ex- 
pression will be used in the analyses discussed below. Similar fit procedure has been performed also for 
the analysis with 0 b-je ts in the f inal state. Due to the larger amount of background, a smaller background 
uncertainty of ~2%/A/Jz^[fb~ 1 ] can be obtained in this case, including the systematic uncertainty on the 
shape parameters c?2 and 03. 



1411 



Higgs - Search for the Neutral MSSM Higgs Bosons in the Decay Channel . 



8 Evaluation of the signal significance 

Two approaches have been applied to evaluate the statistical significance of the observed signal. The 
"fixed mass" approach provides the significance based on the number of signal and background events 
which are expected in a given range of the dimuon invariant mass. However, the location of the signal 
mass peak can be determined only if the signal rates are sufficiently high, or if the signal is already 
discovered in the A/H/h — ► xx decay channel. In a more general "floating mass" approach, no constraints 
are applied on the dimuon invariant mass. 

The number of signal and background events can be determined from the fit of the function fss to 
the data, 



where fs is the Gaussian distribution describing the signal. p2 is the mass of the signal (which can 
be fixed or left as a free, "floating" parameter) and Oa is the width of the Higgs resonance, po is the 
background scale and p\ the total number of signal events. The number of signal (Ns) and background 
(Nb) events used for the calculation of the signal significance is extracted from the fit, by integration in 
a window of ±2<Ja- The signal significance is evaluated by means of the profile likelihood method [32], 
using the obtained number of s ignal and background events as an input and taking into account the 
background uncertainty (10%/y/Jf [fb -1 ]) as discussed in Section 7.3 above. 

The results obtained for the signal significance have been cross-checked by the large number of 
Monte Carlo pseudoexperiments for several different integrated luminosities. The pseudoexperiments 
are based on the Monte Carlo distributions and are divided into the "background-only" experiments 
(BO), containing only background contributions, and the "signal-plus-background" experiments (SpB) 
having both the signal and the background contribution. In addition to the profile likelihood calculation 
in each of these experiments, the signal significance can be estimated also from the log-likelihood ratios. 
The log-likelihood ratio InQso and IuQs p b, both defined as (Ns + Nb) In Ns ^ B — Ns, are evaluated for 
each BO- and S/?fi-pseudoexperiment. The signal significance is obtained from the probability of a Type- 
II error, defined by the fraction of BO pseudoexperiments which have a log-likelihood ratio InQso larger 
than the median of the In Qspfi-distribution. The probability of a Type-I error, i.e. the number of SpB- 
pseudoexperiments which fall below the median of the In Qso-distribution, was used to determine the 
95% CL limits. 

Figure 22(left) shows the comparison of the signal significances obtained in the fixed-mass approach. 
The solid line shows the results of the profile likelihood method, while the dots are the results given by 
the Type-II error probabilities obtained in the pseudoexperiments. At very low luminosities, the profile- 
likelihood estimation based on average values seems to slightly overestimate the significance. However, 
at luminosities close to those needed for the 5a-significance, the two calculations give equivalent re- 
sults. Figure 22(right) shows the degradation of the signal significance observed once the floating-mass 
approach is applied in which the Higgs mass is left as a free parameter of the fit (usually reffered to as 
a look-elsewhere effect). In general, the ratio of the Type-II error probabilities obtained from the fixed- 
mass and the floating-mass approaches is constant and approximatelly equal to the explored mass range 
divided by the signal mass width. Correspondingly, the ratio of signal significances is lower at low lu- 
minosities, while the difference between the two approaches is reduced to 5% or less at the luminosities 
close to those needed for a 5a-discovery. 

8.1 Influence of the systematic uncertainties on the signal significance 

In order to include the systematic uncertainties in the calculation of the signal significance, one should 
perform additional large number of pseudoexperiments for each of the systematic effects. However, the 



fsB(x) =Po-/b + P\ ■ fs = Po ■ fB + P\ j== ■ exp 

a A V2n 




(2) 
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Figure 22: (left) Signal significance obtained in the fixed-mass approach for the signal at 
m^=200 GeV and tan/3=30, as a function of integrated luminosity. The full line corresponds to 
the results obtained by the profile likelihood method using the average number of expected signal 
and background events, while the dots show the results obtained from the Type-II error probability 
from a large amount of pseudoexperiments. Both estimations include the background uncertainty 
from the /sB-fit. (right) Ratio of signal significances obtained with the floating- and the fixed-mass 
approach, as a function of the integrated luminosity. 



difference between the signal significance in the floating-mass and in the fixed-mass approach is small 
compared to the effect of the additional systematic uncertainties. Thus, in the following the fixed-mass 
approach is used and the full treatment of the look-elsewhere effect is left for the future studies. 

By means of the background estimation from the data, the amount of the background events under- 
neath the signal can be determined with an accuracy of ~ 1 0%/ ^/ <5f [fb _1 ], as described in Section 7. 
This can be achieved independently of the systematic uncertainties discussed in Section 6. 

Therefore, the influence of the systematic uncertainties on the signal significance is given by the 
corresponding changes in the expected number of signal and background events. The numbers entering 
the significance calculation are therefore changed accordingly, taking into account that the systematic 
uncertainty is different for different background processes. The signal significance is calculated sepa- 
rately for each systematic effect and the deviations from the original signal significance are added in 
quadrature. Figure 23 shows the signal significance obtained for two different masses rriA, as a function 
of tan j8 . The contributions of the uncertainty in the background determination from the fit to data and of 
the experimental systematic uncertainties to the expected significance are shown separately. 

9 Discovery potential and the exclusion limits 

The previously described methods for the evaluation of the signal significance and of the exclusion limits 
have been applied to all mass points studied. Table 7 summarizes the signal significances obtained for 
different signal mass points at tan/3=30 and «Sf=10 fb _1 , for the analysis with 0 b-jets and with at least 
one b-jet in the final state. The luminosities needed to reach the 5a signal significance and the 95% 
CL exclusion limit are given in Table 8, for different signal masses at tan j3=30. The results shown so 
far do not include the possible degradation due to the influence of pile-up and cavern background. The 
degraded resolution of the missing transverse energy is expected to cause a ~15% change in the final 
selection of the signal and the Z background. The tt background is characterized by a large Ej lss and is 
therefore somewhat less sensitive to the Ej lss performance under pile-up (~10% change in the selection 
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L=10 fb" 1 , m A =300 GeV 

- No background uncertainty 

- Bkg from the fit to data 
Systematics included 



Figure 23: Signal significance as a function of tan/3 for the integrated luminosity of 10 fb -1 and 
m^=150 GeV (left) and mA=300 GeV (right). The dotted curves are obtained assuming a negligible 
error of the background determination. The dashed curves show the result once the background 
uncertainty is taken into account, as obtained from the fit to the data. The solid curves additionally 
include the systematic uncertainties. The width of the solid curves indicates the errors in the 
background shape parametrization. 











Signal 


significance 








m A 


No background 


Backj 


*r. uncertainty 


Experimental 


Theoretical 


(GeV) 


uncertainty 


of lO%/y/^[fb~ l ] 


syst. uncertainty 


uncertainty 




0b 


>lb 


0b 


>lb 


0b 


>lb 


0b 


>lb 


110 


5.5 


6.4 


2.7 


4.5 


2.4-3.1 


3.8-5.3 


2.2 - 3.2 


3.7-5.3 


130 


5.6 


6.6 


3.6 


5.3 


3.4-3.8 


4.8-5.9 


3.1-4.1 


4.5-6.1 


150 


5.2 


5.8 


4.1 


5.2 


3.9-4.3 


4.8-5.6 


3.9-4.7 


4.5 - 5.9 


200 


3.1 


3.8 


2.8 


3.6 


2.7 - 2.9 


3.3-3.9 


2.5-3.1 


3.2-4.0 


300 


1.2 


1.8 


1.1 


1.8 


1.0-1.2 


1.6-2.0 


1.0- 1.2 


1.7 - 1.9 


400 


0.5 


0.8 


0.5 


0.8 


0.4 - 0.5 


0.7 - 0.9 


0.4 - 0.5 


0.7 - 0.9 



Table 7: Signal significance for signal at different mass points, with tan/3=30 and Jz?=10 fb -1 . 
The numbers are shown for different levels of the background uncertainty. For degenerate A, H 
and h boson states with the same mass, the production rates have been summed. 



efficiency). Table 9 summarizes the results obtained with pile-up effects taken into account. Only small 
changes are observed compared to the previous results. 

The results shown in Tables 7 and 8 can be extrapolated to other tan /3 values. The only difference 
which occurs for the a given signal mass point when changing the tan/3 value is the change of the 
production rates and of the natural width of the A/H/h bosons. In the («u — tan/3) plane which is of 
interest for this analysis, the resolution of the mass measurement is mostly dominated by the experimental 
resolution. Nevertheless, the variation of the natural Higgs width is taken into account in the calculation 
by increasing the background contribution correspondingly to the expected change of the mass window. 

Using the previously described production cross-sections and branching ratios, the 5a-discovery 
curves are obtained as shown in Figure 24(left) separately for the analysis with 0 and with at least one b- 
jet in the final state. The luminosity needed for the exclusion of the signal hypothesis at a 95% confidence 
level is shown in Figure 24(right). 

The signatures with the b-jets in the final state allow for the highest discovery potential. The 0 b-jet 
final state plays nevertheless an important role. Since the search in this final state is uncorrected to the 
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m A 


^[fb- 1 


for a 5a discovery 




JSfffb- 


'] for 95% C.L. exclusion 


(GeV) 


(No systematics) 


No systematics 


With systematics 




Ob 


Mb 


comb. 


0b 


>lb 


comb. 


0b 


>lb comb. 


110 


20.9 


12.2 


7.7 


1.8 


2.0 


0.9 


2.7 


2.8 1.4 


130 


12.0 


8.9 


5.1 


1.9 


1.4 


0.8 


3.6 


1.9 1.2 


150 


10.5 


9.3 


4.9 


1.6 


1.5 


0.8 


2.2 


2.0 1.0 


200 


25.9 


19.1 


11.0 


3.7 


3.1 


1.7 


4.9 


3.8 2.1 


300 


174.6 


81.6 


55.6 


38.6 


13.8 


10.2 


43.8 


16.4 12.0 


400 


1124.0 


444.8 


318.7 


320.0 


75.1 


60.8 


361.0 


86.5 69.8 



Table 8: Integrated luminosity (in fb ) needed for a 5a signal significance and 95% CL exclusion 
of the signal hypothesis, shown for the signal at tan/3=30. For degenerate A, H and h boson states 
with the same mass, the production rates have been summed. 



Jz^ffb '] for 5a discovery 



^[ftr 1 ] for 95% CL exclusion 



m A {GeV) 



110 
130 
150 
200 



8.7 
5.8 
5.6 
12.5 
62.9 
359.5 



1.9 
1.6 
1.3 
2.6 
13.6 



Table 9: Luminosity needed for the 5a signal significance (no systematics) and the 95% CL 
exclusion of the signal hypothesis (with systematics), obtained for the combined analyses, with 
the pile-up effects taken into account. The results are shown for tan /3=30. 
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Figure 24: tan/3 values needed for the 5a-discovery (left) and for the 95% CL exclusion of the 
signal hypothesis (right), shown in dependence on the A boson mass. 



previous one, one can quadratically add the signal significances obtained from the two analyses. The 
5a-discovery curves obtained from the combination of both analyses, as well as the combined 95% CL 
exclusion limits are shown in Figure 25. At an integrated luminosity of 10 fb -1 , the discovery can be 
reached for m A masses up to 350 GeV with tan/3 values between 25 and 60. For m A masses below 
110 GeV the sensitivity drops rapidly as shown in Ref [5], due to the increasing Drell-Yan background 
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Figure 25: Combined analyses results: (left) tan/3 values needed for the 5a-discovery at 
Jz?=10 fb -1 and Jz?=30 fb -1 , shown in dependence on the A boson mass and (right) combined 
95% CL exlucion hmits. 



close to the pole mass of the Z boson. The tan/3 values above ~16 can be excluded already with 10 fb 1 
of integrated luminosity in case of the Higgs boson masses up to 200 GeV. 

10 Conclusions 

In this note, the potential for the discovery of the neutral MSSM Higgs boson is evaluated in the dimuon 
decay channel. As opposed to the Standard Model predictions, the decay of neutral MSSM Higgs bosons 
A, H and h into two muons is strongly enhanced in the MSSM. In addition, the jU + jU~ final state provides 
a very clean signature in the detector. 

The event selection criteria are optimized in the signal mass range from 100 to 500 GeV, separately 
for the signatures with 0 b-jets and with at least one b-jet in the final state. The obtained combined result 
shows that an integrated luminosity of 10 fb -1 allows for the discovery for niA masses up to 350 GeV 
with tan /3 values between 30 and 60. Three times higher luminosity allows for an increased sensitivity 
down to tan /3 =20. The theoretical and detector-related systematic uncertainties are shown to degrade 
the signal significance by up to 20%. This takes into account that the background contribution can be 
estimated from the data with an accuracy of ~2-10%/J2?[fb~ 1 ]. 
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Sensitivity to an Invisibly Decaying Higgs Boson 

Abstract 

Many extensions of the Standard Model include Higgs bosons decaying pre- 
dominantly or partially to non-interacting particles such as the SUSY Lightest 
Supersymmetric Particle (LSP). To set limits on the production cross-section 
times the branching fraction to invisible decay products of such Higgs bosons 
with the ATLAS detector requires an examination of specific production modes 
such as the associated production (ZH) or the vector boson fusion (VBF) pro- 
cess. The predominant Standard Model backgrounds for these processes are 
ZZ — > iiw for the ZH channel and jets from QCD processes and W ± or Z 
bosons produced in association with jets for the VBF channel. The sensitivity 
to an invisibly decaying Higgs boson is investigated in this paper using fully 
simulated ATLAS data for both signal and background. The ATLAS potential 
for triggering these events is also discussed. 

1 Introduction 

Some extensions of the Standard Model predict that Higgs bosons could decay into stable neutral weakly 
interacting particles, leading to invisible Higgs boson decays. The Higgs boson decay products could 
be for example neutralinos, gravitinos, gravitons or Majorons [1-3]. In the case of the Minimal Su- 
persymmetric Standard Model (MSSM), if R-parity is conserved, Higgs bosons decaying into a pair of 
neutralinos may in some cases even dominate [1]. Being the lightest supersymmetric particles, neutrali- 
nos would be stable and would leave the detector without decaying, remaining invisible. If R-parity is 
violated, then Higgs bosons could decay into Majorons, which would interact too weakly to allow detec- 
tion [2]. Some theories with extra dimensions also predict invisible Higgs boson decays, and have the 
added advantage of generating neutrino masses [3]. This search is sensitive to any boson coupling to Z 
or W and decaying invisibly. The combined LEP Higgs boson mass limit in this channel is 1 14.4 GeV 
[4]. 

At the Large Hadron Collider, Higgs boson production could occur through several mechanisms. 
To select and identify events with an invisibly decaying Higgs boson one must be able to trigger on a 
signature that is visible in the event. This is possible for channels such as Vector Boson Fusion (VBF) 
qqH [5], ttH [6] and the associated production processes, ZH and W ± H [7, 8]. Although gluon fusion 
has a much higher Higgs boson production cross-section than these modes [9] , it is not possible to trigger 
on these events when the Higgs boson decays invisibly. 

In this paper, the ATLAS sensitivity to an invisibly decaying Higgs boson is determined in a way that 
does not depend on a specific extension of the Standard Model. The analysis uses the variable <§ 2 which 
is defined as, 

§ 2 = aR(ff-™)— (i) 

where Gbsm represents the "Beyond the Standard Model" cross-section and Osm represents the Standard 
Model cross-section. In the case for which the Higgs boson decays entirely to the invisible mode, E, 2 
is the ratio between the non-Standard Model cross-section and the Standard Model cross-section. Only 
two of the three possible production modes are considered in this paper, VBF and associated production. 
In addition, for associated production, only the ZH mode is considered as the background to the W ± H 
signal is overwhelming [10]. 

In this paper, the Monte Carlo samples, for both the VBF and the ZH channels, the trigger, the event 
selection, the systematic uncertainties and the results are discussed. We conclude by summarizing the 
limits on £ 2 for an invisibly decaying Higgs boson. 
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This analysis compares signals and backgrounds that have been generated using Standard Model 
processes. The Higgs boson signal events are simulated to be invisible by changing the properties of the 
Higgs boson decay chain. In reality an invisibly decaying Higgs boson would be expected to result from 
a process not contained within the Standard Model, and in this case backgrounds associated with this new 
physics would be important. However, consideration of "Beyond the Standard Model" backgrounds is 
beyond the scope of this paper as they would have to be considered in the context of each specific model. 
This analysis assumes Standard Model backgrounds and serves as a limiting case for the indication of a 
particle that behaves like a Higgs boson that is not consistent with the Standard Model. 

2 The Vector Boson Fusion qqH production channel 

The vector boson fusion (VBF) channel has the second largest production mode after gluon fusion and 
has the largest production cross-section for observable invisible Higgs boson decays. The VBF invisible 
Higgs boson production mode, Figure 1 , is characterized by two outgoing jets resulting from the inter- 
acting quarks, and large missing transverse energy from the Higgs boson. The topology of the jets is 
particularly useful in selecting the events as the jets are preferentially separated in pseudo-rapidity (77) 
and are correlated in the azimuthal angle 0. In addition, the lack of colour flow between the two jets leads 
to minimal jet activity between the two tagging jets which is potentially useful for selecting events. At 
high luminosity however, central jet activity resulting from overlapping events may become problematic 
for cuts based on this event characteristic. 

92 94 



91 93 

Figure 1: Feynman diagram of the VBF process. The V represents either a Z or W boson. 

The study of the VBF channel for this paper includes a mass scan with an estimate of the sensitivity to 
Higgs boson masses between 1 10 to 250 GeV This is based on fully reconstructed signal and background 
events. In addition to the sensitivity, the trigger acceptance for this channel has been investigated. 

2.1 Monte Carlo generation for the VBF analysis 

Signal and background samples were generated using the standard version of ATLAS software used for 
this set of papers. The samples were used to determine the sensitivity of ATLAS to an invisible Higgs 
boson, taking into account the trigger and analysis efficiencies. A number of backgrounds with signatures 
similar to the signal have been studied and are listed here. In all cases, £ represents e or jj,. 

1 . Dijet production from QCD processes form a major background due to the very large cross-section 
for these processes. Fake missing energy measurements can arise from poorly instrumented re- 
gions or inefficiencies in the detector. 

2. W+jet processes with W —>■ £v mimic the signal when the lepton is outside the detector acceptance. 
The neutrino provides the missing energy signature. 
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Higgs boson Mass [GeV ] 


Cross-section [pb] 


# Events 


Generator 


110 


4.63 


10k 


JIMMY 


130 


3.96 


30k 


JIMMY 


130 


3.93 


10k 


PYTHIA6.4 


200 


2.41 


10k 


JIMMY 


250 


1.79 


10k 


JIMMY 



Table 1: Invisible Higgs boson samples generated for this analysis. Leading-order cross-sections 
for the Higgs boson produced via Vector Boson Fusion were evaluated by the ATLAS Higgs 
Working group [15]. 



3. Z+jet processes with Z — > vv constitute an irreducible background. 

4. Z+jet with Z — >• 11 forms a background to the signal when the leptons are not within the acceptance 
of the detector. 

Event generation for the VBF channel has proved challenging given that the predicted X] distributions 
of tagging jets differ greatly according to the event simulation model used. The HERWIG- JIMMY 
package [11], [12], [13] represents an average response of the available models and has been used to 
generate data for the Higgs boson mass scan. Signal events have been generated with both HERWIG 
and PYTHIA [14] at a Higgs boson mass of 130 GeV to estimate the contribution of this effect to the 
systematic uncertainty. To generate an invisible Higgs boson sample, the Higgs boson is forced to decay 
into two Z bosons which are subsequently forced to decay into neutrinos. The set of data samples 
produced for this analysis is summarized in Table 1. The VBF signal Monte-Carlo was produced to 
leading order. The difference between LO and NLO cross-section is negligible, ~ 1%[9], therefore the 
LO cross-sections were been used for the signal in this note. Table 1 includes the Standard Model VBF 
Higgs boson production cross-section that were used [9]. 

For an ideal detector, event selection cuts efficiently remove the QCD dijet background. However, 
this background is considered because of the large cross-section for the process and the presence of 
poorly instrumented regions and dead regions generating false missing transverse energy (Ej lss ) signals. 
In order to provide enough statistics throughout the full transverse momentum (pj) spectrum, the QCD 
background was divided into several pr ranges to produce several sub-samples, of approximately equal 
number of events, as shown in Table 2 [16]. The surviving background comes from the high pj bins 
allowing reasonable statistics in the final sample. Thus the binning is used to generate the QCD back- 
ground within a reasonable amount of computer time and allow for the very high rejection factor for this 
background in this analysis. 

In previous ATLAS studies of the invisible Higgs boson produced via the VBF process, the PYTHIA 
package has been used to generate both the W+jet and Z+jet backgrounds. However, the PYTHIA 
implementation for these backgrounds only includes the matrix element term for the qq — > gV and qg —> 
qV processes. The PYTHIA implementation tends to underestimate the Z+2jets process because it does 
not include a complete matrix element calculation. In contrast, ALPGEN [17] provides an exact matrix 
calculation at tree level for up to 3 partons. For this reason, ALPGEN was used to produce both the W+jet 
and Z+jet backgrounds. Within ALPGEN, there are two different Z+jet implementations, one which only 
includes QCD matrix element terms and the second which includes the QCD and EW matrix element 
terms. In the second case, only on-shell bosons are created without Z/y* interference, in contrast, the first 
case does include these effects. By comparing events generated with the two different implementations it 
was found that the QCD-only process underestimates the background by ~25%. The effect of a non-zero 
Z boson width was checked by varying the Z boson mass. This showed that the result was insensitive to 
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Jet Sample | o(pb) | Arrange [GeVT 



JO 


1.76 xlO 10 


8-17 


Jl 


1.38xlO y 


17-35 


J2 


9.33x10' 


35-70 


J3 


5.88xl0 6 


70-140 


J4 


3.08xl0 5 


140-280 


J5 


1.25xl0 4 


280-560 


J6 


3.60xl0 2 


560-1120 


J7 


5.71x10° 


1120-2240 



Table 2: Cross-section and pj range for the QCD dijet background samples generated with 
PYTHIA. The cross-sections given in the second column are for the specific pj range with no 
other cuts. 



the use of on-shell bosons. Therefore, all ALPGEN samples used in this study were generated using the 
option that included both QCD+EW terms. 

For the Z+jet background, three samples were produced for each of the two decay modes, Z — > vv and 
Z — ► it. Two exclusive samples were produced for the one and two partem final states and one inclusive 
sample was produced for three or more partons. Default ALPGEN settings were used to generate events 
except for a cut to remove very low Ef lss events by setting E™ lss > 10 GeV and a change in acceptance 
to ensure complete Tj coverage by opening the phase space setting for both jets and leptons (\r]j\ < 6 and 
M < 6). 

In this study, only the leptonic decay of the W boson from the W+jet background was considered. 
The Ej lss arises from a combination of the Ef' ss associated with the neutrino and the lepton energy in 
the case where the lepton escapes detection. The W+jet background was generated in the same manner 
as for the Z+jet background. 

2.2 Trigger 

The major challenge for triggering candidate events for the VBF invisible Higgs boson analysis is to 
retain signal events whilst reducing the very large QCD background to an acceptable level. These prob- 
lems are particularly acute with the first level trigger (LI) which can easily be overwhelmed by the QCD 
background. A trigger for these signal events is possible using a relatively high Ef lss cut while selecting 
one or two jets of moderate transverse energy. For triggers of this type, QCD backgrounds dominate. 
In order to produce an acceptable rate for the High Level Trigger (HLT), the trigger menu items used to 
select invisible Higgs boson events should add no more than a few Hz of trigger rate, even at the highest 
luminosities. 

The trigger study of this note is based on the standard full ATLAS simulation of the LI trigger. 
The HLT has not been considered, as at the time of the study HLT algorithms for Ef lss had not been 
fully implemented and there had been no simulation of forward jets. Jets are classified into central jets 
|tj| < 3.2 and forward jets 3.2 < |tj| < 5. 

Data for the trigger study consisted of the sample of VBF Invisible Higgs boson events with a Higgs 
boson mass of 130 GeV produced using HERWIG and a sample of QCD dijet produced using PYTHIA 
as described in Section 2.1. The results of this study are shown in Table 3 for 10 31 cm~ 2 s _1 luminosity. 
The acceptances shown in Table 3 give the effect of the trigger on the VBF Higgs boson samples used 
in the VBF analysis. As such the acceptance is defined as the number of signal events that survive both 
the trigger and the data selection cuts described in Section 2.3.1, divided by the number of events that 
survive the selection cuts alone. The trigger rates in Table 3 are the expected raw rates for the specific 
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Trigger Menu 


Acceptance[%] 


Rates [Hz] for 




normalized 


Jz?= 10 31 cm~ 2 s _1 




to offline cuts 




L1_XE60 


99 


5.0 ± 1.3 


L1JXE70 


98 


1.5 ± 0.6 


L1_XE80 


96 


0.6 ±0.1 


L1J£E100 


84 


0.2 ±0.1 


L1_XE120 


70 


0.1 ±0.1 


L1_FJ23+XE70 


78 


0.9 ± 0.6 


L1J23+L1JXE70 


83 


1.4 ±0.6 


L1J23+L1JXE100 


73 


0.2 ±0.1 


L1_FJ23+L1J<;E100 


66 


0.0 ± 0.0 


LI _FJ23+L1 J23+L1JXE70 


62 


0.9 ± 0.6 


L1_FJ23+L1 J23+L1JXE100 


55 


0.0 ± 0.0 



Table 3: Signal acceptance and level one trigger rates for the VBF invisible Higgs boson channel 
based on full ATLAS simulations and with mu = 130 GeV . The LI trigger menu items are Ef us 
(L1_XE) central jet (L1_J) and forward jet (L1_FJ), see text. The number following the menu 
object indicates the trigger threshold given in GeV. Both single and combined triggers are shown 
in this Table. The values are given for a luminosity of 10 31 cm~ 2 s _1 and do not account for 
pile-up effects. 



trigger. One expects an overlap with other trigger signatures such that the additional rate produced by 
these signatures will be less than the calculated raw rates. 

The numbers given in Table 3 are the best estimation we currently have of the trigger rates. In reality 
beam conditions and detector effects could lead to much higher values. It is clear that the trigger strategy 
will depend on these background effects and on the luminosity. At low luminosities, (10 31 cm~ 2 s _1 ) it 
is likely that a simple trigger based on Ef lss alone such as Ej lss > 70 GeV (L1_XE70) will be sufficient. 
However if the trigger rate for this item is higher than expected, VBF invisible Higgs boson events 
could still be triggered using a higher E™ lss trigger such as Ej lss > 80 GeV (L1_XE80) or Ef lss > 100 
GeV (L1JXE100) whichever one can be used without pre-scaling. As the luminosity increases or if 
backgrounds are worse than expected, it will be necessary to use a combined trigger for this channel. 
The numbers in Table 3 suggest that triggers based on E™ lss and either a forward or central jet would be 
sufficient. However the addition of a single jet to an Ef lss trigger provides a relatively small reduction in 
rate due to correlations that can occur when high energy jets are mis-measured. There is concern that this 
could be amplified by pile-up effects. Requiring a forward jet plus a central jet plus E™ lss is expected to 
solve these problems albeit with a reduction of signal acceptance. For a luminosity of 10 33 cm~ 2 s~ 1 , the 
most conservative trigger option is a combined trigger with Ej lss > 100 GeV and a forward and a central 
jet each with a pj > 23 GeV. This trigger will have an acceptance rate for the signal of 55% as shown 
in Table 3. Note that the uncertainties shown in Table 3 are statistical only. A further major uncertainty 
in trigger rates are pile-up effects which have not been considered here. In practice, adjustments will be 
required to select the optimum trigger based on the experimental rates observed at the LHC. 

Invisibly decaying Higgs boson events produced via vector boson fusion can be selected using a 
combination of E™ lss and jet triggers with a small impact on the overall LI trigger rate. For low lumi- 
nosities (10 31 cm~ 2 s _1 ) it is expected that a E™ lss trigger of 70 GeV or greater will be sufficient. For 
higher luminosities such as 10 33 cm~ 2 s _1 , a trigger with Ej iss > 100 GeV and a forward and central jet 
each with pj > 23 GeV will be required. 
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2.3 Event selection for the VBF channel 

Two separate methods are used to extract the signal: the first method is called the cut-based analysis, the 
second is the shape analysis. Both analyses are conducted by first applying the selection cuts described 
in the next sub-section. For the cut-based analysis, the signal is extracted using all the cuts including a 
cut on (j>jj, the angle between the two tagged jets in the transverse plane. This is described in Section 
2.5. For the shape analysis, this last cut is not applied. Instead, the shape of the <j>jj distribution is used 
to extract the fraction of signal events. This is described in Section 2.6. Results are derived separately 
with these two methods. 

The selection cuts described in this section were developed using a signal sample with m# = 130 
GeV. 

2.3.1 Selection cuts 

The event selection is based on standard ATLAS definitions for jets, leptons and missing transverse 
energy [18]. A primary characteristic of a signal event is the presence of two jets from the VBF process. 
Events are selected based on each of the two highest pj jets in the event which are referred to as the 
"tagging jets". These tagging jets are required to have a pj > 40 GeV and be in the rapidity range 
V\i\?\ < 5- Cuts on the product and difference of the pseudorapidity of the two jets are used, 7]ji ■ r\j2 < 0 
and AT7 > 4.4, respectively. Kinematic distributions of the tagged jets for the signal and background are 
shown in Figure 2. In the upper two plots of this Figure, it can be seen that signal events and the 
W+jet and Z+jet backgrounds have very similar pj distributions. When the W+jet and Z+jet events are 
generated with only one parton the pj distributions are much softer. When the two parton and three 
parton components are added the pj distribution becomes harder and similar to the signal. 

The second major event characteristic used to select events is a large Z^^from the invisible decay 
of the Higgs boson. A cut on this variable significantly reduces the QCD background as no real Ej lss \% 
expected for QCD events. In this analysis there is a requirement that Ef lss > 100 GeV. The Ef lss distri- 
butions for the signal and backgrounds are shown in Figure 3. 

The majority of QCD dijet background events will produce soft jets resulting in the tagging jets 
having a low invariant mass. This feature can be used to reject QCD events by requiring a minimum 
invariant mass of 1200 GeV for the tagging jets. The invariant mass distribution of the tagging jets is 
shown in Figure 3. The QCD dijet background can be further reduced by requiring that the direction of 
the measured Ef lss is not correlated with the tagging jets. A missing transverse energy isolation variable, 
I, is defined for this purpose as I = min[${Ef lss ) - 0(71,2)] • Events with a small value of I are expected 
to result from mis-measured jets caused by dead material and cracks in the detector. This is illustrated 
in Figure 4 which shows that QCD dijet events preferentially have a small value of I. A selection of 
I < 1 rad has been used which is compatible with previous analyses. The W+jet and Z+jet backgrounds 
can be reduced by rejecting events with any identified lepton. For this reason, events with electrons or 
muons with a pj > 20 GeV are rejected, as are events containing T-jets with a pj > 30 GeV. These cuts 
are based on an earlier ATLAS fast simulation study. 

A key aspect of the VBF Higgs boson search is the electroweak nature of the signal,and this can be 
used to suppress backgrounds by using the fact that the signal has no color flow between the interact- 
ing quarks at tree level. Although the W+jet and Z+jet backgrounds include both electroweak and QCD 
terms, the cross-section is dominated by the QCD contribution. Therefore, unlike the signal, the majority 
of background events have QCD activity in the central region. The presence of this extra QCD radiation 
between the two tagging jets provides, in principle, a powerful tool to suppress this background. In prac- 
tice the difference is diluted both by the underlying event and pile-up. The Underlying Event (UE) arises 
from interactions of the spectator partons and is not consistently modeled by the available event gener- 
ators. For example, the ratio of the average jet multiplicity from the UE between HERWIG/PYTHIA is 
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Figure 2: Comparison of tagged jet properties for signal events (m# = 130 GeV) and the three 
major backgrounds. The upper left plot shows the pj of the leading tagged jet, the upper right 
plot shows the p T of the jet with the second highest p T . The lower left plot shows the product of 
the directions of the two tagged jets in pseudo-rapidity (77,1 x r\j 2 ), and the lower right plot shows 
the difference in 77 between the two tagged jets (Atj). The enhancement at At? of 0.5 in the VBF 
signal results from a single high pj jet being reconstructed as two jets. The filter cut described in 
Section 2.1 has been applied to the W+jet and Z+jet Monte-Carlo data, but no trigger cuts have 
been applied. The distributions are normalized to unity. The vertical dotted lines show the cut 
values used in the analysis. 




Figure 3: The reconstructed invariant mass of the tagging jets (left) and the Ef lss (right) for the 
invisible Higgs boson signal (m# = 130 GeV) and the three main backgrounds. Single events in 
the high Ef lss tail of each individual sample (JO, Jl etc) can result in a spike with a large error. 
The filter cut described in Section 2.1 has been applied to the W+jet and Z+jet Monte-Carlo data, 
but no trigger cuts have been applied. The distributions are normalized to unity. 
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Figure 4: The distribution of the reconstructed Ef lss isolation variable (I) is shown in the right 
hand plot and the azimuthal angle between the tagging jets (fyj) is shown in the right hand plot for 
the invisible Higgs boson signal (m# = 130 GeV) and the three main backgrounds. The filter cut 
described in Section 2.1 has been applied to the W+jet and Z+jet Monte-Carlo data, but no trigger 
cuts have been applied. The distributions are normalized to unity. 



between 1.38 and 1.85. Therefore PYTHIA generates events with fewer jets from the UE, but these jets 
have on average a higher pj . If a cut is applied to remove events that have a central jet that exceed a 
specific pj value, the so called Central Jet Veto (CJV) cut, fewer PYTHIA events will survive than HER- 
WIG events. Although there is a clear difference in the topology between the signal and background, the 
added contribution from the UE has a large effect on the efficiency of this cut. In the same way pile-up, 
which results from central activity unrelated to the event of interest can also reduce the effectiveness of 
this CJV cut. The effect of pile-up has not been studied, as suitable data samples were not available. For 
this analysis, a central jet veto is used requiring that there are no additional jets with pj > 30 GeV for 
\r\ \ < 3.2. It should be stressed that this cut is applied after the selection of the two tagging jets which can 
be located anywhere within the full tj range including |tj | < 3.2. So this cut does not bias the selection 
of the tagging jets, nor does it introduce a bias with respect to the trigger which has elements that allow 
jets to be located within an \ r\ \ < 3.2. 

Unlike the signal which is uniquely produced via Vector Boson Fusion, the W+jet and Z+jet back- 
grounds can be produced by the qq —> gV and qg —> qV processes in which the second jet comes from 
a radiative process. Therefore, the difference in 0 between the two tagged jets is different for the signal 
and the radiative background as can be seen in Figure 4. This difference provides additional discriminat- 
ing power and is used in the analysis presented in Section 2.5 requiring <j>jj < 1 rad. Moreover, the in- 
variable motivates the shape analysis presented in Section 2.6. 

The selection cuts along with the surviving cross-sections after each cut are shown in Table 4 for 
a Higgs boson mass m# = 130 GeV and the three main backgrounds. Table 5 shows the effect of 
the cuts for the four Higgs boson mass values considered in this study. The cross-sections for W+jet 
and Z+jet processes were calculated to LO but have been normalized to the results calculated with the 
generator FEWZ at NNLO which results in a value for the total cross-section which is known to within 
- \0% l [16]. 

The first cut applied to the data simulates the effect of the LI trigger with the most conservative menu 
option given in Table 3 and discussed in the previous section namely, a E' T USS > 100 GeV, a central jet 
with pj > 23 GeV and a forward jet with pj > 23 GeV. This cut reduces the QCD dijet background rate 
by approximately 7 orders of magnitude. The effect of the trigger on the W+jet and Z+jet backgrounds 
is smaller with a reduction of two orders of magnitude, by contrast the signal is reduced by about 50%. 

^his includes the PDF and QCD scale uncertainties. 
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Initial a (fb) 


3.93 xlO 3 


1.24 xlO 6 


4.08 x10 s 


1.91 xlO 13 


LI Trigger 


2.71 xlO 2 (0.07) 


1.42xl0 4 (0.01) 


6.31 xlO 3 (0.02) 


1.52xlO b «0.01) 


+ Tagged jets 


1.47 xlO 2 (0.54) 


1.35 xlO 3 (0.10) 


6.16xl0 2 (0.10) 


1.81 xlO 5 (0.12) 


+ Mjj 


l.llxlO 2 (0.76) 


6.64 xlO 2 (0.49) 


3.76xl0 2 (0.61) 


1.28xl0 5 (0.71) 


+ Ef ss > 100 GeV 


1.08 xlO 2 (0.97) 


4.70xl0 2 (0.71) 


2.69 xlO 2 (0.72) 


2.84 xlO 3 (0.02) 




1.07xl0 2 (1.00) 


3.01 xlO 2 (0.64) 


2.62x 10 2 (0.97) 


2.76xl0 3 (0.97) 


+ / > 1 rad 


9.60X10 1 (0.89) 


1.49 xlO 2 (0.49) 


2.11xl0 2 (0.81) 


3.61 «0.01) 


+ Central jet veto 


8.93 xlO 1 (0.93) 


LlOxlO 2 (0.74) 


1.32xl0 2 (0.63) 


0.07 (0.02) 


+ (j)jj < 1 rad 


4.50x10' (0.50) 


1.94X10 1 (0.18) 


4.21x10' (0.32) 


0.07(1.00) 



Table 4: Cross-section in fb for a Higgs boson (m# = 130 GeV) and background samples at each step of 
the selection process. Initial cross-section for W/Z+jet are quoted after VBF filter and NNLO corrections. 
The first cut is the effect of the LI trigger simulation with a Ef lss of 100 GeV, a central jet with pj > 23 
GeV and a forward jet with pj > 23 GeV (Table 3). The central jet veto is applied to jets other than the 
two tagging jets and does not bias any of the other cuts. Numbers in parentheses are the efficiencies for 
each cut. 



Higgs boson mass [GeV] 


100 


130 


200 


250 


Initial a(fb) 


4630 


3930 


2410 


1780 


LI Trigger 


322 


271 


240 


168 


+ Tagged jets 


166 


147 


134 


93 




126 


111 


100 


73 


+ E? lss > 100 GeV 


121 


108 


98 


70 


+ Lepton veto 


121 


107 


98 


70 


+ / > 1 rad 


108 


96 


86 


63 


+ Central jet veto 


94 


89 


79 


59 


+ <pjj < 1 rad 


43 


45 


39 


30 



Table 5: Cross-section in fb for each signal mass at each step in the selection cuts. The first cut is the 
effect of the LI trigger simulation with a E™ lss of 100 GeV, a central jet with pj > 23 GeV and a forward 
jet with p T > 23 GeV. 

The jet tagging cuts reduces all three backgrounds by a factor of 10. Although a LI E™' ss is applied a 
large fraction of events still survive because of the the LI £'™'"resolution. The other cuts that have a 
large impact on the QCD rate are the E™ lss cut and the Ef lss isolation cut. Together they reduce this 
background to a negligible level. The effect of these selection cuts on the Z+jet and W+jet backgrounds 
are less dramatic. The lepton veto reduces the W+jet and Z+jet by ~ 36% and ~ 3%, respectively. The 
lepton veto cut removes few events in the Z+jet channel as the Ef lss cut removes most of the Z^ it de- 
cay mode. The remaining Z+jet events are dominated by the Z^ vv mode. Leptons are only identified 
for | tj | < 2.5, so the lepton veto cut does not remove all the W+jet background events due to this limited 
T7 range. Therefore, electrons and T-jet in the forward region (|t]| > 2.5) are mis-identified as jets most 
of the time. In a similar manner, muons in the forward direction are generally not identified and result in 
fake Ef iss . 



2.4 Systematic uncertainties 

Three major types of systematic uncertainties are considered. One arises from the implemention of the 
Monte Carlo generators, the second from the experimental systematic uncertainties and the third from 
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| Selection Cuts | HERWIG130GeV | PYTHIA130GeV 



Initial a(fb) 


3.93 xlO 3 (1.000) 


3.93 x 10 3 (1.000) 


Pre-Cut(£f"> 80 GeV) 


1.76 xlO 3 (0.448) 


1.78 xlO 3 (0.453) 


+ Tagged Jets 


4.07 xlO 2 (0.231) 


4.10 x 10 3 (0.230) 




2.45 xlO 2 (0.602) 


2.45 x 10 3 (0.598) 


+ E™ SS > 100 GeV 


2.05 xlO 2 (0.837) 


2.14 x 10 3 (0.873) 


+ Lepton Veto 


2.05 xlO 2 (1.000) 


2.12 x 10 2 (0.991) 


+ / > 1 rad 


1.84xl0 2 (0.898) 


1.80 x 10 2 (0.849) 


+ Central Jet Veto 


1.59 xlO 2 (0.864) 


1.07 x 10 1 (0.594) 


+ <pjj < 1 rad 


7.43 xlO 1 (0.467) 


4.93 x 10 1 (0.461) 



Table 6: Comparison between HERWIG and PYTHIA generated samples on the selection cuts for 
the 130 GeV Higgs boson mass. The cross-section results are quoted in fb and the cut efficiency 
is given in parenthesis. The major difference occurs in the last two rows of this table. 



the theoretical knowledge of the production cross-sections. 

Two event generator effects are discussed, the first is the treatment of the Underlying Event (UE) and 
the second is the effect of using a fixed Z boson mass. To illustrate the effect of the UE, signal events 
have been generated with two different event generators, HERWIG and PYTHIA that treat the UE in 
different ways. Table 6 shows the efficiency of each selection cut used in the VBF analyses for the two 
generators. The difference in cross-sections between the two samples are within ~ 2% of each other 
for cuts up to the Ef tss isolation cut. However, once the central jet veto is applied, fewer Pythia events 
survive resulting in a large difference of ~ 49%. This is believed to be the result of the difference in 
the modeling of the UE in the two generators; Pythia tends to produce fewer but harder jets, resulting 
in more events being removed by the central jet veto cut, see Section 2.3. If the number of background 
events is underestimated due to a combination of this cut and the choice of generator the sensisitivity to 
the signal will be artificially enhanced. A systematic study of the effect of generator choice on the central 
jet veto cut would require a large number of data and background samples to be produced with a variety 
of generators and this is beyond the scope of this paper. It is not clear which generator represents reality 
best. For consistency both the background and signal samples were generated using HERWIG. When 
real data becomes available it will be possible to measure the magnitude of central jet activity directly 
and use this to tune the generators. 

The use of ALPGEN requires the use of a fixed Z mass. The effect of the missing off-shell terms 
from the background samples was checked using the Z —> vv analysis by adding a Ef lss contribution 
randomly generated by a Breit-Wigner distribution using the Z mass and width parameters. This study 
indicated that the effect of using a fixed mass Z boson was negligible. 

Two methods are considered in this paper to extract the signal significance. The first, a cut-based 
analysis, relies on the number of signal and background events after all cuts have been made. The 
second, a shape analysis relies on the ratio of the number of background events contained in two regions 
of the ^variable distribution; namely the number of events for <f>jj < 1 divided by all events. The 
systematic uncertainties of interest are the ones related to these three quantities; the number of signal 
and background events and the background shape ratio. They are shown in Table 7. 

The event reconstruction variables that result in the largest systematic uncertainties are the jet reso- 
lution and the jet energy scale. For the jet energy resolution, the systematic uncertainty was estimated by 
smearing the momentum of the jets using a Gaussian distribution with a width given by c(E) = 0.45y/E 
for \r] \ < 3.2 and o(E) = 0.63\/£ for \r\ \ > 3.2. Changing the jet energy magnitude also affects the 
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Systematics 


Higgs boson 130 GeV 


Background 


Cut-Based 


Shape 


Luminosity 




~0% 


Jet energy resolution: 








a{E) = QA5^/E for \r\ \ < 3.2 


0.8 % 


5.3 % 


4.5 % 


o{E) = 0.63 VE for |tj| > 3.2 








Jets energy scale: 








+7% for |tj| <3.2 


4.0% 


3.2% 


0.2 % 


+15% for |T7| >3.2 








Jets energy scale: 
-7% for |tj| <3.2 


10.0 % 


19.5 % 


2.8 % 


— 15% for |T7| >3.2 








Total 


10.5 % 


20.4 % 


5.3 % 



Table 7: Experimental contributions to the systematic uncertainty. The systematic uncertainty on 
the Jet Energy Scale (JES) is asymmetric. Only the largest (negative) JES systematic uncertainty 
is included in the total experimental uncertainty shown in the last row of the table. 



Ej lss , so for each event, Ef lss was recalculated for the x- and y-components. The analysis was then 
repeated to determine the change in the number of signal and background events and the shape ratio. In 
the same way the effect of changes in the jet energy scale was investigated by shifting the overall scale 
by ±7% for |tj| < 3.2 and ±15% for \r\ \ > 3.2. Again this affects the Ef iss , and this change was taken 
into account. As can be seen in Table 7, there is an asymmetric dependence on the Jet Energy Scale, 
(JES), so positive and negative deviations are considered separately. Lepton reconstruction was analyzed 
and found not to contribute to the final systematic uncertainty. The JES uncertainties used here are con- 
servative for 30 fb _1 , but the impact of this choice on the sensitivity limits presented in this paper is 
small. Finally, 3% was assigned to the uncertainty in the luminosity. To get the experimental systematic 
uncertainty for each analysis the terms were added in quadrature giving an overall systematic uncertainty 
of 20% on the number of background events which applies to the cut-based analysis and an uncertainty 
of 5.3% on the background shape ratio that applies to the shape analysis. 

In addition to the uncertainty in the UE and the reconstruction algorithms, there is a systematic un- 
certainty which arises from the uncertainty in the absolute cross-section of the backgrounds. The main 
backgrounds to the invisible Higgs boson channel are Z+jet and W+jet. The total cross-sections for 
these processes have been corrected to NNLO and are known to ~ 10%, (see Section 2.3). However 
the cuts used to select the VBF process, result in a very restricted phase space which makes it difficult 
to determine the systematic uncertainty on the cross-section for the final data samples. This means that 
the systematic uncertainty on the number of background events due to the cross-section is currently un- 
known, could be very large and is likely to dominate other uncertainties. The shape of the <f>jj distribution 
on the other hand is quite well constrained by theory and based on previous studies has a systematic un- 
certainty of 10% [5]. At NLO it is expected to be 5%. In this analysis a conservative value of 10% 
is assumed for the uncertainty due to the cross-section which when combined with the much smaller 
experimental effect leads to an overall systematic uncertainty of 11.3% on the background shape. 
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2.5 Cut-based analysis 

This analysis uses the selection cuts summarized in Section 2.3.1. The signal significance is calculated 
based on the number of signal and background events that remain after all cuts. The limitation of this 
analysis is that the systematic uncertainty on the background cross-sections described in the previous 
section could be very large. One way of dealing with this uncertainty is to use experimental data of the 
Z+jet channel where the Z decays to two leptons. After correction for the detector acceptance, these 
events can be used to infer the value of the cross-section for the irreducible background in which the 
Z decays to two neutrinos. A similar analysis can be done for the W+jet background. These so called 
"data driven" corrections will be the subject of a future publication. In the next section we report on an 
alternative method which uses the shape of the azimuthal angle distribution between the jets to reduce 
the dependence of the systematic error on the cross-section. In the current section the sensitivity to 
an invisible Higgs boson without systematic errors are calculated to provide baseline numbers for the 
sensitivity to an invisible Higgs boson produced via the VBF process. 

The number of signal and background events after the <f>jj cut is shown in Tables 4 and 5. These 
numbers can be used to calculate the 95 % CL sensitivity of £ 2 for the invisible Higgs boson, given the 
assumed backgrounds. This is done by calculating the number of signal events required to increase the 
total event count by a factor 1.64 times the uncertainty on the number of background events as shown in 
Equation 2. 

\.6Ao B = Ns¥ (2) 

Here Ns is the number of signal events after the selection cuts and Og = \* r Ns . The results of this analysis 
gives a § 2 for an integrated luminosity of 30 fb _1 at 95 % CL. for Higgs mass between 1 10 GeV and 250 
GeV of ~ 5 — 8% in the case when systematic uncertainties are not included. An additional 6% statistical 
uncertainty 2 arises from the limited number of events in the data samples. 



2.6 Shape analysis 

The shape analysis is motivated by a marked difference in the <j)jj distribution between the signal and the 
W/Z+jet background as shown in Figure 5, which is taken from reference [5]. This plot shows that the 
backgrounds peak above a <f>jj of 1 while the signal is higher at low <f>jj values. To characterize the shape 
of the (j)jj distribution the ratio R has been defined as the number of events with <j>jj < 1 divided by the 
total number of events as shown in Equation 3. As the proportion of signal in the sample increases the 
value of R increases. 

r i da_ 

JO d(j>jj 

The advantage of the shape analysis described in this section over other analyses is that it does 
not require a knowledge of the absolute cross-section but rather the ratio of the number of events for 
(<j)jj < 1) to all events. As such, the systematic error associated with the absolute cross-section is reduced 
to a negligible amount. However, as discussed in Section 2.4, there is a systematic uncertainty associated 
with the knowledge of the (j>jj distribution which is known to ~10% or better. In addition, the systematic 
uncertainties due to detector effects are much smaller for this ratio than they are for the number of 
background events, which is the relevant variable for a pure cut-based analysis (Table 7). The overall 
systematic uncertainty on the ratio R has been calculated to be 1 1.3%. 

Equation 3 can be re-written in the context of this analysis and expanded to provide a background- 

2 Based on a binomial error calculation. 
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Zjj 
Wjj 



M H =120 GeV 




Figure 5: The <j>jj distribution for the signal and background in radians [5]. The solid and dash- 
dotted lines represent the expected distribution for the Higgs boson signal with Higgs boson 
masses of ra# = 120 and ra# = 300 GeV respectively. The dotted and dashed lines represent the 
distributions expected from the backgrounds. The plot shows the distributions after VBF selection 
cuts have been made. Note that for the <j>jj plot shown earlier did not have these cuts applied, 
(Figure 4). 



only term, as shown in Equation 4. 

ivi r - /A/1 A/?\ 1 

(4) 

Here N B and Ng are the number of events within <j>jj < 1 and Ng and N$ are the number of events 
within the entire fyj range. The first term of Equation 4 provides the expected ratio for the background 
contribution. However, since the ratio between the signal and background are not the same in the presence 
of a signal, a non-zero value is expected in the second term. The variation from the 'background only' 
ratio dictates the sensitivity to new physics. The ratio N B /Ng can be determined using the Stndard Model 
theoretical prediction or by a data driven technique. 

The shape analysis applies the selection cuts discussed in Section 2.3.1 but not the (j>jj cut. Therefore, 
the results from Table 4 before this last cut are used. The first term of Equation 4 is calculated to be 
0.254 ±0.007. To determined the 95% C.L. sensitivity limit a variation of 1.64<Tr is required, where Or 
is the uncertainty on the ratio R from Equation 4. Therefore, the first order t, 2 terms from Equation 4 is 
set to the required 95% CL sensitivity limit, that is 1.64or, as shown in Equation 5. 

— <i-I)0!) 

Solving for E, 2 provides the 95 % CL sensitivity limit for the invisible Higgs boson. The results of this 
analysis are shown in Figure 6. Without systematic errors, the shape analysis gives a value of t, 2 that 
ranges from 11 to 19%. This can be compared with the simple cut-based analysis which gave a E, 2 
value that ranged from 5 to 8%. So although the shape analysis method removes the dependence on the 
absolute cross-section of the backgrounds there is a reduction in the sensitivity to the signal. To include 



1431 



Higgs - Sensitivity to an Invisibly Decaying Higgs Boson 



100p 




# Full Simulation Using Shape Analysis No Systematics 
^ Full Simulation Using Shape Analysis With Systematics 



ATLAS 



▲ = 



70 ~~ 



60^ 



▲ 



50^- 



▲ 



40 



30 



20^- 



• - 



10=- 



n H H 

TOO 120 140 160 180 200 220 240 260 
Higgs Mass [GeV] 



Figure 6: Sensitivity for an invisible Higgs boson at 95% C.L. via the VBF channel using shape 
analysis for an integrated luminosity of 30 fb~ l with and without systematic uncertainties. The 
black triangles (circles) are the results from this analysis with (without) systematic uncertainties. 

the systematic uncertainties, the uncertainty on the background becomes a R = ^Jo R 2 + a 2 R 2 , where a 
is the fractional systematic uncertainty given in Table 7. The result obtained that includes systematic 
uncertainties gives value of % 2 of around 60% for m# between 100 and 200 GeV. This sensitivity is 
dominated by the systematic uncertainty that arises from the theoretical knowledge of the shape of the (j>jj 
distribution. Using calculations at NLO could reduce this uncertainty by a factor of 2 greatly enhancing 
the sensitivity of this method of analysis. 

2.7 Summary for the VBF invisible Higgs boson channel 

The study described above has investigated the sensitivity of the ATLAS detector to a Higgs boson 
particle produced by the VBF process that has an invisible decay mode. It should be stressed that these 
results do not include pile-up which can reduce the sensitivity. It has been shown that with 30 fb _1 
of data it is possible to detect this process over a wide range of masses if the Beyond Standard Model 
cross-section is more than 60% of the Standard Model cross-section for a Higgs mass range of up to 200 
GeV and 100% of the Higgs boson decays are invisible. Triggering for this channel is possible using a 
trigger requiring large E^plus a forward and a central jet of moderate pj. Triggers of this kind would 
be useful up to luminosities of at least 10 33 cm -2 s -1 . 

3 The associated ZH production channel 

The Feynman diagram for associated production in the ZH channel is shown in Figure 7. The signal of 
an invisibly decaying Higgs boson in the ZH channel can be detected when the Z boson decays into two 
leptons, which can be used for triggering the event. The presence of an invisibly decaying Higgs boson 
is detected from the missing transverse energy. 
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Figure 7: (Left): The Feynman diagram for Higgs boson associated production with a Z boson. (Right): 
A representation of the decay of a Higgs boson into two invisible neutralinos represented by %° recoiling 
against the two leptons coming from the Z decay. 



Various backgrounds with signatures similar to the signal have been studied and are listed here. In 
all cases, unless otherwise specified, I represents e or jj,. 

1. The ZZ —> llw final state gives the same signature as the signal (irreducible background) and is 
the main background; 

2. The tt —* bi + vbi~v process mimics the signal when the two b-jets are not reconstructed, or when 
a second lepton results from a b quark decay; 

3. The — >• iviv process mimics the signal but can be greatly reduced by cutting on the Z 
mass; 

4. The ZZ — >• vvxx and x — >• ivv can also mimic the signal. 

5. The ZZ — >• ilxx and X — > ivv can pass the selection criteria if some particles are missed; 

6. The ZW — > Ulv decay mode also simulates the signal when one lepton is not detected; 

7. The Z plus jets background, with Z —> it (Drell-Yan process) final state can be mistaken for the 
signal when poor jet reconstruction leads to missing transverse energy. 



3.1 Monte Carlo generation for the ZH channel 

The signal and the background events have been generated using different particle generators chosen 
according to which process they simulate best. The events are fully simulated then reconstructed using 
ATHENA. The diboson production cross-sections are taken from Ref. [16]. All events were passed 
through a filter immediately after generation. The two filters used are described in detail in Section 
3.2.2. Only the few samples generated with the simpler lepton filter will be mentioned here. All other 
samples were generated with the filter containing m z and Ef' ss cuts. 

Details on the generated events and pre-defined parameters are given below. All generators use the 
CTEQ6M structure functions to generate the processes. 

• Seven signal samples were generated using PYTHIA, with m H = 1 10, 120, 130, 140, 150, 200 and 
250 GeV. Only the samples with mg = 130 and 140 GeV used the lepton filter. To generate the 
invisibility of the Higgs boson, the H is produced as a stable particle, which goes undetected. 

• ZZ — > t + i~vv is generated using PYTHIA. This is the main and irreducible background. One Z 
decays to a lepton pair while the other is allowed to decay to any flavor of neutrino pairs. 
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• it is produced with the MC@NLO generator with all top quark decay modes allowed. The filter 
described in detail in Section 3.2.2 retains mostly it —> b£ + vb£~v events with £ = e or \i only. 
Hadronic top quark decays with a subsequent b — >• c£v decay are also included. 

• W+W- -► £+v£-v is generated with MC@NLO. 

• ZZ — > T + T~ vv and ZZ — »■ ££t + t~ events are generated with PYTHIA. The T was allowed to decay 
hadronically and leptonically in both samples. 

• ZW ± —> £ + t 't^v samples are produced with the MC@NLO generator. Since the cross-sections 
for ZW + and ZW~ production are different, two different datasets were generated. The ZW sam- 
ples were not filtered since MC@NLO does not include the Z width at generation, leading to 
problems when trying to apply a cut on the Z mass at the generator level. For these datasets, we 
use samples generated with the lepton filter which selected events containing at least two leptons. 
These events need to be reweighted after full reconstruction to account for the distorted Z mass 
distribution. 

• Z —> £ + £~ + jet events are produced using the SHERPA generator. 

3.2 Event selection for ZH channel 

The event selection is made in three stages: 

1. For simulated samples, a filter is applied immediately after the event Monte Carlo generation to 
avoid unnecessary simulation of a large fraction of the background which would otherwise be 
readily rejected at the preselection level. The event filter cuts are loose preselection cuts applied 
on true Monte Carlo quantities. 

2. The preselection cuts use fully reconstructed variables and aim at rejecting most backgrounds, 
retaining only the most likely events to be used at the final selection level. 

3. The final selection uses a multivariate analysis (Boosted Decision Tree) to refine the selection cuts 
while retaining a high signal efficiency. 

3.2.1 Analysis framework for the ZH channel 

The standard ATLAS selection criteria are used to identify these objects [18]. 

3.2.2 Filter for the ZH channel 

The filter decision is based on true Monte Carlo quantities. The filter must not reject events that would 
have passed the preselection cuts to avoid introducing biases, and must have a large rejection efficiency 
against background. To do so, the filter uses cuts looser than, but similar to, the preselection cuts, namely: 

• The events must contain at least two leptons with pj >4.5 GeV of same flavor but opposite charge 
within tj < 2.7. 

• The reconstructed mass of these two leptons must be within ±25 GeV of the Z mass. 

• The events must satisfy Ef iss > 50 GeV. The Ef lss is computed from a vectorial sum over all 
invisible, stable particles such as Higgs bosons and neutrinos, and all lost particles falling outside 
the calorimeter fiducial region of p T > 5.0 GeV and T7 > 5.0. 
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channel 


ZH^ 11 inv. 


ZZ 




WW 


ZZ 


ZZ 


ZW 


Z+jet 




m H =\3Q GeV 


ffvv 




1 V V 


TTVV 


££tt 


1 I f V 


ee+jet 


# of generated events 


10 K 


50 K 


104 K 


70.4 K 


50 K 


50 K 


250 K 


27.25K 


filter efficiency 


75.4% 


29.2'/* 


1.13% 


4.5% 


5.16% 


71.4% 


100% 


0.0154% 


(7*BR (fb) 


46.2 


728.0 


833000.0 


5245.6 


364.0 


123.0 


820.0 


3105063 


filter cuts 


34.9 


212.6 


9412.9 


236.5 


18.8 


87.8 


820.0 


478.4 


after trigger 


32.5 


198.5 


8620.6 


217.1 


14.2 


77.8 


735.3 


460.6 


E mis S > 9 Q Ge y cut 


14.0 


83.8 


3254.5 


46.6 


1.9 


4.1 


85.9 


105.8 


pr lepton cut 


10.1 


61.6 


1596.2 


30.5 


0.4 


1.5 


24.0 


46.2 


m z ±20 GeVcut 


9.6 


60.2 


1187.4 


19.9 


0.0 


1.2 


19.7 


43.2 




9.3 


58.6 


358.0 


18.7 


0.0 


1.1 


19.0 


17.7 



Table 8: Monte Carlo estimates of the cross-section times branching ratio in (fb) for the signal 
with m.H =130 GeV and background processes following successive preselection cuts described 
in Section 3.2.4 for the ZH analysis. The number of generated events refers to filtered events. The 
cross-sections are given at NLO as calculated in Ref. [16] for Standard Model processes and from 
Ref. [9] for the ZH production cross-sections. 





ZH^ 11 inv. 


m H (GeV) 


110 


120 


130 


140 


150 


200 


250 


# of generated events 


10 K 


50 K 


10 K 


10 K 


10 K 


10 K 


10 K 


filter efficiency 


47.0% 


49.6% 


75.4% 


75.9% 


56.4% 


64.6% 


70.1% 


a-BR(fb) 


77.3 


59.4 


46.2 


36.6 


29.1 


11.0 


5.2 


after filter cuts 


36.3 


29.4 


34.9 


27.7 


16.4 


7.1 


3.6 


after trigger cuts 


34.0 


27.8 


32.5 


26.0 


15.6 


6.8 


3.5 


after Ef ss > 90 cut 


18.3 


15.6 


14.0 


12.5 


10.0 


4.9 


2.7 


after p T lepton cut 


13.5 


11.6 


10.1 


9.3 


7.4 


3.7 


2.0 


after m z ± 20 GeV cut 


13.2 


11.4 


9.6 


8.7 


7.3 


3.6 


2.0 


after b-tag cut 


13.0 


11.1 


9.3 


8.5 


7.1 


3.5 


2.0 



Table 9: Monte Carlo estimates of the cross-section (fb) for the signal with seven different Higgs 
mass hypotheses following successive preselection cuts described in Section 3.2.4 for the ZH 
channel analysis. The number of generated events refers to filtered events. Two samples were 
generated using a simpler filter that retained events containing at least two leptons, (m# = 130 and 
140 GeV) whereas all other samples used a filter that required finding two leptons forming a Z 
boson and large Ef lss , as described in Section 3.2.2. The efficiency for this filter increases with 
the Higgs mass hypothesis. 



A simpler lepton filter with only the first selection cut was used for the ZW —>■ lllv samples, for two 
signal samples with Higgs mass hypothesis m# = 130 and 140 GeV, and for the ZZ — > Ml sample used 
for a normalization study. 

The filter reduces the total CPU time needed for full reconstruction by more than a factor of 1000. 
The results are summarized for a signal with a Higgs boson mass hypothesis of 130 GeV and the back- 
ground samples in Table 8. In Table 9, the effect of the filter, trigger and preselection cuts are shown for 
the signal at other Higgs boson masses. 

3.2.3 Trigger for ZH channel 

An invisibly decaying Higgs boson in the ZH channel can be detected when the Z decays into two 
leptons. We trigger on such events using the full simulation of the trigger and by requiring either one or 
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two isolated, high pj leptons satisfying any of the following trigger signatures: 

• single electron trigger: one isolated lepton with pj > 22 GeV. 

• single muon trigger: one muon with pj > 20 GeV. 

• di-electron trigger: two isolated electrons with pj > 15 GeV. 

• missing transverse energy trigger: Ef lss > 100 GeV. 

The di-muon trigger with a lower pj momentum cut was not implemented in this analysis but will 
be used in the future. The overall trigger efficiency of 92.8% compares well with what is retained 
when applying cuts on fully reconstructed variables, selecting events containing either one electron with 
pj > 25 GeV, two electrons with pj > 15 GeV or one muon with pj > 20 GeV. The effect of the trigger 
on all samples studied is given in Table 8. 

3.2.4 Event preselection 

A first preselection is applied to reject most backgrounds, in particular the tt and (Z+jet) backgrounds. 
The following cuts are applied: 

• the event must satisfy one of the trigger signatures described in the previous section; 

• large missing transverse energy, i.e. Ej lss > 90 GeV. 

• the event must contain exactly two leptons of the same flavor but opposite charge with p T > 15 



• an anti-b-tag is applied to further suppress the ft background. 

• a loose cut on the invariant mass of the two leptons, namely \m u -m z \ < 20 GeV, is applied to 
reject some tt background without reducing the signal efficiency. 

The (7 • BR for the signal and the background processes listed in Section 3 are shown in Table 8. The 
effects of the filter and preselection cuts are also shown in this table. 

3.2.5 Final event selection 

In order to improve the sensitivity of this channel, the most discriminative variables are used to form a 
multivariate analysis (Boosted Decision Tree or BDT). Sixteen different variables are used as inputs to 
the BDT, namely: 

• the missing transverse energy Ef lss , 



• the transverse mass mj = \1pj • Ef iss {\ — cos A0) where A0 is the azimuthal angle between the 



dilepton system and p T miSi ', 

• the cosine of the angle between p T miss and the most energetic lepton, 

• the reconstructed Z mass, 

• the transverse momentum of each lepton, 

• the cosine of the angle between the two leptons in the transverse plane and in 3-dimensions, 



GeV; 
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• the cosine of the angle between p T miss and pj Z , 

• the cosine of the angle between the most energetic jet and pr miss , 

• the energy found in a cone AR = 0. 1 rad around each lepton; the lepton isolation variable is partic- 
ularly useful for muons where very little energy is found around isolated muons but not so much 
for electrons where the energy deposit is much wider, 

• the energy of each of the three most energetic jets, and 

• the total number of jets. 

These variables are shown in Figures 8 to 1 1 for the signal and the background after applying the pres- 
election cuts listed in Section 3.2.4. Each distribution has been normalized to unity. The various back- 
grounds have been regrouped: the irreducible background, ZZ — > ££vv, the non-resonant background: tt 
and WW, and finally all other backgrounds containing at least one Z boson: ZZ — >• ttll, ZZ — >• TTVV, 
ZW and (Z + jet). 

Each of the main backgrounds after the preselection cuts, namely tt —> btv btv, WW —> ivlv, 
ZZ —> till, ZW —> ttlv and (Z+jet) are compared to the signal to train a Boosted Decision Tree 
(BDT) [19]. Each one of these backgrounds is trained separately. Nothing is gained from training a 
BDT against the irreducible background, ZZ —> ttvv, so this background is not used. An ensemble (for- 
est) of decision trees is successively generated from the training sample, where each new tree is trained 
by giving increased weights to the events that have been misclassified in the previous tree. The classifier 
response is obtained as the sum of the classification results for each tree, weighted by the purity obtained 
for all training events in that tree. The large number of decision trees in the forest increases the perfor- 
mance of the classifier and stabilizes the response with respect to statistical fluctuations in the training 
sample. 

For each background type, a set of weights is established. Half the Monte Carlo events contained in 
each file is used for the training and the other half for the analysis. The file containing the signal and 
all the backgrounds, including the less important ones, is then analyzed using these weights. Each event 
is assigned a weight corresponding to the likelihood of being identified as signal or background. The 
BDT weight distributions are shown in Figure 12. Each plot shows the output variable distribution for 
Boosted Decision Trees trained against different backgrounds, namely, from top plot to bottom plot, the 
it -> btvbtv, WW -> tvtv, ZZ -> tt%%, ZW Ittv and Z+jet. The ZZ TTVV background is not 
used to train a specific BDT since too few events survive the preselection cuts. The cuts on the five BDT 
outputs are adjusted to minimize the value of t, 2 , defined in Equation 2. 

The same procedure is repeated using different Higgs boson mass hypotheses ranging from m# = 110 
GeV to niH = 250 GeV, optimizing the cuts each time. The numbers of events surviving all Boosted 
Decision Trees cuts for each Monte Carlo sample and these seven Higgs boson mass hypotheses are 
given in Table 11. The BDT inputs variables are also ranked during each training against a particular 
background. Each time, the input variables are assigned a weight proportional to their importance in 
separating power. The sum of these weights are given in Table 10, showing which variables offer the 
best separation power. The order of importance varies for each of the trees but all variables are useful 
in at least one tree. The Z mass is the overall most discriminative variable, mostly due to its very high 
ranking in the BDT trained against tt and WW backgrounds. 

3.3 Systematic uncertainties 
3.3.1 Background cross-section 

Since this is a counting experiment, one is looking for events in excess of what is expected by the 
Standard Model. However, exactly what is expected from Standard Model backgrounds is not well 
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Figure 8: Input variables used by the Boosted Decision Tree for the signal with m# = 130 GeV and the 
main backgrounds. Top left: Missing Ej. Top right: transverse mass, defined as the reconstructed mass 
in the transverse plane, namely m^ = Ej— p\. Bottom left: cosine of the angle between the missing Ej 
and highest momentum lepton in the transverse plane. Bottom right: reconstructed Z mass. Each plot 
has been normalized to unity. The combined samples had first been scaled to the same luminosity. 
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Figure 9: Input variables used by the Boosted Decision Tree for the signal with m# = 130 GeV and the 
main backgrounds. Top left: Transverse momentum of the most energetic lepton. Top right: Transverse 
momentum of the second lepton. Bottom left: Cosine of the angle between the two leptons in the 
transverse plane and, Bottom right: in 3-dimensions. Each plot has been normalized to unity. The 
combined samples had first been scaled to the same luminosity. 
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cos(E™ ss - p z ) ' cosGtf - Ef ss ) 





energy in cone around lepton 1 (GeV) 




energy in cone around lepton 2 (C 



Figure 10: Input variables used by the Boosted Decision Tree for the signal with m# = 130 GeV and 
the main backgrounds. Top left: The cosine of the angle between the direction of missing Ej and the 
reconstructed Z transverse momentum. Top right: The cosine of the angle between the most energetic 
jet and the direction of missing Ej. Bottom left: The energy contained in a cone of 0.10 rad around 
the most energetic lepton. Bottom right: The energy contained in a cone of 0.10 rad around the second 
lepton. Each plot has been normalized to unity. The combined samples had first been scaled to the same 
luminosity. 
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E jet3 (GeV) number of jets 



Figure 11: Input variables used by the Boosted Decision Tree for the signal with ma = 130 GeV and the 
main backgrounds. Top left: The energy distribution for the most energetic jet; Top right: for the second 
and, Bottom left: third most energetic jets. Bottom right: the number of jets in the event. Each plot has 
been normalized to unity. The combined samples had first been scaled to the same luminosity. 
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Figure 12: The Boosted Decision Tree (BDT) output variables obtained after comparing half the signal 
events to five different backgrounds separately, namely, from top to bottom: it — > bivbiv, WW — > ^v£v, 
ZZ Urz, ZW -> tttv and Z^U + jets. The BDT assigns values close to +1 for a signal-like event and 
-1 for background-like events. The distributions are shown for the signal and all types of background 
when using the other half of the events for the analysis. The Boosted Decision Trees trained against 
the ZW background offers the best separation power. The % 2 decreases further once additional cuts on 
the other BDT output variables are applied, namely the WW BDT output, then the ZZ -> llxx BDT 
output, the (Z+jet) BDT output and finally the tt BDT output. All BDT output cut values are indicated 
by a vertical dashed line. Nothing is gained from training a BDT against the irreducible background, 
ZZ — ► llw for all Higgs boson mass hypotheses, so it is not used. 
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input variable to BDT 


sum of weights 


Z mass 


0.57 


cos U (in 2D) 


0.46 


cos 11 (in 3D) 


0.42 


cos(Ef ss -p z ) 


0.41 


# of jets 


0.40 


transverse mass 


0.39 


cos( jet — Ef lss ) 


0.32 


r-miss lepton#l 

cosEj — p T 


0.31 




0.28 


p l r n * 1 


0.27 


Ejet#l 


0.26 


Ej et #2 


0.23 


lepton#2 
P T 


0.21 


energy in a cone around lepton # 1 


0.17 


energy in a cone around lepton # 2 


0.16 


Ejem 


0.13 



Table 10: Order of importance for the 16 input variables used to train the separate Boosted Deci- 
sion Trees used for the analysis at m# = 130 GeV. The second column gives the sum of the weights 
given to each input variable by the five separate BDT used for the analysis. These weights are not 
used for the analysis per se but give an idea of the relative importance of each input variable. In 
particular, the first four variables are used to mostly reject the non-resonant background. 
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channel 


ZH 


ZZ 


tt 


WW 


ZZ 


ZZ 


ZW 


Z+jet 




ilim. 


eiw 




£v£v 


TTVV 


Uzz 


£Uv 


U+jet 




expressed as cross-sections given in fb 


m H = 110 GeV 


1.31 


3.44 


0.00 


0.01 


0.00 


0.02 


0.37 


0.03 


m H = 120 GeV 


2.99 


13.64 


0.18 


0.28 


0.00 


0.16 


2.16 


0.10 


m H = 130 GeV 


0.89 


2.93 


0.00 


0.00 


0.00 


0.02 


0.33 


0.00 


m H = 140 GeV 


0.98 


3.51 


0.00 


0.00 


0.00 


0.03 


0.48 


0.00 


m H = 150 GeV 


1.32 


6.37 


0.18 


0.00 


0.00 


0.06 


0.62 


0.00 


m H = 200 GeV 


0.62 


4.83 


0.18 


0.00 


0.00 


0.04 


0.42 


0.00 


m H = 250 GeV 


0.31 


2.50 


0.00 


0.00 


0.00 


0.03 


0.24 


0.02 




# of events corresponding to 30 fb 1 


m H = 110 GeV 


39.2 


103.1 


<2.7 


0.2 


<0.01 


0.8 


11.0 


0.8 


m w = 120 GeV 


89.6 


409.3 


5.4 


8.5 


0.1 


4.7 


64.7 


3.0 


m w = 130 GeV 


26.8 


87.9 


<2.7 


<0.06 


<0.01 


0.8 


9.9 


<0.36 


m H = 140 GeV 


39.5 


191.1 


5.4 


<0.06 


<0.01 


1.7 


18.7 


<0.36 


m# = 150 GeV 


36.5 


173.0 


5.4 


0.2 


<0.01 


1.8 


19.0 


<0.36 


m H = 200 GeV 


18.5 


145.0 


5.4 


<0.06 


<0.01 


1.3 


12.7 


<0.36 


m H = 250 GeV 


9.3 


74.9 


<2.7 


<0.06 


<0.01 


1.0 


7.2 


0.7 



Table 1 1 : Monte Carlo estimates of the cross-sections in fb surviving the final Boosted Decision 
Tree selection cuts for each background process and seven mass hypotheses for the ZH channel. 
The corresponding numbers of events for 30 fb~ l of total integrated luminosity are also given. 
The final cuts on the Boosted Decision Tree output variables were set separately for each BDT 
output variable and for each mass hypothesis, each time optimizing the sensitivity t, 2 . 
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known, given the theoretical uncertainties on the Standard Model production cross-sections and this leads 
to the main source of systematic uncertainty. The current best estimates for each of these cross-sections 
from the next-to-leading order calculation is about 6% for ZZ and 5% for ZW [20]. The uncertainty on 
the (Z+jet) cross-section has no impact on the final results since this background is negligible. Several 
control samples can be used to constrain the ZZ and ZW cross-sections. For ZZ, one can use the four 
lepton final state (even including x) but this will require a large data sample (of the order of at least 
30 fb~ l ) to reduce the statistical uncertainty. For the ZW cross-section, one can use a ZW control 
sample with events containing three identified leptons. Both these cross-sections will be measured in 
ATLAS data. Uncertainties associated with kinematic distributions have not been taken into account at 
this point. A combined theoretical uncertainty of 5.8% obtained from a weighted average is assigned to 
the background production cross-section. 

One could in principle use ZZ —> ££££ events from data to calibrate the number of events coming 
from ZZ —> ££w decays. Such an approach was proposed in [21] where one would first select a pure 
sample of ZZ — > ££££ events by finding two Z bosons, then declaring one Z to decay invisibly. This 
would work in the absence of other backgrounds but it is not possible to completely eliminate the ZW — > 
£££v background. More importantly, such a technique has a very low efficiency: about 1.8% of all 
ZZ — >• ££££ survive the preselection cuts, with £ here being e,/J, or x. Only a dozen of events would 
survive all of the BDT selection cuts for 30 fb" 1 of data. Hence, it is deemed impossible to calibrate 
quantitatively the ZZ — > ££££ cross-section using this technique. However, one could still check the effect 
of the preselection cuts on ZZ — > ££££ events with two leptons declared invisible as described above to 
ensure that the main and irreducible background, ZZ —> Uvv, behaves as expected under these cuts. 
About 85 ZZ — > ££££ events are expected to pass the preselection cuts, as opposed to 163 ZZ —> Uvv 
events for 30 fb -1 of integrated luminosity. After the preselection cuts, the ZZ — > Uvv background 
corresponds to about 36% of the total number of selected events in the absence of non Standard Model 
contributions, as seen from Table 8. This method would provide a normalization of the cross-section 
using data at about 11% uncertainty level. 

3.3.2 Effect related to the training of the Boosted Decision Tree 

Since half the events are used for training the BDT, and the other half for testing, this arbitrary choice 
has a slight effect on the outcome. For the central value of this analysis, we used every other event for 
the training. To estimate the effect of this choice, the analysis was redone using the first half of the events 
for training, and the second half for testing. Since we are only using Monte Carlo events, this second 
choice does not introduce additional time-dependent effects that one would expect with real data. The 
difference in the results, namely +0.2% signal events and +0.7% background events, is ascribed as a 
contribution to the systematic uncertainty. 

3.3.3 Lepton momentum resolution effect and energy scale effect 

Different tests are done to assess the contributions to the systematic uncertainty from the lepton momen- 
tum resolution and the uncertainty on the lepton energy scale. Each time, new modified input variables 
are used to retrain the BDT and assess the overall effect by comparing the new number of selected signal 
and background events to the original numbers of events selected. All contributions to the systematic 
uncertainty are summarized in Table 12. 
The tests performed are: 

• The lepton momenta are smeared using a Gaussian distribution. A constant sigma of 0.73% is 
used for electrons. For muons, the sigma is calculated using the following formula: o(pr) = 
[(0.011 • p T ) 2 + (0.00017 • Pt) 2 ] 1,2 /pt with p T in GeV. The smearing is applied to one type of 
leptons at a time. 
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signal 


background 


electron reconstruction efficiency 


±0.2% 


±0.2% 


electron pj resolution (±0.73%) 


+0.5% 


+1.7% 


electron energy scale (±0.5%) 


+1.1% 


+2.1% 


sub-total for electrons (43% of events) 


+1.2% -0.2% 


+2.7% - 0.2% 


muon reconstruction efficiency 


±1.0% 


±1.0% 


muon pj resolution (see formula in text) 


+1.1% 


+1.9% 


muon energy scale (±1%) 


+1.0% 


+2.2% 


sub-total for muons (57% of events) 


+1.8% - 1.0% 


+3.1%- 1.0% 


combined contributions for leptons 


+1.5% -0.7% 


+2.9% - 0.7% 


jet energy scale (±7% or ±15%) 


+0.8% 


+0.2% - 2.2% 


jet energy resolution effect on Ef lss 


-2.2% 


-0.4% 


luminosity 




±3.0% 


cross-section 




±5.8% 


filter effects 


±1.4% 


±1.4% 


Boosted Decision Tree training effects 


±0.2% 


±0.7% 


total 


+2.2% - 2.6 % 


+7.3% -7.1% 



Table 12: Contributions to the systematic uncertainties. The Higgs boson mass was set to 130 
GeV to assess these uncertainties. The final background uncertainty is rounded-off to ±7.2%. 



• For each type of lepton, a multiplicative scaling factor is applied to simulate an energy scale 
uncertainty of ±0.5% for electrons and ±1.0% for muons. 

3.3.4 Jet momentum resolution effect and energy scale effect 

Three different modifications are done in turn to the jet energy to evaluate the contributions from the 
jet energy scale and jet energy resolution to the missing energy evaluation. After each modification, the 
missing Ej is recalculatedEach contribution is shown in Table 12. The three modifications made to the 
jet energy are: 

• Jet energy scale: the jet energy is increased by ±7% for jets within r\ < 3.2 and ±15% for jets 
within T7 > 3.2. 

• Jet energy resolution: the jet energy is smeared using a Gaussian by 0.45 • \[E for jets within 
tj < 3.2 and 0.63 • y/E for tj > 3.2. 

3.4 Results for the ZH channel 

The sensitivity with 30 fb _1 of data is evaluated in terms of E, 2 with t, 2 = \.6Aob/Ns for a 95% CL as for 
the VBF analysis where Og is the combined statistical and systematic uncertainty as detailed in Section 
2.5. The values of E, 2 are summarized in Table 13. 

3.5 Cross-checks with a cut-based analysis 

To ensure that the Boosted Decision Tree performed as expected, we duplicated a previous ATLAS 
analysis performed using a cuts-based approach [21]. The same cuts as were applied to our current 
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# signal 


# background 


o B 




llOGeV 


39.2 


115.9 


13.6 


56.7% 


120 GeV 


82.5 


449.4 


38.3 


76.2% 


130 GeV 


26.8 


98.6 


12.2 


74.4% 


140 GeV 


39.5 


217.0 


21.3 


88.5% 


150 GeV 


36.5 


199.4 


20.0 


89.9% 


200 GeV 


18.5 


164.4 


17.3 


153.4% 


250 GeV 


9.3 


83.8 


10.9 


191.6% 



Table 13: The sensitivity with 30 fb 1 at 95% confidence level calculated in terms of <§ 2 for seven 
different mass hypotheses for the ZH channel. 



Monte Carlo samples after the filter and trigger cuts of this analysis, and using the signal generated with 
mji = 130 GeV. These cuts are: 

1 . Filter cuts as in this analysis 

2. Trigger cuts as in this analysis 

3. Lepton cuts: select events containing no more than two leptons with pj>l GeV. Electrons must 
have p T > 15 GeV within |t]| < 2.5 and muons are selected if pr > 10 GeV and |tj| < 2.4. Two 
leptons of the same flavor but opposite charge are required. 

4. Z mass: the recontructed Z mass must be within 10 GeV from the pole mass. 

5. Ef ss > 100 GeV. 

6. Jet veto: all events containing a jet having pj > 30 GeV within \r\ \ < 4.9 are rejected. 

7. b-jetveto: all events containing a b-taggedjet having at p T > 15 GeV within |t]| < 4.9 are rejected. 

8. Transverse mass: m T > 200 GeV. 

The two analyses can be compared after the MET cut. The sensitivity with 30 fb -1 at 95% confidence 
level calculated in terms of <§ 2 for this cut-based analysis is 87.9% for m# = 130 GeV. This compares 
well with what was obtained with the BDT technique (<^ 2 = 74.4% for the same mass value with the BDT 
approach). The difference in sensitivity increases further for higher Higgs mass hypotheses. The results 
are given in Table 14. 

4 Comparison of results and summary 

The sensitivity of ATLAS to an invisibly decaying Higgs boson produced via the VBF and ZH channel 
has been examined. A comparison between the sensitivities of the two channels can be seen in Figure 13. 
This plot shows that the channels have a similar sensitivity for low Higgs boson masses. It is possible 
to look at combined statistics for the ZH analysis and the VBF shape analyses although the analysis 
techniques are different. Clearly the improvement in sensitivity by combining statistics is not large. Of 
far greater significance in the analysis of real ATLAS data would be the observation of a significant 
excess of events in two different and distinct channels. An observation of this kind would give credibility 
to the hypothesis that a particle is being generated that behaves like a Higgs boson and decays invisibly. 
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channel 


ZH 


ZZ 




WW 


ZZ 


ZZ 


zw 


Z+jet 




(f\m. 


ttvv 




Ivlv 


TTVV 


tlxx 


lilv 


«+jet 


<7*BR in fb 


46.2 


728.0 


833000.0 


5245.6 


364.0 


123.0 


820.0 


3105062.8 


after filter 


34.9 


212.6 


9412.9 


236.5 


18.8 


87.8 


820.0 


478.4 


after trigger 


32.5 


198.5 


8620.6 


217.1 


14.2 


77.8 


735.3 


460.6 


Pt lepton +ID + charge cut 


23.8 


148.1 


4451.2 


158.4 


2.7 


37.6 


177.5 


221.6 


after m z ± 10 GeV cut 


20.7 


133.0 


1654.7 


51.1 


0.0 


28.8 


125.4 


192.9 


after Ej iss > 100 cut 


7.4 


44.9 


460.7 


7.3 


0.0 


0.8 


13.7 


27.9 


no jet with p T > 30 GeV 


4.2 


23.7 


5.8 


0.8 


0.0 


0.3 


4.2 


0.1 


b-tag cut for jet with p T > 15 GeV 


4.2 


23.6 


4.8 


0.8 


0.0 


0.3 


4.1 


0.1 


after m T > 200 GeV cut 


3.9 


21.3 


0.5 


0.4 


0.0 


0.3 


3.4 


0.1 



Table 14: Monte Carlo estimates of the cross-sections in fb after applying simple cuts for each 
background process and one mass hypothesis of m# = 130 GeV for the ZH channel. The corre- 
sponding E, 2 would be 87.9%. 
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Figure 13: Sensitivity to an invisible Higgs boson with ATLAS for both the VBF and ZH channels 
with 30 fb _1 of data assuming only Standard Model backgrounds. The open crosses show the 
sensitivity for the ZH analysis and the solid triangles show the sensitivity for the VBF shape 
analysis for 95 % CL. Both these results include systematic uncertainties. 
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In summary, a study using fully simulated ATLAS data has shown that the ATLAS experiment will 
be sensitive to an invisibly decaying Higgs boson in both the VBF and ZH production channels assuming 
only Standard Model backgrounds. It is clear that the analysis will require a good understanding of the 
experimental systematic uncertainties. If the decay of a Higgs boson was entirely in the invisible mode, 
this analysis has shown that with 30 fb _1 of data, ATLAS will be sensitive to a situation in which the 
beyond the Standard Model cross-section is of the order of 80% the Standard Model Higgs boson cross- 
sections for a Higgs Boson mass of less than 150 GeV. The VBF analysis has a sensitivity of better than 
90% up to a Higgs Boson mass of 250 GeV. 
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Abstract 

The discovery of a charged Higgs boson would be tangible proof of physics 
beyond the Standard Model. This note presents the ATLAS potential for dis- 
covering a charged Higgs boson, utilizing five different final states of the signal 
arising from the three dominating fermionic decay modes of the charged Higgs 
boson. The search covers the region below the top quark mass, taking into ac- 
count the present experimental constraints, the transition region with a charged 
Higgs boson mass of the order of the top quark mass, and the high-mass region 
with a charged Higgs boson mass up to 600 GeV. All studies are performed 
with a realistic simulation of the detector response including all three trigger 
levels and taking into account all dominant systematic uncertainties. Results 
are given in terms of discovery and exclusion contours for each channel indi- 
vidually and for all channels combined, showing that the ATLAS experiment 
is capable of detecting the charged Higgs boson in a significant fraction of the 
(tan/3, m H ±) parameter space with its first 10 fb" 1 of data. The so-called in- 
termediate tan/3 region (around tan/3 = 7) is experimentally hard to reach but 
exclusion sensitivity is given in this area. 

1 Introduction 

Charged Higgs bosons (Z/*) 1 are naturally predicted in many non-minimal Higgs scenarios, such as 
Two Higgs Doublet Models (2HDM), and models with Higgs triplets including Little Higgs models. 
Their discovery would be a definite signal for the existence of New Physics beyond the Standard Model, 
possibly the first experimental evidence for the Minimal Supersymmetric Standard Model (MSSM) if it 
is realised in nature, and the supersymmetry mass scale is high enough that sparticles escape discovery. 
The following analyses will only consider the 2HDM, in particular the so-called type II-2HDM, which 
is the Higgs sector of the MSSM. 

The search strategies for charged Higgs bosons depend on their hypothesized mass, which dictates 
both the production rate and the available decay modes. Below the top quark mass, the main production 
mode is through top quark decays, t — >• H + b, and in this range the H + — >• TV decay mode is dominant. 
Above the top quark threshold, production mainly takes place through gb fusion (gb — > tH + ), and for 
such high charged Higgs boson masses the decay into a top quark and a b quark dominates, H + — > tb . 
A more detailed discussion of the different signal final states used for this study including backgrounds, 
cross-sections and branching ratios can be found in Reference [1]. 

Charged Higgs boson searches involve several higher level reconstructed physics objects such as elec- 
trons, muons, jets, jets tagged as b jets and finally jets identified as T jets. These objects are reconstructed 
by dedicated ATLAS algorithms and details about their performance can be found in References [2-6]. 
In the following, only efficiencies will be quoted for the different objects. 

The note is organized as follows: Section 2 summarizes the trigger menus used for the signatures 
under investigation. The analysis sections are divided into light (Section 3) and heavy (Section 4) H + 
studies, which are in turn divided according to the signal final state: First, tt —>■ bH + bW —>■ bx(had)vbqq 
is discussed in Section 3.1, then tt — > bH + bW — > bz(lep)vbqq in Section 3.2. Section 3.3 addresses 
tt — > bH + bW — > bz(had)vb£v. Two studies for heavy H + are described, namely for gg/gb — > t[b]H + — ► 
bqq[b]z(had)v (Section 4.1) and gg/gb -> t[b]H+ -> t[b]tb -> bW[b]bWb -> b£v[b]bqqb (Section 4.2). 
In Section 5, systematic uncertainties and their impact are discussed, followed by a description of the 

'in the following, the charged Higgs boson will be denoted H + , but H~ is always implicitly included. 
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data-driven ti control sample method which is needed for the extraction of the background shape and 
normalization in all H + studies. Section 6 provides combined results from the different search topolo- 
gies. The charged Higgs boson results are summarized in Section 7. 

2 Trigger 

The production and decay modes of the charged Higgs boson studied here lead to final states containing 
the following: two to four b jets, light jets from hadronic decays of W bosons, one or more neutrinos from 
W or H + decays, and for most channels a tau lepton decaying either hadronically or into an electron or 
muon plus neutrinos. These event characteristics suggest the following ATLAS Trigger [7] menus, one 
for an instantaneous luminosity of L = 10 31 cm~ 2 s _1 , and two alternatives for L = 10 33 cm _2 s _1 , based 
on these trigger signatures: 

• L = lO^cm-V 1 : xE70, e22i, mu20, xE30_Ll_TAU13, xE20_3j20_Ll_TAU13 

• L = 10 33 cm- 2 s _1 : xE80, e55, mu40, xE50_Ll_TAU30, xE40_3j20_Ll_TAU30 or 

xE80, e22i_xE30, mu20_xE30, xE50_Ll_TAU30, xE40_3j20_Ll_TAU30 
where xE represents a missing transverse energy trigger selection, and e, mu, tau and j correspond to 
electron, muon, tau, and jet triggers, respectively. A number before each symbol indicates the required 
trigger multiplicity, as in 3j for three jets. Values after these symbols are the approximate trigger thresh- 
olds in transverse momentum, and a final i means that isolation requirements are applied. Ll_ indicates 
that the stated threshold for the following trigger selection is applied at the first trigger level while the 
high level trigger selection is softer. Combined triggers are indicated by the juxtaposition of two or more 
selections. Each analysis uses a subset of items of the two 10 33 menus. 

The menus were chosen based on a careful study of the signal event characteristics and the most 
realistic trigger signature rates available. The chosen signatures meet the requirements of the trigger 
bandwidth budget [7] at all trigger levels. For each of the channels studied in the following sections, the 
trigger efficiencies will be evaluated. 

3 Light Charged Higgs Boson Searches 

In this section, charged Higgs boson searches in the three light H + channels (m H + < m t ) selected for 
investigation are presented: 

• it '-► bH+bW ->■ bz{had)vbqq 

• tt^bH+bW -> bz(lep)vbqq 

• tt^bH+bW br(had)vb£v 

where for a tau lepton, x{had) indicates its hadronic decay and xilep) its decay to an electron or muon 
plus neutrinos. If the charged Higgs boson is light, then BR(? — > bW) might not be close to unity, as it 
is in the Standard Model prediction. This means that the expected background from Standard Model it 
decays (the dominant background to all H + searches) is reduced. Since the number of events after the 
event selection is A^| M for the background-only hypothesis, and Nh+ + A^ ssm in the signal+background 
case, the number of observable excess events is thus not simply the number of H + events (/V#+) — it is 
decreased by the difference in the SM and the MSSM ti prediction: N excess = N H+ - (A/g M - A^ SSM ). 
This is taken into account consistently. 

All plots, tables and results in this section are based on the trigger menu for an instantaneous lumi- 
nosity of L=10 33 cm~ 2 s _1 presented in Section 2, on the signal cross-sections for the m^-max MSSM 
scenario (see Reference [1]) and the background cross-sections given in Reference [8]. 



1452 



Higgs - Charged Higgs Boson Searches 



3.1 tt -> bH+bW -> bz(had) vbqq 

The channel in which both the x and the W boson decay hadronically has a relatively high cross-section, 
making it a priori one of the most promising in searches for H + lighter than the top quark. On the other 
hand, the absence of leptons, as well as the high hadronic activity, is a challenge, particularly for the 
trigger. The events are characterised by a x jet, four other jets, two of which are initiated by b quarks, 
and missing energy. The main expected backgrounds for this channel are tt events where one top quark 
decays hadronically and the other to a hadronically decaying x. For this study all tt decay modes have 
been considered as background, as well as single top, W+jets and QCD dijet events. For ATLAS, this 
channel has previously only been studied using fast simulation [9]. 

3.1.1 Preselection 

Trigger Either the xE50_Ll_TAU30 or the xE40_3j20_Ll_TAU30 trigger signature (see Section 2) is 
required for this study. 

Cuts I Following the trigger, a set of cuts was used to preselect the signal and suppress the background. 
This first cut set relates primarily to the multiplicity of the offline analysis objects (which are different 
from the trigger objects, leading in some cases to lower optimum cut values for the offline objects). These 
are given in Table 3.1.1. 

Cuts II After the first set of cuts, the W is reconstructed from the pair of light jets with invariant mass 
rrijj closest to the nominal W mass, mw- The parent top quark is then found by pairing the reconstructed 
W with the b quark which leads to an apparent top quark mass rrijjb closest to the nominal value m t . Cuts 
are made both on the W and the top quark reconstructed masses. The top quark on the H + side cannot 
be fully reconstructed due to the presence of the neutrino, but information on its azimuthal angle, 0, and 
transverse momentum, pj, can still be extracted. 

At this stage a large number of background events still remains, mainly from Standard Model tt as 
well as from QCD and single-top production. In order to discriminate against the latter two, two cuts 
(items 8 and 9 in Table 3.1.1) are applied aiming to enhance the topology of tt decays. 



Table 1: tt — ► bH + bW — > bx(had)vbqq: Applied precuts. A ^-tagging cut is applied to all jets, 
yielding b jet reconstruction efficiency of about 70%. For x jets, a cut on the T-tagging variable 
is applied such that a reconstruction efficiency of about 32% is obtained for a pseudorapidity 
| tj | < 1 .5, and a tighter cut (due to the high QCD jet fake-rate for large 17 ) for 1 17 1 > 1 .5 leads to an 
efficiency of about 20% in this region. The Ef lss cut is increased to 50 GeV if only the signature 
xE50Xl_TAU30 has triggered the event. 

Cuts I Cuts II 

1. exactly 1 T-tagged jet with pj > 35 GeV 

2. exactly 2 ^-tagged jets with pj > 15 GeV 

3. at least 2 non-tagged jets with pj > 15 GeV 

4. veto on isolated leptons with p T > 5 GeV 

5. Ef ss > 40/50 GeV 



6. \m% c -m w \ < 30 GeV 

7. \m r t ec -m t \ < 40 GeV 

„ . ,, hardest top softest top . 

8. A(j>(p T ^,p T *J>2.5 

„ hardest top , softest top „. 

9. p T Ip t F < 2 
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(a) (b) 

Figure 1: tt — ► bH + bW — ► bx(had)vbqq: Likelihood distribution (a) and transverse mass (b) for an H + 
mass of 130 GeV and the corresponding background, stacked and normalized to the cross-section with 
tan/3 = 20. The hatched area shows what would be expected from a Standard Model only scenario. A 
cut at 0.6 is made on the likelihood and at 65 GeV on the transverse mass. 



3.1.2 Likelihood Discriminant 

Following these cuts, the remaining background is dominated by tt events, particularly those in which one 
W decays hadronically and the other to a T and a neutrino, and all reducible backgrounds are effectively 
suppressed. In order to discriminate between this background and the signal, a Likelihood Discriminant 
method has been implemented. The variables used (listed in Table 2) have been selected to reflect 
the two characteristics which distinguish the signal from the background: the heavier mass of the H + 
compared to the W, with its effects on the kinematics of the event; and the difference in x polarization 
depending on the parent. This difference, exploited in variable I, will cause the leading track within the 
X jet to be harder [10]. Also selected combinations of variables have been included, as correlations are 
not included in the definition of the likelihood used. Likelihood discriminants are constructed for the 
five different studied H + mass points (90, 110,120,130 and 150 GeV). Figure la shows the resulting 
likelihood distribution for an H + mass of 130 GeV and the background. 

Table 2: tt — > bH + bW — > bx(had)vbqq: Variables used for the Likelihood Discriminant, x denotes 
the rjetinthe event, b H + is theft jet coming from the same top as the presumed H + , andM(T, b H +) 
is the invariant mass of the x jet and the b H + jet. AR = \J (A0) 2 + (Atj) 2 represents a distance. 

T leading T track . T , T r / b H + 

I. p T lp\ V. VtIVt 

II. 1-cos(A<Kt,/^" 4 ')) VI. M{x,b H+ ) 

III. l-cos(A0(tf+,fe ff +)) VII. AR(x,b H +) 

IV. M(x,b m )-AR(x,b H+ ) 



A cut is applied on the obtained likelihood, requiring a value higher than 0.6 for an event to be 
retained (0.8 for a charged Higgs boson mass m H + = 150 GeV, due to the better signal-background 
separation). The transverse mass of the H + is then calculated for the retained events (shown in Fig. 
lb). A final cut is then applied to this reconstructed transverse mass of the H + candidate (at 50 GeV for 
m H+ = 90 GeV, at 60 GeV for m H + = 1 10 and 120 GeV, at 65 GeV for m H+ = 130 GeV and at 75 GeV 
for m H + = 150 GeV). These cut values have been selected to optimize the significance considering both 
systematic and statistical uncertainties. 
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3.1.3 Results 

Table 3: it — > bH + bW — > bz(had)vbqq: Selection cut flow. For each sample, the cross-sections 
after cuts are given in fb and for tan j3 = 20 in the first line and the relative cut efficiencies in the 
second line (in italics). Standard Model cross-sections are given for all the backgrounds. There 
are insufficient Monte Carlo statistics for the QCD events - see discussion in the text for a cross- 
section estimate. 



Channel 




Trigger 


Cuts I 


Cuts II 


rj+ an n^\f 


[fb] 


38388 










[/] 




0.088 


0.119 


0.337 


H+ llOGeV 


[fb] 


27147 


2871 


286 


112 




[/] 




0.106 


0.100 


0.391 


H+ 120 GeV 


lib] 


21363 


2563 


255 


94 




[/] 




0.120 


0.099 


0.368 


H+ 130 GeV 


[fb] 


15666 


2136 


217 


79 




m 




0.136 


0.102 


0.364 


H+ 150 GeV 


[fb] 


5869 


982 


88 


30 




[/] 




0.167 


0.089 


0.342 


tt > 1 lepton 


[fb] 


4.52- 10 s 


56287 


852 


307 




[/] 




0.125 


0.015 


0.360 


tt hadronic 


[fb] 


3.81-10 5 


1746 


37 


21 




[/] 




0.005 


0.021 


0.571 


single top 


[fb] 


112500 


7700 


63 


17 




[/] 




0.068 


0.008 


0.277 


W+jets 


[fb] 


277800 


15489 


73 


30 




[/] 




0.056 


0.005 


0.409 


QCD dijets 


lib] 


3.2-10 8 


3.18-10 5 


5 






m 




0.001 


1.7-10- 4 





Table 3 shows the selection cut flow for the five different signal mass points and all backgrounds 
studied in this note. The expected cross-section after the trigger and after the first and second set of cuts 
is presented, as well as the relative efficiency of each step. 

It is difficult to draw conclusions about the suppression of the QCD background, due to its very high 
cross-section at the LHC. This leads to large statistical uncertainties since no simulated event survives 
the entire analysis chain. In fact already the expected cross-section after Cuts I (see Table 3) is governed 
by large statistical uncertainties. A very conservative upper limit for the expected number of QCD events 
can be obtained by assuming that the expected efficiency of QCD events is limited from above by the 
efficiency of the hadronic tt events for all cuts for which the number of simulated QCD events is not 
sufficient to draw definitive conclusions 2 . This assumption is very conservative since the second set of 
cuts have been specifically designed to identify the tt topology in order to reject QCD. Nevertheless, 
using this assumption, the cross-section of surviving QCD events is limited by 55 fb after the likelihood 
cut (for m H + = 130 GeV), while after the cut on the transverse mass this estimate is down to 50 fb. 
Given the conservative nature of this estimate, the assumption that the background from QCD events is 
negligible compared to the one from tt events is justified. 

Table 4 shows the final expected cross-section following the cut on the likelihood discriminant and 
the further cut on the transverse mass of the H + candidate. The shape of the likelihood discriminant 

2 After Cuts II, even more conservatively, this limit is taken from the efficiency for the tt > 1 c/ju/t events, due to the 
limited number of surviving hadronic tt events at this point. 
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Table 4: tt — > bH + bW — > bz{had)vbqq: Final event selection results. The cross-sections after all 
cuts are given in fb and for tan j8 = 20 as well as the relative cut efficiencies. Standard Model 
cross-sections are given for tt . Backgrounds not tabulated have been found to be negligible. 



Channel 


Cut 


Signal 


tt> 1 


e/n/x 






[fb] 


[/] 


[fb] 


[/] 


H+ 90 GeV 


LH > 0.6 


56.2 


0.413 


55.8 


0.182 




mT >50 GeV 


35.3 


0.628 


32.1 


0.574 


H+ 110 GeV 


LH > 0.6 


53.6 


0.478 


52.7 


0.172 




mT >60 GeV 


35.1 


0.655 


27.9 


0.529 


H+ 120 GeV 


LH > 0.6 


42.6 


0.455 


45.5 


0.148 




mT >60 GeV 


32.5 


0.764 


29.0 


0.636 


H+ 130 GeV 


LH > 0.6 


38.3 


0.483 


50.7 


0.165 




mT >65 GeV 


31.4 


0.819 


25.9 


0.510 


H+ 150 GeV 


LH > 0.8 


14.0 


0.467 


26.9 


0.088 




mT >75 GeV 


9.3 


0.662 


10.3 


0.385 




Figure 2: tt — > bH + bW — > br(had)vbqq: Discovery (left) and exclusion contour (right) for Scenario 
B (mj-max) [1]. Systematic and statistical uncertainties are included. The systematic uncertainty is 
assumed to be 10% for the background, and 24% for the signal (see Sections 5.2 and 5.1). The lines 
indicate a 5a significance for the discovery and a 95% CL for the exclusion contour. 



depends on the mass point for which the analysis is performed, i.e. one will need to run a separate 
analysis using a different likelihood discriminant for each mass point. Therefore the background rejection 
will naturally depend on the mass point for which it is evaluated. 

The shape-based Profile Likelihood method (see Section 6) is applied on the entire transverse mass 
histogram for each masspoint, in order to extract the significance of the signal hypothesis. Fig. 2 shows 
the discovery contour in the (tan/3, m H +) plane for an integrated luminosity of Jzf = 10 fb -1 as well as 
the exclusion reach for Jzf = 1 fb -1 . 

3.2 tt -> bH+bW -> bxilep) vbqq 

The events of the leptonic % channel are characterized by a single isolated lepton, and large missing 
energy due to three neutrinos in the final state. A full reconstruction of the event is therefore impossible. 
Instead, kinematic properties of the event are used to discriminate between the signal and the main 
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background, which is the Standard Model semi-leptonic tt process. 

The branching ratio to T leptons of a light charged Higgs boson is expected to be almost 100%, 
while the Standard Model W decay is universal with respect to lepton flavor. This means that the most 
prominent signature of the signal is an excess of kinematically T-like events. 

Furthermore, it is possible to construct a quantity that is bound by the charged Higgs boson mass 
(in analogy to the transverse mass in a W decay). This 'charged Higgs boson transverse mass' can be 
used to provide further discrimination power against the background, as well as a direct indication for 
the charged Higgs boson mass. 



3.2.1 Preselection 

Trigger The trigger requirement is based on either an isolated lepton or missing transverse energy (xE). 
Events are required to pass one of the following three trigger signatures (see Section 2): e22i_xE30, 
mu20_xE30, or xE80. 



Offline Events that have passed the trigger are selected based on the following cuts: 

• Exactly one isolated lepton with pr > 5 GeV. If the event has not been triggered by the xe80 
signature then the requirement is raised to 20 GeV or 25 GeV (for the e25i+xe30 and mu20i+xe30 
requirement, respectively). 

• Missing transverse energy Ef lss > 120 GeV 

• At least 4 jets with p T > 40 GeV and | tj | < 2.5 

• Exactly two out of the four leading jets are tagged as b jets 

Events surviving these selection cuts are expected to be dominantly tt events. 



3.2.2 Method for final state reconstruction 

First, the hadronic W is reconstructed from the light (non ^-tagged) jets. All possible pairs of light jets 
are considered and the one which gives an invariant mass closest to mw is selected. 

The assignment of the two b jets to the hadronic & leptonic sides is done using the angular correlation 
between the b jets and their associated particles, i.e. the lepton and the reconstructed W. The charge 
correlation between the lepton and its associated b jet is also used. The charge of the b jet is defined in 
the following way: 

_I(p9V 



L(pir 



(1) 



where the sum is over tracks i belonging to the jet, q' is the charge associated to the track, and p' L is 
the longitudinal component of the track momentum with respect to the jet direction. The parameter a is 
optimized to give maximal separation between b and b jets. The value obtained from optimization with 
MC data is a = 0.5. 

A likelihood ratio combining the angular and charge correlations is used. This likelihood ratio is 
defined between two hypotheses corresponding to the two possibilities of assigning the b jets. The 
hypothesis which achieves the higher likelihood ratio score is adopted. This algorithm selects the correct 
b jets assignment for about 70% of the events, when no cut is applied to the likelihood score of the 
selected hypothesis. Higher purity can be achieved at the cost of lower efficiency by placing such a cut, 
but this is not done for this analysis. 
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Figure 3: ft — ► bH + bW — ► bx(lep)vbqq: (a) cos i/a distribution, (b) W transverse mass distribution, for 
signal and background, after selection cuts, for charged Higgs boson masses of 90 and 150 GeV. 



Once the b jets are assigned, the reconstructed top quark mass m r t e 0 c p on the hadronic side of the event 
can be evaluated, and only those events which satisfy 100 GeV < m r t e 0 c p < 300 GeV are retained for further 
analysis. 



3.2.3 Further reduction of ft background 

In order to reduce the number of background events in which the lepton comes directly from a W decay, 
the decay angle cos y is used 

cosy = ^ 7 -1. (2) 

mfop ~ mw 

In such events, y is the angle between the lepton and top quark directions in the W rest frame (In the 
limit that the b quark is massless). The top quark, due to its large mass, couples mostly to the longitudinal 
polarization component of the W boson. As a result the cos y distribution would have a large (« 70%) 
contribution that is symmetric around cos y = 0 [1 1-13]. However, both the left-handed component of 
the W boson and the indirect leptons coming from x decays contribute to the lower cos y region, and this 
region is further enhanced by the selection cuts that favor events with large missing energy and therefore 
leptonic x decays. The signal contribution is similar to that of the indirect leptons, and the less energetic 
b jets, due to the higher H + mass, push cos y to even lower values. This can be seen in Fig. 3 (a). As a 
further selection cut, events are required to have cos y < -0.8. 

The transverse mass of the W (in the hypothesis of a leptonic W decay), calculated using the 
missing transverse momentum and the lepton transverse momentum, provides further discrimination 
against the it background with a direct lepton from a W. The separation is most distinct for low charged 
Higgs boson masses and is shown for two different H + masses in Fig. 3 (b). 



Charged Higgs Boson Transverse Mass A generalized transverse mass for the charged Higgs boson 
(in the hypothesis of a H + —> TV decay) in this channel can be defined. The derivation and properties of 
this variable are described in Reference [14]. 

K + ) 2 = (yfir&r + (p'{ p + p\ + pf ss ) 2 ~ Pt? ~ {p7 SS + Pt P ? O) 

This transverse mass satisfies m# + < nij + < m top . Figure 4 shows the distributions of ntj + for several 
charged Higgs boson masses. 
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m T [GeVI m y [GeV] m T [GeV] 

Figure 4: tt —> bH + bW —> bz(lep)vbqq: distribution, after selection cuts, for m H + = 90, 120, and 
150 GeV. The dotted line represents the nominal simulated mass. 



3.2.4 Results 

The cross-sections of signal and background events surviving selection cuts are shown in Table 5. 



Table 5: tt — ► bH + bW — > bz(lep)vbqq: Selection cut flow. For each sample, the cross-sections 
after cuts (in fb) are given in the first line and the relative cut efficiencies in the second line (in 
italics). The signal cross-sections correspond to tan j8 = 20. 



Channel 


All events 


Trigger 


pre-selection 


reco. cuts 


cosy 


m T 


H+ 90 GeV 


[fb] 


20760 


8408 


239 


190 


74 


70 




[/] 




0.405 


0.028 


0.79 


0.39 


0.94 


110 GeV 


[fb] 


14710 


6401 


138 


104 


42 


37 




[/] 




0.43 


0.021 


0.75 


0.40 


0.89 


120 GeV 


[fb] 


11560 


5305 


125 


82 


32 


23 




1/1 




0.46 


0.023 


0.66 


0.39 


0.77 


130 GeV 


[lb] 


8510 


4103 


75 


49 


23 


21 




[f] 




0.48 


0.018 


0.65 


0.47 


0.88 


150 GeV 


lib] 


3180 


1747 


33 


23 


12 


10 




[/] 




0.55 


0.079 


0.70 


0.54 


0.79 


tt> 1 e/pL/x 


lib] 


452000 


209339 


1963 


1317 


257 


144 




[/] 




0.46 


0.009 


0.67 


0.19 


0.56 


QCD dijet p r =280-1120 GeV 


[lb] 


12.9 -10 6 


213000 


<50 


<50 


<50 


<50 




[/] 




0.017 


<2.5-10- 4 








W+jets 


lib] 


31.2 -10 6 


7.69 -10 6 


173 


86.4 


<80 


<80 




[f] 




0.25 


2.2 -10- 5 


0.50 







Figure 5 (a) shows the differential cross-section for tan j8 = 20. Figure 5 (b) shows the corre- 
sponding distributions for nij . 

The statistical significance of the signal is calculated from both the W transverse mass and the H + 
transverse mass distributions, after event reconstruction and all selection cuts. The final significance is 
taken as the maximum of these two. In Fig. 6, the 5a discovery contour is plotted in the (m#±,tanj6) 
plane. 
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Figure 5: tt — > bH + bW — > bx(lep)vbqq: Transverse mass differential cross-section for signal and back- 
ground, for tan/3 = 20, and for the hypothesis of (a) W, and (b) H + . 




Figure 6: it —>■ bH + bW — > bx(lep)vbqq: Discovery (left) and exclusion contour (right) for Scenario 
B (m^-max) [1]. Systematic and statistical uncertainties are included. The systematic uncertainty is 
assumed to be 10% for the background, and 35% for the signal (see Sections 5.2 and 5.1). The lines 
indicate a 5a significance for the discovery and a 95% CL for the exclusion contour. 



3.3 ft -> bH+bW -> bx(had)vb£v 

In this channel a light charged Higgs boson is produced in it decay. The W boson from one top quark 
decays into leptons, while the x lepton originating from the H + decay results in a jet. At least three 
neutrinos are present in the event and thus the reconstr uction of the complete event is impossible. Due to 
the high branching ratio of this charged Higgs boson decay (BR(// + —> xv ) « 100% for low m#+)> me 
signal can be observed as an excess of tau leptons in the final state over the main background of Standard 
Model it production. 

Due to the production mechanism, the Standard Model it events will be the most important back- 
ground to this channel. However, significant contributions from other backgrounds are still possible, and 
thus important to study: In the inclusive process pp — > VK+jets, the final state is similar to the signal 
signature if one of the jets is mis-tagged as a x jet or a lepton and the W boson decays to the appropriate 
other object. Furthermore, the backgrounds constituted by single top quark events are considered. Single 
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top quarks can be produced with an associated W boson or b jet, and thus produce final states similar 
to the signal. Their contribution to the total background is however expected to be small due to the 
smaller cross-sections. Finally, the overwhelming background of QCD dijet is expected to be completely 
suppressed by demanding a high quantity of missing transverse energy and an isolated lepton. 

3.3.1 Event Selection 

Trigger: Events are required to pass one of the following three trigger items (see Section 2): 
e22i_xE30, mu20_xE30, or xE40_3j20_Ll_TAU30. 

Selection: To select signal events and suppress the Standard Model ti background several small dif- 
ferences between the processes are exploited. The following cuts are applied: 

1- N e;jU > 1: At least one isolated lepton with pj > 10 GeV; muons with |tj| < 2.7 and electrons with 
\r} \ < 2.5 are considered. 

2. Njet s =Ni i g ht j ets +N bjets +N Tjets > 3: At least three jets with p T > 20 GeV. 

3- N T j e t > 1: At least one of the jets is required to be T-tagged. A T jet quality cut is applied, leading 
to a T jet reconstruction efficiency of about 30%. 

4. N b j e t > 1: At least one b jet with /? T > 20 GeV is required, and a b jet quality cut is applied, 
leading to a b jet efficiency of about 60%. 

5. p\ > 40 GeV 

6. p e T > 25 GeV or pj > 20 GeV: This cut is only applied if the event was triggered by the appropriate 
lepton signature. If both lepton trigger requirements are met then only the cut on p e T is applied. 

7. q r + qi = 0: The x jet and the lepton are required to have opposite charge. 

8. Ef ss > 175 GeV 

Figure 7 (a) shows the jet multiplicity of the signal and the main it background. While there are more jets 
for the it background than the signal a cut on jet multiplicity suppresses other backgrounds. Figure 7 
(b) displays the x transverse momentum {p}). Cut 6 is only applied, if the event was triggered by a 
lepton signature. For such events, a p T cut is applied to the reconstructed lepton according to the trigger 
threshold. 

3.3.2 Results 

In Table 6 the cross-sections for signal and background after each cut are shown. The first two cuts select 
events which include a lepton and three jets, assumed to be the two b jets and a X jet. These cuts are 
motivated by the final state under study, but do not reduce the most important leptonic ti background. 
Requiring one jet to be a x jet removes more than 95% of the ti background while at least one quarter of 
the signal remains, depending on the charged Higgs boson mass. Only one jet is required to be 6-tagged, 
which is motivated by the fact that m H + > and thus the b quark produced in t — > bH + is in average 
softer (the most probable value for p h T is about 55 GeV for t — > bW, and between 15 GeV for m H + = 150 
GeV and 45 GeV for m H + = 90 GeV for t —> bH + ). The ^-tagging efficiency decreases quickly for low 
Pj [5] and thus the number of reconstructed b jets decreases with increasing m H +. 

Due to the mass difference mentioned above and the x polarizations being different depending on 
whether the x jet originated from an H + or W, the T-jet is expected to be harder for the signal than the 
background, motivating a cut on p\. A harder x jet in turn leads to more missing energy, thus making 
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Figure8: tt bH+bW bz(had)vb£v: Ef iss differential cross-section after all cuts for tan j8 = 20. The 
signal contributions for the different mass hypothesis are individually stacked on top of the background 
distribution. 



the Ef lss an attractive variable to cut on. A high value is required for Ef lss . This is mainly needed to 
optimize the significance with respect to the systematic error. Multidimensional optimization attempts 
have shown that above respective thresholds, the significance is not very sensitive to changes in the values 
of other continuous selection cuts (such as p\). 

The normalized number of events as a function of the missing transverse energy is shown in Fig. 8. A 
clear excess is observable for low charged Higgs boson masses, while close to the top quark mass there is 
only sensitivity for very high values of tan /3 . Figure 9 shows the final discovery and exclusion contours 
for the channel under investigation. A significant region of the parameter space above tan j3 = 30 is 
covered by this decay mode, while the sensitivity for low and intermediate tan j3 is limited. 

4 Heavy Charged Higgs Boson Searches 

In this chapter, charged Higgs boson searches in the two heavy H + {m H + > m t ) channels selected for 
investigation are presented: 

• gg/gb^t[b]H+^bqq[b}z(had)v 

• gg/gb -► t[b\H + -► t[b]tb -► bW[b]bWb -► b£v[b]bqqb 

The notation [b] implies the additional b given in the production mode gg — > tbH + , which is not produced 
in the mode gb —>■ tH + . All plots, tables and results in this section are based on the trigger menu for an 
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Table 6: it — > bH + bW — > bT(had)vb£v: Event selection cut flow. For each sample, the cross- 
sections after cuts are given in fb and for tan j8 = 20 in the first line and the relative cut efficiencies 
in the second line (in italics). 



Channel 


All events 


Trigger 


>1 e,ju 


> 3 jets 


> It 


>lb 




Lq 




H + 90 GeV 


[fb] 


12098 


6219 


4972 


4248 


1092 


929 


586 


582 


44 




[/] 




0.57 


0.80 


0.85 


0.26 


0.85 


0.63 


0.99 


0.08 


llOGeV 


[fb] 


8570 


4510 


3534 


2986 


772 


650 


439 


431 


30 




[/] 




0.53 


0.78 


0.84 


0.26 


0.84 


0.67 


0.98 


0.07 


120 GeV 


[fb] 


6737 


3611 


2868 


2440 


654 


535 


360 


354 


23 




[/] 




0.54 


0.79 


0.85 


0.27 


0.82 


0.67 


0.98 


0.06 


130 GeV 


[fb] 


4954 


2670 


2112 


1730 


512 


399 


270 


265 


20 




[/] 




0.54 


0.79 


0.82 


0.30 


0.78 


0.67 


0.98 


0.07 


150 GeV 


[fb] 


1853 


1048 


836 


626 


177 


130 


94 


94 


7 




[/] 




0.57 


0.80 


0.75 


0.28 


0.74 


0.72 


7.00 


0.07 


tt> 1 efn/% 


[fb] 


452000 


169612 


137928 


122547 


4760 


4006 


1915 


1730 


78 




[/] 




0.37 


0.81 


0.89 


0.04 


0.84 


0.48 


0.90 


0.04 


single top 


[fb] 


112500 


30180 


25065 


18081 


271 


168 


47 


38 


0 




[/] 




0.27 


0.83 


0.72 


0.02 


0.67 


0.28 


0.87 


0.0 


W^ev+jets 


[fb] 


476012 


144997 


114152 


53060 


780 


90 


40 


29 


0 




[/] 




0.30 


0.79 


0.46 


0.07 


0.72 


0.44 


0.74 


0.0 


jUV+jets 


[fb] 


157800 


48372 


43003 


41493 


582 


70 


40 


26 


0 




[/] 




0.31 


0.89 


0.96 


0.07 


0.72 


0.57 


0.64 


0.0 


TV+jets 


[fb] 


277755 


23187 


9443 


6920 


187 


20 


12 


3 


0 




[/] 




0.08 


0.41 


0.73 


0.03 


0.70 


0.67 


0.22 


0.0 





Figure 9: «" — > bH + bW —>■ bz(had)vb£v: Discovery (left) and exclusion contour (right) for Scenario 
B (mj-max) [1]. Systematic and statistical uncertainties are included. The systematic uncertainty is 
assumed to be 10% for the background, and 41% for the signal (see Sections 5.2 and 5.1). The lines 
indicate a 5a significance for the discovery and a 95% CL for the exclusion contour. 
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Figure 10: gg/gb — > t[b]H + — > fe^[fc]T(/iaJ)v: Probability density functions for ff and two signal mass 
points. The plots show, from the upper left to the lower right, \og{pj), \og{Ef lss ), and 1 — cos(A0). 



instantaneous luminosity of L=10 33 cm 2 s 1 presented in Section 2, on the signal cross-sections for the 
ra/j-max MSSM scenario (see Reference [1]) and the background cross-sections given in Reference [8]. 

4.1 gg/gb -> t[b]H+ -> bqq[b]x(had)v 

The following section describes the event selection for the H + —> xv channel for the case m H + > m t . 
The signal final state is characterised by a hard x jet, large missing transverse momentum (due to the 
neutrino), one or two b jets, two light jets, a W boson and a top quark (the mass of which can be 
reconstructed) and has previously been investigated in Reference [15]. 

The main background to this signal channel are tt decays, in particular when one of the top quarks 
decays to a x jet, t —>■ bx(had)v, and the other one hadronically, t — > bqq. However, other tt modes 
can also contribute when some of the objects in the event are not correctly reconstructed, e.g. a light 
jet as a x jet. Other backgrounds to be considered are single top, W+jets and QCD multi-jet events. A 
parametrized detector simulation has been used to evaluate the leptonic (e,/JL,x) tt background. The 
parametrization has been adjusted (using a smaller full detector simulation tt sample) such that the 
distributions of all the quantities in the events agree with the full detector simulation. 

4.1.1 Preselection 

Trigger: The trigger selection employs items of the H + trigger menu (see Section 2). Events are 
required to pass at least one of the two following trigger item combinations: xE40_3j20_Ll_TAU30 or 
xE50_Ll_TAU30. 

T jet reconstruction: Only x jet candidates with transverse momenta greater than 15 GeV with a pseu- 
dorapidity outside of the crack region 1.4 < T] < 1.6 are considered, and a high cut on the % quality 
variable is placed yielding a x reconstruction efficiency of about 20% and leading to a high rejection of 
parton jets. Exactly one x jet is required in the event, followed by a cut of p\ > 50 GeV in order to 
reduce the QCD background at an early stage. 

Jet reconstruction: At least three more jets with pj > 15 GeV are required, and exactly one of them 
has to be ^-tagged. For this purpose, a ^-tagging cut is applied to all jets with a ^-tagging efficiency of 
about 70%. 

Missing Transverse Energy: A soft cut of 40 GeV on the transverse missing energy E™' ss is applied 
to remove most of the QCD background already in the preselection, while it affects the tt background 
only slightly (about 30% of the tt events with leptons are removed). 



1464 



Higgs - Charged Higgs Boson Searches 




Figure 11: gg/gb — > t[b]H + — > T(/iod)v: Likelihood distributions. The areas are normalized to 

unity. The background likelihood for 170 GeV does not peak at 0 because of the trigger selection (and to 
a smaller degree the less stringent preselection cuts for the PDF determination) which already removes 
most of the events in the first bins. 



W boson and top quark reconstruction: For further discrimination of all backgrounds without hadron- 
ically decaying top quarks, an attempt is made to reconstruct a W boson and a top quark. For all combi- 
nations of the b jet with two other jets, the minimum value of 

% 2 = (mjj-mw) 2 + {m Kjjy -m t ) 2 

is calculated for each event (mw and m, are the nominal W and t masses, (jj) r are the two jets used in 
the W reconstruction, rescaled to the W mass, and o mw and o,„ t represent the resolution of the W and t 
mass reconstruction, 10 GeV and 15 GeV). The resulting x 2 values are required to be smaller than 3. 
Lepton veto: To eliminate events with leptons (in particular leptonic it modes), events with at least 
one isolated lepton (e, ju) with Pj>1 GeV are rejected. 

The preselection aims at suppressing the reducible background such that only tt events involving 
one hadronic and one semileptonic top quark decay survive. W+jets events are successfully removed by 
requiring a ^-tagged jet and a reconstructed top quark in the events, single top events by requiring a hard 
T jet and high E™ m together with a reconstructed hadronically decaying top quark, and QCD events by 
requiring high E™ ISS (and additionally by requiring b- and T-tags and a reconstructed top quark). 

4.1.2 Likelihood for further reduction of the tt background 

After the preselection cuts, the background is dominated by tt events with one W decaying hadronically, 
and the other one to a hadronically decaying % lepton and a neutrino (75% of the remaining background). 
An uncorrected likelihood approach has been chosen to reduce this background, employing the follow- 
ing discriminant variables: (i) p\, (ii) £"™ 1SS , (hi) A(p (azimuthal angle between the T jet and the missing 
momentum), (iv) Hj (scalar sum of the pr of all jets in the event (excluding x jets), and (v) p r f bo (ratio 
between the transverse momenta of the % jet and the hardest jet not used for the top quark reconstruction). 

The probability density functions (PDFs) have been created for each signal mass point and for the 
tt sample (with at least one leptonic W decay) using full detector simulation. Slightly less stringent 
preselection cuts (jp\ >40 GeV instead of 50 GeV, at least 1 b jet instead of exactly 1, and a maximum 
W/top reconstruction % 2 of 6 instead of 3) and no trigger selection are used to obtain the PDFs in order to 
keep a sufficiently high number of simulated events. The PDFs for two mass points and the tt background 
are shown in Fig. 10. 
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Figure 12: gg/gb — ► t[b]H + — > ^^[Z?]t(/jck/)v: # + transverse mass distributions for three signal mass 
points and the corresponding background fortan/3 = 35. The signal events are stacked on top of the 
background events. 



The likelihood distributions for two example mass points are shown in Fig. 11. A cut requiring the 
likelihood to be higher than 0.9 (m H + = 400 and 600 GeV) or 0.95 (other mass cases) is applied and the 
charged Higgs boson transverse mass of the remaining events is plotted as a final distribution to extract 
the significance of the H + signal. These transverse mass distributions are shown in Fig. 12. A clear 
excess can be observed for lower H + masses, but the shapes are similar for signal and background. 
Varying cuts to change the signal-to-background ratio would be a quick way to establish the excess if it 
was statistically unambiguous, followed by data-driven background estimation methods as described in 
Section 5.2. For higher H + masses, sensitivity is only given for higher values of tan/3, but the signal 
and background shapes are clearly distinguishable. 

Event selection summary: The following list summarizes the event selection: 

• Trigger: Trigger xE40_3j20_Ll_TAU30 or xE50_Ll_TAU30 

• Cut A: exactly one x jet with p\ > 50 GeV, and Ef iss > 40 GeV 

• Cut B: at least three additional jets 

• Cut C: exactly one of the additional jets ^-tagged 

• Cut D: veto on a lepton (e, /i) with p e T >l GeV 

• Cut E: W boson and top quark reconstructed with % 2 < 3 

• Cut F: likelihood value greater than 0.95 (0.9 for m H+ 400 and 600 GeV) 

• Cut G: m H + in a certain mass window 

4.1.3 Results 

The cross-section of signal and background events surviving preselection cuts are shown in Table 7. All 
backgrounds except for tt with at least one leptonic W decay mode are efficiently suppressed at this 
stage. The QCD dijet selection efficiency is too small to draw definite conclusions due to its high cross- 
section, but assuming that dijet events in pj -bins lower than 140 GeV cannot produce a hard x jet and 
three jets with the top quark-invariant mass (plus large missing Ej ), that the numbers presented for Cut 
B are of the right order of magnitude and that the relative efficiencies for the remaining cuts is smaller 
than for tt events the conclusion can be drawn that the background from dijet events is negligible. 

The efficiencies of the remaining Cuts F (likelihood) and G (m H + ) are shown separately in Table 8; 
as for these cuts the background efficiencies depend on the H + mass hypothesis. At this point, all other 
backgrounds except for tt events with W decays to e, }X or x are negligible. 

A signal-to-background ratio of the order of 1 has been achieved, resulting in robustness with respect 
to systematic uncertainties. The cross-section for very heavy H + — > XV production is small but a very 
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Table 7: gg/gb — > t[b]H + — > &gg[&]T(/iafi?)v: Preselection cut flow. For each sample, the cross- 
sections after cuts are given in fb and for tan j3 = 35 in the first line and the relative cut efficiencies 
in the second line (in italics). The size of the QCD dijet sample is too small to draw accurate 
conclusions. 



Channel 


All events 


Trigger 


Cut A 


CutB 


CutC 


CutD 


CutE 


H+ 170 GeV 


[fb] 


1346 


280 


76 


67 


36 


35 


14.7 




[/] 




0.21 


0.27 


0.88 


0.54 


0.96 


0.42 


200 GeV 


[fb] 


551 


139 


40 


33 


19 


18 


1A 




[/] 




0.25 


0.28 


0.83 


0.57 


0.97 


0.41 


250 GeV 


[fb] 


184 


58 


17 


15 


7.8 


7.6 


2.9 




[/] 




0.32 


0.30 


0.84 


0.54 


0.97 


0.38 


400 GeV 


[fb] 


28 


11 


3.3 


2.8 


1.5 


1.4 


0.58 




[/] 




0.39 


0.31 


0.84 


0.52 


0.98 


0.41 


600 GeV 


[fb] 


4.5 


1.7 


0.52 


0.46 


0.24 


0.23 


0.10 




[/] 




0.39 


0.30 


0.87 


0.52 


0.97 


0.43 


tt> 1 e/fi/% 


[fb] 


452000 


56300 


1669 


1532 


697 


518 


188 




[/] 




0.12 


0.03 


0.92 


0.46 


0.74 


0.36 


tt hadronic 


[fb] 


381000 


1746 


37 


37 


16 


16 


5 




[/] 




0.005 


0.02 


1.00 


0.43 


1.00 


0.33 


QCD dijet pr =140-1 120 GeV 


[fb] 


3.2- 10 8 


3.2 -10 s 


1285 


1234 


106 


106 






[/] 




0.001 


0.004 


0.96 


0.09 


1.00 




W+jets 


[fb] 


341200 


19170 


2518 


1892 


314 


314 


13 




[/] 




0.06 


0.13 


0.75 


0.17 


1.00 


0.04 


single top 


[fb] 


112500 


7570 


161 


132 


39 


37 


12 




[/] 




0.07 


0.02 


0.81 


0.30 


0.93 


0.32 



good discrimination against the background can be obtained with the likelihood method. The resulting 
discovery contours are presented in Fig. 13 and show that a sizable region of the MSSM space which 
has not been explored experimentally before can be covered. Sensitivity is given for large tan /3 and 
m H+ < 500 GeV. 

4.2 gg/gb -> t[b]H+ -> t[b]tb -> bW[b]bWb -> b£v[b]bqqb 

In this section the analysis strategy for the H + — > channel is outlined. This is a particularly challenging 
signal final state which includes 3 (or 4) b quarks, 2 light quarks, 1 high pr lepton and one neutrino. The 
channel has previously been studied for ATLAS using fast detector simulation [16], and several aspects 
of the previous analysis have been adopted. However, the use of full simulation for the present study 
showed the need to add several new cuts to improve the signal to background ratio. 

4.2.1 Trigger 

Events are required to pass one of the following trigger signatures (see Section 2): e22i_xE30, mu20_xE30, 
or xE40_3j20_Ll_TAU30. 

4.2.2 Preselection 

Following the trigger, the events are required to pass a set of preselection criteria which define the mini- 
mum requirements needed for the event reconstruction: 

• exactly 1 isolated lepton (e or ju) with p e T > 25 GeV, p£ > 20 GeV and \rj \ < 2.5. 
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Table 8: gg/gb ^t[b]H + — > bqq[b]x(had)v. Event selection results. The cross-sections after cuts 
are given in fb and for tan /3 = 35 as well as the relative cut efficiencies. The remaining background 
consists only of it events with at least one W decay to e, }JL or x plus v. 



Channel 


Cut 


Signal 


Back 


ground 






[fb] 


[/] 


[fb] 


[/] 


H+ 170 GeV 


LH>0.95 


3.8 


0.26 


2.3 


0.012 




mf > 100 GeV 


3.8 


0.99 


2.1 


0.91 


200 GeV 


LH>0.95 


3.1 


0.42 


3.2 


0.017 




mf > 120 GeV 


2.9 


0.92 


2.4 


0.74 


250 GeV 


LH>0.95 


1.4 


0.47 


3.1 


0.017 




mf > 150 GeV 


1.1 


0.77 


2.0 


0.63 


400 GeV 


LH>0.9 


0.47 


0.80 


4.5 


0.024 




m$ + > 250 GeV 


0.26 


0.56 


0.33 


0.074 


600 GeV 


LH>0.9 


0.075 


0.76 


2.5 


0.013 




mf > 300 GeV 


0.044 


0.58 


0.15 


0.062 




Figure 13: gg/gb — > t[b]H + — > bqq[b]x(had)v: Discovery (left) and exclusion contour (right) for Sce- 
nario B (m/,-max) [1]. Systematic and statistical uncertainties are included. The systematic uncertainty 
is assumed to be 10% for the background, and 44% for the signal (see Sections 5.2 and 5.1). The lines 
indicate a 5a significance for the discovery and a 95% CL for the exclusion contour. 



• at least 5 jets with p T > 20 GeV and | T] | < 5. 

• at least 3 ^-tagged jets with |tj| < 2.5. 

Lepton selection: Electron candidates that pass the cuts on transverse momentum and pseudorapidity 
mentioned above are required to pass further identification and isolation criteria based on the shower 
shape in the electromagnetic calorimeter and the quality of the track in the inner detector. Concerning 
isolation, the energy in a cone of AR = 0.20 around the electron is required to be less than 20% of the 
electron transverse energy. The same isolation criteria is also applied to muon candidates. 

Jet selection and ^-tagging criteria: Jet multiplicity is one of the main sources of combinatorial back- 
ground and the ^-tagging efficiency is the major factor reducing the overall efficiency of the preselection. 
From jets passing the preselection cuts mentioned above, those within the range \ rj \ < 2.5 are considered 



1468 



Higgs - Charged Higgs Boson Searches 



^-tagged if passing a ^-tagging quality cut leading to a b jet reconstruction efficiency of about 60%. Only 
events with at least 3 ^-tagged jets are accepted for the rest of the analysis. 

4.2.3 Reconstruction 

Apart from the objects expected in the event final state, jets from the underlying event are also present 
leading to an increase of the jet multiplicity. 

Reconstruction of the leptonic W: After the preselection is performed, all physics objects necessary 
for the complete reconstruction of the event are present except for the neutrino coming from the leptoni- 
cally decaying W. In order to reconstruct the four- momentum of the neutrino, its transverse component 
is identified with the missing transverse momentum, and the longitudinal component is computed using 
the W mass constraint, leading to 0, 1 or 2 real solutions. In about 25% of the cases no real solution 
can be found. In order to recover those events, the approximation of neglecting the imaginary part of the 
solution is applied and has been found to have only a small effect on the top quark mass resolution. 

The Combinatorial Likelihood: The next step is to associate the reconstructed physics objects with 
objects of the event, assuming a signal event topology. The number of possible combinations is very large 
and depends strongly on the number of light jets in the event. In order to overcome this combinatorial 
background, a likelihood function is defined. The likelihood formalism used in this analysis is based on 
m variables and should discriminate between n classes of events. For each of the m variables, first the 
probability density functions //(*,-) for each of the n classes are determined: Then the probability for an 
event to be of class j when the value x,- is measured for variable i is given by: 

The information about all m variables are combined, ignoring correlations, to define the likelihood Jzfy 
that an event belongs to class j when measuring the values x t for variables i=l,...,m: 

% = rciiP/fo) (6) 
3 iLinti*/) 

A combinatorial likelihood is used to discriminate between the two classes of events: the correct and the 
wrong combinations. The likelihood is based on 8 variables: 

• mjf. The invariant mass of two light jets. 

• nijj h : The invariant mass of two light jets and one b jet. 

• m£vb'- The invariant mass of the lepton, one of the two solutions of the neutrino and one b jet. 

• PT{bn)'- The transverse momentum of the b jet associated to the charged Higgs boson decay. 



• AR(jJ): The distance in the azimuthal-pseudorapidity plane (AR = y / A0 2 + Atj 2 ) between two 
light jets. 

• AR(jj,b): AR between the sum of two light jets and one b jet. 

• AR(£,b): AR between the isolated lepton and one b jet. 

• AR(pH,tH)'- AR between the & jet and the top quark associated to the charged Higgs boson decay. 
The likelihood is computed for each combination, and the combination with the highest likelihood in the 
event is chosen. If the maximum likelihood in the event is found to be less than 0.7, the event is rejected. 
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4.2.4 Final Event Selection 

After the event reconstruction is complete the physical backgrounds must be suppressed, of which the 
largest is tt + jets. In order to reduce this background requiring 4 b jets in the event is found to be crucial. 
The events passing the 4 b jets requirements are then passed to a likelihood cut. The likelihood function 
used is based on 5 variables: 

• r\b H '■ The pseudorapidity of the b jet associated to the charged Higgs boson decay. 

• Hb w b- The sum of the ^-tagging weights of the 3 b jets associated to the top quark and charged 
Higgs boson decay. 

• < Jzf >: The average combinatorial likelihood in the event. 

• AR(bH,b t b t ): The distance in the (<j),ri) plane between the b jet associated to the charged Higgs 
boson decay and the system of the two b jets associated to the top quark decays. 

• Pt IPt'- The pr ratio of the two b jets not associated to the top quark decay, b\ being the one with 
the lowest pj. 

A cut on the output of this likelihood is applied. Its value is optimized to maximize the charged Higgs 
boson signal significance. This final step improves the significance by only 10 to 15%. The limited 
number of simulated events at this stage of the analysis, especially for the backgrounds, made it very 
difficult to optimize the choice of variables to include in the likelihood. Larger background samples 
will be needed to be able to define a more performant likelihood as it was done in Reference [16] with 
parametrized detector simulation. 

4.2.5 Results 

Table 9 shows the selection cut flow after the different steps of the analysis for all simulated signal 
masses, and for the background for one H + mass hypothesis. Table 10 presents the results for all 
simulated H + hypotheses. In Fig. 14, the reconstructed charged Higgs boson mass for signal and physics 
background is shown. Since the limited number of simulated events does not allow the construction of 
a performing final selection likelihood, currently no H + discovery or exclusion power can be extracted 
from this channel on its own and thus no contours are shown. It, however, contributes to the combined 
H + sensitivity. 

5 Systematic Uncertainties and Background Extraction From Data 

The observation of a charged Higgs boson signal will be subject to statistical and systematic uncertainties. 
The systematic uncertainties stem from two sources: theoretical and experimental, and both are discussed 
in Section 5.1. Section 5.2 addresses how to extract the dominating ti background from real data using 
a novel technique with so-called control samples. 

5.1 Systematic Uncertainties 

5.1.1 Theoretical Systematic Uncertainties 

Uncertainties in the expected production cross-sections for background and signal processes affect the 
discovery /exclusion potential of the channels under investigation. For all channels the uncertainty of the 
tt background is particularly interesting since this is the dominant background. A 12% uncertainty on 
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Table 9: gg/gb -> t[b]H+ -> f [fc]ffc -> &W[fe]feWfe -> b£v[b]bqqb: Selection cut flow. The cross- 
sections in fb are given after each cut and for tan/3 = 35 as well as the relative cut efficiencies. 
The backgrounds are shown for a charged Higgs mass boson hypothesis of m H + = 250 GeV. 



Channel 


All events 


Trigger 


Preselection 


Reconstruction 


4 b tags 


Selection 




200 GeV 


[fb] 


105 


64 


2.3 


2.2 


0.18 


0.16 






[/] 




0.61 


0.036 


0.93 


0.08 


0.89 




250 GeV 


[fb] 


170 


108 


8.1 


7.3 


1.06 


0.65 






[/] 




0.63 


0.075 


0.88 


0.11 


0.61 




400 GeV 


[fb] 


65 


45 


4.5 


3.9 


0.43 


0.26 






[/] 




0.69 


0.10 


0.88 


0.11 


0.59 




600 GeV 


[fb] 


22 


16 


1.8 


1.7 


0.27 


0.18 






[/] 




0.75 


0.12 


0.92 


0.16 


0.67 


it 


+ jets 


[fb] 


112000 


74400 


1040 


875 


35.0 


11.7 






[/] 




0.66 


0.014 


0.84 


0.04 


0.33 


tt 


bb (QCD) 


[fb] 


2240 


1575 


130 


117.8 


17.6 


6.8 






[/] 




0.70 


0.083 


0.90 


0.15 


0.38 


it 


bb (EW) 


[fb] 


244 


155 


14.4 


13.0 


2.0 


0.39 






[/] 




0.63 


0.09 


0.90 


0.15 


0.19 



Table 10: gg/gb ->■ ->■ f [fc]f& ->■ ->■ blv[b]bqqb: Event selection results. The 

cross-sections after all cuts are given in fb and for tan/3 = 35 as well as the global selection 
efficiencies. 



Channel 


Signal 


tt 


+ jets 


tt bb (QCD) 


tt bb (EW) 




[fb] 


[/] 


[fb] 


[/] 


[fb] 


[/] 


[fb] 


[/] 


200 GeV 


0.16 


0.0015 


21.0 


0.00019 


12.2 


0.0054 


1.05 


0.0043 


250 GeV 


0.65 


0.0038 


11.7 


0.00010 


6.76 


0.0030 


0.74 


0.0030 


400 GeV 


0.26 


0.0039 


15.9 


0.00014 


8.91 


0.0040 


1.05 


0.0043 


600 GeV 


0.18 


0.0084 


19.8 


0.00018 


10.5 


0.0047 


1.21 


0.0049 



the NLL calculations is expected 3 leading to o t j = 833 ± 100 pb [17]. Other backgrounds considered 
have similar or smaller uncertainties. 

The branching ratios BR(f — > H + b) and BR(// + —> rv,cs,tb) have been determined with the Feyn- 
Higgs package, and similar systematic uncertainties apply [18]: 

• ABR(f -► H+b) /BR < 10% 

• ABR(#+ -► TV) /BR < 5% 

• ABR(#+ -► cs,tb)/ER < 10% 

In the high-mass region, the dominant systematic uncertainties on the charged Higgs boson production 
cross-section stem from the renormalization scale and factorization scale dependence and are calculated 
to be smaller than 20% in the whole MSSM space. The decrease of the cross-section due to supersym- 
metry loop corrections has been taken into account by adjusting the cross-sections with an additional 

3 The tt cross-section will be measured in early LHC studies and transform this theoretical uncertainty into a much smaller 
experimental uncertainty, despite possible H + effects in the measurement. 
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Figure 14: gg/gb t[b]H + -> t[b]tb -> bW[b]bWb -> b£v[b]bqqb: Reconstructed H + mass. The value 
of tan/3 has been chosen such that the pure statistical significance results in a value of 5. 



factor as proposed in Reference [19], reflecting the altered relation between the bottom quark mass and 
its Yukawa coupling, Am/,. Remaining supersymmetry loop corrections are shown to be negligible. 

5.1.2 Experimental Systematic Uncertainties 

Several quantities are subject to experimental systematic uncertainties: Tagging and reconstruction ef- 
ficiencies, energy scales, energy resolutions and the luminosity determination. A detailed list of the 
systematical uncertainties considered, including their numerical value, is given in Table 1 1 . Each sys- 
tematic effect has been evaluated individually using the given uncertainty on an event-by-event basis. 

The systematic uncertainty of the missing transverse energy is indirectly considered by taking the 
effects of the other systematic effects in the missing transverse momentum calculation into account. 
The systematic uncertainty estimates are generally conservative, in particular for the results assuming an 
integrated luminosity of 30 fb~ 1 . However, due to the usage of control samples to estimate the tt back- 
ground, the uncertainty values have a negligible impact on the H + discovery sensitivity. Furthermore, 
tests have shown that even for the exclusion sensitivity the effect is very small: Assuming that the total 
experimental systematic uncertainty on the results could be halved, the change in sensitivity would only 
be of the order of 0.1 in tan/3 for fixed values of m H +. 

The dominant systematic uncertainty for all H + channels is the jet energy scale, with values between 
10% and 30%. Similarly, channels with hadronic T decays are strongly affected by the x jet energy scale. 
The channel H + — > tb, requiring 4 &-tags, is strongly affected by uncertainties in ^-tagging efficiency 
and rejection of light jets. The total experimental systematic uncertainty for the different H + channels 
is between about 15% and 40% for the signal, affecting mainly the exclusion sensitivity. Similar values 
apply for the main background, tt , which would remove most of the discovery potential. Thus a tech- 
nique for a data-driven estimation of the background has been developed, greatly reducing the systematic 
uncertainty on the background. These "tt control samples" are discussed in the following section. 
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Table 11: Effects of systematic uncertainties for all channels under investigation. The num- 
bers are given in terms of percentage changes in cross-section. The channels are: 1: tt — > 
bH+bW -»■ bx{had)vbqq (see Section 3.1), 2: tt '-► bH + bW -► bx{lep)vbqq (see Section 3.2), 
3: tt^bH + bW -> bx(had)vb£v (see Section 3.3), 4: gg/gb -► t[b]H + -► bqq[b]x(had)v (see 
Section 4.1) and 5: -► -► f[J>]rf> -► -»• b£v[b]bqqb (see Section 4.2). 



Uncertainty 


Value 


S 


1 

B 


s 


B 


S 


B 


S 


1 

B 


S 


5 

B 


T E Resolution 


0.45 x \/E 


-2 


+3 






+8 


-3 


-4 


-1 






T E Scale 


-5% 
+5% 


-2 
-5 


+5 
-5 






0 
+8 


-9 
+1 


-15 
+4 


-21 
+28 






T-tag Efficiency 


±5% 


-5 








-8 


-1 


-8 


-5 






Jet E Resolution 


0.45^£, |T7| < 3.2 
0.63\/E, |i? 1 > 3.2 




-3 


-8 


+5 


+8 


+3 


-12 


-3 


-2 


-4 


Jet E Scale 


+7(15)%,|rj| < (>)3.2 


-9 


+12 


+29 


+22 


+35 


+19 


+4 


-18 


+9 


+8 


-7(15)%,|r/|<(>)3.2 


-5 


-5 


-21 


-12 


-19 


-17 


-31 


+15 


-8 


-6 


fo-tag Efficiency 


±5%e btag 


0 


-14 


+4 


-6 


0 


-3 


-7 


+3 


-8 


-10 


fo-tag Rejection 


-10% 


-7 


+10 


0 


+ 1 


0 


0 


-2 


-3 


-4 


+6 


+10% 


+7 


-2 


0 


0 


0 


-1 


-3 


-1 


0 


-5 


,u E Resolution 


0.01 l/P T 00.00017 


0 


0 


-4 


+ 1 


0 


+1 


0 


0 


-4 


-5 


jU E Scale 


-1% 


0 


0 


0 


+ 1 


+4 


-1 


0 


0 


-4 


-6 


+1% 


0 


0 


-4 


-1 


0 


0 


0 


0 


+4 


+7 


jU Efficiency 


±1% 


0 


0 


0 


-1 


0 


0 


0 


-2 


-2 


-1 


e E Resolution 


0.0073 x E T 


0 


0 


0 


0 


0 


-1 


0 


0 


-4 


-4 


e E Scale 


-0.5% 


0 


0 


0 


+1 


0 


-1 


0 


0 


-4 


-5 


+0.5% 


0 


0 


0 


-1 


+4 


-1 


0 


0 


+4 


+6 


e Efficiency 


±0.2% 


0 


0 


0 


0 


0 


0 


0 


0 


0 


-1 


Luminosity 


-3% 
+3% 


-3 
+3 


-3 
+3 


-3 
+3 


-3 
+3 


-3 
+3 


-3 
+3 


-3 
+3 


-3 
+3 


-3 
+3 


-3 
+3 



5.2 tt Control Samples 

The tt process, in particular with one or more x leptons in the final state, is the dominant background 
to all analyses presented in this note. As the relative contributions from this background in different 
jet multiplicities are not known, the subtraction of these backgrounds using a data-driven method is 
necessary. 

The method is data-driven in the sense that it uses tt —> WbWb —> \ivb\ivb and tt —> WbWb —> 
jxvbqqb events collected by ATLAS to model the tt backgrounds with one or more taus in the final state. 
After applying a minimal set of event selection criteria on the data (optimized for both efficiency and 
purity), one or two leptons from the events are removed and the 4 momenta of the removed objects are 
scaled into tau leptons with corrections for the mass. The tau leptons are fed into TAU0LA [20] for decay 
and the decay products are passed to the ATLAS detector simulation and reconstruction software. Finally 
the result is merged with the original event from which the leptons were removed to constitute a control 
sample. 

The result is a data-driven control sample for each of the tt final states which potentially constitute 
a background to one of the H + analyses, using events from data that can be easily and efficiently 
triggered. With this method both the shape and the normalization of these backgrounds for all x decays 
(i.e., leptonic and hadronic decays, or in the case of analyses requiring two taus the lepton-hadron and 
hadron-hadron modes) can be modelled. 
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5.2.1 Obtaining ft Control Samples from Data 

To extract the ti control samples from data a set of selection criteria is applied. These are designed to 
optimize the efficiency and purity of the samples. 

Dimuonic channel In order to extract the it —> WbWb — »■ \ivb\ivb sample, events with at least two 
isolated muons with p T > 20 GeV are selected. To reject muons coming from Z decays, events with a 
dimuonic invariant mass in the range 70-1 10 GeV are rejected. A further requirement is that the missing 
transverse energy in the event is larger than 40 GeV. 

Table 12 summarizes the result of the above selection. The efficiency of the signal (dimuonic ti 
events) to survive this selection is 28% and the sample purity is estimated to be 71%. 

Table 12: Efficiency and purity for collecting ti dimuonic events. Each m«20 indicates a generator 
level cut requiring one muon with pj > 20 GeV. 



Process 


cross-section [fb] 


efficiency 


events [fb-1] 


it signal 


9310 


0.284 


2641 


ti background 


823690 


4.96 10" 4 


407 


W+Jets 


202400 


1.61 • 10 4 


33 


Z+Jets 


210290 


1.45 10" 3 


305 


bb(mu20mu20) 


261000 


1.23 -10- 3 


322 


Total background 






1067 



jU+jet channel The selection criteria for the ti — > WbWb — > jxvbqqb sample are designed to reject 
bb — >• lju +X events which have a large cross-section. Events are accepted if an isolated muon is found 
and two jets in the event with transverse momenta above 40 GeV have an invariant mass within 20 GeV 
of the nominal W mass. Events with high pj muons in the jets are rejected. A missing transverse energy 
cut of 40 GeV is applied, as well as a requirements of least two more jets with transverse momenta above 
40 GeV. At least one of these jets is required to be a ^-tagged. The overall transverse energy of the event 
is required to be larger than 250 GeV and events with a high pj isolated electron are rejected. 

The results of the above selection are summarized in Table 13. The selection efficiency for signal 
events is 8.6% and the signal purity is 74%. 



Table 13: Efficiency and purity for collecting tt muon+jets events. 



Process 


cross-section [fb] 


efficiency 


events [fb-1] 


ti signal 


119040 


8.62 10" 2 


10263 


ti background 


713960 


1.80 10" 3 


1287 


W+Jets 


202400 


5.61 • 10 3 


1134 


Z+Jets 


210290 


2.84 -10- 4 


74 


bb(mu20) 


13600000 


8.40- 10~ 5 


1147 


Total background 






3642 
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5.2.2 Method Validation 

To thoroughly test the tt control sample method, the produced events have to be run through the different 
analyses and the obtained shapes and normalizations have to be compared to the ones obtained in these 
analyses. However, a global check can be done by comparing various distributions from the two cases 
(a) "real" tt events and (b) "scaled" tt control sample events where muons have been replaced by z 
leptons. This has been done separately for three final states of interest. 

Basic preselection cuts on quantities like transverse momenta have been applied, lower than or at 
most equal to the preselection cuts applied in the analyses which use the quantities plotted in the follow- 
ing. The MC@NLO event weights have not been taken into account in order to increase the available 
statistics since this has been shown not to bias the comparison. 




Figure 15: tt — »■ bz(lep)vbqq: Left: W — »■ z(lep)v transverse mass, both for the real and the scaled tt 
events. Right: The corresponding bin-by-bin ratio. The gray band represents ±10% around a ratio of 1. 




Figure 16: tt —> bz(had)vbqq: Left: t — > bz(had)v momentum, both for the real and the scaled tt events. 
Right: The corresponding bin-by-bin ratio. The gray band represents ±10% around a ratio of 1. 

In Fig. 15, the W — > z(lep)v transverse mass is shown for the tt — »■ bz(lep)vbqq mode. Here, for the 
scaled events, a muon has been replaced by a leptonically decaying z. For the modes tt —> bz(had)vbqq 
and tt — ► bz(had)vb£v, in the scaled events a muon has been replaced by a hadronically decaying z. The 
t — > bz(had)v momentum is shown in Fig. 16, demonstrating the success of this replacement. 

The presented plots demonstrate that the dominant background of all H + studies, tt , can be mod- 
elled with the tt control sample method. Even without the intended further refinement of the method, in 
the regions of interest quantities of the H + analyses can be modelled within a 10% error margin. This is 
remarkable in particular for complex quantities, i.e. variables extracted from the combination of several 
objects (like the top quark mass), and gives confidence that the tt control sample method allows to re- 
produce the relevant correlations in the event. Thus for all results a systematic tt background uncertainty 
of 10% is assumed (while the signal systematic uncertainty is extracted from Monte Carlo events, see 
Section 5.1). 
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6 Combined Results 




Figure 17: Scenario A: Combined Results. Left: Discovery contour, Right: Exclusion contour. System- 
atic and statistical uncertainties are included. 




Figure 18: Scenario B (ra^-max): Combined Results. Left: Discovery contour, Right: Exclusion contour. 
Systematic and statistical uncertainties are included. 

The sensitivities for discovery and exclusion are calculated with the Profile Likelihood method [21- 
23], which includes statistical and systematic uncertainties. The results are summarized in combined 
discovery and exclusion contours for all H + channels for two MSSM Scenarios A and B [1], and 
three different integrated luminosities (1, 10, and 30 fb _1 ). Figure 17 shows the result for the MSSM 
Scenario A. 5a discovery contours and 95% CL exclusion contours are shown. Figure 18 shows the 
same results for the MSSM Scenario B (m/,-max). Previous studies have shown that the dependence of 
the H + discovery sensitivity on the specific choice of the MSSM parameter values is generally very 
small, with the exception of the Higgsino mixing parameter ji. [24,25]. The discovery significance is 
calculated for both cases assuming a systematic background uncertainty of 10% following the study of 
tt control samples (Section 5.2). The signal systematic uncertainties are discussed in Section 5.1. The 
statistical uncertainties arising from the use of Monte Carlo samples with a finite number of events have 
been consistently taken into account. 

A discovery sensitivity is given for a large part of the m H + -tan/3 space for both scenarios, but 
the difficult intermediate tan/3 region, where the H + cross-section has its minimum, is not covered. 



1476 



Higgs - Charged Higgs Boson Searches 




Figure 21: Model-independent: Heavy H + results in the {m H +, a)-plane. Left: Discovery contour, 
Right: Exclusion contour. Systematic uncertainties and statistical uncertainties are included. 
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However, a charged Higgs boson in this region could be excluded for values up to the top quark mass. 
Additional integrated luminosity does not give any further H + sensitivity for a low-mass H + after a 
few fb -1 have been recorded. The reason is the statistical error from the small it Monte Carlo sample 
which is equivalent to about 1 fb -1 at the LHC and makes extrapolation to higher luminosities difficult. 
Repeating the low-mass H + studies with larger Monte Carlo samples is thus expected to lead to a 
significantly larger discovery and exclusion reach, as can be seen in Fig. 19 in which the statistical 
uncertainty arising from simulation statistics is neglected (i.e. it is assumed that the number of simulated 
events is much larger than the number of expected real-data events). 

Figure 20 shows the discovery and exclusion contour in terms of the branching ratio t — ► H + b as a 
function of m H +. With one year of low luminosity data, it will be possible to discover the charged Higgs 
boson if BR(? — > H + b) is larger than about 1-3%, and to exclude it even if this branching ratio is well 
below the percent level. Similar contours for a heavy H + are presented in Fig. 21; here the y-axis shows 
the cross-section for the process gg/gb — ► t[b]H + — > t[b]zv. Sensitivity is given for a cross-section of 
the order of O.lpb. Both figures are model- independent in the sense that they can be interpreted in the 
context of any MSSM, other SUSY, or even non-SUSY scenario. 

7 Conclusions 

The ATLAS potential for discovering or excluding the existence of a charged Higgs boson in two dif- 
ferent MSSM scenarios has been evaluated for five different final states of the H + signal. Significant 
improvements of present day constraints can already be achieved with limited data (less than 1 fb -1 , 
about one month at low luminosity at the LHC) although it may not qualify as early physics due to its 
dependency on higher level reconstruction objects. 

Below the top quark mass charged Higgs bosons are predominantly produced in top quark decays 
and the main decay mode is H + — > TV . Three different signal final states have been studied and analyzed 
separately, each of them separately outperforming the present sensitivity from the Tevatron experiments 
already with 1 fb -1 of data. The combined performance of the three channels yields a discovery reach 
for 10 fb -1 which covers tan/3 values down to 20 and up to 4 for all charged Higgs boson masses up to 
about 150 GeV. For intermediate tan/3 region (around tan/3 = 7), no discovery sensitivity is present, but 
a charged Higgs boson could be excluded in this region. The current sensitivity is primarily limited by 
simulation statistical uncertainties, it is thus expected that a larger production of simulated events will 
greatly improve the situation and give access to the intermediate tan /3 region. 

Two analyses have been conducted in the search for a heavy charged Higgs boson (m H + > m t ), a 
region presently uncovered in direct searches. Here, the main production mode is through gb fusion 
{gb —> tH + ) and the decay into a top and a b quark dominates. However, the dominant tb decay mode 
suffers from large irreducible backgrounds, and the combinatorial background. Consequently the dis- 
covery potential for a heavy charged Higgs boson is dominated by the TV decay mode, which despite its 
significantly smaller branching ratio allows for more efficient background suppression. The discovery 
reach in the context of the MSSM (m^-max) strongly depends on the charged Higgs boson mass and 
reaches from {m H + = 200 GeV, tan/3 = 28) to (m H + = 350 GeV, tan/3 = 58) for an integrated luminosity 
of 30 fb -1 . Additionally, the model-independent discovery reach for a charged Higgs boson as a function 
of its production cross-section has been evaluated. A light H + sensitivity for a BR(t — > H + b) down to 
the percent level is given for a discovery, and well below that level for an exclusion. For a heavy H + 
decaying to TV, sensitivity is given for cross-sections of the order of O.lpb. 

The results presented in this note give confidence that the LHC and the ATLAS detector will be 
able to probe an extended Higgs sector over a sizable region of the MSSM parameter space. For a high 
SUSY mass scale, the charged Higgs boson could be the first signal of New Physics (and indication for 
Supersymmetry) discovered. 
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Statistical Combination of Several Important Standard 
Model Higgs Boson Search Channels 

Abstract 

In this note we describe statistical procedures for combination of results from 
independent searches for the Higgs boson. Here only the Standard Model 
Higgs is considered, although the methods can easily be extended to non- 
standard Higgs models as well as to other searches. The methods are ap- 
plied to Monte Carlo studies of four important search channels: H — > T + T~, 
H -»■ -> eVjUV, H -> 77 and H -> ZZW -> 4 leptons. The statistical 

treatment relies on a large sample approximation that is expected to be valid 
for an integrated luminosity of at least 2 fb~ 1 . Results are presented for the 
expected statistical significance of discovery and expected exclusion limits. 

1 Introduction 

Higgs searches will exploit a number of statistically independent decay channels. One wishes to combine 
all of the information from them to provide a single measure of the significance of a discovery or limits 
on Higgs production. The approach taken in this paper is based on frequentist statistical methods, where 
effects of systematic uncertainties are incorporated by use of the profile likelihood ratio. 

The statistical procedures used for establishing discovery and setting limits are described in Section 2. 
These methods are very general and can be applied to the combination of results of essentially any search 
that will be carried out at the LHC. Section 3 summarizes the four search channels for the Standard Model 
Higgs boson considered in this note: H — > T + T~, H — > — > ev/xv, H — > 77 and H — > ZZM — > 

4 leptons. 

The statistical treatment requires knowledge of the distribution of a test statistic based on the profile 
likelihood ratio. To determine these distributions by Monte Carlo so as to establish discovery at a high 
level of significance would require an enormous amount of simulated data, which is not practical at 
present. Therefore the distributions have been estimated using the functional form expected to hold in 
the large sample limit. Investigations shown in Section 3 indicate that this approximation should be 
reliable for an integrated luminosity above 2 fb _1 . 

In Section 4 we show the result of the combination. For different values of the integrated luminosity 
and hypothesized Higgs mass, we present the signal significance expected assuming the Standard Model 
Higgs production rate, as well as expected upper limits on the Higgs production cross section, under the 
hypothesis of no Higgs signal. 

The channels considered here focus on the search for a Higgs boson in the low-mass range. It is 
planned to include other channels in the future, e.g., further final states from the W+W and ZZ modes. 
This will improve sensitivity especially at higher Higgs mass values. 

2 Statistical methods 

In this section we describe the general statistical model and likelihood function, first for a single channel 
and then generalized to multiple channels. In Section 2.2 we give the procedure used to establish discov- 
ery based on a frequentist significance test, where the effects of systematic uncertainties are incorporated 
by use of the profile likelihood ratio. Section 2.3 covers the corresponding methods for setting limits. For 
both discovery and exclusion one requires the sampling distribution of the statistic used in the test; this 
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is described in Section 2.4. Section 2.5 discusses a series of approximations used to determine expected 
values of the discovery significance and exclusion limits. 

The approach taken in this note is to carry out tests for discovery and exclusion for fixed values of 
the Higgs mass m H . In principle the entire procedure is then repeated for all masses, resulting in limits 
on or a measurement of m#. In practice, an interpolation is made between finite steps in m^. 

2.1 The statistical model and likelihood function 

First we consider the case of a single search channel. The measurement results in a set of numbers 
of events found in kinematic regions where signal could be present. These typically correspond to a 
histogram of a variable such as the mass of the reconstructed Higgs candidate, with the numbers of 
entries denoted by n = (m, . . . In some cases one may consider a histogram with only one bin, i.e., 
the measured outcome is simply a number of candidate events found. The number of entries in bin i, n,-, 
is modeled as a Poisson variable with mean value 

E [m] = \lLZiC0 + bi = }iSi + bi , (1) 

where L is the integrated luminosity, £,-, a, and SB are the signal efficiency, Higgs cross section, and 
branching ratio, and bi is the expected number of background events. Here \i is a signal strength parame- 
ter defined such that fx = 0 corresponds to the absence of a signal; }X = 1 gives the signal rate 57 expected 
from the Standard Model. If we consider a fixed Higgs mass m#, the only parameter of interest is jj,. All 
other adjustable parameters needed to specify the model are called nuisance parameters. 

In principle the expected background values bi can be predicted using Monte Carlo models for Stan- 
dard Model processes. In the measurements considered here, however, the systematic uncertainty in the 
Standard Model prediction is in many cases quite large, and this would severely limit the sensitivity of 
the search. Therefore data regions where one expects only a very small amount of signal {control regions) 
are used to constrain the background in the signal region (see also below). 

For the ith bin of a histogram of a discriminating variable x, the expected signal and background can 
be written 



si = s tot J b _f s (x;d s )dx, (2) 
bi = b tot J^f h (x;G h )dx, (3) 

where s tot and b tot are the total expected numbers of events in the histograms, f s (x; 6 S ) and fb(x; 6b) are 
the probability density functions (pdfs) of x for signal and background, and 6 S and 6 h represent sets of 
shape parameters. 

The parametric forms of the pdfs f s (x; 0 S ) and fb(x; Ob) are determined from Monte Carlo simula- 
tions or data control samples. In the following we will use B = (d s , db,b tot ) to refer to all of the nuisance 
parameters. The signal normalization s tot here is not an adjustable parameter, but rather is fixed equal to 
the Standard Model prediction. 

In addition to the measured histogram n, some search channels also make use of a set of subsidiary 
measurements m = (mi, . . .,m^) in control regions where one expects mainly background events. These 
can be modeled as being Poisson distributed with mean values 

E[mi]=Ui(d), (4) 

where the w,- are calculable quantities depending on a set of parameters, at least some of which are the 
same as those entering into the predictions for s; and bi above. In practice the subsidiary measurements 
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are constructed so as to provide information on the background normalization b tot and sometimes also 
on its shape. 

If the measurement is based on counting events in a given kinematic region, i.e., without using the 
shape of a distribution, in the formalism above the histograms have a single bin. The value s = s tot is 
then the Standard Model prediction for the signal and b = b tot is the (unknown) expected background. 
There are then no shape parameters, and b itself plays the role of 6 as the single nuisance parameter. In 
this case the subsidiary measurement m is made in a control region where signal is absent (or can to good 
approximation be neglected), and has an expectation value 

E[m] = u = Tb, (5) 

where % is a scaling constant whose value can be estimated from a Monte Carlo simulation. 
The likelihood function is the product of Poisson probabilities for all bins: 

L ( Jlj e) = f\ ^±p)l e -^j+bj) T\ U 2-e-^. (6) 
y=i njl l\m k l 

Equivalently the log-likelihood is 

N M 

lnL(/i, d) = £ (njln^Sj + bj) - {ixsj + bj)) + £ (m k \nu k -u k ) + C , (7) 

7=1 k=l 

where C represents terms that do not depend on the parameters and thus can be dropped. Here and in (6) 
the parameters 6 enter through Eqs. (2), (3), and (4). 

In the case where the presence of signal in the histogram n gives a peak sitting on a smooth back- 
ground, one does not need a subsidiary measurement m. Rather, as long as the number of parameters in 
the models for the signal and background distributions is smaller than the total number of bins measured, 
one can determine the strength parameter }X from the histogram n alone. Here the regions away from 
the peak (the sidebands) play the role of the subsidiary measurement by providing information on the 
background level. Of course if an additional subsidiary measurement is available, this will improve the 
accuracy of the background determination, which will increase the sensitivity of the analysis. 

In the case of several independent search channels, the method described above is generalized in a 
straightforward manner. For each channel i there is a likelihood function L,-(/i, 0,-). Its general form is 
given by Eq. (6), except that all quantities carry an additional index i to label the channel except the 
global strength parameter jx, which is assumed to be the same for all channels. Since the channels are 
statistically independent, the full likelihood function is given by the product 

L^,d) = l\L i ^,6 i ), (8) 

where 0 here represents all of the nuisance parameters. 

Systematic uncertainties are effectively included in the analysis through the nuisance parameters 
0. The model must be sufficiently flexible, i.e., it must contain enough parameters, so that for at least 
some point in its parameter space it can be regarded as representing the truth. One must exercise some 
restraint in achieving this, however, as an increasing number of nuisance parameters leads to a decrease in 
sensitivity to the parameters of interest. Some of the components of 0 may be common among different 
channels, e.g., parameters relating to uncertainty in the integrated luminosity. These then represent a 
common (correlated) systematic uncertainty. 

As an example, consider the signal efficiency e that enters in the relation between the cross section 
and expected number of signal events. Suppose the efficiency has been estimated to have a value e and 
systematic uncertainty Cg. To incorporate this uncertainty into the model, we can regard the measured 
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value £ as a random variable whose true value e is treated as a nuisance parameter. For the pdf f e (e; £ , <jg) 
one could use, e.g., a Gaussian distribution centred about £, or for a quantity such as the efficiency which 
must lie in the range 0 < £ < 1 one could use a pdf that automatically satisfies this constraint (e.g., a beta 
distribution). For whatever choice is deemed appropriate, the likelihood (6) is multiplied by f e (e;£, Cg), 
evaluated with the best estimate e, and the parameter £ is included in the set of nuisance parameters d. 
To test a hypothesized value of jx we construct the profile likelihood ratio, 



Here 8 in the numerator denotes the value of 6 that maximizes L for the specified }X, i.e., it is the 
conditional maximum-likelihood estimator (MLE) of 6 (and thus is a function of jx). The denominator 
is the maximized (full) likelihood function, i.e., fx and 6 are the MLEs. The presence of the nuisance 
parameters broadens the profile likelihood ratio as a function of jx relative to what one would have if 
their values were fixed. This reflects the loss of information about jx due to the systematic uncertainties. 

The likelihood ratio (9) and procedures for incorporating systematic uncertainties applied here differ 
somewhat from those used for the searches carried out at LEP. Some of these differences are discussed 
further in Appendix A. 

From the definition of the profile likelihood ratio one can see that 0 < A < 1, with A (/x) = 1 implying 
good agreement between the data and the hypothesized value of }X. Equivalently it is convenient to work 
with the quantity 



so that high values of correspond to poor agreement between the data and the hypothesized jx. The 
statistic will have a sampling distribution /(^|ju'). Here jx refers to the strength parameter used 
to define the statistic q^, entering in the numerator of the likelihood ratio, and jx' is the value used to 
define the data generated to obtain the distribution (i.e., the 'true' value). For the special case jx' = jx and 
for a sufficiently large data sample, the pdf f(q^ approaches a limiting form related to the chi-square 
distribution, discussed further in Section 2.4. For jx' j^/x, the distribution of q^ is shifted to higher values, 
reflecting the decreased agreement between the data generated with \x' and the hypothesis tested by q^, 
as indicated in Fig. 1. The two cases of particular interest are /x = 0, the background-only hypothesis, 
and ll = 1, the hypothesis of background plus signal present at the Standard Model rate. 

The level of compatibility between data that give an observed value g^obs for q^ and a hypothesized 
value of jx is quantified by giving the /rvalue 



This is the probability, under the assumption of /x, of seeing data with equal or greater incompatibility, 
as measured by q^, relative to the data actually obtained. This is illustrated in Fig. 1, where the shaded 
area indicates the p- value of the hypothesized jx. The figure also indicates the median value of q^ under 
the assumption of a different value of the strength parameter jx f used to generate the data. For jx and jx f 
values that are increasingly different, the median med[^|ju'] moves further to the right. An observed 
value of q^ at this median would give a correspondingly small p- value for jx. 

2.2 Establishing discovery 

To establish discovery we try to reject the jx = 0 (background-only) hypothesis, i.e., that there is no 
Higgs signal present. To do this we use the statistic qo = — 21nA(0). One expects to find a low value 



= -21nA(ju) 



(10) 




(11) 
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f(q>) 



Figure 1 : Illustration of the determination of 
the p- value of a hypothesized value of fl. The 
left-hand curve indicates the pdf of for data 
generated with the same value of fl as was used 
to define the statistic ; this is used to deter- 
mine the p- value of [l, shown as the shaded re- 
gion. The right-hand curve indicates the pdf of 



q M 



qfx for data generated with a different value of 
the strength parameter, fl'. 



of A(0) (high qo) if the data include signal. Here even though one is testing the hypothesis that the 
Higgs does not exist, the definition of go depends on the hypothesized Higgs mass ra#. It enters through 
the denominator of the likelihood ratio (9), which contains the maximum-likelihood estimator (x for the 
strength of a Higgs signal at the mass m#. By defining the test statistic in this way one maximizes the 
probability of rejecting the /I = 0 hypothesis if the Higgs boson exists at the specified mass. This search 
procedure is then carried out for all values of m# (in practice an interpolation is carried out between finite 
steps in 

A given data set will result in an observed value <7o,obs of qo- The level of compatibility between the 
data and the no-Higgs hypothesis is quantified by giving the /rvalue 



This is the probability, under the assumption of /I = 0 (background only), of seeing data as signal-like 
or more so relative to the data actually obtained. A small value is interpreted as evidence against }X = 0, 
i.e., a discovery of the signal. 

One can define the significance corresponding to a given /rvalue as the number of standard deviations 
Z at which a Gaussian random variable of zero mean would give a one-sided tail area equal to p. That is, 
the significance Z is related to the p-value by 



where <I> is the cumulative distribution for the standard (zero mean, unit variance) Gaussian. Equivalently 
one has 



where <£ 1-1 is the quantile of the standard Gaussian (inverse of the cumulative distribution). In (13) and 
(14) the subscript 0 was dropped as these relations hold for all /j-values, not only those of the jl = 0 
hypothesis. The relation between Z and p is illustrated in Fig. 2. 

A significance of Z = 5 corresponds to p = 2.87 x 10~ 7 . For a sufficiently large data sample, one 
would obtain a /j-value of 0.5 for data in perfect agreement with the expected background. With the 
definition of Z given above, this gives Z = 0. If the data fluctuate below the expected background, Z 
becomes negative. 

Note that according to the definition (14), a /j-value of 0.05 corresponds to Z = 1.64. This should not 
be confused with a 1.96a fluctuation of a Gaussian variable that gives 0.05 for the two-sided tail area. 

The significance of a discovery Z depends on the data obtained. To quantify our ability to discover a 
hypothesized signal in advance of seeing the data, we report the median significance under the assump- 
tion that the signal is present at the Standard Model rate, ju = 1. Since Z is a monotonic function of po, 
and po is also a monotonic function of qo, we have for the median significance, 




(12) 




(13) 



z = <t>-\i- P ), 



(14) 
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Figure 2: Illustration of the corre- 
spondence between the significance Z 
and a p- value. 



Zmed = * -1 (l-/0 =< S>~\l-P0(q0 m J) ■ 



(15) 



This can be obtained from the median value of qo found using data generated under the assumption of 
M = l. 

A complete evaluation of the median significance is computationally difficult, as it requires a large 
number of repeated simulations of the full set of experimental outputs and determination of Z(qo) from 
the combination of all channels. Therefore in this note we have used the approximate methods described 
in Section 2.5, which allow one to estimate quickly the median significance. 

2.3 Setting limits 

In addition to establishing discovery by rejecting the /I = 0 hypothesis, we can consider the alternative 
hypothesis of some non-zero jx and try to reject it. A /7-value is computed for each jx, and the set of jx 
values for which the p-value is greater than or equal to a fixed value 1 — CL form a confidence interval 
for jx, where typically one takes a confidence level CL = 95% . The upper end of this interval /x up is the 
upper limit (i.e., jx < jU up at 95% CL). 

To compute the p- value for a hypothesized jx we first consider again the test statistic = — 2 In X{jx) 
as initially defined in (9) and (10). For purposes of computing limits, we introduce a modification to this 
definition as described below. 

If the data are incompatible with the hypothesized fx, one expects a large value of — 21nA(ju), i.e., 
X (ju) close to zero. If a data set generated according to the hypothesis jx gives a large value of — 2 In X (jx), 
this can be the result of either an upward or downward fluctuation in jx relative to jx. This is illustrated 
in the scatterplot of jx versus — 21nA(ju) shown in Fig. 3(a), which is from a toy Monte Carlo study with 
jx = 0.8. The projection of the points on the jx axis is shown in Fig. 3(b). Note that jx > 0 is imposed; 
the reasons for and consequences of this requirement are discussed in Section 2.4. 

For purposes of setting an upper limit, however, we want to determine the smallest jx such that there 
is a fixed small probability (one minus the confidence level) to find data as compatible with that value of 
[X or less, relative to the degree of compatibility found with the real data. Therefore the data with upward 
fluctuations in jx are not counted when computing the /rvalue, because they would be compatible with 
some larger jx. Therefore for purposes of computing limits we redefine q^ to be 1 



The distribution of the new q^ thus corresponds to the lower branch only of the U-shaped scatterplot 
shown in Fig. 3(a). 

1 Equivalently, one could retain the definition = — 21n(L(/i, 8) /L(fi, 8)) by placing an upper bound on fi equal to fl, i.e., 
by imposing 0 < fi < fx. In this way, when fi = [l then one has = 0 just as in the case of discovery when testing fl = 0. 




otherwise. 



(16) 



1485 



Higgs - Statistical Combination of Several Important Standard Model Higgs . 




Figure 3: (a) Scatterplot of fi versus -21nA(/j); (b) the distribution of fi (see text). 



Using the new definition (16), the /7-value is given by the integral of f(q^\jx) from the observed 
value g^obs to infinity as in Eq. (11) and as illustrated in Fig. 1. The /rvalue is computed in this manner 
for all values of /x, and the upper limit jx up at 95% confidence level is the largest value of /x for which the 
rvalue is at least 0.05. 

The result can be summarized by giving the upper limit on as a function of the Higgs mass m H . 
Specifically, if we can reject the hypothesis /x = 1 at a certain confidence level, then the corresponding 
value of niH is regarded as excluded for a Standard Model Higgs. The lowest mass value not excluded is 
the lower limit m\ 0 . 

One is also interested in the median limit under the assumption that there is no Higgs. As in the case 
of the discovery significance, a full calculation of the median limit is difficult as it requires a large number 
of repeated simulations based on the full profile likelihood ratio. For purposes of this note, therefore, we 
use the approximation techniques described in Section 2.5. 

2.4 Sampling distribution of the likelihood ratio 

To determine the values required for both discovery and exclusion we need the sampling distribution, 
assuming data generated according to a given value of /x, of the statistic q^, i.e., f(q^ \/x). For the case of 
discovery significance we use go = — 21nA(0), and for setting limits we use q^ = — 2lnX(jx) for fi < fx 
and q^ = 0, otherwise. 

To claim discovery we require ^-values for /x = 0 down to around 10~ 7 , and therefore to do this with 
a Monte Carlo simulation requires an extremely large number of simulated measurements. In practice 
this is only carried out for simple test cases. Even for setting limits at 95% confidence level, it is often 
not practical to use Monte Carlo. 

Under a set of regularity conditions and for a sufficiently large data sample, Wilks' theorem says that 
for a hypothesized value of /X, the pdf of the statistic — 21nA(/x) approaches the chi-square pdf for one 
degree of freedom [2]. More generally, if there are n parameters of interest, i.e., those parameters that do 
not get a double hat in the numerator of the likelihood ratio (9), then —2\nX{jx) asymptotically follows 
a chi-square distribution for n degrees of freedom. A proof and details of the regularity conditions can 
be found in standard texts such as [3]. 

In the searches considered here, the data samples are generally large enough to ensure the validity of 
the asymptotic formulae for the likelihood-ratio distributions. In our case, however, the distributions are 
modified because of constraints imposed on the expected number of events. 

Usually when searching for a new type of particle reaction one regards the mean number of events 
contributed to any bin from any source, signal or background, to be greater than or equal to zero. In 
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some analyses it could be meaningful to consider a new effect that suppresses the expected number 
of events, e.g., the presence of a new decay channel could mean that the number of decays to known 
channels is reduced. Here, however, we will regard any contribution to an expected number of events as 
non-negative. 

Assuming only non-negative event rates, the maximum-likelihood estimators for the parameters are 
constrained, e.g., /t > 0. As a consequence, if the observed number of events is below the level predicted 
by the background alone, then the maximum of the likelihood occurs for jx = 0, i.e., negative [X is not 
allowed. We can consider the effect of having fx = 0 on the distribution of for two cases: }X = 0 and 
H >0. 

For fx = 0, i.e., when computing the discovery significance, if fx = 0 one has (see (9)), 
L(A,0) L(0,0) 

since fx =0 and therefore 6 = 0. The statistic qo = — 21nA(0) is therefore equal to zero. This can be 
seen in the scatterplot of q 0 versus fx in Fig. 4(a). Figure 4(b) shows the corresponding q Q distribution 
with the peak visible at q 0 = 0. The superimposed curve is a chi-square distribution multiplied by one 
half, corresponding to the half of the events with fx > 0. 




Figure 4: (a) Scatterplot of fx versus from a Monte Carlo study with jx = 0; (b) the distribution of qo (see text). 

From Fig. 4(b) one can see that except for the spike at qo = 0 (when fx = 0), the pdf of go can be well 
approximated by the chi-square pdf. Assuming a fraction w for the cases with fx > 0 one has the pdf 



f(q 0 \0) = wf x 2(qo) + (l-w)8(q 0 ) . (18) 

In the usual case where upward and downward fluctuations of fx are equally likely we have w = 1/2. The 
p-value of the background-only hypothesis given an observation q 0phs greater than zero is therefore 

P = I wf x 2{q 0 )dq 0 = w(l -F^o.obs)) , (19) 

where F % i is the cumulative chi-square distribution for one degree of freedom. 

The second case to consider is /x > 0, e.g., when one wants to set an upper limit on /x. Under the 
hypothesis /x, one obtains fx > jX and fx < /X with approximately equal probability. Figure 5 shows the 
distributions of q^ for both cases fx > }X and fx < jX obtained from the scatterplot Fig. 3(a), from a Monte 
Carlo study with jx = 0.8. 
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(a) 




Figure 5: Distributions of — 2\nX(jx) for (a) jl > jx and (b) jl < Ll. The superimposed curves are chi-square 
distributions for one degree of freedom normalized to half the number of entries in the original distribution (see 
Fig. 3(a)). 

From Fig. 5(a) one can see that for fx > jx, the data follow the chi-square pdf quite accurately. This 
portion of the distribution is ignored, however, when setting upper limits on jx, because of the modified 
definition of (20) used for limits, 



q» = 



-21nA(/i) 
0 



otherwise . 



(20) 



Suppose now fx < jx with a probability w; in practice this is close to one half. (Note for the case of qo, 
w is the probability of jl > Ll. The different definitions of w are used so as to give similar forms for 
f(qo\0) and /(q^/x).) Thus for jl > jx one has from (20) q^ = 0, and therefore the distribution has a 
delta function at q^ = 0 with weight 1 - w. The pdf of f(q^\lx) can therefore be written 

=w/(^| M ,A <n) + (l-w)8( qtl ) . (21) 
where f{q^ |jU,/j. < /l) is the conditional pdf for given fx < Ll. 

For fx < jx, one may sometimes find fx equal to zero, i.e., the lower edge of the allowed range, as can 
be seen in the scatterplot of fx versus q^ shown in Fig. 3(a). Although for the case Ll = 0 this gave a peak 
at q Q = 0, here it gives 



A 00 = 



(22) 



_ L(jM) = Un,d) 
" L{fx,6) L(O,0) ' 

which in contrast to (17) is not equal to unity. The effect of having fx = 0 on the distribution of q^ is 
therefore more complicated than was the case for go- 

In general for fx < Ll, the distribution of q^ falls off more steeply than the chi-square distribution. 
This is seen in Fig. 5(b). Therefore a /rvalue based on the chi-square formula will be larger than the 
true /rvalue, and the corresponding significance Z will be smaller. The upper limits obtained for fx are 
therefore larger, i.e., a smaller set of jx values is excluded. 

If jx is sufficiently large, then fx is very rarely pushed to zero and f(q^\/x,fl < Ll) approaches a 
chi-square distribution for one degree of freedom. For purposes of the present study, the chi-square 
approximation is adequate, but gives somewhat conservative limits. That is, we take the distribution of 
q\i to be 



f{q^)=wf x 2{q }l ) + {\-w)8{q ll ) 



(23) 
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and use w = 0.5. One has therefore the same pdf for using the modified definition (20) as was found 
in (18) for qo based on the original definition, go = — 21nA(0). 

To summarize the result above, the pdf of q^ can be approximated by a mixture of a chi-square pdf 
for one degree of freedom with weight w and a delta function at zero with weight 1 — w. This holds both 
for discovery {ji = 0) and setting limits {ji > 0). 

Consider now the variable 



" = v^=\/- 21nA (i u )> ( 24 ) 

which has the pdf 

f(u) = ®{u) W ^e^l 2 + (1 - w)d(u) , (25) 

where ®(u) = 1 for u > 0 and is zero otherwise. The second term in (25) follows from the fact that the 
values go = 0 and u = 0 occur with equal probability, 1 — w. Furthermore if a variable x follows the 
standard Gaussian distribution, then one can show x 2 follows a chi-square distribution for one degree of 
freedom. Therefore if x 2 follows a % 2 distribution, then Vx 2 follows a Gaussian scaled up by a factor of 
two for x > 0 so as to have a total area of unity. 

The p- value of the hypothesis jj, for a non-zero observation g^obs is therefore 

p=P{q t i > <^,obs) =P(u > y/qn,obs) =2w J^_-^=e^ ull2 du = 2w{l-^(^/q^;)) . (26) 

Combining this with Eq. (14) for the significance Z gives 

Z = 0- 1 (l-2w(l-<I>(^bI))). (27) 
In the usual case where the weights of the chi-square and delta-function terms are equal, i.e., w = 1/2, 
Eq. (27) reduces to to the simple formula 

Z = ^,obs • (28) 

2.5 Approximate methods 

To determine the discovery significance or to set limits using a given data set, one must carry out the 
global fit described above. For this one needs first to combine the likelihood functions for the individual 
channels into the full likelihood function containing a single strength parameter jj,, and use this to find 
the profile likelihood ratio. It is possible, however, to find approximate values for the median discovery 
significance and limits in a way that only requires as input the separate profile likelihood ratio values 
from each of the channels. This is very useful especially in the planning phase of a search that combines 
multiple channels. 

The procedure relies on two separate approximations. First, we estimate the median value of the pro- 
file likelihood ratio by evaluating the likelihood function with a single, artificial data set in which 
all statistical fluctuations are suppressed, as described in Section 2.5.1. Second, to determine the signif- 
icance values from the likelihood ratios, we use the asymptotic form of the distribution of — 21nA(/x) 
valid for sufficiently large data samples. This is described in Section 2.5.2, and its validity is checked 
for the individual channels in Section 3. Here the limitations of the approximation are investigated and 
for one case where it is found to be insufficiently accurate (the discovery significance for the channel 
H — >• plus no jets), an alternate procedure is followed. 
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2.5.1 Approximation for the median likelihood ratio 

To find the median discovery significance and limits, the median likelihood ratios are first found 

for each channel separately, and then combined to give the full median likelihood ratio. One can estimate 
the median A,-(jU) by the value of the likelihood ratio evaluated with a single artificially constructed data 
set, in which all statistical fluctuations are suppressed and the data values n and m are replaced by their 
expectation values for a given integrated luminosity and a hypothesized strength parameter /x A . We refer 
to this as an 'Asimov' data set. 2 It replaces having to simulate a large number of experiments from which 
one would determine the median. 

As before, Ha = 0 is the background only hypothesis and Ha = 1 corresponds to background plus 
signal present at the Standard Model rate. The median referred to thus pertains to what one would obtain 
with a large number of experiments generated under the assumption of ;ii A . The approximation is in fact 
more accurate if one uses noninteger values for numbers of events in the log-likelihood (the factorial 
terms are in any case absent) so that the Asimov likelihood La is found by substituting 



nj = llASj + bj (29) 

m k = u k , (30) 

into the likelihood function (6) for each channel. Here for Sj, bj and u k , one needs in principle the 
expectation values, i.e., these quantities should have no statistical errors. In practice they are estimated 
using a Monte Carlo sample corresponding to an integrated luminosity substantially larger than what is 
considered for the data. The numbers of signal and background events are then scaled to the desired 
luminosity. The other nuisance parameters such as shape parameters are estimated as would be done 
with any other data set; we refer below to the resulting values as 0 A . Because the Asimov data set has 
no statistical fluctuations, the 6a are simply the values one would derive from a very large Monte Carlo 
data sample. 

The estimate of the median likelihood ratio used for the ith channel is therefore 

i (u) Mm J) 3 

AA ' i(M) -L A ,(A,0)~WMA,e A )' (31) 

where L A> ; denotes the likelihood function (6) evaluated with the Asimov data values (29) and (30). The 
approximation used for the final step in (31) exploits the fact that ML estimate of /t is very close to the 
input value /x^ when the likelihood function is constructed using the Asimov data set. 

Note that if the likelihood functions for the individual channels were to be constructed with data 
containing statistical fluctuations rather than with the artificial Asimov data, then the ML estimate of the 
strength parameter, /t, would in general be different for each channel. The full likelihood function (8) 
used for the combination, however, contains a single global ji. We can now exploit the fact that for the 
Asimov data one has fx « /x A for all of the channels and thus obtain the median likelihood ratio for the 
combination as the product of the individual A Al -(jU), 

AaOO = n^(M) ■ (32) 

Monte Carlo studies show that Eq. (32) provides an excellent approximation to the median value one 
would find from data generated with /x A as the strength parameter. 

2 The name of the Asimov data set is inspired by the short story Franchise, by Isaac Asimov [1], In it, elections are held by 
selecting a single voter to represent the entire electorate. 
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For purposes of quantifying how likely we are to discover the Higgs if it exists, we report the signif- 
icance obtained from the p- value of jx = 0 with an Asimov data set that corresponds to Ha = 1 , 

Here the subscript s + b refers to the Asimov data set; the argument 0 denotes the value of }X being tested. 
That is, Eq. (33) approximates what one would obtain with data generated with signal and background 
for the median value of A(0), which is used to test the background-only (/x = 0) hypothesis. Equation 
(33) provides the median qo meA = — 2lnX s+ b(0), and from this the /rvalue and significance Z are found 
using equation (15). 

To determine the limits on jj, that we expect to set if the Higgs does not exist (or is beyond our 
reach), we find the ^-value of a hypothesized jl using the likelihood ratio A(ju) based on Asimov data 
for background only (^a = 0), 

That is, Ab(jll) approximates the median value of A(/i) one would obtain from data generated according 
to the background-only hypothesis. The value of Afe(/x) is used to determine the median g Mmed , which is 
used to find the median p- value, p^ med . This is computed for all ji. and the point where p^ mai = 0.05 gives 
the 95% CL upper limit. 

Because p. « ;Ua holds for each channel individually when using Asimov data, it is possible to de- 
termine the values of the likelihood ratio entering into (32) separately for each channel, which simplifies 
greatly the task of estimating the median significance that would result from the full combination. It 
should be emphasized, however, that the discovery significance or exclusion limits determined from real 
data require one to construct the full likelihood function containing a single parameter jl, and this must 
be used in a global fit to find the profile likelihood ratio. 

Furthermore, some systematic errors, e.g., the uncertainty in the integrated luminosity, are common 
to all channels and correspond to a common nuisance parameter. When using Asimov data, the values of 
such parameters will be fitted to the same values in all channels. Thus the correlations between common 
systematics are taken into account just as they would be in a global fit of all channels. 

A limitation of the procedure with Asimov data is that it only provides an estimate of the median 
likelihood ratio. To obtain an uncertainty band on the expected (median) discovery and exclusion sensi- 
tivities as a function of m# one would have to simulate a large number of experiments. 

2.5.2 Approximate relation between likelihood ratio and significance 

To compute the values we need the distribution f(q^i\n) of = — 21nA(ju). For a sufficiently large 
data sample the pdf of q^ takes on a well defined limiting form related to the chi-square distribution, as 
discussed in Section 2.4. Assuming this form, the /rvalue of the hypothesis }X is found to be 

p^l-^(^qj), (35) 
and the significance Z is given by the formula 



Z = <D- 1 (l-^)« v /-21nA(Ai). (36) 

For estimating the median discovery significance we use equations (35) and (36) together with the equa- 
tion (33), the likelihood ratio based on Asimov data containing signal and background. To find the 
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median limit on ji, we use again equations (35) and (36) but with the likelihood ratio based on Asimov 
background data only, equation (34). 

The validity of this approximation is investigated for each channel by generating distributions of 
q^i for n = 0, 1 using a fast Monte Carlo simulation and comparing the resulting histograms with the 
expected asymptotic form. These comparisons are shown in Section 3. 

2.6 Consequences of testing many Higgs mass values 

The statistical significance of a potential discovery is quantified by giving the /rvalue of the no-Higgs 
hypothesis, i.e., the probability, under the assumption of background only, that that one would see data 
with equal or less compatibility with this hypothesis relative to the data obtained. Finding this proba- 
bility below a specified threshold (e.g., the 5a threshold, or p < 2.87 x 10~ 7 ) corresponds to claiming 
discovery of a Higgs boson. 

The approach taken in this analysis is to compute the /rvalue of the no-Higgs hypothesis separately 
as a function of the Higgs mass. The threshold /rvalue is thus the false discovery rate for Higgs boson 
of a given mass. Further one should also estimate the probability, under the assumption of background 
only, that this p-value will fall below the discovery threshold for any mass within the range considered. 
By searching for the Higgs within a broad range of hypothetical masses, one increases the probability of 
observing what appears to be a signal at some mass, and so the effective significance of the discovery is 
reduced. In HEP this is sometimes referred to as the "look-elsewhere effect". 

To first approximation the effective increase in the false-discovery rate is given by the number of 
statistically separate mass ranges explored. If a certain data set would give a /rvalue of mn below the 
discovery threshold, then the same data would in general also indicate discovery for other masses very 
close by. Roughly speaking, the mass range in which a given data set would indicate discovery is set by 
the mass resolution for the Higgs candidate. So the factor by which the p-value is inflated is given by the 
mass range explored divided by the average mass resolution. Monte Carlo studies can be used to validate 
and refine this approximation; this approach is planned for future analyses. 

An alternative to considering fixed Higgs masses is to treat both the strength parameter ji and the 
Higgs mass m H as free parameters in the likelihood ratio. For example, to establish discovery one com- 
putes the /rvalue of the no-Higgs hypothesis. As before, this is the probability, under the assumption of 
no Higgs, of finding data with equal or lesser compatibility with jj, = 0 relative to the data obtained. In 
contrast to the fixed-mass case, however, "less compatible" here means having a lower likelihood ratio 
for any allowed value of the Higgs mass; the lowest value comes when the denominator contains the 
fitted maximum-likelihood estimator In practice the fitted value of the Higgs mass is restricted to 
lie within a stated range. This has been done for the Higgs searches using the 77 [4] and W+W [5] 
channels, with the aim of extending this method to a combination of all channels. 

3 Combination of Higgs search channels 

In this section a brief description of each of the four search channels is given. For each channel, the 
method used to obtain the likelihood ratio is described, and values of the test variable as defined 
by Eq. (10) for discovery and by Eq. (16) for limits are tabulated for several values of the integrated 
luminosity L and Higgs mass mn- For the discovery sensitivity where one tests jj, = 0, the median value 
of go is given under the assumption of }X = 1; for exclusion sensitivity, the median of qi is given under 
the assumption of ji = 0. 

In addition, for each channel we show distributions of q^ under the assumption of for the two 
cases jit = 0 and jj, = 1 . For the approximations used in this note to be valid, these should be close to the 
asymptotic form described in Section 2.4. This limiting form for the distribution f{q >1 \jx) is a mixture of 
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a delta function at = 0 and a chi-square distribution for one degree of freedom, where each component 
has equal weight. We refer to this as a distribution. For the case of \l = 0 we compare directly the 
Monte Carlo distribution of q Q with the \%\ distribution and the delta-function term at q 0 = 0 is clearly 
visible. For jj, = 1 we show the equivalent comparison but for reasons of convenience only the events 
with fx < \i are shown, i.e., the events with fx> jl that contribute to the delta function at qi = 0 are left 
off. That is, for exclusion it is the conditional pdf f(qi\jJ. = 1,/t < 1) that is compared to a chi-square 
distribution for one degree of freedom; there is no delta function term. 

All channels use data driven background estimation methods. This way, the uncertainties in the back- 
ground shape and normalization are treated within the framework of the profile likelihood as nuisance 
parameters. Using control samples the effect of many uncertainties like energy scales and fake rates on 
the background estimate can be constrained by the control samples. Uncertainties on the signal efficiency 
do not affect the discovery sensitivity which is testing the presence or absence of a signal; however, this 
is not the case for exclusion sensitivity. As one would expect, uncertainty in the signal efficiency does 
reduce the exclusion sensitivity. This uncertainty was incorporated into the profile likelihood calculation 
by adding an extra term to the likelihood function for every channel as described in Section 2.1: a Gaus- 
sian relating the nominal efficiency estimated in an auxiliary measurement, the true efficiency, and the 
uncertainty of that auxiliary measurement. 



Details on the H — > yy channel are given in Ref. [4]. We perform an unbinned maximum-likelihood 
fit to extract the signal and background event yields by using the diphoton invariant mass, m rr , as a 
discriminating variable. The H —>■ yy distribution of m rr forms a Gaussian peak with tails to lower 
values from photon energy losses before the calorimeter. It is well modelled by a Crystal Ball function. 
The signal probability density function, pH{m n ), is given by 



where t = (m rr — mH — dmn) /c(m rr ), N is a normalisation parameter, m# is the Higgs boson mass, 
Smn is an offset and a represents the diphoton invariant mass resolution. The non-Gaussian tail is 
parametrised by n and a. We include an additional, broader Gaussian term in Eq. 37 to improve the 
description of the tails of the distribution. Within a sufficiently narrow mass window, the background of 
m,yy is modeled by an exponential distribution with a single slope parameter E, . 

The resulting median profile likelihood ratios for discovery, A(jU = 0) (using toy s + b Monte Carlo 
experiments and taking the median of the A (jJ, =0) distribution ) are given in Table 1 for a few Higgs 
masses at some given luminosities. 

The distribution of the test statistic qo under the null background only hypothesis, for m# = 120 GeV 
with an integrated luminosity of 2 and 10 fb~\ is shown in Fig. 6. A distribution is superimposed, 
showing the validity of the asymptotic approximation. 

The median profile likelihood ratio for exclusion, — 21nA(/x) (using toy background-only Monte 
Carlo experiments and taking the median of the A(jll) distribution) is given in Table 2 for a few Higgs 
masses at several integrated luminosities and for a signal strength \x = 1, corresponding to a Standard 
Model Higgs Boson. 

The distribution of the test statistic q\ for fx < 1 under the s + b hypotheses for m# = 150 GeV with 
an integrated luminosity of 2 and 10 fb -1 is shown in Figures 7. A %\ distribution is superimposed, 
showing the validity of the asymptotic approximation. 



3.1 H->yy 




fort > -a, 
otherwise , 



(37) 
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Table 1: Median values of -21nA(/x) (evaluated at n = 0) obtained from fits to simulated data generated with 
H ^77 signal plus background (ju = 1) for several values of the Higgs mass and integrated luminosity. 



L 




m H (GeV) 




(fb- 1 ) 


115 


120 


130 


140 


1 


0.35 


0.55 


0.75 


0.67 


2 


0.75 


1.07 


1.45 


0.95 


5 


1.95 


2.95 


3.65 


2.55 


10 


3.95 


5.86 


7.35 


5.05 


30 


11.85 


17.72 


21.99 


15.05 




Table 2: Median values of — 21nA(/i) (evaluated at fi = 1) obtained from H — > 77 background-only (jU = 0) 
simulated data for several values of the Higgs mass and integrated luminosities. 



L 




m H (GeV) 




(fb- 1 ) 


115 


120 


130 


140 


1 


0.72 


0.73 


0.91 


0.67 


2 


0.97 


1.21 


1.39 


0.97 


5 


2.11 


2.59 


3.13 


2.23 


10 


3.55 


4.87 


5.71 


4.04 


30 


8.47 


10.50 


11.63 


9.00 
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Figure 7: The distribution of the test statistic q\ for jl < 1 under the s + b hypothesis (for H — > 77), for m# =120 
GeV with an integrated luminosity of (a) 2 fb -1 and (b) 10 fb -1 . A %\ distribution is superimposed. 

3.2 H-+W+W 

The H — > search is divided into two topologies, production of a Higgs with no jets (H + 0 j) and 

with two additional jets (H + 2 j), using in both cases the decay mode H —> WW —>■ e\\iv. The present 
study does not yet consider the final states evev or /JLV/JLV, nor those with hadronic W decays. Future 
inclusion of these channels is expected to improve the search sensitivity particularly for the high Higgs 
mass region. The search is described in detail in Ref. [5]. 

3.2.1 H + Oj 

The analysis of the H + Oj channel uses a two dimensional maximum-likelihood fit of the transverse 
mass and the transverse momentum of the WW system in two bins of the dilepton opening angle in the 
transverse plane. The fit includes control samples to measure the backgrounds from it and Z —>■ xx. 

The QCD WW background requires particular attention. Its distributions of Higgs-candidate trans- 
verse mass and pj are described with functions containing several adjustable (nuisance) parameters, and 
several others whose values are determined from a full Monte Carlo simulation and thereafter treated as 
fixed. The distribution of the test statistic qo under the background-only (jj, = 0) hypothesis is shown in 
Fig. 8(a) for ra# = 150 GeV for an integrated luminosity of 10 fb -1 . The same fixed QCD WW shape 
parameters are used both to generate the data and for calculating the likelihood ratio. A \x\ distribution 
is superimposed, showing the level of agreement of the asymptotic approximation. 

For this channel, further investigation of the systematic uncertainties was carried out. For the fixed 
shape parameters related to pj and transverse mass distributions for the QCD WW background, the val- 
ues used to generate the data were varied relative to what was used when determining the likelihood ratio. 
This was done in a manner that minimized the sensitivity of the resulting qo distribution to variations in 
other fixed parameters such as the QCD Q 2 scale. The resulting distributions of q Q are thus no longer 
expected to follow the \%[ form, as can be seen in Fig. 8(b). 

Because the chi-square approximation is not valid in this case, the ^-values are calculated using the 
qo distribution obtained directly from the Monte Carlo. An exponential is fitted to the tail region in 
order to extrapolate to large qo values, and the median value of q 0 under the hypothesis of signal plus 
background is determined using the same variation of the background parameters. It was found that the 
median /rvalue of the background-only hypothesis, with the median computed under assumption of the 
s + b hypothesis, is very similar to the original case where the QCD shape parameters are not varied and 
the \%\ distribution is used. 
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Figure 8: The distribution of the test statistic qo for H + Oj ; — > WW + 0 j, under the background-only hypothesis, 
with the same fixed QCD WW shape parameters used at both the generator and the fit level, for m# =150 GeV 
and for an integrated luminosity of 10 fb _1 (a) with the same shape parameters for event generation and fitting; (b) 
with altered shape parameters. A jX\ distribution is superimposed. 

For the combination of results for discovery (i.e., testing }X = 0), we have used the p-values as 
described above for the case of the H + Oj channel. The same variation of QCD WW parameters was 
also investigated for the case of exclusion (i.e., testing jx > 0, and in particular /x = 1), and it was done 
as well for the H + 2 j channel. In those studies, however, the distributions, under the assumption of 
jX, of the test statistic were found to agree quite well with the expected \%\ distribution, even after 
the parameter variation. Therefore for these cases we have based the combination of results on the 
asymptotic approximations for the distributions (as done in this paper for the other Higgs channels). 

To simplify the comparison with the other channels, the median ^-values of the background-only 
(H = 0) hypothesis for the H + 0 j channel were converted into effective values of the variable qo = 
— 21nA(0) according to q 0 = Z 2 = - p)) . These are given in Table 3 for several Higgs masses 

and integrated luminosities. 

Table 3: Median values of —2 In X (ju = 0) obtained from H + Oj fits using data simulated under the assumption 
of signal plus background (ju = 1) for several values of the Higgs mass and integrated luminosity. 



L 








m H (GeV) 








(fb- 1 ) 


130 


140 


150 


160 


170 


180 


190 


1 


1.64 


4.30 


9.56 


16.52 


15.39 


7.37 


3.17 


2 


2.87 


8.60 


19.36 


34.67 


30.58 


13.73 


5.63 


5 


6.55 


20.15 


42.77 


74.39 


63.58 


31.54 


13.54 


10 


11.52 


33.27 


70.67 


113.33 


103.44 


51.06 


22.78 



The median profile likelihood ratio for exclusion, A (/i) (using background-only MC experiments and 
taking the median of the A(/x) distribution), is given in Table 4 for several Higgs masses and integrated 
luminosities for the signal strength ju = 1.0, corresponding to a SM Higgs Boson. 

The distribution of the statistic q\ for fx < 1 under the s+b hypothesis is shown in Fig. 9 for m# = 150 
GeV and for an integrated luminosity of 2 and 10 fb -1 . A distribution is superimposed, showing the 
validity of the asymptotic approximation. 
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Table 4: Median values of — 21nA(/i = 1) obtained from H + 0j fits using simulated background-only (jx = 0) 
data for several values of the Higgs mass and integrated luminosity. 



L 








m H (GeV) 








(fb- 1 ) 


130 


140 


150 


160 


170 


180 


190 


1 


1.37 


3.93 


8.69 


14.83 


14.23 


7.26 


3.26 


2 


2.57 


7.47 


15.59 


25.20 


23.65 


12.91 


5.84 


5 


5.85 


15.26 


30.05 


45.60 


41.13 


25.41 


12.02 


10 


10.01 


24.69 


45.24 


62.13 


57.69 


37.42 


20.03 




Figure 9: The distribution of the test statistic q\ for /t < 1 under the s + b hypothesis for H + Oj — > WW + 0 for 
ntH = 150 GeV with an integrated luminosity of (a) 2fb _1 and (b) lOfbr 1 . A %i distribution is superimposed. 
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3.2.2 H + 2j 

The H + 2j analysis uses a two-dimensional fit based on the transverse mass and the output of a Neural 
Network, which takes as input several kinematic variables related to the jet activity in the event. The fit is 
performed simultaneously in signal-enriched and background-enriched regions distinguished by lepton 
angular variables, which are nearly uncorrected to the jet variables used in the Neural Network. 

The median profile likelihood ratios for discovery, -21nA(/x = 0) (using toy s + b MC experiments 
and taking the median of the = 0) distribution), are given in Table 5 for several Higgs masses and 
integrated luminosities. 

Table 5: Median values of — 21nA(jLl) for jj, — 0 obtained from H + 2j fits to simulated data with signal plus 
background (/I = 1) for several values of the Higgs mass and integrated luminosity. 



L 








m H (GeV) 








(fb- 1 ) 


130 


140 


150 


160 


170 


180 


190 


1 


0.39 


0.89 


1.61 


2.73 


2.89 


1.85 


1.23 


2 


0.70 


1.75 


3.22 


5.34 


5.67 


3.66 


2.07 


5 


2.01 


5.29 


8.87 


14.20 


14.58 


9.22 


5.71 


10 


3.82 


9.14 


16.56 


26.35 


26.05 


16.68 


9.93 



The distribution of the statistic qo under the background-only hypothesis is shown in Fig. 10 for 
run = 150 GeV and for an integrated luminosity of 2 and 10 fb -1 . A distribution is superimposed, 
showing the validity of the asymptotic approximation. 



(a) 



o10 




23456789 10 



^W + W" (2 jet) 
= 1 0 fb" 1 




(b) 
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Figure 10: The distribution of the test statistic q Q (for H + 2j — > WW + 2j), under the null background only 
hypothesis, for m H = 150 GeV and for an integrated luminosity of 2 (a) and 10 (b) fb~ l . A \%\ distribution is 
superimposed. 



The median profile likelihood ratio for exclusion, — 21nA(/x) with jll = 1, where the median is com- 
puted using background-only MC data, is given in Table 6 for several Higgs masses and integrated 
luminosities. 

The distribution of the test statistic q\ for fx < 1 under the s + b hypothesis is shown in Figures 1 1 
for m H = 150 GeV with an integrated luminosity of 2 and 10 fb -1 . A %\ distribution is superimposed, 
showing the validity of the asymptotic approximation. 
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Table 6: Median values of — 21nA(jU) for jj. = 1 obtained from H + 2 j fits to simulated background-only (fi = 0) 
data for several values of the Higgs mass and integrated luminosity. 



L 








m H (GeV) 








(fb- 1 ) 


130 


140 


150 


160 


170 


180 


190 


1 


0.37 


0.75 


1.31 


2.29 


2.44 


1.99 


0.95 


2 


0.87 


2.49 


5.06 


8.18 


7.74 


3.85 


1.90 


5 


1.40 


3.13 


5.86 


9.58 


9.90 


7.89 


4.59 


10 


2.80 


6.56 


10.72 


16.14 


16.62 


13.94 


8.74 




Figure 11: The distribution of the test statistic q\ for (i < 1 under the s + b (jU — 1) hypothesis (for H + 2 j —> 
WW + 2j), for m H = 150 GeV with an integrated luminosity of (a) 2fb _1 and (b) lOfbT 1 . A %\ distribution is 
superimposed. 
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3.3 H^t+t 

The sensitivity of the ATLAS detector to a Higgs boson produced via Vector Boson Fusion and decaying 
to tau leptons has been investigated [6]. 

Two tau decay channels are considered: // and Ih. Although there are several neutrinos in the event, 
it is possible to reconstruct the x + x~ invariant mass, m TT , by making the collinear approximation, in 
which the decay products of the x are assumed to be collinear with the x direction in the laboratory 
frame. After other event selection criteria are imposed, the spectrum of m xx is used to extract the signal. 
Particular care has been given to the incorporation of uncertainty in both the rate and shape for the signal 
and backgrounds. 

A data-driven background estimation technique has been established for the major backgrounds. 
Each technique has been developed to address the aspects of the background estimation which are most 
relevant for the analysis: the shape of the m TT tail from the irreducible Z —>■ xx, the fake tau contribution 
in the Z/j-channel, and the normalization of the QCD backgrounds. 

In addition to the m tz spectrum from the Z — > xx and QCD control samples, a track multiplicity 
distribution is used to constrain the fraction of QCD events in the Z/z-channel. This likelihood term is 
denoted L track (rQ C D,r t au), where r tau (rg CZ) ) denotes the fraction of real taus (fakes from jets) in the 
sample. The track multiplicity distribution for the QCD jets is modelled from samples of QCD di-jets 
that produce tau candidates. 

The shape of the m zx distribution for signal and Z —> xx events is dictated by the resolution of $ T and 
the kinematics of the collinear approximation. The parameterization of m xz for the signal and Z — ► xx 
background are based on the kinematics of the collinear approximation and reproduces an asymmetric 
distribution with non-Gaussian tails. This distribution is dependent on the overall width width, a H / z , and 
a mean, m H / z . 

A Z —> xx control sample is used to constrain the mean, mz, and the overall width of the distribution, 
Oz, which are the only free parameters in the Z — > xx background model. The error bars in the control 
sample were scaled to 10% to account for the 10% shape uncertainty in the jx — > x rescaling method. 

The shapes for W+jets and tt are very similar and are modelled with a single distribution. A con- 
servative 50% error is applied to each bin in the combined QCD (i.e., tt and W+jets) control sample to 
reflect uncertainty in how this shape changes as the remainder of the analysis cuts are applied. 

The shape of the QCD background was parametrized with the following equation: 



The form is motivated by a competition between the parton distribution functions and the matrix element. 
In the //i-channel, the normalization of the backgrounds with fake taus can be constrained by using the 
track multiplicity method described above. We apply a conservative 50% systematic on this fraction. 

By fitting the m TT spectrum to a model that accurately describes the signal and various backgrounds 
it is possible to directly incorporate uncertainty in the background shape and take advantage of the shape 
of the signal within the mass window. We utilize the profile likelihood ratio as our test statistic. The 
likelihood function corresponding to the simultaneous fit is simply a product of the likelihoods from the 
individual measurements: 

L(data\n,m H ,v) = L track (track multiplicity \tqcd) x L z (Z + jets control \m z ,o z ) 
x Lqcd (QCD control \a\ , ai , ) 



where the a,- are the parameters used to parametrize the QCD background and v represents all nuisance 
parameters of the model: OH,m z ,o z ,rQCD,ai,a2,a3- 




(38) 



x L s+ t,( signal candidates | , m# , c# , mz, Oz, tqqd , a\ , ai , «3 ) , 



(39) 
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The data-driven background estimation methods described above have been developed so that un- 
certainty in the background shape and normalization are included directly into the significance calcu- 
lation. Because the discovery criterion is simply testing the presence or absence of the signal, it is not 
sensitive to some of the sources of systematic uncertainty. In contrast, measurement and exclusion of 
o(pp — > qqH) x BR(H — > tt) are sensitive to the uncertainty on the signal selection efficiency. Both 
experimental and theoretical sources of uncertainty on the signal efficiency have been evaluated. The jet 
energy scale uncertainty dominates in this channel, and a signal efficiency uncertainty of 18% was used 
when estimating the exclusion sensitivity. 

The median profile likelihood ratios for discovery, A(jU = 0) (using the Asimov data sets with Ha = 
1), are given in Table 7 for a few Higgs masses at some given luminosities. The distribution of the test 
statistic qo under the null background only hypothesis, for m# = 130 GeV with an integrated luminosity 
of 2 and 10fb~\ is shown in Figure 12. A j%l distribution is superimposed, showing the validity of the 
asymptotic approximation. 

Table 7: Median values of — 21nA(/X = 0) obtained from H — > T + T~ simulated data generated with signal plus 
background (p. = 1) for several values of the Higgs mass and integrated luminosity. 



L 




m H (GeV) 




(fb- 1 ) 


110 


120 130 


140 


1 


0.59 


0.88 0.72 


0.46 


2 


1.18 


1.74 1.40 


0.89 


5 


2.91 


4.43 3.34 


2.08 


10 


5.81 


8.23 6.37 


3.91 


30 


17.2 


23.6 17.6 


10.6 



The resulting median profile likelihood ratio for exclusion, (using the Asimov data sets, }Xa = 
0), is given in Table 8 for a few Higgs masses at some given luminosities and signal strength ju = 1.0 
corresponding to a Standard Model Higgs Boson. 

The distribution of the test statistic q\ for fi < 1 under the s + b hypotheses for m# = 130GeV with 
an integrated luminosity of 2 and 10 fb -1 is shown in Figures 13. A %\ distribution is superimposed, 
showing the validity of the asymptotic approximation. 

Table 8: Median values of -21nA(jU = 1) obtained from H — > t + t~ background-only (ju = 0) simulated data for 
several values of the Higgs mass and integrated luminosities. 



L 




mtf(GeV) 




(fb" 1 ) 


110 


120 130 


140 


1 


0.64 


0.71 0.52 


0.32 


2 


1.31 


1.41 1.03 


0.64 


5 


3.30 


3.42 2.52 


1.56 


10 


6.93 


7.79 7.18 


3.48 


30 


15.3 


16.6 12.8 


8.48 
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Figure 12: The distribution of the test statistic qo for H — > T + T _ under the null background-only hypothesis, 
for ma = 130GeV with an integrated luminosity of 2 (a) and 10 (b) fb _1 . A jXi distribution is superimposed. 
Figures (c) and (d) show 1 —F(qo) where F(qo) is the corresponding cumulative distribution. The small excess 
of events at high go is statistically compatible with the expected curves, as can be seen by comparison with the 
dotted histograms that show the 68.3% central confidence intervals for p = 1 — F(qo\0). The lower dotted line at 
2.87 x 10~ 7 shows the 5(7 discovery threshold. 
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3.4 H -> ZZW -> 4/ 

Details of the // — »■ ZZ**' — > 4/ channel can be found in Ref. [7]. The main challenge of this channel 
for what concerns statistical analysis is its relevance over a very wide mass range (from m# around 
120GeV up to 700GeV), over which the shapes and cross sections of both signal and background show 
considerable variations. 

While in principle it would be possible to divide the mass range in different regions and use sepa- 
rate models for each of them, using a unique background model for the whole phase space is a better 
approach, since it allows to estimate discovery and exclusion significance at any mass, without having 
to worry about boundaries between different models. This will be needed of course when the analysis is 
performed using real data, where m# is unknown. 

The main background after event filtering in this channel is the irreducible ZZ — »■ 41 process. Re- 
ducible backgrounds such as Zbb — > 4/ +X or tt give a negligible contribution to the overall shape, with 
the only exception of the m H = 120GeV case, where Zbb -^Al + X modifies the background shape in 
the low mass region, and it must therefore be taken into account. 

The irreducible background has been modelled using a combination of Fermi functions which are suit- 
able to describe both the plateau in the low mass region and the broad peak corresponding to the second 
Z coming on shell. The chosen model is described by the following function: 



The first plateau, in the region where only one of the two Z bosons is on shell, is modelled by the 
first term, and its suppression, needed for a correct description at higher masses, is controlled by the pS 
and p9 parameters. The second term in the above formula accounts for the shape of the broad peak and 
the tail at high masses. This function can describe with a negligible bias the ZZ background shape with 
good accuracy over the full mass range. 

As already mentioned, the Zbb contribution is relevant only when searching for very light Higgs 
bosons (in this study, only niH = 120GeV). In this case, an additional term is added to the ZZ continuum, 
with a functional form similar to the second part of equation 40. For what concerns signal modelling, a 
simple Gaussian shape has been used for m# < 300 GeV, while a relativistic Breit-Wigner formula was 
needed to properly describe the big tails arising at higher values of the Higgs mass. 

In the fits to determine the profile likelihood ratio, m# is fixed to the hypothesized value, while o# 
is allowed to float in a ±20% range around the value obtained from the signal Monte Carlo distributions. 
All the parameters describing the background shape are floating within sensible ranges. Given the com- 
plexity of the model involved, the fit can from time to time get trapped into local minima. While there is 
no easy way to avoid this problem, the fake measurements obtained in this case are easy to distinguish 
from the correct ones, and a repetition of the fit from a different starting point is enough to solve the 
problem. 

The resulting median profile likelihood ratios for discovery, — 21nA(jU = 0), with the median com- 
puted using toy s + bMC data (i.e., with /x = 1), are given in Table 9 for several values of the Higgs mass 
and integrated luminosity. 

The distribution of the test statistic qo under the null background-only hypothesis, for mu = 200 GeV 
with an integrated luminosity of 2 and 10fb _1 , is shown in Fig. 14. A distribution is superimposed, 
showing the validity of the asymptotic approximation. 

The resulting median profile likelihood ratio for exclusion, (using toy background-only MC 

experiments and taking the median of the A(ju) distribution), is given in Table 10 for several Higgs 
masses and luminosities using a signal strength \i = 1.0 corresponding to a Standard Model Higgs boson. 

The distribution of the test statistic q\ for fl < 1 under the s + b hypothesis for m H = 200 GeV with 




(40) 
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Table 9: Median values of -21nA(jU = 0) obtained from H — > ZZM — ► 41 simulated data generated with signal 
plus background (ju = 1) for several values of the Higgs mass and integrated luminosity. 



L 












m H (GeV) 












(fb- 1 ) 


120 


130 


140 


150 


160 


165 


180 


200 


300 


400 


500 


600 


1 


0.22 


1.20 


3.98 


5.35 


1.65 


0.49 


0.86 


6.86 


5.20 


3.55 


0.88 


0.31 


2 


0.44 


2.40 


7.96 


10.7 


3.30 


0.98 


1.71 


13.7 


10.4 


7.68 


1.74 


0.62 


5 


1.05 


5.97 


19.9 


26.7 


8.26 


2.46 


4.27 


34.3 


25.8 


17.8 


4.34 


1.54 


10 


2.18 


12.0 


39.8 


53.5 


16.5 


4.9 


8.55 


68.7 


51.6 


35.5 


8.47 


3.11 


30 


6.56 


35.8 


120 


160 


48.9 


14.8 


25.6 


206 


162 


108 


27.9 


10.9 


60 


13.1 


71.6 


239 


321 


99.1 


29.5 


51.3 


407 


310 


213 


52.6 


18.6 




10 0 2 4 6 8 10 12 14 16 18 10 0 2 4 6 8 10 12 14 16 

% % 
Figure 14: The distribution of the test statistic qo (for H — > 4-1), under the null background only hypothesis, 
for niH = 200 GeV with an integrated luminosity of 2 (a) and 10 (b) fb -1 . A jXi distribution is superimposed. 
Figures (c) and (d) show 1 ~F(qo) where F(qo) is the corresponding cumulative distribution. The small excess 
of events at high q 0 is statistically compatible with the expected curves, as can be seen by comparison with the 
dotted histograms showing the 68.3% central confidence intervals for p = 1 — F(qo\0). The lower dotted line at 
2.87 x 10~ 7 shows the 5 a discovery threshold. 
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Table 10: Median values of — 2\nX(fX = 1) obtained from H — > ZZ^*> — > Al simulated data generated with back- 
ground only (ji = 0) for several values of the Higgs mass and integrated luminosity. 



L 












m H (GeV) 












(fb- 1 ) 


120 


130 


140 


150 


160 


165 


180 


200 


300 


400 


500 


600 


1 


0.16 


0.93 


2.25 


3.12 


1.23 


0.39 


0.51 


4.86 


2.86 


2.36 


0.87 


0.28 


2 


0.33 


1.85 


4.50 


6.22 


2.4 


0.75 


1.90 


9.64 


5.68 


4.69 


1.74 


0.56 


5 


0.83 


4.60 


11.2 


15.4 


6.09 


2.28 


4.74 


23.5 


14.0 


11.6 


4.32 


1.39 


10 


1.60 


9.14 


22.0 


30.3 


12.1 


3.86 


5.05 


45.2 


27.3 


22.7 


8.57 


2.77 


30 


4.78 


26.8 


63.0 


85.3 


35.1 


11.4 


14.7 


105 


74.1 


63.0 


24.9 


8.22 


60 


8.90 


51.7 


117 


155 


66.9 


22.3 


28.0 


174 


129 


113 


47.6 


16.1 



an integrated luminosity of 2 and lOfb 1 is shown in Figures 15. A Xi distribution is superimposed, 
showing the validity of the asymptotic approximation. 




0 2 4 6 8 10 12 0 2 4 6 8 10 12 14 



9i "1 

Figure 15: The distribution of the test statistic q x for ji < 1 under the s + b hypothesis (for// — > Al), for m H = 200 
GeV with an integrated luminosity of (a) 2fb _1 and (b) lOfb -1 . A%\ distribution is superimposed. 



3.5 Limitations of the approximations used 

The distributions shown in Sections 3.1 through 3.4 show varying levels of agreement between the 
asymptotic chi-square form and the results of Monte Carlo simulations. For the WW (0 jet) channel 
(Fig. 8), the discrepancy in the distribution of qo is very large, and this is understood to arise from the 
special manner in which the systematic uncertainties for this channel were treated. The distribution of qo 
for the WW (0 jet) channel therefore does not use the asymptotic formula. This is the only channel for 
which the approximation was not applied. 

For other cases such as the distribution of q\ for the H — > 77 channel shown in Fig. 7, the Monte Carlo 
distribution falls off significantly faster than the chi-square curve. This means that the significance with 
which one excludes the tested hypothesis will be less when estimated from the chi-square curve, leading 
to conservative limits. As the integrated luminosity increases, one expects to the asymptotic formula to 
become more accurate. 

In some of the distributions such as that of go for the H —> ZZ^*> — »■ 41 channel shown in Fig. 14, the 
Monte Carlo simulation indicates a slight excess over the chi-square curve in the tail region. The level of 
the excess is not statistically significant in the part of the distribution that can be meaningfully assessed 



1505 



Higgs - Statistical Combination of Several Important Standard Model Higgs . 



given the amount of Monte Carlo data available (out to go between around 9 to 16, i.e., to the level of a 

3 to 4a discovery). At present it is not practical to verify directly that the chi-square formula remains 
valid to the 5a level (i.e., out to qo = 25). Thus the results on discovery significance presented here rest 
on the assumption that the asymptotic distribution is a valid approximation to at least the 5 a level. 

The validation exercises carried here out indicate that the methods used should be valid, or in some 
cases conservative, for an integrated luminosity of at least 2 fb _1 . At earlier stages of the data taking, 
one will be interested primarily in exclusion limits at the 95% confidence level. For this the distributions 
of the test statistic at different values of }X can be determined with a manageably small number of 
events. It is therefore anticipated that we will rely on Monte Carlo methods for the initial phase of the 
experiment. 

4 Results of the combination 

4.1 Combined discovery sensitivity 

The full discovery likelihood ratio for all channels combined, X s+ b{0), is calculated using Eq. 33. This 
uses the median likelihood ratio of each channel, X s+ bj(0), found either by generating toy experiments 
under the s + b hypothesis and calculating the median of the X s+ bj distribution or by approximating the 
median likelihood ratio using the Asimov data sets with Haj = 1- Both approaches were validated to 
agree with each other. The discovery significance is calculated using Eq. 36, i.e., Z rj y 7 — 21nA(0), 
where A (0) is the combined median likelihood ratio. 

The resulting significances per channel and the combined one are shown in Fig. 16 for an integrated 
luminosity of 10 fb~ ! . 




Figure 16: The median discovery significance for the various channels and the combination with an integrated 
luminosity of 10 fb _1 for (a) the lower mass range (b) for masses up to 600 GeV. 

The median discovery significance as a function of the integrated luminosity and Higgs mass is shown 
colour coded in Fig. 17. The full line indicates the 5a contour. Note that the approximations used do 
not hold for very low luminosities (where the expected number of events is low) and therefore the results 
below about 2fb~ 1 should be taken as indications only. In most cases, however, the approximations tend 
to underestimate the true median significance. 

4.2 Combined exclusion sensitivity 

The full likelihood ratio of all channels used for exclusion for a signal strength /i, Af,(ju), is calculated 
using Eq. 34 with the median likelihood ratios of each channel, Xbj(jJ.), calculated, either by generating 
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significance 




120 140 160 180 200 220 240 260 280 300 
m H [GeV] 



Figure 17: Significance contours for different Standard Model Higgs masses and integrated luminosities. The 
thick curve represents the 5a discovery contour. The median significance is shown with a colour according to the 
legend. The hatched area below 2 fb _1 indicates the region where the approximations used in the combination are 
not accurate, although they are expected to be conservative. 

toy experiments under the ft-only hypothesis and calculating the median of the Xbj distribution or ap- 
proximating the median likelihood ratio using the Asimov data sets with Haj = 0. Both approaches were 
checked to agree with each other. A signal strength }X = 1 corresponds to the Standard Model Higgs 
boson. 

Any exclusion of ju(m^) smaller than 1 corresponds to an exclusion of a Standard Model Higgs 
boson with a mass m#. To probe the median sensitivity for excluding a Standard Model Higgs boson we 
follow Eq. 35 and calculate the corresponding p- value for ji = 1, p\ for a given luminosity at a given 
Higgs mass. A /rvalue of 0.05 corresponds to a significance (Eq. 36) of 1.64. The resulting p\ for the 
various channels as well as for the combination, for a luminosity of 2fb~ 1 , are shown in Fig. 18. Note 
that any /rvalue below 0.05 indicates an exclusion. We therefore conclude that with a luminosity of 2 
fb _1 ATLAS has the median sensitivity to exclude a Standard Model Higgs boson heavier than 115 GeV 
at the 95% Confidence Level. This can also be seen from Fig. 19, which shows the luminosity required 
to exclude a Higgs boson with a mass nin at a given confidence level from the combination of the four 
channels explored in this note. 

The sharp increase in the required luminosity for lower m# seen in Fig. 17 reflects the decrease in 
sensitivity to the Higgs when using only the set of channels considered here. Further developments will 
increase the sensitivity in this region. For example, improved analysis methods for the H — > 77 channel 
are described in Ref. [4], including a separation of the events into those with zero or two accompanying 
jets. Additional final states such as ttH with H — > bb will help somewhat, although the contribution to 
the sensitivity will be small because of the large uncertainties in the background. 

For the WW channel, the present study includes only the <?v/iv decay mode, but it is planned to 
include evev, jXVjXV and qqlv as well. The ZZ^ channel here only includes Z decays to ee and jUjU, but 
in future analyses qqVV will be included. The additional WW and ZZ^ modes have been found to have 
sensitivity for a high-mass Higgs. Finally, combination with the results from ATLAS with those of CMS 
will of course result in an overall increase in sensitivity. 
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Figure 18: The median p-value obtained for excluding a Standard Model Higgs Boson for the various channels 
as well as the combination for (a) the lower mass range (b) for masses up to 600 GeV. 




110 115 120 125 130 135 140 

m H [GeV] 



Figure 19: The expected luminosity required to exclude a Higgs boson with a mass m# at a confidence level given 
by the corresponding colour. The hatched area below 2 fb -1 indicates the region where the approximations used 
in the combination are not accurate, although they are expected to be conservative. 
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5 Conclusions 

The procedure for combination of search results based on the profile likelihood ratio has been applied 
to a study of the search for the Standard Model Higgs boson using four search channels: H — > T + T~, 
H — > — > evjiv, H — > 77 and // — > ZZM — »■ 4 leptons. The combination method is very general 

and can be applied to essentially any search that will be carried out at the LHC. 

The study here has not exploited all of the search channels that will be investigated and therefore the 
current estimates of the sensitivity can be regarded as conservative. For example, using further decay 
modes in the ZZ and WW channels will provide additional sensitivity especially for a Higgs boson in the 
higher mass range. 

The studies have exploited a series of useful approximations that allow one to determine the median 
discovery and exclusion sensitivities from a combined fit in a manner that only requires separate input 
ingredients from the individual channels. The determination of the significance for a given (e.g., real) 
data set, however, will require a simultaneous fit of all of the channels. 

It is not practical at present to generate enough Monte Carlo data to verify directly that the tail of 
the profile likelihood distribution is well described to the level required for discovery at the 5a level, 
corresponding to an upper tail area of 2.87 x 10~ 7 . The estimates of discovery significance presented 
here therefore rely on the assumption that the large-sample approximation used remains valid out to this 
level. 

The validation studies shown in Section 3 indicate that the approximations used should be reasonably 
accurate or lead to conservative limits for an integrated luminosity of at least 2 fb _1 . For the earlier 
stages of the experiment it is expected that one will need to rely on Monte Carlo methods, which should 
be feasible for exclusion limits at the 95% confidence level. 

The profile likelihood ratio treats systematic errors by associating the uncertainties with adjustable 
(nuisance) parameters. Other methods for treating systematic uncertainties can also be considered. Us- 
ing Bayesian methods, for example, one would associate a prior probability density with the nuisance 
parameters. We plan to develop and use this and other approaches in parallel with the profile likelihood 
method for searches at the LHC. 

The study presented in this paper provides the discovery significance for a Higgs boson of a specific 
mass. That is, the traditional discovery threshold /rvalue of 2.87 x 10~ 7 corresponding to a 5a effect 
for a given hypothesized Higgs mass refers to the false discovery rate for a Higgs of that mass. The false 
discovery rate for a Higgs of any mass is higher, and several approaches are being pursued to quantify this 
(the so-called 'look-elsewhere effect'). The most mature of these methods involves using a simultaneous 
fit of the Higgs mass wth and the strength parameter jx (or equivalently the Higgs production rate), as has 
been discussed in the studies of H — > 77 [4] and H — > [5]. 

To summarize, the studies based on the four channels considered in this note confirm the good dis- 
covery and exclusion sensitivities already shown in the ATLAS Technical Design Report (TDR) [10]. 
Furthermore the results here are based on better knowledge and a more realistic simulation of the detec- 
tor than what is described in the TDR. Because of the approximations used, the present studies are valid 
only for luminosities above 2fb _1 . With a luminosity of 2fb~' the expected (median) sensitivity is at the 
5a level or greater for discovery of a Higgs boson in the mass range between 143 and 179 GeV, and the 
expected upper limit at 95% confidence level on the Higgs mass is 1 15 GeV. 

A Comparison with procedures used at LEP 

In this appendix we compare the procedures described in the present analysis with those used in searches 
carried out at LER More details on these methods be found in [8]. The important differences involve 
the definition of the test statistic used and the treatment of systematic uncertainties. In addition, the LEP 
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analyses adopted a special procedure to prevent spurious exclusion due to a downward fluctuation of the 
number of events (the CL S method, see below). 

In the LEP Higgs searches, a hypothesized value of the Higgs mass m H was tested by constructing 
the statistic 

L f± > = L(n = l) 
W U L(M = 0)' ^ ^ 

where as before b {ji = 0) represents the background-only hypothesis and s + b {ji = 1) refers to back- 
ground plus signal at the rate predicted by the Standard Model. For convenience the equivalent logarith- 
mic variable q = -2\nQ was used. 

The sampling distribution of Q was determined by Monte Carlo simulation. The Monte Carlo was 
also used to incorporate systematic errors by sampling values of the corresponding nuisance parameters 
from pdfs that reflected their uncertainties. That is, one effectively integrated the product of the likelihood 
and prior pdfs for the nuisance parameters. 

For a give observed value g 0 bs = -2 In gobs, the ^-values for the s and s + b hypotheses were deter- 
mined as 

Ps+b = r f{q\s + b)dq = CU+b, (42) 
Pb = J**f(q\b)dq=l-CL b . (43) 

Having determine the ^-values, the LEP analyses then based exclusion of the s + b hypothesis not on the 
p- value of s + b but rather on the ratio CL S , defined as 

CL S = ^ = -^. (44) 
CL b 1 - p h 

The signal-plus-background hypothesis was said to be excluded at confidence level CL = 1 - a = 0.95 
if one finds 

CL. S < a . (45) 

Since CL/, < 1, one has CL S > CL s+ b. Therefore the CL. V method will not exclude as large a region of 
parameter space as that based on the signal-plus-background /rvalue (CL s+h method). As the CL s+b 
method was designed to provide an interval that brackets the true value of the parameter with a proba- 
bility of at least 1 — a, the CL S limit must cover the true parameter with a greater probability; it is in 
this sense conservative. The CL^ method was devised so as to avoid the problem where a downward 
fluctuation in the number of background events can lead to exclusion of the Higgs mass considered, even 
for hypothesized mass values where one does not expect to be sensitive to Higgs production [9]. 
In contrast, in the present analysis we test a hypothesized value of the strength parameter }X using 

as described in Section 2. With this definition, the sampling distribution of the test statistic f(qn\n) 
approaches a well defined form related to the chi-squared distribution for a sufficiently large data sample. 
The ability to exploit this approximate form is very useful as the relevant /rvalue for a 5a discovery 
is 2.87 x 10~ 7 , and therefore to determine this from Monte Carlo would require an extremely large 
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number of simulated experiments. At LEP this was not a crucial issue as the statistical treatment focused 
primarily on exclusion limits at 95% CL, not on discovery at the 5a level. 

Systematic uncertainties here have been incorporated using the profile likelihood, rather than with 
the integrated likelihoods used at LEP. For the uncertainties most relevant to the analyses at the LHC, the 
broadening of the likelihood function obtained by both procedures is similar. It is the profile likelihood 
ratio and not the ratio of integrated likelihoods, however, that approaches the chi-squared form in the 
large sample limit in accordance with Wilks' theorem. 

The CL S method has not been applied in the present analysis but studies of its application to searches 
in ATLAS are ongoing. 

A final difference with the LEP procedures concerns the definition of significance Z. Here we have 
defined its relation to the /rvalue as the number of standard deviations of a Gaussian variable that would 
give a one-sided tail area of p, as described in Section 2.1. A significance of Z = 5 corresponds to 
p = 2.87 x 10~ 7 . The LEP Higgs group defined this relation using a two-sided fluctuation of a Gaussian 
variable, i.e., a 5a significance corresponded to p = 5.7 x 10~ 7 . 
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Abstract 

This chapter serves as an introduction to a collection of six articles that detail 
the strategy foreseen to search for Supersymmetry with the ATLAS detector 
at the Large Hadron Collider, concentrating on the initial data taking period 
with an expected integrated luminosity of about 1 fb 1 . We review here the 
phenomenology of Supersymmetry with ATLAS and discuss how events are 
simulated and reconstructed, concentrating on aspects related to Supersymme- 
try searches. We also introduce many of the experimental variables that are 
used throughout this collection. 

1 Introduction 

Supersymmetry (SUSY) is one of the theoretically favoured candidates for physics beyond the Standard 
Model. The main motivation is to protect the Higgs boson mass from quadratically diverging radiative 
corrections, in a theory where the Standard Model is valid only up to a high scale A. The proposed 
solution postulates the invariance of the theory under a symmetry which transforms fermions into bosons 
and vice-versa. 

The basic prediction of SUSY is thus the existence, for each Standard Model particle degree of 
freedom of a corresponding sparticle, with spin different by half a unit. The SUSY generators commute 
with the SU(2) x[/(l) x SU(3) symmetries of the Standard Model, and with the Poincare group. It 
follows that with unbroken SUSY the partner particles would have the same quantum numbers and 
masses as the Standard Model particles. Since no superpartner has been observed to date, SUSY must be 
broken. A common approach to the phenomenological study of SUSY is to assume the minimal possible 
particle content, and to parametrise the SUSY-breaking Lagrangian as the sum of all the terms which 
do not reintroduce quadratic divergences into the theory. The model thus obtained is called Minimal 
Supersymmetric Standard Model (MSSM) and is characterised by a large number of parameters (~ 100). 
In order to warrant the conservation of baryonic and leptonic quantum numbers, a new multiplicative 
quantum number, /^-parity, is introduced, which is 1 for particles and -1 for the SUSY partners. Models 
where /^-parity is violated can be formulated, but in the current volume we concentrate on models with 
/^-parity conservation. 

The consequences of /^-parity conservation are that sparticles must be produced in pairs, and that 
each will decay to the lightest SUSY particle (LSP) which must be stable. Cosmological arguments 
suggest that stable LSPs should be weakly interacting and so would escape direct detection at ATLAS, 
resulting in the characteristic feature expected for SUSY events - an imbalance of the transverse energy 
measured in the detector, abbreviated here as £"™ ss . The associated signatures will provide sensitivity 
to a large class of models. The aim of the simulation studies is to ensure that the ATLAS experiment 
will have rapid sensitivity to a large ensemble of SUSY models, and to develop a general search strategy. 
It is not possible to explore in full the 100-dimensional parameter space of the MSSM. It is therefore 
necessary to adopt some specific assumptions for the SUSY breaking, resulting in models defined by a 
small number of parameters at the SUSY breaking scale. Two models will be studied in detail: 

• mSUGRA, where SUSY breaking is mediated by gravitational interaction; 

• GMSB, where SUSY breaking is mediated by a gauge interaction through messenger gauge fields. 

These two models give quite different topologies, due to the different nature of the lightest SUSY particle, 
which is the lightest neutralino for the mSUGRA case and the gravitino for the GMSB case. For each 
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of these models a set of benchmark points has been defined, on which full simulation studies have been 
performed. 

The analysis work presented in the different SUSY notes is a coherent body of work based on an 
agreement among the many groups performing the analyses on many subjects both theoretical and ex- 
perimental. All of the analyses are based on a common implementation of the SUSY model, common 
background datasets have been used, and all of the analyses apply a common definition of the physics 
objects in the detectors, optimised for the specific environment of the SUSY searches. In the following 
sections a quick overview of these common issues will be given, which should be read before approach- 
ing the detailed studies documented in the different notes. 

2 Signal and Background generation 

The simulated data used for the studies documented here have been produced by Monte Carlo simulation 
inside the official ATLAS software and production frameworks, and for all of the samples a detailed 
simulation of the detector has been performed. 

2.1 Signal samples 

For the detailed SUSY analysis a set of benchmark points in the mSUGRA and GMSB frameworks were 
chosen, with the aim of exploring sensitivity to a wide class of of final-state signatures. 

For the mSUGRA points, the principle that the predicted cosmological relic density of neutralinos 
should be consistent with the observed density of cold dark matter was used for guidance. In order 
to reproduce the observed relic density, the model parameters must result in a spectrum which ensures 
efficient annihilation of the neutralinos in the early universe. In the mSUGRA scenario this is possible 
only in restricted regions of the parameter space where annihilation is enhanced either by a significant 
higgsino components in the lightest neutralino or through mass relationships. The points chosen are 
defined in terms of the mSUGRA parameters at the unification scale: 

SU1 m 0 = 70 GeV, m 1/2 = 350 GeV, A 0 = 0, tan/3 = 10, ju > 0. Coannihilation region where 

Xi annihilate with near-degenerate I. 
SU2 m 0 = 3550 GeV, m 1/2 = 300 GeV, A 0 = 0, tan/3 = 10, ]X > 0. Focus point region near 

the boundary where \i 2 < 0. This is the only region in mSUGRA where the %° has a high 

higgsino component, thereby enhancing the annihilation cross-section for processes such 

as Xifi ww - 

SU3 m 0 = 100 GeV, m 1/2 = 300 GeV, A 0 = -300 GeV, tan/3 = 6, ju > 0. Bulk region: LSP 

annihilation happens through the exchange of light sleptons. 
SU4 m 0 = 200 GeV, m 1/2 = 160 GeV, A Q = -400 GeV, tan/3 = 10, n > 0. Low mass point 

close to Tevatron bound. 
SU6 m 0 = 320 GeV, m 1/2 = 375 GeV, A 0 = 0, tan/3 = 50, n > 0. The funnel region where 

2m^o « niA- Since tan/3 » 1, the width of the pseudoscalar Higgs boson A is large and T 

decays dominate. 

SU8.1 m 0 = 210 GeV, m 1/2 = 360 GeV, A 0 = 0, tan/3 = 40, ju > 0. Variant of coannihilation 
region with tan /3 » 1 , so that only m?, — m^o is small. 
SU9 m 0 = 300 GeV, m 1/2 = 425 GeV, A 0 = 20, tan j3 = 20, /J, > 0. Point in the bulk region with 
enhanced Higgs production 
The SUSY particle mass spectra for each of these points are listed in Table 2. 

The leading-order and next-to-leading-order cross-sections at 14 TeV center of mass for the chosen 
points are given in Table 1. These have been calculated with the program PROSPINO 2.0.6 [1-3], using 
the default settings, and the parton distribution set CTEQ6M [4]. 
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Table 1: Production cross-sections at leading order (a LO ) and next-to-leading order (a NLO ), numbers 
of Monte Carlo events simulated (N) and corresponding integrated luminosity for the SUSY benchmark 
points used by ATLAS. 



Label 


a LO (pb) 


a NLO (pb) 


N 


L (fb" 1 ) 


SU1 


8.15 


10.86 


200 K 


18.4 


SU2 


5.17 


7.18 


50 K 


7.0 


SU3 


20.85 


27.68 


500 K 


18.1 


SU4 


294.46 


402.19 


200 K 


0.50 


SU6 


4.47 


6.07 


30 K 


4.9 


SU8.1 


6.48 


8.70 


50 K 


5.7 


SU9 


2.46 


3.28 


40 K 


12.2 



Table 2: Particle mass spectrum (in GeV) for the SUSY benchmark points. 



Particle 


SU1 


SU2 


SU3 


SU4 


SU6 


SU8.1 


SU9 


d L 


764.90 


3564.13 


636.27 


419.84 


870.79 


801.16 


956.07 


u L 


760.42 


3563.24 


631.51 


412.25 


866.84 


797.09 


952.47 


hi 


697.90 


2924.80 


575.23 


358.49 


716.83 


690.31 


868.06 


h 


572.96 


2131.11 


424.12 


206.04 


641.61 


603.65 


725.03 


d R 


733.53 


3576.13 


610.69 


406.22 


840.21 


771.91 


920.83 


U R 


735.41 


3574.18 


611.81 


404.92 


842.16 


773.69 


923.49 


h 


722.87 


3500.55 


610.73 


399.18 


779.42 


743.09 


910.76 


h 


749.46 


2935.36 


650.50 


445.00 


797.99 


766.21 


911.20 




255.13 


3547.50 


230.45 


231.94 


411.89 


325.44 


417.21 


v e 


238.31 


3546.32 


216.96 


217.92 


401.89 


315.29 


407.91 


Tl 


146.50 


3519.62 


149.99 


200.50 


181.31 


151.90 


320.22 


V T 


237.56 


3532.27 


216.29 


215.53 


358.26 


296.98 


401.08 


Sr 


154.06 


3547.46 


155.45 


212.88 


351.10 


253.35 


340.86 




256.98 


3533.69 


232.17 


236.04 


392.58 


331.34 


416.43 


8 


832.33 


856.59 


717.46 


413.37 


894.70 


856.45 


999.30 


Xi 


136.98 


103.35 


117.91 


59.84 


149.57 


142.45 


173.31 


Xi 


263.64 


160.37 


218.60 


113.48 


287.97 


273.95 


325.39 


?I 


466.44 


179.76 


463.99 


308.94 


477.23 


463.55 


520.62 




483.30 


294.90 


480.59 


327.76 


492.23 


479.01 


536.89 


xi 


262.06 


149.42 


218.33 


113.22 


288.29 


274.30 


326.00 


xi 


483.62 


286.81 


480.16 


326.59 


492.42 


479.22 


536.81 


hP 


115.81 


119.01 


114.83 


113.98 


116.85 


116.69 


114.45 


H° 


515.99 


3529.74 


512.86 


370.47 


388.92 


430.49 


632.77 


A Q 


512.39 


3506.62 


511.53 


368.18 


386.47 


427.74 


628.60 


H+ 


521.90 


3530.61 


518.15 


378.90 


401.15 


440.23 


638.88 


t 


175.00 


175.00 


175.00 


175.00 


175.00 


175.00 


175.00 
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Even though the SUSY benchmark points are all based on the mSUGRA scenario, they provide a 
rather wide range of possible decay topologies. They share some common features, for example for all 
these points the gluino mass is less than 1 TeV, and the ratio ntg/m^o = 6-8. For all points except SU2, 
the squark and gluino masses are comparable. Hence gluinos and squarks would be copiously produced 
and would decay giving relatively high pj jets, possibly leptons, and £"™ 1SS . These features are relatively 
general - although not guaranteed - but the cold dark matter density predictions are very specific to 
mSUGRA. 

The signatures from GMSB models are often very different because the X\ is no longer the lightest 
supersymmetric particle, so it would be expected to decay, and (depending on its lifetime) may do so 
within the detector. Given its particular experimental signature, a dedicated article of this chapter is 
devoted to the relevant signatures [5], where the GMSB benchmarks are discussed in detail. 

2.2 Backgrounds 

The Standard Model background processes most relevant to SUSY searches are tt, W +jets, Z+jets, jet 
production from QCD processes and diboson production. 

Different Monte Carlo generators were used for different processes, in the attempt of optimising 
the reliability of the estimate for the Standard Model backgrounds. See Ref. [6] for a more detailed 
description. 

For tt production, which is the dominant background for many of the signatures, the MC@NLO [7,8] 
generator was used. It includes full next-to-leading order QCD corrections, affording a quite stable 
absolute cross-section prediction and a good description of the final state kinematics for events with up 
to one additional QCD jet. QCD showering and fragmentation are performed using the HERWIG [9, 10] 
program. Single top production is expected to add a very small contribution [11], but has also been 
investigated. 

Typically SUSY analyses require large jet multiplicities. It is very important to simulate correctly 
the kinematics of the additional jets for processes like W +jets and Z + jets. For these processes we 
therefore used the ALPGEN [12] generator, which at leading order in QCD and electroweak interactions, 
calculates the exact matrix elements for multiparton hard processes in hadronic collisions. Showering 
and hadronisation are provided through the HERWIG program. In order to achieve a correct description 
of the jet multiplicities, it is necessary to perform a correct match between the jets produced by the 
matrix-element generator and the ones produced by parton showering. This topic is the subject of active 
theoretical work. Background samples of W and Z containing at least four jets were produced with 
ALPGEN, using MLM matching [13] and with the jet matching cut set at 40 GeV. Contributions from 
processes with matrix-element parton multiplicities between one and five were summed to produce the 
multi-jet sample. A filter was applied at the generator level requiring at least four jets with transverse 
momentum above 40 GeV, with at least one of these having a transverse momentum above 80 GeV, and 
with missing transverse energy above 80 GeV. 

The leading-order cross-sections were normalised to the results from next-to-next-leading-order cal- 
culations by applying a k factor of 1.15 (1.27) for W (Z) production respectively. The k factor was 
calculated by comparing the inclusive leading-order cross-section to the NNLO calculation of [14]. For 
topologies involving fewer than four jets, the PYTHIA generator [15] was used for the generation of the 
W and Z boson backgrounds. 

For the simulation of QCD multi-jet samples, ALPGEN would also be an appropriate choice. For 
practical reasons, however, it was impossible to generate ALPGEN samples with sufficiently large num- 
bers of events to simulate the backgrounds for these studies. As a backup solution we have used a shower 
Monte Carlo, PYTHIA, for which adequate statistics could be generated by producing samples in slices 
the of pj of the hard scattering. A filter at generation level is applied to the PYTHIA events, requiring 
for the hardest jet a transverse momentum above 80 GeV, for the second jet transverse momentum above 
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Table 3: Kinematic boundaries, number of events and integrated luminosity of the PYTHIA QCD back- 
ground samples. 



Label 


Pt range (GeV) 


N 


Int. Lumi (pb 1 ) 


J4 


140-280 


18k 


19.6 


J5 


280-560 


60 k 


168.4 


J6 


560-1120 


20 k 


296.6 


J7 


1129-2240 


4k 


754.7 


J8 


>2240 


4k 


181000 



40 GeV and missing transverse energy above 100 GeV. The numbers of events which were used for 
detailed simulation, and the corresponding integrated luminosity for each pj slice are shown in Table 3. 

The contributions of the diboson processes WW, ZZ and WZ are almost negligible for multi-jet 
analyses as they are strongly suppressed by typical SUSY selections requiring a large number of jets 
with high transverse momenta and large missing transverse energy. However they are very important 
when searching for direct gaugino production [16]. The corresponding data samples were generated at 
leading order with the HERWIG Monte Carlo, including the full off-shell structure for Z/y. The cross- 
sections were then normalised to the next-to-leading-order cross-sections calculated with the MCFM 
code [17]. 

For all the samples except QCD jet production a sample corresponding to at least 1 fb _1 of integrated 
luminnosity was simulated. 

Pile-up and cavern background simulations were generally not included in the signal and background 
samples except in a few cases where this is specifically indicated. 

3 Object Identification for SUSY analysis 

Supersymmetric events are expected to be characterised by several high-momentum jets and missing 
transverse energy. Leptons 1 and taus are also present in a large fraction of the events for the benchmark 
points considered. The analyses documented in this Chapter use common particle identification criteria, 
which are briefly described in this section. 

3.1 Jets 

Because of the relatively large multiplicity of jets in SUSY events, a narrow cone is preferable in the 
reconstruction of jets. The algorithm used to reconstruct jets in the analysis documented here is the cone 
algorithm [18] with a cone size of 0.4. Most analyses presented do not rely on secondary-vertex tagging 
("b tagging") but when it is applied, standard ATLAS algorithms [19] are used. 

3.2 Missing transverse energy 

The measurement of the transverse energy imbalance in the detector plays a crucial role in the searches 
for Supersymmetry with R-parity conservation, and the requirement of a large value of E™ 1SS is a com- 
mon feature of all the analyses presented in this chapter. 

'Except where explicitly indicated, in this Chapter the word "lepton" and the corresponding symbol I should be understood 
to mean I e {e, jj.}, i.e. they do not include taus. 
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For the studies presented here, the E™ ss is calculated from the calorimeter cells, with calibration 
weights derived separately for cells associated to different objects (jets, electrons, photons, taus, and 
non-associated clusters due to the soft part of the event) [20]. Sources of fake missing energy, such as 
dead or noisy parts of the calorimeter, fake muons, beam-gas and beam-halo events, cosmic rays and 
electronics problems are not considered here. A detailed discussion of the strategies to remove sources 
of fake E™ 1SS in early data and to measure the Ej ls * resolution and scale can be found in [20]. 

The contribution from non-gaussian tails in the £"™ ss measurement can be strongly suppressed by 
requiring a minimum angular separation between the £"™ ss vector and the jets in the event. This cut also 
suppresses the contributions from jets containing hard neutrinos from the leptonic decays of charmed 
and beauty mesons. This issue is discussed in detail in Ref. [21]. 

3.3 Electrons 

In SUSY searches, the requirement of a high p T electron is normally associated with additional require- 
ments on jets and £"™ 1SS . For typical SUSY analyses the background from the production of QCD jets 
can be reduced by cuts other than those requiring any lepton(s). Therefore stringent rejection against jets 
is not needed in SUSY studies, and relatively mild electron identification cuts can be applied, leading to 
a significant gain in efficiency especially for searches involving many leptons. Jets reconstructed within 
a cone 2 of AR = 0.2 of an identified electron are discarded from the jet list. This procedure prevents the 
same object being reconstructed both as a jet and as an electron. 

A standard algorithm called "eGamma" [22] was used for the electron identification and reconstruc- 
tion, using the "medium" purity cuts. 

The transverse isolation energy in a cone of AR < 0.2 around the electron, computed using the calori- 
metric information, is used to select isolated electrons. This quantity is required to be smaller than 
10 GeV. In the available data-sets this variable was incorrectly calculated, but a significant bias is intro- 
duced by this problem only in the crack region 1 .37 < | T] | < 1 .52. In this region the electron identification 
and measurement are also degraded because of the large amount of material in front of the calorimeter 
and the crack between the barrel and extended barrel of the calorimeters [23]. Events with an electron 
reconstructed in this region are therefore rejected. 

Finally, an electron is rejected if it is found within a distance 0.2 < AR < 0.4 of a jet, since such 
candidates are likely to be associated with the decay of a particle within that jet. 

3.4 Muons 

Muons were reconstructed using an algorithm (STACO), which performs a statistical combination of a 
track reconstructed in the muon spectrometer with its corresponding track in the inner detector [24]. A 
reasonable quality of combination was guaranteed with a loose requirement that the tracks should match 
with x 2 < 100. If more than one Inner Detector track matched a track from the Muon Spectrometer, only 
the one with best match (smallest distance AR) was kept. The total calorimeter energy deposited in a 
cone of AR < 0.2 around the muon was required to be less than 10 GeV. Finally, muons found within a 
distance AR < 0.4 of a jet were discarded. 

3.5 Taus 

As will be described in subsequent analyses, final states containing taus are one of the possible signatures 
to be expected within the SUSY rich phenomenology. Taus are challenging objects to identify. When taus 
undergo leptonic decays, their products can be detected in electron- or muon-specific analyses. However, 
in approximately 65% of cases, a tau decays hadronically and the resultant jet needs to be disentangled 

2 The cone is denned by AR 2 = Ar/ 2 + A0 2 , where rj and (j) are the pseudorapidity and azimuthal angle respectively. 
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from the overwhelming jet backgrounds expected in ATLAS. In addition, tau signatures always contain 
an important source of missing energy in the form of neutrinos which usually make it impossible to fully 
reconstruct the tau-jet energy. 

Elsewhere in this volume [25] two tau reconstruction algorithms, one calorimeter-based and the 
other one track-based, are described. Unless otherwise stated, taus being used in this Chapter follow the 
recommendations from [25], with a likelihood discriminant of 4, within the fiducial region | tj | < 2.5 and 
Pt > 15 GeV. In addition, although not crucial for the results, an overlap removal procedure has been 
applied. Since the electron identification efficiency is higher than the tau identification efficiency, a tau 
object found in a vicinity (AR < 0.4) of an electron object (following the prescriptions defined in the 
electron performance section of the present note) is considered to be a fake and is removed. Also, since 
hadronically decaying taus are also usually reconstructed as jets, when a calorimeter jet is found within 
AR < 0.4 of a reconstructed tau-jet, then the non-tau jet is disregarded. 

3.6 Treatment of systematic uncertainties 
3.6.1 Detector uncertainties 

Supersymmetry searches and measurements will inevitably be subject to uncertainties due to the im- 
perfect understanding of the behaviour of the detector. Many of the performance characteristics of the 
detector will be constrained from the data themselves, with an accuracy which will depend on the size 
of the data-set available. In this chapter, we have used estimations of these uncertainties based on the 
precision which is likely to be achievable with 1 fb _1 of integrated luminosity. The detector systematics 
- energy scale, resolution, or efficiency variations - are applied both to the "true" backgrounds and to the 
control samples; data-driven methods will therefore be less sensitive to detector systematics than purely 
Monte Carlo-driven estimates. 

Jets and £"™ ss We assume an overall uncertainty on the jet energy scale of 5%. We apply this as a 
global uncertainty, independently of jet pj and tj, and identically for light-quark jets and jets from b 
quarks. 

The jet energy resolution will be studied on di-jet samples and jets from W boson decay in tt events. 
After such measurements we expect a residual uncertainty of 10% on the resolution. 

The missing transverse energy of the event, £"^ 1SS , is calculated from the transverse vector sum of 
high pj objects like leptons and jets with a further component from unclustered energy. Part of the 
uncertainty in £"™ ss is thus correlated with jet energy scale uncertainties, but also a wrong calibration of 
unclustered energy can affect £"™ ss . When jets are rescaled or smeared, the £"™ 1SS is recalculated with 
the new jets. This takes into account the part of the £"™ ss uncertainty that is correlated with the jets. For 
the low pj part related to the unclustered energy, we first subtract leptons and jets with pt > 20 GeV 
from the £™ ss , apply a 10% uncertainty on the remainder, and then add the leptons and jets back in. 

Electrons For electrons we estimate an uncertainty on the identification efficiency (including the trig- 
ger) of 0.5%. Furthermore, we assume an uncertainty on the electron energy scale of 0.2%, and on the 
energy resolution of 1%. All these uncertainties are assumed to be independent of p T and tj. 

Muons For muons we estimate an uncertainty on the identification efficiency of 1% for muons with 
Pt < 100 GeV, plus a 3% extrapolation uncertainty to a pt of 1 TeV. Furthermore, we assume an uncer- 
tainty on the muon pt scale of 0.2%, and on the pt resolution of 4% below 100 GeV, whereas the pt 
resolution at 1 TeV is assumed to be 10 ± 1%. All these uncertainties are assumed to be independent of 
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Heavy-flavour tagging Where b tagging is used, we assume a relative uncertainty of 5% on the b- 
tagging efficiency, and 10% on the light-quark rejection. 

3.6.2 Monte Carlo uncertainties 

Although data-driven background estimation methods try to use Monte Carlo simulations as little as 
possible by construction, some dependence remains for example when trying to understand the compo- 
sition of control samples when Monte Carlo-based corrections, distribution shapes or redecay methods 
are used. 

We estimate uncertainties from the use of Monte Carlo samples by comparing different event gen- 
erators, particularly Alpgen and MC@NLO, and by variation of generator parameters. For the vector 
boson (V £W,Z) + jets Alpgen samples used in this note, the standard value for the renormalization and 
factorization scales was Q 2 = Mj e + p T (V) where the two leptons are the vector boson decay products. 
The pr threshold of the partons in the matrix-element calculation was 40 GeV, the minimum distance 
ARjj between partons was 0.7, and the partem distribution function used was CTEQ6L. 

For systematic studies we vary: 

• The renormalization and factorization scales between 0.52 an d Q as defined above. We have also 
used a sample with a different scale definition Q 2 = Y*p\ (partons). 

• The p T threshold of partons in the matrix-element calculation, between 15 and 40 GeV. 

• The distance in AR between partons in the matrix-element calculation, between 0.35 and 0.7. 

• The parton distribution function, comparing CTEQ6L to MRST2001J. 



4 Global Variables 

The SUSY analyses use some global event variables, built out the momenta of jets, leptons and p™ ss , 
which have a good signal discriminating power. We give here a detailed definition of these variables as 
used in the work described in the SUSY notes. 



4.1 Effective mass 

The effective mass (M eff ) is a measure of the total activity in the event. It is defined as: 

4 

Meff^£y r eM + £y r etV +^T iSS 

i=l (=1 

where the sums run respectively over the four highest p T jets within |tj| < 2.5, and over all of the 
identified leptons. This variable is useful in discriminating SUSY from Standard Model events. It has 
also the interesting properties that for SUSY events it the M e g distribution peaks at a value which is 
strongly correlated with the mass of the pair of SUSY particles produced in the proton-proton interaction. 
It can therefore be used to quantify the mass-scale of SUSY events [26]. 



4.2 Transverse sphericity 

The transverse sphericity (St) is defined as: 



2A 2 
: (Ai + A 2 ) 
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where X\ and A2 are the eigenvalues of the 2 x 2 sphericity tensor 5,y = £^ PkiP k * - The tensor is computed 
using all jets with \r\ | < 2.5 and pj > 20 GeV, and all selected leptons. 

SUSY events tend to be relatively spherical (St ~ 1) since the initial heavy particles are usually 
produced approximately at rest in the detector and their cascade decays emit particles in many different 
directions. QCD events are dominated by back-to-back configurations (St ~ 0). 

4.3 Transverse mass 

The transverse mass M T is defined by: 

M T (p<*,pf ss ,m a ,m x ) =m 2 a +m\ + 2 (E? E™ ss - p% ■ pf™) (2) 

where 

m a and p" are the mass and transverse momentum of some visible particle and p™ ss is the missing- 
transverse-energy two-vector. The parameter m x is the mass of the invisible particle, which is usually 
assumed to be zero. 

This variable is useful when one parent particle decays to one visible and one invisible daughter 
particle, for example W — ► ev where it is clear that the mass of the invisible particle (neutrino) can 
indeed be safely neglected. 

4.4 Stransverse mass 

The stranverse mass mn variable can be defined in terms of the transverse mass (Eq. (2)) by: 

m T2 (PT^T^T^ m a,mp,m x )= min \max.iM%(p%,iP;m a ,m x ), M£(pf ,qf ^m^m^)}] 
4¥'+W=Pf ssL 

(4) 

where m x is the trial mass for the lightest SUSY particle and p" are the transverse momenta of two 
visible particles (each of which is a canididate decay product of one of the two SUSY parent particles). 
The vector sum of the dummy variables q^ and q^ is constrained to equal the total p™ ss 2- vector, so 
the missing transverse momentum is required as an input to the mj2 calculation. One can consider mj2 
to be a variable formed from dividing p™ ss into two parts in all possible combinations that satisfy the 
kinematics of the event (for some m x , which is here taken to be zero) and calculating the transverse mass 
for each decay branch. The resulting value is the best lower limit on the mass of a pair-produced SUSY 
particle that could have decayed to the observed final state with the given p", p^ and p™ lss . 

The original purpose for mx2 was to provide information on the masses of pair-produced SUSY 
particles, decaying semi-invisibly [27,28]. The variable was first proposed to determine SUSY particle 
masses in "simple" two-body decays, such as two-jet or two-lepton final states, but it can also be used 
in more complicated cases, especially if it is possible to unambiguously determine which particles came 
from which branch of the decay. 

5 SUSY Studies in ATLAS 

In this collection of papers we review the techniques that will be used to search for SUSY with ATLAS 
at the LHC turn-on. Most of the studies presented are based on a total integrated luminosity of lfb" 1 that 
may be collected during the first year of operation of the LHC. Larger datasets are assumed for studies 
that investigate SUSY models with a lower cross-section as an illustration of studies that can be carried 
out once the LHC is in full operation. 



1522 



Supers ymmetry - Supersymmetry Searches 



The first two articles in this collection concentrate on the background estimation for SUSY searches: 

• "Data-driven determinations of W, Z, and top backgrounds to Supersymmetry" [11], 

• "Estimation of QCD backgrounds to Searches for Supersymmetry" [21]. 
The following two articles discuss SUSY searches and measurements: 

• "Prospects for Supersymmetry Discovery Based on Inclusive Searches" [29], 

• "Measurements from Supersymmetric events" [30]. 

The last two papers focus on searches for specific signatures: 

• "Multi-lepton Supersymmetry searches" [16], 

• "Supersymmetry signatures with high-p^ photons or long-lived heavy particles" [5]. 
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Data-Driven Determinations of W, Z and Top Backgrounds to 
Supersymmetry 

Abstract 

The Standard Model processes of W boson, Z boson and top quark production 
each in association with jets constitute major backgrounds to searches for Su- 
persymmetry at the LHC. In this note, we estimate the contribution of these 
backgrounds for a basic SUSY selection, and discuss methods to derive them 
from the initial 1 fb 1 of integrated luminosity at ATLAS. 

1 Introduction 

1.1 Motivation 

The Large Hadron Collider (LHC) will provide excellent opportunities to search for new physics beyond 
the Standard Model, and the ATLAS detector [1] is a general purpose experiment to explore such new 
physics. Supersymmetry (SUSY) is a theoretically attractive model for new physics beyond the Standard 
Model, and searching for Supersymmetry is one of the main objectives of ATLAS. The actual search 
strategy is described elsewhere in this volume [2]. 

It is clear, however, that any discovery of new physics can only be claimed when the Standard Model 
backgrounds are understood and are under control. It is expected that at the LHC, Monte Carlo pre- 
dictions will not be sufficient to achieve this: the backgrounds will have to be derived from the data 
themselves, possibly helped by Monte Carlo. The development and description of such data-driven 
background estimation is the topic of this note. We note that for a complete understanding of the back- 
grounds, multiple, independent methods are desired. Each of these may be sensitive to a specific back- 
ground source, and affected by specific systematic effects. Only their consistency in combination allows 
for sufficient confidence in the control of the background to claim a discovery when a signal appears to 
be present. 

1.2 Data-driven methods: scope of this note 

The general aim of data-driven methods is to estimate from the data the Standard Model backgrounds 
and their uncertainties in a "signal" region, in which new physics may be present. Such a signal region is 
typically obtained after applying selection cuts, or multivariate methods, and the new physics is searched 
for as an excess in the number of selected events over background, or as an excess in certain regions of 
certain distributions. 

The background estimation is performed by selection of "control samples", from which predictions 
in the signal region are derived. Good control samples should be as close as possible to the signal 
region, yet free of SUSY signal, give an unbiased estimate of background in the signal region, have 
sufficient statistics, and small theoretical uncertainties. This note intends to describe a number of ideas 
on selection of such control samples for SUSY searches. Good control of the composition of control 
samples is important for a correct extrapolation into the signal region. 

The methods described in this note should not be regarded as the final word on these procedures, but 
rather present a number of ideas. Each of these ideas will have to be pursued further, and the effect of 
other systematic uncertainties will need to be studied. Furthermore, SUSY selection cuts will evolve, and 
so the methods will need to evolve too. We do believe, however, that a first indication of the uncertainties 
that can be expected can be given. 
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This note deals with top, W and Z backgrounds to SUSY searches with primary squark or gluino 
production, and assuming /^-parity conservation. The initial priority is to simulate results for 1 fb _1 of 
integrated luminosity, and for the understanding of the detector expected. The other important QCD 
background of quark (other than top) and gluon jet production is treated elsewhere in this volume [3]. 
Backgrounds to alternative production models are also described elsewhere: direct gaugino produc- 
tion [4], and photonic and long-lived particle (such as R hadron) signatures [5]. 

1.3 SUSY contamination 

If SUSY is discoverable, it is likely that SUSY events will creep into the control samples, thereby affect- 
ing the background estimates. In general, SUSY events, mistakenly regarded as Standard Model physics, 
will lead to an overestimation of the background, and thus to a reduced SUSY event excess. The extent 
to which this happens will be analysis- and SUSY model-dependent. 

Since we do not know whether SUSY exists, we quote the SUSY contamination effects separately 
from the other systematics. We will do so by running each data-driven estimation method not only over 
background samples, but also on a number of SUSY signal samples [6]. The samples represent various 
regions of mSUGRA parameter space, and together give an impression of the effects. The SU1 sample is 
a point in the stau coannihilation region, the SU2 sample in the focus point region, and the SU3 sample 
in the bulk region. The SU4 point is a low-mass point, just above the Tevatron limits. It has a very large 
cross-section, and kinematic distributions that are typically only slightly harder than the Standard Model 
background. As will be shown, this model has the largest SUSY contamination effect on the background 
estimates. 

There are a number of ways that the data-driven methods can take the presence of SUSY into account: 

1. Iteration. The Standard Model background is evaluated under the assumption that there is no 
SUSY. This will overestimate the background if there is SUSY, and reduce any excess. Neverthe- 
less, if an excess is seen, the underlying assumption in the background estimation has been proven 
wrong, and a correction can be applied. This correction can be derived from the properties of the 
observed excess, and will lead to a new background estimate. An example of such a procedure is 
the "new MT method" described in section 3.3.3. However, other implementations are possible, 
and perhaps necessary, as well. 

2. A combined fit determining the composition of the control sample, allowing for a possible SUSY 
contribution. 

Both methods are investigated in this note. Nevertheless it is clear that these are preliminary ideas 
that require further investigation. Most likely, some form of iteration on the background determinations 
will be necessary. 

1.4 Layout of this note 

A number of important prerequisites for the studies presented here are described in an introductory 
note [6]: 

• the physics processes that form the background to SUSY searches and how they are simulated, as 
well as a few SUSY event samples (SU1-SU8) that serve to estimate the effect of SUSY on our 
background estimates; 

• the definition of objects like electrons, muon, taus, jets and missing transverse energy, and common 
variables like the effective mass M e ff ; 
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• the origin and common treatment of various systematic uncertainties, both from the simulation and 
from the performance of the detector. 

Furthermore, the trigger menu that was used is described elsewhere [7]. In this note we then discuss the 
W, Z, and top-quark backgrounds and their data-driven estimation for two different SUSY search modes: 

1. the mode with one isolated electron or muon (section 2); 

2. the no-lepton mode, with a veto against isolated leptons (section 3). 

2 One-lepton search mode 

2.1 Selection 

The one-lepton search mode is expected to play a major role in the SUSY search, since the requirement 
of an isolated lepton will be effective in suppressing QCD background. In this search mode, we require 
one isolated electron or muon, with a pj of more than 20 GeV. We veto events with a second identified 
lepton with a pj of more than 10 GeV, so that we have no overlap with the di-lepton search mode. 

We demand at least four jets with |tj| < 2.5 and p T > 50 GeV, at least one of which must have 
p T > 100 GeV. The transverse sphericity S T should be larger than 0.2, and the missing transverse energy 
Z?™ ss should be larger than 100 GeV and larger than 0.2M e ff, where M e g- is the effective mass 1 . The 
transverse mass Mj reconstructed from the lepton and Ej lss should be larger than 100 GeV. 

2.2 Backgrounds in Monte Carlo 

In many SUSY models after the selection cuts have been applied clear excesses will be observed in the 
high £"™ ss and high effective mass regions, as shown in Figure 1. The dominant background process 
for the one-lepton mode is tt (90%), with W ± + jets (10%) being the subdominant process. The neutrino 
emitted from the W ± decays produces the £"™ ss in the both processes. Smaller contributions come from 
Z+jets, diboson and single top events and from QCD processes. It is interesting to note that the major tt 
background does not come from the semileptonic (tt — > bblvqq 1 ) top pair events which are reduced by 
the M T and Ef 1SS cuts, but rather from the double leptonic (tt — >• bbiviv) top decay where one lepton is 
not identified. 

2.3 Data-driven estimation strategies 

We discuss a variety of different methods to estimate the background from data. These methods differ 
in their approach and therefore are influenced by different systematic uncertainties, and they focus on 
different aspects of the background: 

1. estimation of W and tt background from a control sample formed by reversing one of the selection 
cuts (on Mj) (section 2.3.1); 

2. estimation of the semileptonic it background by explicit kinematic reconstruction and selection on 
top mass ("top box") (section 2.3.2); 

3. estimation of the double leptonic tt background, where one lepton is missed, by explicit kinematic 
reconstruction of a control sample of the same process with both leptons identified (section 2.3.3); 

'The variables St, M eS and M T are defined elsewhere in this volume [6]. 
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Missing E ? [GeV] Effective Mass [GeV] 



Figure 1 : The E™ 1SS and effective mass distributions for the background processes and for an example 
SUSY benchmark point (SU3) in the one-lepton mode for an integrated luminosity of 1 fb _1 . The black 
circles show the SUSY signal. The hatched histogram show the sum of all Standard Model backgrounds; 
also shown in different colours are the various components of the background. 

4. estimation of that same double leptonic tt background from a control sample derived by a cut on a 
new variable HT2 (section 2.3.4); 

5. estimation of tt background by Monte Carlo redecay methods (section 2.3.5); 

6. estimation of W and tt background using a combined fit to control samples (section 2.3.6). 

2.3.1 Creating a control sample by reversing the Mj cut 

The transverse mass Mj is constructed from the identified lepton and the missing transverse energy. In 
the narrow-width limit Mj is constrained to be less than mw for the semileptonic tt and the processes. 
Figure 2 shows that Mj is only weakly dependent on £™ 1SS . This variable is therefore suitable for the 
estimation of the background distribution itself. Events with small Mj (< 100 GeV) are selected as the 
control sample, in which the it (~ 84%) and W ± (~ 16%) processes are enhanced over the SUSY and 
the other background processes. The large Mj (> 100 GeV) region is referred to as the signal region. 
Since, for the control sample, the other selection criteria are identical to those for events in the signal 
region, the same kinematic distributions including E™ ms can be obtained. The number of events for the 
various processes in signal region and control sample is summarized in the Table 1 . 

Table 1: Number of background events and estimated numbers for tt, W ± and QCD processes without 
SUSY signal, normalized to 1 fb -1 . 





Signal Region 


Control Sample 


tt(evqq) 

tf(Mv) 
W ± (£v) 


51 (25%) 
140 (70%) 
10 (5%) 


1505 (77%) 
132 (7%) 
305 (16%) 


SUSY(SU3) 


450 


317 



The normalization factor is obtained from the event numbers of the signal region and the control 
sample (100 < Ej lss < 200 GeV), in which the SUSY signal contribution is expected to be relatively 
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Figure 2: The E™ iss distribution for tt (left) and SUSY (SU3, right) signal. In both figures, the solid and 
dashed histograms show the £'™ 1SS distribution for Mr > 100 GeV and < 100 GeV, respectively. The 
numbers are normalized to 1 fb _1 . 

small. Figure 3 shows the £"™ ss and M e ff distributions which are obtained using this method to estimate 
the size of these backgrounds, and, for comparison, the true background distributions. The numbers of 
events with E^ iss > 100 GeV and > 300 GeV are listed in Table 2. The prediction and the true values 
agree within the uncertainties, although somewhat less well for high Ej lss . 

The tt event composition of the control sample differs from that of the signal sample, since the Mj 
cut removes a much larger proportion of the semileptonic tt events. The control sample is still able to 
predict the background in the signal sample within statistical uncertainties. Nevertheless, the resulting 
systematic shift needs to be investigated, and would be desirable to obtain independent estimates of the 
fully-leptonic and semileptonic tt backgrounds separately. 




Figure 3: The E™ 1SS and effective mass distributions of the background processes for the one-lepton 
mode with an integrated luminosity of 1 fb -1 . The open circles show the estimated distributions with the 
Mj method. The hatched histogram shows the true sum of all Standard Model backgrounds; different 
symbols show the various contributions to the background. 



SUSY signal contamination If supersymmetric particles are produced they are also likely to contribute 
to the control samples. The estimated £"™ ss distribution with the presence of a SUSY signal (SU3 point) 
is shown in Figure 4 (left), and the numbers are listed in Table 3. The background is overestimated due 
to the SUSY contamination, and the inferred Ej lss distribution is biased towards larger values. However, 
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Table 2: Numbers of background events and estimated numbers for the sum of all background processes 
without SUSY signal, normalized to 1 fb~ ! 





£™ 1SS > 100 GeV 


£™ ss > 300 GeV 


True BG 
Estimated BG 
Ratio(Est./True) 


203 ±6 
190 ±8 
0.93 ± 0.05 


12.4 ± 1.6 
9.4 ± 0.7 
0.76 ±0.11 



the amount of the over-estimation is smaller than the SUSY signal itself, and a clear excess can still be 
observed, as shown in the figure. The same exercise was repeated for other SUSY signal points, as also 
shown in Table 3. 




Missing E T [GeV] Transverse Mass [GeV] 



Figure 4: Left: the E™ 1SS distribution of the background processes for the one-lepton mode with an 
integrated luminosity of 1 fb _1 . The red dots show the estimated distributions with the Mj method, with 
SUSY signal (SU3) present. The hatched histogram shows the sum of all Standard Model backgrounds, 
and the OPEN histogram shows the SUSY signal (SU3). Right: the transverse mass distributions of 
the various SUSY signals (SU1, SU2 and SU3) with an integrated luminosity of 1 ftr 1 . Background 
processes are superimposed for comparison. The hatched histogram shows the sum of all Standard 
Model backgrounds. 



Correcting for SUSY signal: "New MT method" If, even for overestimated backgrounds, the pres- 
ence of a concrete SUSY excess is observed in data, we can try to correct the background estimates. 

One possible procedure is described here, referred to as the "new MT method". More advanced 
implementations of such a correction procedure are possible and should be studied. 

The new MT method makes use of the observation that in the one-lepton search mode, the Mj 
distribution of backgrounds falls off steeply beyond ~ 100 GeV, whereas for many SUSY signal models 
this distribution falls only slowly. This is illustrated in Figure 4 (right). By making a general ansatz for 
the shape of the SUSY Mj distribution, and neglecting to first order the Standard Model background 
at high Mj, the SUSY contamination can be subtracted from the control sample. Obviously, remaining 
Standard Model background in the high Mj region and variations in the Mj shape for various SUSY 
signals are to be treated as systematic uncertainties on the method. Nevertheless, the data itself will tell 
what the Mj shape is. 
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Table 3: Number of background events and estimated numbers for all background processes with SUSY 
signal, normalized to 1 fb _1 . Also the total number of events (SUSY + background) is shown. 



Estimated BG 
True BG+SUSY 



Estimated BG 
True BG+SUSY 



Estimated BG 
True BG+SUSY 



£™ ss > 100 GeV | £™' ss > 300 GeV 



203 ±6 



12.4 ± 1.6 



225 ±9 21.6 ±1.1 

463 ±7 194 ± 4 



200 ± 9 10.9 ± 0.7 

249 ± 7 34 ± 2 



296 ± 10 I 33.3 ± 1.4 
653 ± 8 245 ± 4 



£™ ss > 100 GeV | £™' ss > 300 GeV 



203 ±6 



12.4 ± 1.6 



2366 ± 102 165 ± 12.7 

3177 ±79 415 ±29 



213 ±9 16.3 ±0.9 

365 ±9 129 ± 5 



206 ±9 I 13.7 ±0.8 

354 ±8 115 ±5 



In the simplest ansatz used here, the ratio of SUSY signal between the control sample Mr < 100 
GeV and signal region Mj > 100 GeV is assumed to be constant for all SUSY signal samples. The 
normalization factor is obtained from the number of events in the signal region and the corrected control 
sample in the interval 100 GeV < E^ iss < 150 GeV (instead of 100 - 200 GeV) to suppress the SUSY 
contribution in the normalization region. The statistical error becomes relatively larger when the narrow 
band is used for normalization, but the over-estimation of the normalization factor due to the SUSY 
signal can be suppressed. A lower Ej lss region, such as £"™ ss = 70 — 100 GeV, could be used for the 
normalization in future studies. 

Figure 5 shows the E™ 1SS and the effective mass distributions of the estimated background processes. 
The true distributions of the background processes are also superimposed. The numbers in regions of 
^rniss > GeV and 300 GeV are listed in Table 4. A reasonable agreement between the prediction 
and the true values is observed. For high values of £"™ ss , the method tends to subtract too much SUSY 
contamination and underestimates the background. More study is needed. The SU4 benchmark point is 
a special case because it has a particularly light SUSY particle spectrum. 




Figure 5: The £'™ 1SS and effective mass distributions of the background processes for one lepton mode 
with an integrated luminosity of 1 fb~ 1 . The red dots show the estimated distributions with the "new 
Mj" method. The hatched histogram show the sum of all Standard Model backgrounds. The open 
circles indicate the SUSY (SU3) signal. 
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Table 4: Numbers of background events and estimated numbers for all background processes in the 
presence of various SUSY signals, using the new MT method. The numbers are normalized to 1 fb _1 . 





£™ ss > 100 GeV 


£™ lss > 300 GeV 


£™ ss > 100 GeV 


£™ ss > 300 GeV 


True BG 


203 ±6 


12.4 ± 1.6 


203 ±6 


12.4 ± 1.6 


Estimated BG 
True BG+SUSY 


SU1 


SU4 


186 ± 11 
463 ±7 


8.9 ±0.8 
194 ±4 


1382 ±98 
3177 ±79 


48.3 ± 12.7 
415 ±29 


Estimated BG 
True BG+SUSY 


SU2 


SU6 


183 ±11 
249 ±7 


8.8 ±0.8 
34 ±2 


185 ± 11 
365 ±9 


8.1 ±0.9 
129 ±5 


Estimated BG 
True BG+SUSY 


SU3 


SU8 


212 ± 11 
653 ±8 


12.3 ± 1.0 
245 ±4 


180 ± 11 
354 ±8 


6.6 ±0.8 
115 ±5 



The systematic uncertainties 2 for the MT method are summarized in Table 5. As well as variation 
of jet energy scale and lepton identification efficiency, the ALPGEN Monte Carlo was compared to 
MC@NLO, and parameters in ALPGEN (minimum pj of partons and minimum AR between partons) 
were varied. This method is stable against these systematic uncertainties at the ~ 15% level. More work 
is needed to estimate the SUSY contamination effects. 

Table 5: Systematic uncertainties of the one-lepton background estimations with the MT method, ex- 
cluding those related to SUSY signal contamination. Numbers are normalized to 1 fb -1 





Syst. error 


Jet energy scale 
Lepton ID efficiency 


<5% 
7% 


MC@NLO vs ALPGEN 

Monte Carlo parameter variation (ALPGEN) 


8% 
<5% 



2.3.2 Topbox: a control sample for semileptonic top-pair background 

Top mass reconstruction and "topbox" cuts This section describes a data-driven method, denoted 
the "topbox method", for estimating the tt background where one top decays leptonically, and the other 
hadronically. 

For semileptonic tt events, the invariant mass of the leptonically decaying W boson can usually 
be reconstructed by assuming that the neutrino from the W decay is responsible for all missing energy. 
This is a fair assumption; after removal of fake £"™ 1SS (noisy/dead calorimeter cells etc.) in the event- 
cleaning procedure, the resolution on £"™ 1SS is expected to be approximately equal to 0.55^/L^r UL 
which is much smaller in a typical tt event than the £"™ ss from the escaping neutrino. The fact that the 
mass of the leptonically decaying top can be reconstructed satisfactorily (see below) further justifies the 
assumption. 

The core of the method is to construct both the semileptonic and the hadronic top decays in a tt event 
following the procedure below: 

2 Throughout this note systematic uncertainties have been calculated according to the procedures outlined in the introduction 
to this chapter [6]. 



1532 



SUPERSYMMETRY - DATA-DRIVEN DETERMINATIONS OF W, Z AND TOP BACKGROUNDS . 



• The leptonic W is assumed to decay into the observed lepton and a neutrino which is responsible 
for all missing energy. The p x and p y components of the neutrino momentum are hence taken to 
be the x and y components of E™ 1SS . The p z component of the neutrino can be calculated using 
a W mass (m w ) constraint. The four- vector of the leptonic W is the sum of the four- vectors of 
the lepton and the reconstructed neutrino. For events with transverse mass Mj less than mw , two 
solutions can be found. In the case of Mj > mw no real solution is possible and, in such cases, 
the momentum of the leptonic W is taken from the transverse components of the lepton and £™ ss . 

• The leptonic top is then reconstructed by taking the solution with the best reconstructed top mass 
( m top-lep ) f rom combinations of a jet and one of the above leptonic W solutions. The jet is taken 
from the pool of the four highest-/^ jets in the event. The best reconstructed top mass is defined 
to the one that is closest to the nominal top mass m t . 

• The hadronic W is then taken to be formed from the best reconstructed W mass (m-^-had ) among 
the two-jet combinations from the remaining three jets in the pool. The best reconstructed W mass 
is defined to be the invariant closest to m w . 

• Finally, the hadronic top is taken to be the one with the best reconstructed top mass (m top .h a d ) 
among combinations of the hadronic W and one of the remaining jets. 

The plots in Figure 6 show the distributions of the reconstructed masses m top _ lep , m w _ had , and 
m top-had a f ter the mass reconstruction procedure described above. The distributions are made for tt , 
SU3 and W + jets event samples with standard one-lepton cuts, except for a modified Mj requirement 
(see below in the control sample section). As expected, the topbox mass reconstruction procedure offers 
a very good separating power between tt and other processes. 

The topbox cuts are then defined as follows: |m top . lep —m t \ < 25 GeV, |m w _h a d ~m w \ < 15 GeV, 
and |m top . had - m t \ < 25 GeV. 

Topbox control sample To make the topbox control sample, events are selected with the standard 
SUSY search cuts in the one-lepton mode, with the exception that M T > 100 GeV is replaced by M T < 
mw- In addition, the above topbox cuts are applied. 

Table 6 shows the number of events of various processes in the topbox control sample. The tt + jets 
process makes up more than 95% of the topbox control sample if no SUSY signal is present. 

Table 6: Composition of the topbox control sample. Numbers shown correspond to an integrated lumi- 
nosity of 1 fb _1 . The last five columns show the numer of SUSY events which would enter into the 
topbox control sample. 



Process 


tt +jets 


W + Jets 


SU1 


SU2 


SU3 


SU4 


SU6 


Events 


340.9 


6.8 


1.8 


0.4 


4.9 


243.6 


0.4 



SUSY signal contamination Table 6 also shows the number of SUSY events, for various signal sam- 
ples, in the topbox control sample, for 1 ftr 1 . In this method, SUSY contamination is in general small. 
This fact makes the topbox method a good supplement to the other methods (e.g. the MT method). The 
exception is the SU4 benchmark point, which has a larger contribution because its light spectrum makes 
it rather similar to the tt background. 
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Estimation of the tt background in the signal region The tt contamination in the signal region is 
estimated by multiplying the number of events in the data topbox by a scaling factor R tt . R tt is defined 
as the ratio of the number of Monte Carlo tt events in the signal region (those that pass the one-lepton 
cuts) to that in the topbox control sample. The procedure is summarized by the following equations: 

^ignal-region (data) = ^opbox^^ . R { (1) 



R n = A£ gnal " region (MC) /A£ pbox (MC) (2) 

With fully simulated Monte Carlo samples, R tt is determined to be 0.386 . The model dependence 
(variation of Monte Carlo generator and generator parameters) of this number is treated as a systematic 
uncertainty. 

Systematics The systematic uncertainties of the topbox method are summarized in Table 7. The largest 
source of uncertainty is from the jet energy scale uncertainty; this is expected since the method relies 
heavily on the reconstruction of top and W masses. The Monte Carlo model dependency of R„ is 
estimated by comparing MC@NLO and ALPGEN, and by variation of the ALPGEN parameters, and 
amounts to 8%. Finally, it is expected that extra jets due to event pile-up may affect the mass recon- 
struction resolution. However, this is relevant only in high luminosity scenarios, beyond the scope of this 
note. The statistical uncertainty on the topbox control sample normalization is estimated to be 5% for 
1 fb -1 given that the effective cross-section of tt in the topbox is about 400 fb. 
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Table 7: Systematic uncertainties of the topbox method for 1 fb l . 



Source 


Contribution [%] 


Jet energy scale 
Ej lss scale 

Monte Carlo Model dependence of R t , 


20 
2 
8 


Total 


22 



2.3.3 Di-leptonic top with one lepton missed: kinematic reconstruction 

Introduction Fully leptonic tt events may contribute to the one-lepton SUSY search sample if one 
of the two leptons originating from the W decay is not identified. Such events can be classified as: (1) 
events with one tau (51%); (2) events where one lepton is misidentified due to inefficiency of the lepton 
identification algorithms (20%); (3) events where one lepton is lost inside a jet (17%); (4) events where 
one lepton is not in the pj or Y\ acceptance (9%); and (5) events with two tau leptons (3%). 

The method discussed here is based on the selection of a sample enhanced in tt — > bbivlv events 
by requiring that the events satisfy a set of kinematic constraints particular to the tt — >• bbivlv process. 
This sample, denoted as the control sample, with two isolated identified leptons, is used to estimate the 
contribution from the first two categories of events listed above. The contribution from category (1) is 
estimated by replacing one of the leptons in the control sample with a tau, and category (2) is estimated 
by removing one of the two leptons. The contribution from the categories (3)-(5) is not estimated from 
the control sample. Events were required to fire either the 4j50 multi-jet trigger or the j80_xE50 jet plus 
E™ ss trigger [7]. 

Selection of the control sample The following requirements are imposed to select events in the control 
sample: two isolated oppositely-charged leptons (electron or muon), with pj > 10 GeV and at least one 
with pi > 20 GeV; at least three jets with |tj| < 2.5 and pj > 50 GeV at least one of which must have 
pi > 100 GeV. Note that in contrast to the SUSY one-lepton search selection given in Sec. 2.1 only 
three jets are required, since the misidentified lepton or tau can produce the fourth jet. 

For tt — > bbivlv events the two leptons, two b jets and the x- and j-components of the £"™ 1SS -vector 
satisfy the following kinematic constraints: 



{Pv+Pi+f =mlr, 

(Pv + Pe-f = mw, 
(p v +Pt^+Pb) 2 = mj, 
(Pv+Pe- +p- h f = m^ 
Pv x +Pv x =E™* s , 

Pv y +Pv y =E™* s , (3) 

where p£±, p v / v , p b j- b are the lepton, neutrino and &-quark momenta respectively and m w and m t are the 
W boson and top quark masses. We assume that the only source of £"™ 1SS is a pair of neutrinos, which is 
a fair assumption as shown in the previous section. 

The final state contains two unknown neutrino momenta and the above system of equations has a two- 
or four-fold ambiguity, as the solution is given by a quartic equation which can be solved with standard 
analytical techniques [8]. Since there are at least three jets in each event, all possible combinations of 
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jet pairs made from the three highest pj jets are considered. Jet pairs for which the above system of 
equations has real solutions are denoted as ft-jet pairs 3 . Figure 7 (left) shows the number of b-]et pairs 
for the various processes contributing to the control sample. 




Figure 7: Left: distribution of number of Z?-jet pairs for events passing the control sample requirements in 
the kinematic reconstruction method. The fraction of it — ► bbiviv events with no &-jet pairs is dominated 
by events with at least one b jet which is not among the three highest-/^ jets. Right: distribution of E™ 1SS 
for it —> bbiviv events with one tau lepton and events with a misidentified lepton compared to the 
estimation from resimulated events with an integrated luminosity of 1 fb _1 . The requirement on the 
number of ft-jet pairs is not applied to the resimulated events. The distribution of all it — > bbiviv events 
is also shown. 



Replacement procedure Each event in the control sample is used as a seed for producing a series of 
resimulated events. One of the two identified leptons in the seed event is replaced by tau lepton and a set 
of 1000 tau decays are simulated using the TAUOLA package [9]. The same procedure is repeated for the 
second lepton in the seed event, yielding a total of 2000 events for every seed event. Each resimulated 
event is weighted by a factor of 1/e, where £, the identification efficiency for the replaced lepton, is 
estimated from simulations. 

The contribution of events where one lepton evades identification is estimated as follows. If the 
replaced lepton is an electron then a jet with the same momentum is substituted instead of it. If the lepton 
is a muon it is replaced by a so-called stand-alone muon (defined as a track in the muon spectrometer 
with no match to a track in the inner detector) justified by the fact that most muons not passing the muon 
definition are stand-alone muons. This procedure is applied to each of the two leptons in the seed events, 
resulting in two resimulated events for each seed event. The resimulated events are re-weighted with 

For both kinds of resimulated events, the SUSY one-lepton search selection are subsequently applied. 

As a closure test of the replacement procedures described above, the Ej lss distribution for resim- 
ulated it —> bbiviv events passing the control sample selection apart from the requirement of b-jet 
pairs, is compared to the Monte Carlo prediction. The result is shown in Fig. 7 (right) and shows good 
agreement. 

Normalization The number of it —> bbiviv events in the signal region is estimated by scaling of 
the sum of described above contributions with two scaling factors. The first factor takes into account 
the other categories of it —>■ bbiviv events that are not estimated by this method. This first factor is 

3 Note that within this section only kinematical conditions have used to identify these fc-jet pairs - no secondary-vertex 
requirement is used. 
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Table 8: Estimated background corresponding to an integrated luminosity of 1 fb _1 for different 
mSUGRA benchmark points. The second column shows the relative increase of the estimated back- 
ground with respect to the estimation without contamination from the SUSY signal. The third column 
shows the number of SUSY events. The Monte Carlo prediction of it — > bblvlv background in the one 
lepton search mode is 136 events. The errors in the first column are statistical only. 



SUSY point 


Estimated 


Relative change 


True Signal 




Background 


[%] 


Events 


No signal 


120±14 






SU1 


137±15 


15 


260 


SU2 


127 ±15 


5.9 


45 


SU3 


176±18 


47 


454 


SU4 


604 ±38 


405 


2960 


SU6 


129 ±16 


7.8 


162 


SU8 


124 ±14 


3.8 


100 



estimated from Monte Carlo to be R MC = 1.4 ±0.1. The second normalization factor, R b -j^p^ t takes 
into account the efficiency of it — > bblvlv events to pass the requirement on the number of b-]et pairs; 
it is defined as the ratio of resimulated events before and after the b-]et pair selection in a normalization 
region, 80 < £^ iss < 120 GeV, and found to have the value flHetpair = i.4±o.l(stat) ±0.1(syst). 



Presence of SUSY A possible SUSY signal could have an effect on the background estimation in two 
ways: 1) by satisfying the kinematic constraints in Eq. 3 and therefore enter the control sample and 2) by 
entering the normalization region giving a systematic contribution to the scale factor /?H et P air . j n pjg g 
the estimated it — ► bblvlv background is shown with and without the contamination of a SUSY signal 
(SU3) while Tab. 8 gives the estimated number of it —>■ bblvlv events in the presence of different SUSY 
signals. 




Figure 8: The Ef 1SS (left) and M eff (right) distributions for the estimated and true it — > bblvlv contri- 
bution for the one-lepton SUSY search. Black points (red area) represent the estimation without (with) 
the presence of a signal from SUSY (SU3). 



Systematic Uncertainties The systematic uncertainties for this method are summarized in Tab. 9. The 
uncertainty from the replacement procedure is estimated by comparing number of resimulated events 
to the Monte Carlo prediction, see Fig. 7(right). The uncertainty of R MC is estimated by comparing 
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Table 9: Breakdown of systematic uncertainties in the kinematic reconstruction method. 



Source 


Contribution [%] 


Replacement 


10 


R MC 


10 


Jet Energy Scale 


9 


^-jetpair stat 


9 


flHetpair syst 


8 


Background Subtraction 


3 


Jet Energy Resolution 


1 


£™ ss scale 


1 


Total 


21 



MC@NLO and ALPGEN. The statistical uncertainty of flMetpair j s calculated using binomial errors. 
The systematic uncertainty of this factor takes into account the difference in the shapes between £"™ 1SS 
distribution of the resimulated samples with and without applying the kinematical constraints in Eq. 3. 
The uncertainty due to background subtraction is dominated by the presence of tt — > bbqqiv events 
in the control sample. The systematic effects resulting from uncertainties in the lepton identification 
efficiency, the trigger efficiencies and the energy scale and resolution are expected to be much smaller. 

2.3.4 Dileptonic top with one lepton missed: HT2 

Introduction In this section we describe a method, denoted the "HT2 method", to estimate background 
from dileptonic tt production where one of the leptons is not identified. It relies on the (near) indepen- 
dence of Ej lss and the variable HT2. This variable is defined as: 

4 

HT2^£/^ + // r epton . (4) 

(=2 

In the HT2 method, the shape of the £"™ 1SS distribution is estimated from dileptonic tt events with low 
HT2. This distribution is then normalized to the number of events at large HT2, but with low missing Ej 
, and can then be used to estimate the remaining backgrounds in the signal region of large HT2 and large 

For this method to work, the shape of the £"™ ss distribution needs to be independent of HT2. Note 
that in Equation 4, the leading jet pj was excluded from the sum in order to reduce the correlation with 
Ej lss . The correlation between the hightest-p T jet and £™ ss is likely to be due to simple kinematics, i.e. 
to first approximation, the rest of the event recoils against this leading jet. This is illustrated in Figure 9 
which shows the £"™ 1SS distribution (at Monte Carlo "truth level") in slices of leading and sub-leading jet 
pj . The reduced dependence of the £"™ 1SS shape on the jet pj in the second-leading jet case is apparent, 
and will be further diminished by detector resolution effects. 

To further reduce the correlation between HT2 and £"™ ss , the £"™ 1SS significance was used. This is 
to remove the correlation which arises from the fact that the £"™ 1SS resolution depends on ££r> where 
££t is clearly related to HT2. A simple form of £"™ 1SS significance was used here, defined as £"™ ss 
significance = £"™ 1SS /[0.49 • \/Y^Ej]. 

The results shown here are from a data sample consisting of the sum of tt (semi-leptonic and dilep- 
tonic decay modes) plus W(/v)+jets (where I = e,jJ,,T). The trigger used in this analysis was the logical 
OR of the 4j50 multi-jet, the e22i single electron and the mu20 single muon triggers [7]. 

A control sample defined by HT2 < 300 GeV was used to estimate the shape of the £"™ 1SS significance. 
The assumption is that this shape is independent of HT2 so it can be used to predict the shape of the £"™ 1SS 
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Figure 9: Missing Z?t distribution in "lepton+jet" it events with Mj > 100 GeV at Monte Carlo "truth" 
level. Left: as a function of truth leading-jet pj . Right: as a function of truth second-leading jet pj . 



Table 10: Predicted and actual background levels as a function of £"™ ss significance cut for an integrated 
luminosity of 1 fb -1 in the HT2 analysis. A rough equivalent E™ 1SS cut is listed, but the £"™ ss cut is not 
sharp. 



Ej lss sig. cut 


Rough equivalent £™ ss cut [GeV] 


Predicted BG 


Actual BG 


14 


180 


57.3 ± 5.5 


60.6 ± 3.2 


16 


200 


34.8 ± 4.5 


39.2 ±2.6 


18 


220 


19.1 ±3.1 


23.6 ±2.0 


20 


240 


10.1 ±2.1 


15.1 ± 1.5 


22 


260 


6.2 ± 1.8 


9.8 ± 1.2 


24 


280 


3.8 ± 1.5 


6.2 ±0.9 


26 


300 


1.3 ±0.7 


3.5 ±0.6 



significance in the signal "band" defined by HT2 > 300 GeV. The normalization of the prediction in the 
signal band was obtainined by the number of events with HT2 > 300 GeV, but at low £"™ ss , specifically 
8 <£'™ 1SS significance < 14. A comparison of this predicted background with the correct background is 
shown in Figure 10 (left). The agreement between the predicted background and the actual background 
in Fig. 10 is reasonable, indicating that the correlation between HT2 and Ej lss significance is small. A 
numerical comparison of predicted and actual background levels can be seen in Table 10. For each value 
of the Ej lss significance cut, a rough equivalent in E™ 1SS is listed as a guide, but it should be emphasized 
that the cut in E^ 1SS is not sharp. The number of events is for HT2 > 300 GeV, which corresponds 
approximately to a cut on the effective mass of M eff > 600 GeV. 

The ratio of observed to predicted backgrounds for a £"™ 1SS significance cut of 14 is 1 .06 ± 0. 12; while 
the ratio is consistent with unity, we take the uncertainty on the ratio (12%) as a systematic uncertainty 
due to possible correlations between HT2 and £"™ 1SS significance. Monte Carlo samples with larger 
numbers of events would provide one possible way to further study the potential for correlations. 

The distribution of the "orthogonal" variable, namely HT2, was predicted in a similar way. The HT2 
distribution was measured in a control region defined by 8 < £'™ ss significance < 14. This distribution 
was then normalized to the number of events at large £"™ 1SS significance and low HT2, specifically, £"™ 1SS 
significance > 14, and 150 GeV < HT2 < 300 GeV. The results are shown in Fig. 10 (right). 

The near independence of HT2 and £"™ 1SS significance should provide an important tool in under- 
standing jet energy and £"™ 1SS performance in the complex events that make up the background to SUSY 
searches. After all the SUSY selection cuts have been applied, the jet energy performance can be studied 
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Figure 10: Left: Points: Predicted £"™ ss significance distribution in a tt plus W + jets sample. His- 
togram: actual i?™ ss significance distribution. Right: Predicted HT2 distribution in the same sample. 
Histogram: actual HT2 distribution. 



Table 11: Predicted and actual background levels (for 1 fb l , HT2 method) for £"™ ss significance > 14 
as a function of systematic effects applied to the reconstructed objects. 



Modification 


Predicted BG 


Actual BG 


Actual/predicted 


Baseline 

Energy scaled up 
Energy scaled down 
Jet resolution smearing 


57.3 ± 5.5 
64.1 ±5.5 
45.5 ± 4.5 
55.5 ±5.1 


60.6 ± 3.2 
79.3 ± 3.7 
47.3 ±2.7 
65.3 ±3.4 


1.05 ±0.12 
1.24 ±0.12 
1.04 ±0.12 
1.18 ±0.12 



by looking at the HT2 distribution for low £"™ 1SS events; conversely, the £"™ ss distribution can be studied 
by selecting events with low HT2. Events in the tails of these distributions can be examined for signs of 
detector problems. 

Systematic uncertainties due to detector miscalibrations The results of systematic uncertainties due 
to detector performance are summarized in Table 1 1. The energy scale variations change the background 
level by about 30% while the worsening jet energy resolution results in about a 10% increase in back- 
ground. However the predictions tend to change in the same direction as the actual backgrounds, and 
generally continue to provide reasonable determinations. We assign a 20% systematic uncertainty due to 
detector effects. 

Systematic uncertainties due to event generation parameters The systematic uncertainties in the 
method due to changes in Monte Carlo event generation parameters were studied with ALPGEN. The 
parton pj cut in ALPGEN was changed from 40 to 15 GeV and the renormalization scale was reduced 
by a factor of 2. The results of the studies are summarized in Table 12. We assign a 20% systematic 
uncertainty due to event generation uncertainties. 

Background estimation in the presence of SUSY In this section, we repeat the background estimation 
in the presence of SUSY signal. Figure 1 1 (left) shows the £"™ 1SS significance distributions for the true 
background, true signal, and the estimated background, as well as the observed distribution of signal plus 
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Table 12: Predicted and actual background levels (for 1 fb l , HT2 method) for E™ 1SS significance > 14 
as a function of changes in the Monte Carlo generation parameters. 



Modification 


Predicted BG 


Actual BG 


Actual/predicted 


tt 


W+jets 


PT40, scale 1.0 
PT40, scale 0.5 
PT15, scale 1.0 


PT40, scale 0.5 
PT40, scale 0.5 
PT40, scale 0.5 


73.3 ±5.8 
133.8 ±7.2 
91.1 ± 12.6 


63.9 ± 3.2 
109.2 ±3.6 
72.5 ± 6.0 


0.87 ±0.11 
0.82 ± 0.05 
0.80 ±0.13 




Missing Et significance Missing Et significance 

Figure 11: Left: Histogram: observed £"™ ss significance distribution for the sum of tt plus W±jets 
background plus SUSY signal. Open circles: SUSY signal. Blue triangles: true tt plus W + jets back- 
ground. Black filled circles: estimated background. The SUSY signal shown here is the "1 TeV SUSY" 
point (see text). Right: Open circles: true SUSY signal as a function of £"™ ss significance. Black: es- 
timated SUSY yield, obtained from the difference of the observed £"™ ss significance distribution minus 
the estimated background distribution. 

background. The SUSY signal here is the so-called "1 TeV SUSY" point (m 0 = mi = 400 GeV, tan/3 = 
10, A=0, ^ > 0). 

Because of the signal contamination in the control region, the background level is overestimated, 
leading to an underestimation in the excess of signal over background. Nevertheless, it is clear that by 
cutting harder on £"™ ss significance, for example, the signal can still be clearly seen over the estimated 
background. A comparison of the estimated signal yield to the true signal is shown in Figure 1 1 (right). 

The results for all the tested SUSY points are summarized in Table 13. 

2.3.5 Top background estimation with top redecay simulation 

Introduction It is possible to isolate a pure biased sample of fully-leptonic tt events by selecting low 
E™ ss (to reduce SUSY signal) opposite sign dilepton events where one and only one pair of invariant 
mass combinations mgj between the two leptons and two hardest jets (b jets if tagging available) gives 
values below the expected endpoint from t — > Wb — > £vb decays: m™ ax = J mj — (neglecting mt). 

A possible use of such a sample is to estimate the background of fully-leptonic tt events to SUSY 
searches. One can reconstruct the kinematics of the decaying particles (W's or top quarks), remove 
their inferred decay products from the reconstructed event (including the event E™ ms ), redecay the re- 
constructed f's or top quarks using an event generator (e.g. PYTHIA) and then merge the simulated 
re-decay products back into the parent ('seed') event. By redecaying particles earlier in the decay chain 
(i.e. the top rather than the W) the kinematic bias obtained from the event selection can be minimised. 
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Table 13: True and estimated background and signal, using the HT2 method, when the background 
estimation is performed in the presence of SUSY signal. The numbers are for an integrated luminosity 
of 1 fb _1 except for the SU4 point where 100 pb _1 is used. 



SUSY 


£™ ss sig. 










True 


Estimated 


point 


cut 


True BG 


Est. BG 


True signal 


Est. signal 


s/Vb 


s/Vb 


SUl 


16 


39.2 ±2.6 


100.5 ± 10.4 


219.7 ± 8.7 


158.4 ± 13.8 


35.1 


15.8 




20 


15.1 ± 1.5 


53.1 ±7.8 


167.0 ±7.6 


128.9 ± 11.0 


43.0 


17.7 




24 


6.2 ±0.92 


33.1 ±6.5 


120.8 ± 6.4 


93.8 ± 9.2 


48.6 


16.3 


SU2 


14 


60.6 ±3.2 


69.1 ±6.4 


30.4 ± 2.3 


21.9 ±7.5 


3.9 


2.6 




16 


39.2 ± 2.6 


43.1 ±5.3 


24.0 ±2.1 


20.2 ± 6.2 


3.8 


3.1 




18 


23.6 ± 2.0 


24.1 ± 3.7 


18.3 ± 1.8 


17.9 ± 4.6 


3.8 


3.6 




20 


15.1 ± 1.5 


13.9 ±2.7 


13.5 ± 1.6 


14.7 ±3.5 


3.5 


3.9 


SU3 


16 


39.2 ±2.6 


198.1 ±22.5 


328.1 ± 14.9 


169.2 ±27.2 


52.4 


12.0 




20 


15.1 ± 1.5 


119.9 ± 18.5 


228.9 ± 12.5 


124.1 ±22.4 


59.0 


11.3 




24 


6.2 ±0.92 


62.9 ± 13.7 


144.7 ± 9.9 


88.0 ± 16.9 


58.3 


11.1 


SU4 


16 


3.92 ± 0.26 


120.7 ± 8.7 


76.4 ± 4.0 


-40.4 ± 9.6 


38.6 


-3.7 




20 


1.51 ±0.15 


47.4 ± 5.5 


37.4 ±2.8 


-8.5 ±6.1 


30.4 


-1.2 




24 


0.62 ± 0.09 


17.8 ±3.3 


18.8 ±2.0 


1.6 ±3.9 


23.9 


0.4 


SU6 


16 


39.2 ±2.6 


71.5 ±7.2 


140.5 ± 5.3 


108.2 ±9.3 


22.4 


12.8 




20 


15.1 ± 1.5 


36.5 ±5.0 


108.8 ±4.7 


87.4 ±7.0 


28.0 


14.5 




24 


6.2 ±0.92 


25.1 ±4.3 


79.3 ±4.0 


60.3 ± 6.0 


31.9 


12.0 


1 TeV 


16 


39.2 ±2.6 


61.1 ±6.8 


155.0 ±5.7 


133.1 ±9.2 


24.7 


17.0 




20 


15.1 ± 1.5 


27.6 ± 4.4 


118.1 ±5.0 


105.6 ±6.8 


30.4 


20.1 




24 


6.2 ±0.92 


15.6 ±3.5 


84.5 ± 4.2 


75.1 ±5.6 


34.0 


19.0 



This technique has a number of advantages over conventional Monte Carlo techniques. In particular the 
event generator is used purely for modelling relatively well-understood decay and hadronisation pro- 
cesses - initially poorly understood aspects of process generation, such as parton distributions and the 
underlying event model, are effectively obtained from the data. In principle this technique is applicable 
also to other background processes such as Z —> t + t~, which could be modelled by replacing identified 
electrons or muons in Z —> £ + £~ control sample events with redecayed taus. 

It should be noted that this technique is at best an approximation, assuming as it does the factorisation 
of each it event into two independent tops, and hence neglecting effects such as colour connection and 
spin correlations between the tops and other partons in the event. It is therefore unlikely to be competitive 
with a detailed Monte Carlo study using a fully tuned generator and validated parton distribution func- 
tions. In the early days of data-taking however it potentially provides a route to a rapid direct estimate of 
it background from data complementary to, and independent from, more conventional estimates. 

Seed event selection Seed events were selected from the 'data' with cuts designed to maximise the 
number of fully leptonic ttCH-tF) events while minimising the number of Standard Model backgrounds 
or SUSY signal events. Events were required to pass the j45_xE50 jet + Ej lss trigger [7]. Single 
and dilepton triggers were not included in this study, but are planned to be added in future analyses. 
Subsequently, the following criteria were applied: Afj et > 2, prlj^) > 20 GeV, two Opposite Sign (OS) 
isolated leptons should be present, pr^) > 10 GeV, if the two leptons are of the same flavour \mu — mz\ 
> 15 GeV and mu > 10 GeV is required, \m xx — mz\ > 15 GeV, where m TT is calculated assuming the 
neutrinos travel parallel to their parents, and E™ 1SS < j(pt{£\) +Pt(,@2))- The upper limit on E™ 1SS as 
a function of lepton p T rejects SUSY signal events. For the purposes of this early-data study ^-tagging 
was assumed to be either not available or not well-understood. 
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m y (GeV) 

Figure 12: Distributions of mgj values for various different Standard Model backgrounds and SUSY 
signal. The histograms show distributions of 'data' events selected with the 2£-tt selection. The data- 
points show the equivalent distribution estimated with redecay simulation, normalised to the peak. 



Kinematic Reconstruction and Redecay Simulation The mtj distribution of selected events (the 
histogram in Figure 12) contains a prominent edge at the expected position ra™ ax = 155.4 GeV {m, = 
175 GeV). Events were further selected in which one and only one of the two possible pairs of £j com- 
binations obtained from the two leptons and two hardest jets gave ray values which were both less than 

m TT- 

For 2£-tt signal events the two (b) jets, two leptons and E™ ss components satisfy the constraints of 
Eq. 3 given in section 2.3.3. These constraints, assuming massless neutrinos, leptons and jets, may be 
solved for the 8 unknown 4-momentum components of the neutrinos. The constraints together give a 
quartic equation which can be solved with standard analytical techniques. If no solution was obtained 
then to maximise statistics the real part of the least imaginary solution was taken. If multiple solutions 
were obtained the solution with the smallest mean \p z \ of the reconstructed top was used. 

This selection results in 2207 dileptonic it events for 1 fb _1 . The contamination by other Standard 
Model processes and SUSY signal events was 912 events, dominated by semileptonic tt events, as shown 
in Figure 12. The SUSY contamination in the sample is small, due to the tight selection cuts. 

Four-vectors of the two reconstructed top quarks from each event were passed to a modified version 
of PYTHIA 6.4 [10]. 1000 redecayed tops were produced from each reconstructed seed top, with each W 
forced to decay to e, /I or %. This 'recycling' of seed events increases the statistics of decay resimulated 
events for the final Ej lss estimation process but leads to correlations between resimulated events derived 
from the same seed event. These correlations were taken into account in the final uncertainties quoted 
below. Decay products were passed to the ATLAS fast simulation program, and then merged back into 
their parent seed events. 

As a cross-check of the estimation procedure redecayed events were passed through the same selec- 
tion as their parent seed events and the distribution of ray constructed and normalised to the seed m^j 
distribution. This is shown in Fig. 12 and indicates good agreement below ra™ ax . 

Use in one-lepton search background estimate Decay resimulated events were subjected to the stan- 
dard one-lepton SUSY search selection described in section 2.1, with one modification: Mt(£,Ej 1ss ) > 
150 GeV. 
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Figure 13: £'™ 1SS (left) and M e g (right) distributions of events passing the basic one-lepton SUSY 
selection cuts described in the text. Note that Z+jet and W + jet backgrounds are under-represented in 
these plots for £"™ ss < 80 GeV or M e g < 350 GeV due to filter requirements applied to the respective 
Monte Carlo samples. 



The remaining background at high i?™ ss following such cuts is dominated by semi-leptonic it events, 
and to a lesser extent leptonic W + jets, Z + jets, single-top and di -boson events. For all these backgrounds 
one expects primarily a Jacobian peak in the event Mt(£,E™ 1ss ) distribution near il% (Mz for Z+ jets). 
This \t- Jacobian background was estimated with the E™ 1SS distribution of events selected with the same 
cuts, with the exception of the Mj{£,E™^) cut, which was reversed to require Mt(£,E™ ss ) < 100 GeV. 

The 11- Jacobian and the 2£-ti background estimates were compared to 'data' events subjected to the 
one-lepton selection criteria described above. The two estimates were simultaneously normalised to the 
'data' E^ ss distribution in two bins: 40 < E^ iss < 100 GeV and 100 < ££" ss < 140 GeV. The total (2£-tt 
+ l£-Jacobian) normalised estimate is plotted in Fig. 13(left) together with the 'data' E™ 1SS distribution. 
For Ej lss > 200 GeV the agreement between the estimate and the 'data' is good: 30.7 ±9.8 (30%) 
estimated, versus 39 'observed'. 

In Fig. 13(right), the M e g distribution is shown, with the estimate normalised with the same factors 
as used in Fig. 13(left). The semi-leptonic it forms a larger fraction of the background at large M e ff 
compared to at large £"™ ss because it produces a larger number of jets than fully-leptonic it. 

SUSY contamination The shape of the estimated distribution is effectively insensitive to the presence 
of SUSY signal, primarily due to the low Ej lss requirement in the 2£-ti selection. 

However, there may be SUSY signal in the normalisation region. The bias in the estimate will be 
proportional to the amount of SUSY in the normalization region, which is largest for the samples with 
the highest SUSY cross-section: SU3 and SU4. The effect of admixture of SU3 or SU4 signal events is 
shown in Table 14. In case of SU3, the background estimate is 60% higher, in case of SU4 as large as a 
factor 15. For both samples, however, the excess of signal events is still significant. 

In principle the contamination effect could be reduced by normalization to more signal-free re- 
gions. The 2£-ti and 1^-Jacobian estimates could be normalized for example to the tail and peak of 
the Mj{t,Ef 1 ^) distribution at low E™ ms . 

2.3.6 Combined fit method 

The fit-based method for measuring the background, as described in this section, aims to improve upon 
the plain Mr sideband subtraction method for the one-lepton SUSY search mode. By analysing data in 
a L-shaped region at both low-/?™ 155 in the full Mj range and at low Mj in the full £"™ ss range, and 
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Table 14: Estimated and true background with the redecay method, in case of no SUSY signal, or 
presence of SU3 or SU4 SUSY signals. For the latter, also the amount of observed signal plus background 
events is shown, proving that although the background is overestimated, the excess is still present. 



Sample 


Estimated BG 


True BG 


Observed signal + BG 


No SUSY 


30.7 ±9.8 


39 


39 


SU3 


50.8 ±13.6 


39 


392 


SU4 


456 ±102 


39 


1230 



performing a two-dimensional extrapolation into the SUSY signal region we hope to enhance the back- 
ground estimation. In a fit, correlations between £"™ 1SS and Mj can be taken into account. Furthermore, 
an explicit assumption can be put in that there is a finite SUSY contamination in the control sample. 

For our purposes, we define in this analysis a sideband (SB) region and a SUSY signal (SIG) region. 
For both regions, we apply the standard SUSY one-lepton selection cuts defined in section 2.1, with the 
exception of the cut on M T . The SB region is defined by the following additional cut: Ej lss < 200 GeV 
or M T < 150 GeV. In this analysis, the SIG region is defined by: £™ iss > 200 GeV and M T > 150 GeV. 
This classification assumes that the mass scale of SUSY is much higher than 200 GeV, which is true for 
all SUSY points considered in this note except SU4. 

The analysis described here has been applied to electron + jets events only, but we expect the muon 
+ jets analysis to be completely analogous. 

The main Standard Model backgrounds we expect are single-leptonic tt decays, double-leptonic 
tt decays, and W ±jets events. Other backgrounds, such as Z± jets, diboson production or single top 
are negligible for 1 fb _1 . The trigger used was an OR of the e22i single electron trigger, and the 4j50 
multi-jet trigger [7]. 

Shape of the backgrounds We will try to fit the contributions of the backgrounds in three observables: 
Mj, Ej lss and m top . Here m top is the invariant mass of the three jets in the event with the largest vector- 
summed pr [11]. 

We construct probability density functions (p.d.f.'s) that model the major contributing processes in 
the three observables after the event selection. We have done this both with and without explicitly taking 
into account correlations between the three observables. Most of these correlations are in any case 
consistent with zero. 

Figure 14 shows the distribution of the three observables for each type of background we consider, as 
well as for one SUSY signal sample (SU3). The empirical model that we use to describe each background 
type is overlaid on the data. This comparison of shapes demonstrates that there is sufficient information 
in these three observables to be able to measure each of them in a combined fit. 

We perform the procedure of constructing an empirical model from Monte Carlo for multiple SUSY 
data points. A striking feature of a comparison of £™ ss and M T distributions of these SUSY points in 
the SB region is that, with the exception of lower mass SUSY point SU4, they are all quite similar in 
shape. We can thus construct a model-independent 'Ansatz' shape to describe the SUSY contamination 
at low energy. 

To validate this procedure, we perform fits to a "data" sample consisting of either background only, 
or background plus SU3 SUSY signal. The yields we find in a fit to 1 fb _1 are listed in Table 15 and are 
in agreement within errors with the truth values of the fitted event mix. If, in contrast, a SU3 SUSY signal 
would be present in data, but the fit would not allow for SUSY (see table), the fit would overestimate 
dileptonic tt and W + jets, and underestimate semileptonic tt . This is as expected, as the SUSY signal 
has a long tail in M T , but no substantial peak in the top mass. 
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Figure 14: Distributions in missing Ej (left), Mj (middle) and m top (right) of single lepton it (top row), 
double lepton tt (second row), W + jet events (third row) and SUSY SU3 (last row). Each distribution 
is overlaid with a projection of the three-dimensional model that is fitted to that sample. 

The same table also lists the yields obtained from a fit in which all components are taken as a simple 
uncorrelated product of three one-dimensional p.d.f.'s. The yields and their uncertainties are very similar 
to those from the fit with models that include correlations and indicates that the effect of correlations in 
the description of the background components is minor. 

Fitting the data The fit with fixed shapes relies on simulated events to determine the shapes of the 
various background components, while all yields are fitted from the data. The next step is to release as 
many of the shape parameters in the fit to the data; in total there are 15 of these parameters. In the limit 
that all parameters can be floated, with the exception of the SUSY ansatz shape, the method becomes 
almost independent on simulation input and fully data driven. It turns out that on a 1 fb _1 sample we can 
float all but two of the W + jets and 1-lepton tt shape parameters. These two are the fraction of events 
that give the correct top quark mass in m top for single lepton tt and the fraction of events with a correctly 
constructed W boson in Mj in single lepton it . 

Floating the 2-lepton it shape parameters in addition causes the fit to become unstable because the 
shapes of the dilepton component and that of the SUSY ansatz model are very similar. We are currently 
investigating the possibility of introducing additional constraints on the shape of the di-lepton tt events 
in order to solve this. 

The result of the 1 fb _1 fit with floating parameters is shown in Figure 15. 

The final step in the analysis is to extrapolate the yields of the standard model background compo- 
nents from the sideband region to the signal region. Table 16 shows the yields from the combined fit with 
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Table 15: Yields from the combined fit with fixed shapes for the various modeled physics processes. 
The leftmost column lists the yield fitted with models that include correlations in the model structure. 
The middle column shows these yields for models in which correlation terms have been disabled. The 
rightmost column gives the true yields in the fitted event mix. 



Component 


Fitted Yield 


Fitted Yield w/o correlations 


True Yield 


no SUSY in data, no SUSY component in fit 


W+jets 


186 ±33 


200 ±27 


173 


1-lepton ti 


507 ±32 


482 ±32 


502 


2-lepton ti 


53±13 


55 ±16 


70 


SUSY 






0 


no SUSY in data, SUSY component in fit 


W+jets 


185 ±33 


201 ±27 


173 


1-lepton ti 


507 ±32 


482 ±32 


502 


2-lepton ti 


52 ±15 


56 ±16 


70 


SUSY 


4.4±3.0 


-7.1 ±16.0 


0 


SU3 in data, no SUSY component in fit 


W+jets 


292 ±35 


261 ±34 


173 


1-lepton ti 


386 ±30 


356±31 


502 


2-lepton ti 


338 ±25 


389 ±34 


70 


SUSY 






271 


SU3 in data, SUSY component in fit 


W+jets 


181 ±40 


194 ±35 


173 


1-lepton ti 


521 ±36 


509 ±35 


502 


2-lepton ti 


35 ±23 


15 ±30 


70 


SUSY 


280 ±22 


293 ±24 


271 



floating shapes extrapolated to the signal region while propagating all (correlated) parameter uncertain- 
ties; for comparison the same table also shows the results with shapes kept fixed. The fits describe the 
data within the statistical uncertainties. 

Table 16: Yields from the combined fit with either fixed or floating shapes in the sideband region extrap- 
olated to the full parameter space, the truth yields in full parameter space, the extrapolated yields into 
the signal region and the truth yields in the signal region. 



Component 


Extrap. Yield in FULL 


True 


Extrap. Yield in SIG 


True 




Shape Fixed 


Shape Floating 


FULL 


Shape Fixed 


Shape Floating 


SIG 


W + jets 


205 ±45 


227 ±68 


173 


0.5±0.4 


-1.2±2.7 


2 


1-lepton ti 


476 ±35 


485 ±59 


502 


0.4±0.2 


-1.1 ±3.9 


0 


2-lepton ti 


62 ±38 


17 ±54 


70 


4.5±2.9 


4.7±7.9 


5 


SUSY SU3 


273 ±33 


287 ±38 


271 


92.7 ±2.8 


95.6±4.0 


91 



Effect of SUSY signal While table 16 shows the results of the fit for the various backgrounds and the 
SUSY signal in the case of the SU3 sample, Table 17 shows results, for fits with floating shapes, for other 
SUSY signal samples. 

As has been noted earlier, SU4 is a special case. It is too close to the background for the Ansatz of 
the shape to be valid, and it can not be fitted very well. 
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Figure 15: Distribution of missing Ej (left), Mj (center) and m top (right) of a 1 fb~ ! mix of it ,W + 
jets standard model events and SUSY SU3 events overlaid with projections of the combined model on 
these observables that was fitted to this mix of events with floating yield parameters and floating shaped 
parameters. For each projection the contributions of the 1-lepton tt ttbar contribution (dark blue), 2- 
lepton tt ttbar contribution (light blue), W + jet contribution (red) and ansatz SUSY constribution (black) 
are shown. 



Table 17: Yields from the combined fit with floating shapes in the sideband region extrapolated to the full 
parameter space, the truth yields in full parameter space, the extrapolated yields into the signal region 
and the truth yields in the signal region, for various SUSY samples. 



| Component | Extrap. Yield in FULL | True in FULL | Extrapolated Yield in SIG | True in SIG~| 



SU1 


W+jets 


215±56 


173 


1.8±2.0 


2 


1-lepton tt 


486 ±54 


502 


0.6±0.6 


0 


2-lepton tt 


19 ±35 


70 


0.5±1.7 


5 


SUSY SU1 


154 ±30 


129 


50.1 ±2.6 


46 


SU2 


W+jets 


226±51 


173 


1.3 ± 1.0 


2 


1-lepton tt 


452 ±49 


502 


0.7±0.5 


0 


2-lepton tt 


81 ±32 


70 


8.0±5.0 


5 


SUSY SU2 


0±10 


14 


1.0±6.1 


4 


SU6 


W+jets 


215±54 


173 


0.5±0.8 


2 


1-lepton tt 


469 ±52 


502 


0.2±0.6 


0 


2-lepton tt 


29 ±34 


70 


2.6±2.7 


5 


SUSY SU6 


117±29 


86 


38.8±2.5 


35 


SU8 


W+jets 


239 ±53 


173 


2.5±2.4 


2 


1-lepton tt 


485±51 


502 


1.0± 1.2 


0 


2-lepton ft 


66 ±45 


70 


15.1 ±12.0 


5 


SUSY SU8 


34 ±26 


79 


34.5 ±13.0 


46 



Systematics Table 18 summarizes the results of a series of systematic studies. These studies are quan- 
tified in terms of relative variations of the measured SUSY cross-section, which we define as the counted 
number of events in the data in the SIG region minus the fitted and extrapolated number of SM model 
events expected in that same region, and divided by the efficiency for the selection of SUSY events, 
including the SIG region cuts, as measured from a pure sample of simulated SU3 SUSY events. 
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The second column of Table 18 lists the relative variation under the influence of each systematic 
variation if no extrapolation is applied in the fit and the analysis is performed without the SIG region cut. 
This column thus quantifies the effect on the shape fit stability only. The third columns shows the effect 
of each systematic study on the fitted yield in the signal region using extrapolation. Most effects are here 
of the order of 5-7%. The uncertainties due to Monte Carlo statistics are explicitly listed in the table. 



Table 18: List of studied systematic uncertainties for the combined-fit method. 



Systematic variation 


w/o extrapolation ± MC-error [%] 


in SIG ± MC-error [%] 


Jet energy scale 


1.9±0.7 


3.1 ± 1.1 


Jet energy resolution 


3.7±0.5 


0.5±0.1 


Electron energy scale 


4.8±0.6 


5.6±1.4 


Electron energy resolution 


9.0±1.2 


7.2 ±1.4 


Electron identification efficiency 


0.5±0.2 


3.6±2.3 


SoftZs™ 88 scale 


8.1 ± 1.8 


7.4±3.4 



3 No-lepton search mode 

3.1 Selection 

In the no-lepton search mode, we veto all events with an identified electron or muon with a pj of more 
than 20 GeV. We demand at least 4 jets with \r\\ < 2.5 and p T > 50 GeV, one of which must have 
Pt > 100 GeV. The transverse sphericity St should be larger than 0.2, and the missing transverse energy 
£"™ ss should be larger than 100 GeV and larger than 0.2 M e g , where M e g- is the effective mass. We 
add one more cut against the QCD background: the minimum value of the difference in azimuthal angle 
between the £™ ss vector and the three highest-p r jets should be larger than 0.2. This cut is futher 
discussed in a dedicated note on QCD background estimation [3]. 

3.2 Backgrounds in Monte Carlo 

Figure 16 shows the distributions of £"™ 1SS and M eS after all the selections are applied. 

3.3 Data-driven estimation strategies 

In this section we discuss data-driven estimation strategies for the no-lepton search mode. The strategies 
we have studied are: 

1. estimation of Z (— >• vv) plus jets from Z (— >• plus jets, purely from data (replace method, 
section 3.3.1); 

2. estimation of Z and W in an analogous way, but also helped by Monte Carlo (section 3.3.2); 

3. estimation of W and tt background from a one-lepton control sample derived by reversing one of 
the selection cuts (on Mj) (section 3.3.3); 

4. estimation of the cross-section of the tt — »■ bbqq'xv process with hadronic tau decay (section 3.3.4). 
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Figure 16: The i?™ ss and effective mass distributions of the SUSY signal and background processes 
for the no-lepton mode with an integrated luminosity of 1 fb _1 . The open circles show the SUSY signal 
(SU3 point). The shaded histogram shows the sum of all Standard Model backgrounds; different symbols 
show the various components. 

3.3.1 Replace method: Z -> vv from Z l + T 

Introduction The Z — > vv background is one of the main background process in the no-lepton channel. 
In order to estimate and reproduce the number of expected background events, as well as the shape of 
the £™ ss and M e ff distributions, Z —> £ + £~ events are selected, and the charged leptons are replaced 
by neutrinos. However, as the ratio of branching-ratios Br(Z — > £ + £~)/Br(Z —>■ vv) is small, statistical 
uncertainties will tend to be relatively large. Two solutions are proposed : 

1. Taking the distribution shape from Z — > £ + £~ data but constraining it via a fit plus the assumption 
of a smooth evolution of the fitting parameters when relaxing the cuts. This is the method described 
in this section. 

2. Taking the distribution shape from Monte Carlo simulation as described in the next section ( Sec- 
tion 3.3.2). 

The Monte Carlo method is more sensitive to generator-level and detector systematic uncertainties, but 
does not suffer from the larger statistical uncertainties, whereas the replace method precision is limited 
by the number of events in the control sample, but less sensitive to systematic uncertainties from the 
detector. Both methods have to account for the fact that the detected charged lepton pairs will not cover 
the full phase space of the neutrinos. 

Control Sample Selection The control sample selection is identical to the no-lepton SUSY search 
selection, except that two electrons or two muons are required, and that the missing Ej (£"™ ss ) is replaced 
by pr(£ + £~) ^ Pt{Z). Thus it is assumed that neutrinos are the main contribution to £"™ ss when the Z 
boson decays into two neutrinos, such that i?™ ss is roughly equivalent to pj (Z) for this physics process. 
The Ej lss resolution of ATLAS is sufficient for this to be a good approximation. In addition to pairs of 
isolated charged leptons, a sample composed of Z — > e ± X is added, where X is a non-isolated electron 
or an electron-like object with very loose cuts. This additional sample is used to increase the statistics 
and measure the electron identification efficiency via the "tag-and-probe" method. The goal of the tag- 
and-probe method is to select on one side a good electron (tag) and look at the other side to the nature 
of the object (the probe) which matches the constraint on the Z mass. Two cuts are added to reject the 



1550 



SUPERSYMMETRY - DATA-DRIVEN DETERMINATIONS OF W, Z AND TOP BACKGROUNDS . 



remaining backgrounds : 81 < M z {i + i~) < 101 GeV, and Missing E T < 30 GeV. After all cuts the 
number of selected events is summarized in Table 19. In particular, Table 19 shows the effect of an 
upper cut on £™ ss in order to reject tt background in the Z — >• e ± X channel. It has been verified that 
the efficiencies measured with the tag-and-probe method agree with efficiencies obtained directly from 
Monte Carlo, confirming that X is indeed dominated by electrons rather than hadronic jets. 

Table 19: Number of selected Z — »■ £ + £~ + > 4 jets events in 1 fb _1 in the replace-method analysis. 



Process 


NoEj lss cut 


£™ ss < 30 GeV 


Z— > e + e~ +njets 


18.2 


14.1 


e ± X + njets 


25.6 


19.6 


Z^ jU + ;U~ +njets 


33.2 


26.1 


Z— > T + T~ +njets 


1.6 


0. 


«-> bblvtv + nyete 


56.2 


0.3 


ti-> bb£vqq + n]ets 


506.6 


2.5 



Lepton identification and acceptance corrections A number of correlations must be applied in order 
to derive Z — ► vv distributions from Z — >• £ + £~: (1) a fiducial correction, since we cannot detect e and 
jU leptons beyond \ r\\= 2.5; (2) a kinematics correction for the additional cuts used to select Z —> £ + £~, 
including the Z invariant-mass window, the pj cut on the leptons, and the E™ 1SS cut; and (3) a correction 
for the lepton identification efficiency. The first two effects have to be computed from simulation whereas 
the lepton identification efficiency can be measured from collision data using the tag-and-probe method. 
After all corrections, the distribution can be summarized by the following formula: 

A^vv(£r s ) =iW<-(M^O) x ckUpt(Z)) x c Fida ( PT (Z)) x ^fzStH-y (5) 

where N z ^ V v{Ej lss ) is the corrected number of events per bin of missing Ej, N z ^£+(-(pt{£ + £~)) is 
the raw number of control sample events as a function of pr(Z), CKin and cpidu are the kinematic and 
fiducial corrections. The £"™ 1SS and M e g- distributions of Z — > e + e~ +e ± X and Z — > jJ. + jJ.^ events after 
all corrections are compared to Z — > vv distributions in Figure 17. 

For very high values of £"™ 1SS and M e g , statistics is low. In order to present a smooth prediction of 
the background, for example as a function of M e g- , a fit of the shape has been performed. Of course, the 
fit is also affected by the low statistics in the tail, but by relaxing the jet pj and £"™ ss cuts and observing 
how the fit parameters evolve with the cuts, a smooth prediction can be made. 

Systematic uncertainties The effects of various Monte Carlo generator and detector systematic uncer- 
tainties are summarized in Table 20. The main generator systematic uncertainty is due to the variation of 
the renormalization scale and affects the acceptance correction, whereas the principal detector systematic 
uncertainty is related to the soft part of the missing transverse energy which is not taken into account 
when replacing neutrinos by charged leptons. 

SUSY contamination in the control sample Due to the tight control sample selection cuts, in partic- 
ular Ej lss < 30 GeV, the SUSY contamination in the control sample in 1 ftr 1 is negligible: 0.1 events 
for SU1, 0.4 events for SU2, 0.9 events for SU3, and < 0.07 events for SU6. 
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Table 20: Summary of systematic uncertainties for the replace method. 



Description | Relative uncertainty AN /N [%] 


MC generator 


systematics 




ALPGEN parameter variation 




6.3 


Detector systematics 


Electron energy scale 




0.05 


Electron energy resolution 




0.03 


Electron id efficiency 




0.50 


Muon energy scale 




0.30 


Muon energy resolution 




0.39 


Muon id efficiency 




1.00 


£™ ss scale (soft part) 




4.5 


Tola! syslemalics (quadratic sum) 
Total statistics 
Total uncertainties 




-8 
~ 13 
~ 15 



3.3.2 Z and W background estimates from Z — ► £ + £ plus Monte Carlo shape 

A modification of the method of the previous section is described here. Denoted the "MC method", it 
uses only the number of events in the Z(— > £ + £~) +jets control sample for normalization, but otherwise 
relies on Monte Carlo simulation of kinematical distributions of events. The same normalization factor is 
used for both the Z(— > vv) background and the W(— > £v) background since the production mechanism 
is very simliar. 

Control sample The event selection of the control sample demands: two opposite-sign same-flavour 
leptons with pj > 20 GeV; Ej lss < 40 GeV; a di-lepton mass Ma within ± 10 GeV of mz . Subsequently, 
the standard no-lepton SUSY cuts are applied after replacing £ + £~ with vv. 

The number of events selected with 1 fb _1 is summarized in Table 21. The contamination from it 
events is about 10 -2 and therefore not significant. The number of events estimated by a full simulation 
sample (with event filter pf > 80 GeV) is 72 ± 3 for 1 fb _1 , which leads to a statistical uncertainty on 
the estimation of 12%. 

The method has been tested with a pseudo data sample prepared with Monte Carlo parameters set 
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Table 21: Number of events in the MC method control sample (for 1 fb l ). 



process 


/i mode 


e mode 


H + e (sum) 


Z(-» £ + r) 

ff 


51 ± 1 
<0.1 
1.0±0.3 


34 ±1 
<0.1 
0.5±0.2 


85 ±1 
<0.1 
1.5 ±0.4 



Table 22: Systematic uncertainty from variation of Monte Carlo generator parameters (ALPGEN) for the 
MC method. The pseudo data sample is discussed in the text. 



Z(- vv) 




> 300 GeV 


M cff > 800 GeV 


pseudo data sample 




5% 


3% 


half renormalization scale 




2% 


0% 


lower parton pj 




9% 


4% 






> 300 GeV 


M cff > 800 GeV 


pseudo data sample 




16% 


7% 


half renormalization scale 




4% 


2% 


lower parton pj 




12% 


15% 



Table 23: Systematic uncertainty from detector performance in the MC method. 





£™ ss > 300 GeV 


M cff > 800 GeV 


Jet energy scale 


6% 


6% 


Jet energy resolution 




1% 


Ef lss soft component scale 


1% 


< 1% 


Lepton energy scale 


< 1% 


< 1% 


Lepton identification efficiency 


2% 


2% 



differently from the standard Monte Carlo sample 4 . In this pseudo data sample, the shape of £"™ ss and 
M e ff distributions is not affected; only the normalization changes significantly. The MC method is able 
to recover such a change and predict the background correctly. 

Systematic uncertainty The MC method relies on Monte Carlo for the shapes of the background 
distributions. The systematic uncertainty from the Monte Carlo is estimated by using samples with 
different Monte Carlo parameters (pseudo real data, half renormalization scale sample, lower parton 
threshold sample). The results are summarized in Table 22; the deviation of the samples is 16% at most. 

Other potential sources of systematic error in the background estimation from uncertainties in detec- 
tor performance are summarized in Table 23. Since Monte Carlo is used, an uncertainty in the experi- 
mental jet energy scale affects the MC method prediction significantly more than for the replace method 
(Section 3.3.1). 

SUSY contamination of the control sample The tight selection cuts of the control sample makes any 
SUSY contamination negligible (as was also the case for the replace method discussed in Section 3.3.1). 

4 renormalisation and factorisation scale 0.8 times the default, a minimum parton p T of 30 GeV (rather than 40 GeV), and 
separation between partons ARjj = 0.6 (rather than 0.7). 
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3.3.3 Control sample constructed from an M^-selected one-lepton sample 

W and top backgrounds The tt and W ± + jets processes contribute to the background in the no- 
lepton mode, when the lepton emitted in the —> iv process is not identified. The reasons why the 
emitted lepton is missed are summarized in Table 24. Hadronic decay of t's and the lepton being out 
of acceptance (the pj of the lepton is required to be larger than 20 GeV) are the dominant reasons, and 
similar reasons/ratio's are observed in both tt and W ± + jets processes. 

Table 24: Numbers of events in which leptons are missed for various reasons in the semi-leptonic and 
leptonic decays of top and W ± . The numbers are normalized to 1 fb _1 and listed after the SUSY no- 
lepton selection is applied. 





tt 


W ± +jets 


W ± — > T — ► hadron 
Out of acceptance 
Isolation (close to jet) 


1993 (43%) 
1805 (39%) 
807 (18%) 


773 (42%) 
762 (41%) 
322(17%) 



The production processes tt and W ± with —> tv where the lepton is identified constitute good 
control samples from which to estimate these background processes in the no-lepton mode, since similar 
kinematic distributions are expected except for the presence of the lepton. For the control sample, the 
same kinematic selections as for the signal in the no-lepton mode are applied. In addition, exactly one 
isolated lepton (e or \i with pj larger than 20 GeV) is required and the transverse mass between this 
lepton and the £"™ 1SS is required to be smaller than 100 GeV to enhance the tt and processes. After 
these selections, this identified lepton is treated as if had been missed and all kinematic variables are 
recalculated. 

The distributions of tt and W ± + jets events differ after the SUSY selections are applied. The £"^ 1SS 
for tends to be larger than for tt events, since the boost factor of the W ± is larger for W ± + jets after 
several high pj jets are required. Therefore the reproduced distributions are sensitive to the mixture of 
the tt and W ± + jets events in the control sample. The fractions of the tt and W ± + jets processes in 
the control sample are 81% and 19%, respectively, close to the actual backgrounds in the signal region, 
which are 73% and 27% respectively. The systematic errors due to the uncertainties in the cross-section 
of tt and W ± + jets will be discussed later. 

The estimated distribution is normalized with the data at 100 GeV < £"™ ss < 200 GeV, where the con- 
tribution of the SUSY signal is expected to be small. The estimated distributions are slightly harder than 
the true distributions of the background processes, but similar distributions are obtained. The number of 
estimated background events is summarized in the two top rows in Table 25. Reasonable agreement is 
observed at high £"™ ss . 

QCD, W and top background without and with SUSY signal In a dedicated note within this vol- 
ume [3] various methods for the estimation of QCD background from data are discussed. 

In this section, we include QCD background, and estimate it as follows. It has been shown that 
after the removal of events with mismeasured £"™ ss (e.g. noisy or dead calorimeter cells), as discussed 
elsewhere in this volume [12], semi-leptonic heavy quark (b,c) decays are the dominant contribution to 
large Ej lss in the QCD background. A function is derived to represent the momentum fraction taken 
by the neutrino in b and c quark decays. This function is then applied to a control sample taken from 
data (at least four jets with pj larger than 50 GeV, pj of the leading jet larger than 100 GeV, £"™ 1SS 
smaller than 100 GeV) dominated by light-quark QCD events. The resulting distributions are normalized 
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to data in the region of A0 m i n < 0.2, where A0 m i n is the minimum value of the difference in azimuthal 
angle between the £"™ ss vector and any of the three highest pj jets, as discussed in section 3.1 and the 
dedicated note [3]. 

The W ± and tt background processes can also be estimated from the data, as discussed in the previous 
paragraph. Since all these processes are present in the data simultaneously, the background estimations 
should be done together. The same holds true for the presence of SUSY signal, which would contaminate 
the control samples. 

Figure 18 (top row) shows the estimated and true distributions of Z?™ 1SS and the effective mass for 
the combined background processes, without SUSY signal. The background distributions are reproduced 
well and the correct normalizations are obtained for all the background processes. The numbers of events 
are also summarized in Table 25. 

Table 25: Number of true and estimated (MT method) background events in the no-lepton mode, for tt, 
W ± and QCD processes without SUSY signal, normalized to 1 fb _1 . 





£™ ss > 100 GeV 


£™ ss > 300 GeV 


tt and W ± only 


True BG (top and W*) 


6894 ± 83 


276 ± 17 


estimated BG (top and W*) 


7018 ±269 


311 ±28 


QCD, ft and W ± 


True BG (QCD, top and W±) 


8077 ± 90 


300 ± 17 


Estimated BG 


8158 ±273 


327 ± 28 


Ratio(EstVTrue) 


1.01 ±0.04 


1.09 ±0.11 



If SUSY exists, SUSY signal can contaminate the background estimations. This is illustrated in 
Figure 18 (middle row) and Table 26. In the figure, the SUSY SU3 signal point is used; the table shows 
the variation over a number of SUSY samples. The figure and the table show that SUSY contamination 
causes a decrease of SUSY event excess by typically 30%, but that the considered points, with the 
exception of SU2, are still observable with 1 fb _1 . 

Table 26: Number of background events and estimated (MT method) numbers for tt, W ± and QCD 
processes, as well as various SUSY signals, in the no-lepton mode, normalized to 1 fb -1 . 





£™ ss > 100 GeV 


£™ lss > 300 GeV 


£™ ss > 100 GeV 


£™ 1SS > 300 GeV 


True BG (QCD,top and W*) 


8077 ± 90 


300 ± 17 


8077 ± 90 


300 ± 17 


Estimated BG 

True BG + SUSY signal 


SUl 


SU4 


8493 ± 283 
9152 ±96 


510 ±39 
1078 ± 33 


27527 ± 588 
34209 ± 185 


1409 ± 83 
3535 ± 59 


Estimated BG 

True BG + SUSY signal 


SU2 


SU6 


8198 ±274 
8193 ±91 


329 ± 30 
351 ± 19 


8362 ± 279 
8930 ± 95 


431 ±35 
924 ± 30 


Estimated BG 

True BG + SUSY signal 


SU3 




9188 ±299 
11333 ± 106 


633 ± 44 
2113 ±46 
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Figure 18: The estimated distributions of E™ 1SS and the effective mass of the tt, W ± and QCD back- 
grounds in the no-lepton mode with a luminosity of 1 fb~ ! . In the plots in the top row, no SUSY signal is 
present in the data, and black/red histograms show the true/estimated (MT method) background distribu- 
tions. In the plots in the middle and bottom rows, a SUSY SU3 signal is present in the data, and the blue 
histograms show the background plus the SUSY signal. In the two plots in the middle row, no correction 
was applied. In the two bottom plots, a correction with the "new MT method" was performed. 

New MT method The "new MT method", discussed in section 2.3.1, provides a first rough method to 
correct for the presence of SUSY signal in the control sample, once a SUSY excess has been observed 
in data. With this method, the distributions shown in the bottom row of Figure 1 8 are obtained. The 
figure shows the estimated background distributions with the new MT method, compared to the true 
background distributions, when SU3 SUSY signal is present in data. 
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Systematic uncertainties The systematic errors of this method are summarized in Table 27. The 
ALPGEN parameter variation includes a variation of the relative fraction of tt and W + jets in the 
sample. Although the actual number of background events varies with jet energy scale and Monte Carlo 
generator parameters by some 25%, the data-driven MT method is able to predict the background in the 
signal region to within an assigned systematic error of 15%. 

Table 27: Systematic uncertainties of the MT-method background estimations in the no-lepton mode 
without SUSY signal. Also changes of the absolute background numbers are listed. Numbers are nor- 
malized to 1 fb _1 . 





Syst. error 


Change in background level 


Jet energy scale 


<5% 


25% 


Lepton energy scale 


<5% 


0% 


Lepton Efficiency 


<5% 


< 1% 


MC@NLO vs ALPGEN 
ALPGEN parameter variation 


<5% 
<5% 


16% 

8% 



3.3.4 Top pairs with t decay 

Introduction The precise determination of the cross-section for the process of top-pair production with 
one tau that decays hadronically, tt —>■ W (qq')W (% a aV t )bb, is relevant because it constitutes an important 
background to SUSY searches with no leptons and significant £"™ ss . In fact, if no tau veto is applied, 
about 65% of the total tt background in the no-lepton mode corresponds to events containing one tau. 

Event reconstruction and selection The topology of tt —> W(qq')W(XhadV)bb events consists of two 
light-quark jets, one tau, and two b jets. 

The control of the tau fake rate is very important in a busy environment like tt where the purity 
of the reconstructed tau sample is low due to the large jet multiplicity. A high tau purity is needed 
in order to reduce the internal combinatorial background and the background from the semileptonic 
(tt — \V((v, )\V {qq')bb where £ G {e,jJ.}) decays of tt. In this analysis, we use a calorimeter-based tau 
reconstruction algorithm [1], and require a minimum visible pj of the identified tau of 25 GeV. 

The event is built independently on the hadronic side and the leptonic side, and topology variables 
useful to identify it events and reject QCD and W +jet background are extracted. On the hadronic side, a 
hadronic W invariant mass is built choosing, among all the combinatorial possibilities of di-jets, the pair 
of jets with closest invariant mass to its PDG value. The hadronic top is built combining this hadronic W 
with the closest identified b jet in AR. The b-jet identification is loose, with a 75% efficiency for b jets 
from top decay. 

The Ej lss is combined with the identified tau in order to build a leptonic W transverse mass. We 
assume a collinear approximation for the decay of the tau (the visible products of the hadronic decay of 
the tau and the associated v T are collinear), and determine the invariant transverse mass of the leptonic 
W. The resulting leptonic W is then combined with the closest (in AR) b jet to constitute the leptonic top. 

For each event, a reconstructed tau will build a leptonic W (and top) in combination with £"™ 1SS , and 
will have an associated hadronic W (and top). If there is more than one reconstructed tau we select the 
one that is associated with the jet pair that gives the hadronic W invariant mass closest to its PDG value. 

Once the event is built, topology variables suitable for tt selection are computed and selection cuts are 
applied: £"™ 1SS > 35 GeV, no identified electron or muon with pj > 15 GeV should be present in the event, 
the angle A0 between the two reconstructed top quarks should be larger than 2.5, the ratio between the 



1557 



Supers ymmetry - Data-Driven Determinations of W, Z and Top Backgrounds . 



transverse momentum of the two reconstructed top quarks should be smaller than 2, the angular distance 
(AR) between the reconstructed Z? jets should be larger than 1, and the angle A(j) between the missing 
momentum vector and the hadronic b jet should be larger than 0.5. 



SUSY no lepton cuts 



• ttbar cuts 

-4- 



Figure 19: Missing Ej of tt events with (g<7',Thad) for (circles) tt selection as in this analysis (control 
sample) and (squares) for the SUSY no-lepton mode selection. 

With the loose b tagging used here, 2910 tt(qq' , Th a d) events are selected for 1 fb _1 . The background 
consists of QCD and W +jets, for which 110 and 100 events respectively are selected. Therefore, with 
loose b tagging a good signal-to-background ratio can already be reached, although the uncertainty on 
the QCD background number is large. If one were to use tighter b tagging (60% efficiency), 1650 
tt(qq', Thad) events would be selected, against 2 QCD events and no W + jet events. 

In the presence of SUSY, the numbers of SUSY signal events that would pass the event selection 
with the loose b tag for 1 ft -1 are given in Table 28. They are generally small, with the exception of the 
SU4 point. 

Table 28: The number of SUSY events remaining after tt(qq' , Thad) selection for different SUSY points, 
fori ft- 1 . 



1700 ±60 70 ± 4 45 ± 3 



Estimation of the tt with (<7<7',Th a d) background to SUSY. Figure 19 shows that the selection ap- 
plied in order to identify the tt events with (qq' , Th a d) introduces little bias in the £"™ ss distribution, as 
compared to the SUSY no-lepton mode selection. 

Applying the SUSY no-lepton mode cuts, we estimate 210 tt (qq',%^) events as remaining back- 
ground to the SUSY no-lepton mode for 1 fb _1 . 
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Systematic uncertainties In the data-driven analysis of tt decays in the (qq', Thad) final state, the most 
relevant contributions to the detector uncertainties are the jet and £"™ 1SS energy scale, ^-tagging efficiency 
and T identification efficiency. 

Table 29: Systematic variation of the tt(qq' ,% a d) cross-section due to detector-related uncertainties. 



Systematic variation 


Cross-section variation [%] 


Jet energy scale 


2.5 


^-tagging efficiency 


7.5 


Light quark rejection in b-tag 


1.3 


T-identification efficiency 


3.4 


Light quark rejection in T-identification 


4.5 



The ^-tagging efficiency plays an important role in the tt(qq' ,% a d) reconstruction, since one of the 
selection criteria is that the two b jets expected in the final state should be reconstructed and correctly 
identified 5 . 

The systematic contribution to the measurement of the tt(qq' ,% a d) cross-section due to the T iden- 
tification efficiency has been estimated by varying the T identification efficiency and the light quark 
rejection factor by 10%. 

The uncertainty on the QCD background in the tt(qq, Thad) sample is large due to the limited number 
of Monte Carlo events which could be generated. Probably tight b tagging is required in this analysis. 
Further study is needed on this topic. 

4 Multi-lepton and tau search modes 

As well as the one-lepton and no-lepton search modes described earlier, there is considerable SUSY 
discovery potential in the multi-lepton and tau search modes, as discussed elsewhere in this volume [2,4]. 

The data-driven estimation of backgrounds in these modes, particularly the opposite-sign dilepton 
and the tau modes, can use the methods that have been described earlier in the context of the one- 
lepton search mode have also proven to be useful. These include the MT method, the HT2 method, 
the kinematic reconstruction method and the redecay method. Furthermore, for the same-sign dilepton 
mode, a technique based on lepton isolation, as described in the note on QCD backgrounds [3] could be 
further developed. 

5 Discussion 

The methods presented in this note represent a number of ideas on how top, W and Z backgrounds to 
SUSY searches can be extracted from the data, with appropriately chosen control samples. The results 
indicate that we expect, with 1 fb _1 , to be able to measure in the no-lepton mode: 

• the Z — ► vv background with two different methods to 8-13% stat. error, 10-15% syst. error; 

• the tt background with hadronic tau decay to < 6% stat. error, 10-15% syst. error (but with a 
caveat for the QCD background); 

• and the sum of top, W and QCD backgrounds with the MT method to 4-8% stat. error, and 15% 
syst. error. 

5 This is in fact the only analysis in this note where b tagging is used. 
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In the one-lepton mode: 

• the MT method gives the sum of tt and W background to 4-8% stat. error, 15% syst. error; 

• the semileptonic tt background can be estimated to 5% stat. error, 22% syst. error; 

• we can determine the fully leptonic tt background to 10% stat. error, 20% syst. error in at least 
three independent ways; 

• and we have a combined fit method to extract all components. 

These methods can also be applied to the multi-lepton and tau search modes. 

The results obtained with the MT method in this note are stable with respect to systematic variations 
in detector performance and Monte Carlo parameters and cross-sections to the 15% level. However, the 
MT method measures a sum of semileptonic and fully leptonic tt and W/Z+jets background; it relies 
on a control sample with different composition than the signal sample, and there is a subtle interplay 
between the tt and W /Z+ jets components of the background. More work is needed to understand 
possible systematic effects. It is desirable to understand the individual components of the backgrounds 
as well, and the various other methods discussed in this note appear to succeed in this. 

The presence of SUSY signal would affect the background estimates, at a level that depends on the 
SUSY signal properties, as well as on the method. Methods with very tight control samples (replace 
method, topbox method) see almost no effect. For the other methods, the background is overestimated 
by typically 20-30% for samples like SU1, SU2, SU3 and SU6. If a SUSY excess is nevertheless 
observed (which is possible with 1 fb _1 ), a correction for the background overestimation can be applied. 
First ideas have been presented in this note, using the MT method, and the combined fit method. More 
work is needed in this area. The SU4 benchmark point is a special case because of its light spectrum. 
It produces events with kinematics which are similar to the Standard Model backgrounds and its cross- 
section is high, so many methods would struggle to provide background predictions. It would, however, 
not be missed [11]. 
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Estimation of QCD Backgrounds to Searches for 
Supersymmetry 

Abstract 

This note deals with techniques for estimating QCD multijet backgrounds to 
inclusive jets + £"^? 1SS + leptons searches for Supersymmetry. The note doc- 
uments how the backgrounds may be estimated using a data set with an in- 
tegrated luminosity of 1 fb _1 or less. The likely systematic and statistical 
uncertainties in such estimates are also discussed. Data-driven approaches dis- 
cussed include jet smearing with functions derived from data and techniques 
for background estimation using control samples. Monte Carlo based tech- 
niques using a variety of generators and simulation tools are also discussed 
with a view to establishing their likely precisions and hence the optimum sim- 
ulation strategy. 



1 Introduction 

In searches for supersymmetry in events containing jets and missing transverse energy (£"™ 1SS ) the biggest 
background-determination challenge will be in understanding QCD jet events. Although fast simulation 
studies [1] indicate that such backgrounds should be sub-dominant in the high £'™ 1SS and large jet mul- 
tiplicity region, the situation with real data is still uncertain, and likely to be less favourable for SUSY 
searches. In particular non-Gaussian tails to the detector jet response function, which can be caused by 
dead material, jet punch-through, pile-up of machine backgrounds and other effects, can dramatically 
increase the cross-section of high £"™ 1SS QCD background. QCD jet events in which such processes have 
occurred shall be referred to in this note as "fake" £"™ 1SS events in what follows, since in such events the 
net pj of non-interacting particles measured in a perfect detector is inherently small. Events in which 
the £'™ ss vector is dominated by contributions from non-interacting particles such as neutrinos or the 
Lightest Supersymmetric Particle (in the case of SUSY signal) shall be referred to as "real" £"™ ss events. 

Accurate estimation of QCD jet backgrounds is difficult for a number of reasons. Processes gener- 
ating fake Ej lss from event mis-measurement are expected to be poorly modeled in current GEANT4 [2] 
simulations - this situation will improve once validation has been performed with real data. Furthermore 
theoretical and experimental uncertainties will conspire to decrease further the systematic precision of 
Monte Carlo estimates while the large QCD cross-section will limit statistical precision by rendering 
production of unbiased GEANT4 samples corresponding to more than a few pb 1 unfeasible. This note 
addresses these problems, assessing the relative magnitudes of some of the uncertainties, identifying 
techniques for minimising the contribution of such backgrounds in the SUSY signal region and studying 
novel Monte Carlo- and data-driven approaches to accurate background estimation. It should be under- 
stood that the background estimates presented here are not intended to be definitive; rather the focus is on 
developing and evaluating the performance of tools for future use. This note should be read in conjunc- 
tion with the notes outlining ATLAS £"™ 1SS performance [3], data-driven estimation strategies for W, Z 
and top backgrounds [4] and inclusive SUSY searches [5]. The main variables sensitive to the presence 
of SUSY signal which have been considered in this note are E™ 1SS and the 'effective mass' defined by 
M e f[ = Ya=\ \Pi{ii) \ +Ej lss [5], where the sum runs over the four leading jets satisfying the conditions 
described below. 

Wherever possible events appearing in £"™ 1SS or M e ff distributions have been selected with a standard 
jet selection requiring at least four jets with pj > 50 GeV and \ r\ \ < 2.5, at least one of which must have 
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Pt > 100 GeV. In some sections describing studies of new simulation techniques with limited available 
simulation statistics these cuts have been relaxed - these cases are highlighted in the text. In addition 
the performance of the different background estimation techniques has been compared by assessing the 
uncertainties of the background estimates for events passing a standard baseline set of cuts for the jets + 
£"™ ss + 0-lepton (1-lepton for Section 5.2) channel described elsewhere in this volume [5]. 

2 Fake£^ ss rejection 
2.1 Jet fiducialisation in 77 

In QCD multijet events the generation of large fake £"™ 1SS from jets falling in poorly instrumented regions 
of the detector is a significant mechanism by which such events mimic SUSY signal. Conversely the large 
number of such events means they can provide a useful in-situ probe of the the ability of the detector to 
measure Ej lss . Here we consider the use of such events for defining non-fiducial regions where jets may 
be expected to be poorly measured and hence capable of generating significant fake £"™ ss . 

The £"™ ss resolution of the ATLAS detector is known to scale with y/LEj [6], where LEj is the 
scalar sum over the transverse energies of all calorimeter objects. For this reason the £"™ 1SS -significance 
defined by 5 = £"™ ss / ^/L^t is frequently used to distinguish real Ej lss from fake Ej lss . Because QCD 
multijet events typically have little energy in invisible particles the value of (S) is determined by the Ej lss 
resolution of the detector. Much of the variance in S is due to intrinsic shower fluctuations, but because 
the reconstructed £"™ ss is dominated by the transverse energy near the highest-p r jets, some is due to 
the non-uniformity in energy resolution of the detector. A measurement of (S) in a sample of events in 
which one of the highest-/^ jets points into a particular region of the calorimeter can therefore be used 
as a measure of the relative performance of that region. 

In this study, the E™ ss -significance of QCD jet events was used to calculate (S) in r\ bins across the 
calorimeter. Each event S value was used in two T7 bins - one for each of the two highest-/?r jets. The 
same techniques could be used generate a full (t],0) map of the calorimeter. A sample of PYTHIA [7] 
simulated QCD jet events with pj > 280 GeV and corresponding to an integrated luminosity of 23.8 pb _1 
was used for the study [8]. Events were required to pass the 160 GeV high-level single-jet trigger [9] 
and to possess at least two high-pr jets with pr(j\) > 100 GeV and priji) > 50 GeV. These two jets 
were required to be back-to-back in the transverse plane such that % — \ A<j>(ji, J2) I < 0.4. Events were 
vetoed if they contained an isolated lepton with pj > 10 GeV and \r\ \ < 2.5 or if £"™ 1SS was greater than 
80 GeV. The A(j) and £"™ ss cuts eliminate contamination from events with significant true £"™ 1SS , eg. Z 
(^vv ) + jets and heavy quark jets. While the E™ 1SS cut does, in principle, prohibit the identification of 
catastrophically unresponsive regions with this method, regions with consistently poor response would 
be readily identifiable via the relative deficiency of events with jets pointing into them. 

The resulting calorimeter map is shown in Fig. 1. The degraded response in the tile barrel and tile 
barrel-extended barrel gaps (r) ~ 0 and 0.6 < |t]| < 0.8) and the LAr barrel-endcap and hadronic LAr- 
forward calorimeter transition regions (1.3 < \r}\ < 1.5 and 3.1 < |t]| < 3.3) are all clearly visible. The 
principal application of these data is to define non-fiducial regions of the detector, such that events with 
jets pointing into these regions can be rejected. Defining a cut value (S) m i n on (S), a region is defined 
to be non-fiducial if (5) > (S) m i n in all of the corresponding bins in the calorimeter map (Fig. 1) and if 
(S) — (S) m i n > 3e in at least one of those bins, where e is the uncertainty on the mean (S). Fig.l also 
shows the non-fiducial regions obtained in this way with (5) m i n = 0.95. 

The jet fiducialisation process has been tested on the QCD jet sample which was used to define 
the regions and also on SUSY events from model point SU3 [10]. Events were required to satisfy the 
jet cuts listed in Section 1 and events with a jet with pj > 40 GeV pointing into one of the non-fiducial 
regions depicted in Fig.l were rejected. The efficiency of these cuts for the QCD and SUSY samples was 
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Figure 1: (£™ ss / 'y/LE T ) vs. Tfy, where is the first or second highest-/? r jet in each event (points). The 
non-fiducial regions selected with a cut at 0.95 (dashed line) are also shown (shaded areas; see text). 
These regions correspond to the LAr barrel-endcap and HEC-forward transition regions. 



(74.8 ±0.4)% and (75.4 ±0.6)% respectively for£™ iss > OGeV, and (67.5 ±3.7)% and (75.5 ±0.7)% 
for Ej lss > 100 GeV. The effect of these cuts on the simulated data-samples is clearly small however 
they may be of more use with real data acquired with an imperfect detector, and the technique could be 
used as an on-line monitor of calorimeter performance. 



2.2 Jet-Ej lss <j> correlations 

One of the main methods used to reduce the QCD multijet background in £"™ 1SS + jets inclusive SUSY 
searches at the Tevatron has been the elimination of events in which the £"™ ss is closely associated with 
one of the leading jets in the transverse plane [11, 12]. This section provides a brief exploration of this 
method of backsround elimination in ATLAS. 




A<Kjl,P T ) A<Kj1,!E T ) 



Figure 2: The \A(j)(j\,E^ iss )\ - \A^)(j 2 ,E^)\ plane for QCD multijet (left) and SUSY SU3 events 
(right) passing SUSY jet cuts and £™ iss > 100 GeV. 

Fig. 2 shows the \A(j)(ji,Ej lss )\ — \A(j)(j2,E™ 1 ^)\ plane for events from the QCD multijet sample 
described in Section 2.1 which pass the jet cuts described in Section 1 and possess £"™ 1SS > 100 GeV 
(note that this sample has at least one jet with pj > 280 GeV - significantly higher than the SUSY 
leading-jet pj requirement of 100 GeV). Almost all events are confined to small regions around (n,0) 
and (0, n), with a small number of events in which the £"^? 1SS is not associated with one of the two highest- 
Pt (reconstructed) jets lying outside these regions. For comparison, the \A(j)(ji,Ej lss )\ — |A^»(72,£'™ SS )| 
plane for events from a SUSY (SU3 model) sample [8] passing the above cuts is shown in Fig. 2. No 
such correlation with £"™ ss is observed for this SUSY benchmark point. 
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Figure 3: The A0 min distribution for QCD multijet (solid line) and SU3 SUSY (dashed line) events 
passing the SUSY jet cuts and with £"™ ss > 100 GeV, as described in the text. Good discrimination is 
achieved with a cut at A0 m i n = 0.2 (vertical dotted line). 

For the purposes of background reduction, this technique can be generalised by considering three 
correlations between the £"™ 1SS vector and the leading three jets by defining A0 m i n as: 

A0 rain = Mn(A(j>(j u Ef ss ),A(j>(j2,Ef ss ),A(j>(h,E^)). (1) 

The A0 min distribution for the QCD and SUSY samples passing the SUSY jet cuts and with £™ iss > 100 
GeV are shown in Fig. 3. Also shown is the position of a cut at A0 min > 0.2 used in Ref. [5]. 

2.3 Calorimeter and tracking cuts 

Events in which jet reconstruction problems lead to fake £"™ 1SS can be identified with several variables [3]. 
Cuts on the fraction of the total energy of the jet (£"Totai) deposited in the outermost layers of the tile 
calorimeter E T u e2 and in the hadronic endcap calorimeter £" H ec can be defined to veto events with likely 
problems in jet containment. Additional cuts on the fraction of the jet energy deposited in the cryostat 
between the tile and the liquid argon (£"cryo - estimated from the energy in the closest calorimeter layers) 
and in the gap and crack scintillators (£"oap) can be used to reject events with large depositions in dead 
material. It is also possible to identify events in which there have been problems reconstructing the 
missing transverse momentum by comparing the value calculated from charged tracks (Ejj* k ) with the 
usual calorimetric £"£" ss . 



Table 1: Cross-section of SUSY SU3 signal and QCD jet backgrounds before and after application of the 
selection and cleaning cuts proposed in the text. Quoted statistical uncertainties in cross-sections after 
cuts derive from statistics of the Monte Carlo samples used. 



Sample 


No cuts [pb] 


Selection cuts [pb] 


Cleaning cuts [pb] 


SUSY SU3 

QCD (280 GeV < p T < 560 GeV) 
QCD (560 GeV < p T < 1120 GeV) 
QCD (1120 GeV < p T < 2240 GeV) 


27.51 
1.25 xlO 4 
360.0 
5.71 


9.654 ±0.019 
17.2± 1.3 
4.22±0.13 
0.434 ±0.003 


6.792 ±0.016 
9.7 ±1.0 
1.805 ±0.082 
0.130 ±0.002 



Table 1 shows the cross-section of SU3 signal and multijet background following application of 
the SUSY jet cuts described in Section 1 together with a cut requiring £"™ 1SS MOO GeV, and in addition 
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applying the fake £"™ 1SS cleaning cuts discussed above. The latter cuts were applied to jets with pj > 150 
GeV with the following acceptance criteria : -ETik^/^Totai < 0.1, -Ecryo/^Totai < 0.2, -Ebap/^Totai < 0.2, 
^HEc/^Totai < 0.5 and Ej^ k — £"™ ss < 50 GeV. Note that the specific cut values are analysis dependent 
and should be adapted for other cases. The table shows that the cleaning cuts are effective at removing 
between 40% (low pj) and 70% (high pj) of the multijet background, while rejecting ~30% of the 
remaining signal after selection cuts. 

2.4 Cosmic backgrounds and rejection cuts 

Searches for supersymmetry must contend with backgrounds in which fake £"™ 1SS arises from a variety 
of sources. Examples in which the fake £"™ 1SS is generated independently from a hard scatter include 
dead or noisy calorimeter channels, accelerator-related background and cosmic rays in which a muon 
undergoes a hard bremsstrahlung. A range of data-cleaning tools is being prepared to reject fake Ej lss in 
ATLAS; here we focus on timing information from the hadronic tile calorimeter (TileCal). As one gains 
a better understanding of the first data, this will be combined with information from other subdetectors. 

Calorimeter timing can be a powerful tool to remove fake £"™ 1SS backgrounds from cosmic rays [13]. 
Timing in ATLAS is defined such that particles from the nominal interaction point (x = 0, y = 0, z = 0) 
will arrive in the calorimeter cells at time t = 0. Cosmic ray events, however, will arrive at random times 
with no correlation to the LHC beam structure. One of the methods to reject cosmic rays is to calculate 
the "up-minus-down" time, in which one divides the TileCal into two segments, an upper segment, with 
<j) > 0 and a lower segment with <j) <0, where 0 is the angle in the transverse plane. One then calculates 
the average time of cells in each segment, weighted by the energy of each cell i: 

^p(down) = £(^ UP(d ° Wn) X ti )/ZEf d -K (2) 

The difference between the two quantities t up and ?down, should reflect the average time-of-flight. The 
resulting "up-minus-down" time should be centered at t = 0 for particles from the interaction point, as 
demonstrated in Fig. 4. In this figure, the "up-minus-down" time is calculated for a simulated Monte 
Carlo QCD dijet sample. As expected, the signal distribution peaks near zero. Note that the simulation 
includes electronic noise, and so the timing distribution has a spread of several ns. 

Depending on their trajectory, cosmic ray muons traveling from the top to the bottom of ATLAS 
at near the speed of light will have a time-of-flight of typically 1 8-20 ns over their travel distance of 
approximately 6-7 m [14] in the TileCal. This is demonstrated in Fig. 5 for a simulated Monte Carlo 
sample of cosmic ray muons. In this case, the distribution peaks near -18 ns (the time of flight is negative 
as expected for a particle traveling from the top to the bottom). The width of the simulated distribution is 
3 ns, allowing for a good separation of the background from the signal peaking at t = 0. Further details 
of calorimeter timing studies with cosmic rays in ATLAS can be found in Ref. [14]. 

3 Monte Carlo systematics 

3.1 Generator systematics: proton PDF and underlying event 

In this section, uncertainties on the Monte Carlo generation of QCD events arising from proton parton 
distribution functions (PDFs) and modeling of the underlying event are briefly investigated. These un- 
certainties provide a limit to the precision with which the novel detector simulation strategies described 
in Sections 4.1 and 4.2 can model QCD backgrounds to SUSY. Due to the large number of Monte Carlo 
samples required for this study, each with different PDF or underlying event parameters, it was unfeasible 
to work with GEANT4 detector simulation samples. Therefore dedicated samples of events were gener- 
ated in the same pj ranges as the GEANT4 samples used elsewhere in this note [8] and passed through 
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Figure 4: "Up-minus-down" timing distribu- 
tion in TileCal for a simulated Monte Carlo di- 
jet sample. As expected, the distribution peaks 
near 0. 
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Figure 5: "Up-minus-down" timing distribu- 
tion in TileCal for a simulated Monte Carlo 
cosmic ray sample. As expected, the distribu- 
tion peaks near -18 ns. 



the ATLAS fast detector simulation [15]. For all distributions shown ii 
selected with the common jet cuts described in Section 1. 



this section, QCD events were 




Figure 6: TheM e ff distribution for simulated QCD events satisfying SUSY jet cuts described in Section 1. 
The left plot shows the number of events per bin, scaled to lfb 1 and the right plot shows the fractional 
difference with respect to CTEQ6. 1M. In each case, the solid histogram and shaded band shows the 
results from the CTEQ6. 1M proton PDF. The dashed and dotted histograms show the results from the 
MRST2004 and CTEQ6L1 PDFs, respectively. The error bars reflect the Monte Carlo statistics. 



Proton Parton Distribution Functions All calculations of cross-sections at the LHC rely on a knowl- 
edge of the proton parton distribution functions (PDFs). High-pr QCD events will be particularly sensi- 
tive to the gluon PDF at large x which, in current global fits [16-19], is relatively poorly known. In this 
section, the impact of the PDF uncertainties on distributions of observables sensitive to the presence of 
SUSY signal is estimated. 

Events were generated using PYTHIA 6 . 403 [7], with the following PDFs: CTEQ6L1 [20], CTEQ6.1M [17] 
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and MRST2004nlo [21]. Note that CTEQ6L1 is a leading order (LO) PDF, while CTEQ6.1M and 
MRST2004nlo sets are both next-to-leading-order (NLO) sets. CTEQ6.1M provides a set of error PDFs 
(Sf), corresponding to the N eigenvector directions. Note that the Sf only reflect the experimental un- 
certainties on the data, and do not include any information on theoretical assumptions used in the fit. 
The total PDF uncertainty arising from experimental sources, on a given observable, L, can then be 
constructed according to: 

AE± = </fj (max (±I(S+) T£(S 0 ),£(±Sr) T^ 0 ),0)) 2 ; (3) 

where A£ + (A£~) are the upper (lower) uncertainties on the observable and So represents the central (best 
fit) PDF value. For CTEQ6.1M, there are 40 error PDF sets, Sf, corresponding to N = 20 eigenvector 
directions. Since the uncertainty band obtained from CTEQ6. 1M only reflects the uncertainties of the 
experimental data, the predictions of this PDF are also compared here to those from MRST2004nlo (an 
alternative, independent NLO PDF) and to CTEQ6L1 1 . 

The distribution of M e ff for the different PDFs considered, and the fractional uncertainty with respect 
to the central prediction of CTEQ6. 1M, is shown in Fig. 6. The CTEQ6. 1M uncertainty band ranges from 
~ 20 — 50%, as M e ff increases. The largest contribution to this band comes from eigenvector 15, which is 
mainly sensitive to the gluon at large x. The results from MRST2004nlo lie well within the CTEQ6.1M 
uncertainty band across the whole range of the distribution. The prediction from CTEQ6L1, a LO PDF, 
lies below those of the NLO PDFs. For M e g- > 1 TeV, the difference is approximately constant and at 
the level of ~ 20%. The validity of using NLO versus LO PDFs, with LO matrix-element generators, 
is discussed in a recent publication [22]. However, these results indicate that the difference between the 
results from LO and NLO PDFs is smaller than the CTEQ6.1M uncertainty band. 



■ PYTHIA- 'ATLAS default' 
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■ PYTHIA- 'no MPI' 
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Figure 7: TheM e ff distribution for simulated QCD events satisfying SUSY jet cuts described in Section 1. 
The left plot shows the number of events per bin and the right plot shows the fractional uncertainty 
with respect to the "PYTHIA - ATLAS default" model for the underlying event. In each plot, the 
solid histogram shows the results from the "PYTHIA - ATLAS default", the dotted histogram shows 
"PYTHIA - Tune A", the dashed histogram shows "PYTHIA - no MPI" and the dot-dashed histogram 
shows "JIMMY - ATLAS default". The error bars reflect the Monte Carlo statistics. 



'Note that the PYTHIA parameters used by default within ATLAS are tuned to the CTEQ6L1 proton PDF. Strictly, if a 
different PDF is used, a retuning of the Monte Carlo parameters to data should be performed. This has not been done in the 
current study, and this should be bourne in mind when interpreting the results. 
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Underlying Event Monte Carlo generation of QCD events is also sensitive to the underlying event. 
Multi-parton interactions are modelled in several Monte Carlo generators. For this study, the PYTHIA 
6 . 403 [7], and HERWIG 6 . 507 [23] (used in conjunction with JIMMY v4 [24]) are used. Further details 
of the underlying-event models compared are summarised below. Note that, unless otherwise stated, 
CTEQ6L1 is used as the proton PDF. 

The underlying event models considered are briefly summarised below: 

• "PYTHIA - ATLAS default" : the ATLAS default underlying-event tune for PYTHIA 6 . 4 [8]. 

• "PYTHIA - Tune A" : a model based on the "PYTHIA-TuneA" [25] to Tevatron data. Note that 
since underlying-event tunes also depend on the PDFs, CTEQ5L [26], appropriate to Tune A, has 
been used here. 

• "PYTHIA - no MPI" : a model in which multi-parton interactions have been switched off. All 
other parameters were left at the PYTHIA 6 . 403 defaults. This model does not describe the Teva- 
tron data, but is included for comparison. 

• "JIMMY - ATLAS default" - the ATLAS default underlying event tune for HERWIG 6.5 + JIMMY 
4 [8]. 

The distribution of M e g is shown in Fig. 7. The left plot shows the number of events per bin (scaled 
to 1 fb _1 ) while the right plot shows the fractional ratio with respect to "PYTHIA - ATLAS default". 
As expected, the model with no underlying event contribution, "PYTHIA - no MPI", lies below the 
predictions of all other PYTHIA models. The highest model prediction is given by the "PYTHIA - Tune 
A" model, based on "PYTHIA-TuneA" to Tevatron data. The ATLAS default tunes, "PYTHIA - ATLAS 
default" and "JIMMY - ATLAS default", show differences of ~ 20%. However, some of this difference 
may come from the different normalisations of PYTHIA and HERWIG and not totally due to the underlying 
event model. The overall spread in predictions from these models, is ~ 40% and is approximately 
constant across the range of M eS shown. 

3.2 Jet energy scale uncertainty 

A significant source of irreducible experimental uncertainty in Monte Carlo QCD background estimates 
will arise from the uncertainty in the jet-energy-scale (JES). In this section, a simple estimate of the 
impact of the JES uncertainty is performed. For each event, the energy and momentum of each jet has 
been scaled by a constant factor, corresponding to a 10%, 5%, 3% and 1% uncertainty on the JES. The 
£'^ 11SS has also been adjusted accordingly, by an amount corresponding to the change in the sum of the 
jet momenta in the x and y directions. All such scaling is performed prior to any selection cuts. For this 
study, the QCD dijet events generated with PYTHIA have been used, with a fake £"™ 1SS filter applied to 
enhance the rate of potentially mismeasured events [3, 8]. 

Figure 8 shows the resulting M e g distribution. The shaded bands show the uncertainties on the distri- 
bution for the four different values of the JES uncertainty considered. For a 10% value, the uncertainty 
on the M e ff distribution ranges from ~ 50 — 150%. An improved understanding of the JES to a level of 
5% reduces the uncertainty on M e ff by more than a factor of two, while a 3% value shows an improved 
uncertainty on M eS of between ~ 10 - 30%. If the challenging goal of a 1% JES uncertainty is achiev- 
able, the results indicate that the uncertainty on high-p T events could be much reduced, to a level of 
~ 5 - 10%. 

3.3 Generator comparison: PS vs. ALPGEN 

The recent development of event generators in which regions of phase-space populated by a multi- 
parton matrix-element calculation are matched to those populated by a partem shower algorithm offer 
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Figure 8: TheM e ff distribution for simulated QCD events satisfying SUSY jet cuts described in Section 1. 
The left plot shows the number of events per bin and the right plot shows the fractional uncertainty with 
respect to the central prediction. The shaded bands show the estimated uncertainty on the observable for 
assumed JES uncertainties of 10% (light), 5% (medium-light), 3% (medium) and 1% (dark). 



the prospect of improved simulation of high pj QCD jet events. In this section we compare the physics 
performance of such a generator, namely the leading-order ALPGEN [27] code, with that of a conventional 
stand-alone parton-shower generator (PYTHIA 6 .403 [7]). 

Multijet QCD events were generated with ALPGEN 2 . 05+ JIMMY and with PYTHIA 6 . 403 and the re- 
sulting distributions of observables sensitive to SUSY signal events compared using fast simulation [15]. 
PYTHIA events were generated in different pj ranges of the two leading partons to study the high energy 
tails with sufficient statistics (see Ref. [8]). Such sliced-sample production is also possible in ALPGEN. 
Only events satisfying the standard SUSY jet cuts described in Section 1 were studied. The number of 
PYTHIA events passing the jet selection cuts is 2. 1 times larger than the number of ALPGEN events for the 
same integrated luminosity 2 . To facilitate comparison of the two samples, they were both normalized 
to 1 fb~ ! and ALPGEN samples were further multiplied by a factor of 2.1. Error bars on each histogram 
below are based on the Monte Carlo statistics used in this study. 

Figure 9 shows the pj distribution of the leading four jets for ALPGEN and PYTHIA events. The new 
parton shower scheme available in PYTHIA 6 . 403 can produce hard subleading jets with similar pj to 
those produced by ALPGEN while the two leading jets are somewhat softer. Even though PYTHIA also 
generates 2^2 scattering using matrix-element calculations, the two highest-pr jets are softer because 
additional partons are emitted from the leading partons using splitting functions. 

The use of fast detector simulation for the comparison of event generators described in this section 
prevents accurate study of the impact of generator differences on 'fake' Ej lss distributions. Neverthe- 
less, 'real' £"™ ss events in which heavy quark decays generate neutrinos can potentially dominate in 
the tail of the £"™ ss distribution following application of the cuts described in Section 2 and Ref. [3]. 
Fast detector simulation can legitimately be used to study this background permitting comparison of 
PYTHIA and ALPGEN predictions of E™ ms and M e ff distributions. Figure 10(a) shows the £"™ ss distri- 
butions obtained from PYTHIA and ALPGEN while Figure 10(a) shows the relative difference between the 
distributions of the £™ ss . 

2 No bb or cc pair production events where the heavy quark pair is produced in the hard scatter rather than through gluon 
splitting were generated in PYTHIA. This contribution is 10 % of the total bb cross-section and negligible compared to the total 
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Figure 9: The pj distributions of the four highest-/?/- jets; (a) the leading, (b) the second (c) the 
third and (d) the fourth jet for PYTHIA (open circles) and ALPGEN (open histogram) events. 



PYTHIA predicts somewhat larger i?™ ss than ALPGEN, although limited Monte Carlo statistics prevent 
firm conclusions from being drawn at large £"™ ss . This effect likely originates from the fact that multi-jet 
events containing gluon splitting are not fully included in ALPGEN because of the requirements of matrix 
element - parton shower matching. 3 The equivalent M e ff distributions and the relative differences 
between generators are shown in Figure 1 1 . The M e ff distribution of ALPGEN events is harder than that of 
PYTHIA events, as expected from Figure 9(a) and (b). 



4 Monte Carlo estimates 

4.1 Jet smearing with Transfer Function 

Introduction In this section, a Monte Carlo based detector simulation technique is described which 
models the response function of the ATLAS calorimeter to jets as a function of energy and rj using 
Monte Carlo "truth" information, and then uses this response function to smear the energies of jets and 
other objects in Monte Carlo QCD jet events. The truth-derived response function used here is referred 
to as the particle-jet transfer function (PJTF) and in principle may be measured also from data using 
techniques such as those described in Section 5.1. 

3 To avoid double counting of partons generated by the matrix element calculation and parton shower algorithm, the MLM 
matching scheme employed by ALPGEN requires that matrix element partons satisfy cuts on pj (>40 GeV) and AR between 
partons (> 0.7). As a result of these requirements, processes such as ggbb generating multiple partons with ARm < 0.7 are 
rejected during matching. Events with gluon splitting generated in the parton shower algorithm are however kept. 
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Figure 10: (a) ATLFAST £™ ss distribution for PYTHIA(circles) and ALPGEN (histogram) and (b) 
the relative difference (N PY thia-N A lpgen)/Nalpgen of the Ef 1SS distributions. The shaded band is 
the Monte Carlo statistical error. 




Figure 11: (a) ATLFAST M eff distribution for PYTHIA(circles) and ALPGEN (histogram) and (b) 
the relative difference (N PY thia-Nalpgen)/Nalpgen of the M e g distributions. The shaded band is the 
Monte Carlo statistical error. 



Modeling jets - reconstruction of the PJTF In this section we use the following terminology for 
two different types of jets formed from Monte Carlo events - "particle jets", made from all the generated 
particles in the event except muons and neutrinos, and "calorimeter jets" formed from topological clusters 
in the calorimeter. 

In general there is a high efficiency for reconstructing jets in the calorimeter, even though their energy 
can potentially be mis-measured by a large amount. Jet fragmentation to particles of different types and 
the response of the detector material to these particles in different r\ regions can produce tails in the 
PJTF. It is possible for there not to be a one-to-one match between generator-level jets and calorimeter 
jets even though the same jet clustering algorithms are used. 

To reconstruct the PJTF it is necessary to measure the response of the calorimeter to generator-level 
jets. This is accomplished by matching these to calorimeter jets, and requiring a good jet isolation in 
both jets (separation from nearest jet AR > 0.8), to ensure that accidental jet overlaps do not contribute 
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to the tails in the PJTF. Jet energy responses are modeled with double Gaussians. The ratio of the 
1st and 2nd Gaussian components is empirically known to follow the ratio of energy deposits in the 
electromagnetic and hadron calorimeter and hence the lst/2nd Gaussian ratio is fixed to the mean value 
of £had/(£em + £had) with a fitting procedure. GEANT4 simulated PYTHIA dijet samples with p T > 17 
GeV [8] are used to estimate the PJTF. Reconstructed jet energies are corrected with an energy-density 
based weighting scheme [28]. The fits are performed before the jet energy corrections have been applied, 
i.e. the PJTF is obtained at the calorimeter cell energy calibration level. To use the PJTF on reconstructed 
jets, cell-to-jet energy corrections are applied after the PJTF. For the Ej lss calculation the cell-level 
correction is used directly to retain consistency with full GEANT4 simulation £"™ ss . 

Modeling Ej lss All truth jets (formed from stable truth particles expected to shower in the calorimeter) 
with energy greater than 10 GeV are smeared with the PJTF. This avoids double counting of interact- 
ing particles (electron, photon, tau) due to the potentially different treatment of merging and splitting 
of objects, for example when the photon is close to a jet. This simplification of reconstructing jets 
from all stable showering particles mis-measures the energy scale of isolated electrons, photons and 
taus by a small amount, however the ATLAS procedure of identifying and calibrating electromagnetic 
and hadronic showers separately ensures that the resulting bias is minimised. This study is additionally 
focussed on QCD jet events and hence any such bias is still less important. Muons in the event are 
simulated with the ATLFAST fast simulation code [15] due to the prohibitive time overhead associated 
with full simulation of the ATLAS tracking systems. Finally in order to account for the potentially large 
fluctuation in the underlying event, we vectorially sum all the particles that are not part of jets above 
10 GeV to form a "soft jet". This soft jet is added to the above jet sum to compensate for the contribution 
from soft particles outside of the jet cones. Once all the objects in the event are defined, the £™ ss in the 
event is calculated by summing their four-vectors, enabling comparison of the performance of the PJTF 
technique with full GEANT4 simulation. 

Performance The performance of the PJTF technique was assessed by comparing PJTF predictions 
with those obtained from full GEANT4 simulation using the pj > 17 GeV dijet samples described above. 
The event-by-event comparison was performed using both ATLFAST and fully reconstructed objects in 
the GEANT4 simulation samples. 

The jet multiplicity difference between full GEANT4 simulation and PJTF, and between full GEANT4 
simulation and ATLFAST are shown in Fig. 12. Only jets with pj > 50 GeV and \ r\ \ < 2.5 were used, but 
no event selection based on the jet multiplicity was performed. 




J4 ANjets 2 (= Full -%ast oAt)) J6 ANjets 2 (= Full -Vast orV)) 



Figure 12: Event by event difference in number of jets between full GEANT4 simulation and PJTF 
(blue,solid) and between full GEANT4 simulation and ATLFAST (red,dashed). The left plot is for 
140 GeV < p T < 280 GeV events, and the right one is for 560 GeV < p T < 1 120 GeV events. 
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There is good agreement in jet multiplicity between the full GEANT4 simulation and the PJTF events. 
The ATLFAST events have a somewhat higher multiplicity due to the simplified calorimeter map used for 
jet reconstruction. For the PJTF technique to be effective, it is important that the multiplicity at recon- 
struction level is not biased. This is clearly demonstrated in Fig. 12. Good agreement was found for jet 
Pt > 40 GeV, which can be explained as follows. The topological clustering jet algorithm has a ~100% 
efficiency so if the transfer function is correctly scaling and smearing the energy the jet multiplicity could 
only be biased by jets splitting and/or merging. 

Another performance check is the comparison of the scalar pj sum of the four leading jets obtained 
from full GEANT4 simulation, PJTF, and ATLFAST events. Events passing the typical SUSY jet cuts listed 
in Section 1 are used. Fig. 13 shows the comparisons for 140 GeV < pj < 280 GeV and 560 GeV < 
Pt < 1120 GeV events. Good agreement is observed between the full GEANT4 simulation and PJTF 
events, while ATLFAST events have significant deviations for the same reasons as for Fig. 12. 





Figure 13: Top: distributions of the scalar pj sum of the four leading jets in PJTF (black, solid), ATLFAST 
(red, solid) and fully GEANT4 simulated jets (blue, points). The left plot is for 140 GeV < p T < 280 GeV 
events, and the right plot is for 560 GeV < pj < 1120 GeV. Bottom: the ratio with respect to the fully 
GEANT4 simulated jets. 

In order to check the performance of Ej lss modeling the typical SUSY jet cuts described in Section 1 
were applied, with no £"™ ss cut applied initially. We define fake i?™ lss as follows: 

£-^ = |p~|-|p™~| 

where p™ ss ieco j s the missing transverse momentum vector reconstructed using either the PJTF tech- 
nique, GEANT4 simulation, or ATLFAST, and p'™ ss true j s the transverse vector of the sum of the neutrino 
momenta. 

Fig. 14 shows the fake i?™ ss distribution obtained from the PJTF technique and from full GEANT4 
simulation. Also shown are the ATLFAST results, and the PJTF results without the soft-jet corrections. 
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Figure 14: Fake E™ 1SS distributions from ATLFAST (red hatched), PJTF technique (black, solid) and 
full GEANT4 simulation (blue, points). Also shown with the dashed green line is the PJTF distribution 
without soft-jet correction. The left plot is for 140 GeV < pj < 280 GeV events, and right plot is for 
560 GeV < pj < 1120 GeV. Bottom plots are the ratio of each distribution with respect to the full 
GEANT4 simulation. 




Figure 15: E^ iss (left) and M eff (right) distributions of 560 < p T < 2240 GeV PYTHIA QCD jet events 
simulated with ATLFAST (light/red solid), the PJTF technique (dark/black, solid) and full GEANT4 simu- 
lation (points). Events were required to pass the SUSY jet cuts described in Section 1. 
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Table 2: Numbers of 560 < Pt < 2240 GeV PYTHIA QCD jet events passing cuts for 1 fb _1 data, starting 
from an original total of 3.5 x 10 5 events. Errors are also normalized to 1 fb _1 . The cut definitions are 
described in the text. 



Method 


Cutl 


Cut 2 


Cut 3 


Full simulation 
PJTF technique 
ATLFAST 


(103.8 ±0.3) x 10 3 
(104.9 ±0.3) x 10 3 
(120.0 ±0.3) x 10 3 


(65.7 ±0.3) x 10 3 
(62.3 ±0.2) x 10 3 
(86.1 ±0.3) x 10 3 


4.3±2.1 
2.2±1.5 
2.1 ±1.5 



It can be seen from the figure that £"™ ss estimated with the PJTF technique and with full GEANT4 
simulation agree at the 20% level for fake £™ iss < 80 GeV (200 GeV) for 140-280 GeV (560-1120 
GeV) dijet events. The figures highlight the improvement in performance relative to the naive ATLFAST 
estimate of £"™ ss , with the agreement between PJTF and full GEANT4 simulation persisting down into 
the tail region a factor 10~ 3 below the peak. 

Fig. 15 shows the performance of the PJTF technique for estimating QCD backgrounds to the E™ 1SS 
and M e ff distributions obtained following application of the SUSY jet cuts described in Section 1. In 
these plots the event- weights and errors are normalized to 1 fb~ 1 . Good agreement is seen between the 
PJTF distributions and the full simulation distributions over the full pj range, while the naive ATLFAST 
distributions show larger deviations from the full simulation results. 

Additional cuts equivalent to the full SUSY cuts described in Section 1 were applied to assess further 
the consistency between the PJTF results and those obtained with full GEANT4 simulation. In Table 2 the 
remaining numbers of events after each stage in the event selection are shown. Cut 1 is the standard 
SUSY jet cut applied above. Cut 2 adds the requirement A(j) min > 0.2 (see Section 2.2) while Cut 3 
requires additionally E^ iss > max(100 GeV, 0.2M e ff) and M eS > 800 GeV. Despite limited statistics the 
results show that the PJTF approach provides reduced event selection bias in comparison with the naive 
ATLFAST simulation. 

4.2 Fast GEANT4 simulation 

Introduction The simulation of the ATLAS detector is currently based on GEANT4 v. 4. 8. 3 [2]. 
GEANT4 simulation is very time consuming however and so its use for QCD background simulation 
at moderate or low pj is not viable. The fully GEANT4-based approach (refered to as full simulation in 
the following) is especially slow in the simulation of electromagnetic cascades in the calorimeters due 
to their complicated geometry. A fast simulation tool (ATLFAST [15]) based on Gaussian smearing is 
also available, and is used for studies of systematic uncertainties elsewhere in this note, but it does not 
reproduce with sufficient accuracy the tails of distributions interesting for SUSY analysis, i.e. £™ 1SS or 
M e ff. For this reason, an intermediate approach to the simulation of the ATLAS detector, based on the 
parameterisation of the calorimeters' response, is under study [29]. In the so-called 'fast' GEANT4 sim- 
ulation, electrons entering the calorimeters may receive different treatments according to their energy. 
For high energy particles a shower parameterisation is used while for medium energy particles (below 1 
GeV) the detector response is taken from a shower library ("frozen showers"), and low energy particles 
(below 10 MeV) are "killed" by depositing their energy in a single spot in the calorimeter. 

Performance We tested fast GEANT4 simulation algorithms by simulating PYTHIA [7] dijet events with 
moderate transverse momentum (280 GeV < pj < 560 GeV). The fast GEANT4 simulation can be per- 
formed using different options, which differ in the treatment applied to electromagnetic particles (shower 
parameterization, frozen showers or "killing") in different parts of the calorimeters (for more details, 
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see [29]). In the following, the fast GEANT4 options are numbered from 1 to 3, the first one being the 
most conservative option, i.e., the closest to full simulation, and the third one the fastest option currently 
available. The measured fast GEANT4 simulation time is ~60% lower than that required for full simula- 
tion. The three fast GEANT4 options differ among themselves by only a few percent of the full simulation 
time. To study the effect of the fast GEANT4 simulation algorithms in SUSY searches, we compared the 
distributions of typical quantities, i.e. the jet multiplicity and transverse momentum, £"™ ss , transverse 
sphericity, and effective mass in 5000 fast GEANT4 simulated events with the ones obtained from full 
simulation (see Figs. 16, 17 and 18). We reconstructed jets with pj > 50 GeV and \r\\ < 2.5. Due to 
the limited number of Monte Carlo events, no requirement was applied to the jet multiplicity, except in 
the Ej lss and M e ff distributions, where the standard SUSY jet cuts described in Section 1 were applied. 
Transverse sphericity and effective mass quantities were calculated only for events with at least 2 jets. 

Whilst the largest jet transverse momentum is well reproduced in the fast simulated samples, the 
average jet multiplicity is lower than that of the fully simulated sample by 1-2%. The most conservative 
sample (option 1) is in agreement with the full simulation distribution inside the statistical uncertainty. 
The width of the jet pj resolution is consistent across the samples, however the jet energy scale is too 
low by 1-2% (see Fig. 17). Due to the limited number of events available only a qualitative comparison 
is possible in regions of interest for SUSY searches for E™ 1S& (higher than 100 GeV) and transverse 
sphericity (larger than 0.2). Within the statistical precision possible the fast simulated distributions agree 
well with those obtained from full GEANT4 simulation. The size of the available samples preclude a full 
assessment of performance for the baseline SUSY cuts described in the introduction, since no events pass 
all cuts when using any of the three options. Similarly, from an equivalent number of GEANT4-simulated 
events, none pass all the cuts. 
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Figure 16: Reconstructed jet multiplicity (left) and highest jet pj (right) for PYTHIA dijet samples simu- 
lated with different options. The statistical uncertainty only is shown for the full simulation sample. 

Overall, the fast GEANT4 simulation has been shown to be a promising faster alternative to the full 
GEANT4 based simulation. The reconstructed quantities in fast GEANT4 simulated samples are in good 
agreement with full simulation. Work is on-going to further reduce the simulation time, and to improve 
the modeling of the jet energy scale. 
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Figure 17: Leading jet pj resolution (left) and £"™ ss (right) for PYTHIA dijet samples simulated with 
different options. The statistical uncertainty only is shown for the full simulation sample. 
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Figure 18: Transverse sphericity (left) and M e g (right) for PYTHIA dijet samples simulated with different 
options. The statistical uncertainty only is shown for the full simulation sample. 

5 Data-driven estimates 

5.1 Jet smearing in the zero-lepton mode 

Introduction The inherent systematic and statistical uncertainties in Monte Carlo based QCD back- 
ground estimates will limit their use until sufficient data have been acquired to understand both the 
ATLAS detector and the underlying physics of QCD processes at 14 TeV. For this reason data-driven 
background estimates with minimal reliance on Monte Carlo simulation will be a priority for the early 
phase of data-taking. 

The method described in this section reduces dependence on Monte Carlo simulation by smearing jet 
transverse momenta in low £"™ ss QCD multijet data with a data-measured jet response function R defined 
as the distribution of event-by-event ratios of measured jet pj to true jet pj. This response function could 
potentially be used inter-changeably with the Monte Carlo-truth derived Particle Jet Transfer Function 
(PJTF) discussed in Section 4.1. Consequently the method described below provides a route to practical 
realisation of the PJTF technique for use with real data. The technique can be broken down into three 
distinct parts which are outlined below. 



1578 



SUPERSYMMETRY - ESTIMATION OF QCD BACKGROUNDS TO SEARCHES FOR . 



Step 1: Gaussian response function measurement The "£"™ 1SS projection method" [30] applied to y 
+ jet events allows a measurement of the Gaussian response R G of the ATLAS calorimeters to jets. The 
limited number of events prevent the measurement of the non-Gaussian tail with this technique (see Step 
2). The method uses transverse momentum conservation to constrain the pj of all activity associated with 
a reconstructed jet with the pj of a hard photon. R G is obtained from the distribution of the photon-jet 
pj balance R\. 

where pr(y) is the photon transverse momentum, and p™ lss is the missing-transverse-momentum two- 
vector. 

A GEANT4-simulated PYTHIA [7] sample equivalent to 23.8pb _1 of 7 + jets events was used, with 
events required to pass a 60 GeV single-photon trigger [9]. With the additional requirement that events 
should contain one and only one reconstructed jet, the jet response was measured with the distribution 
of R\ values defined above. The R\ distribution was measured in a number of photon p T bins, and each 
was fitted with a Gaussian function. The standard deviations from these fits to the response distributions 
are shown in Fig. 19 with the statistical error bars derived from the fit errors. The pj dependence of the 
widths of the R\ distributions was fitted with the parametric form 

o R =A + -^— + — , (5) 
VPt Pt 

shown in the figure to describe well the GEANT4 simulated data. The sampling (A) and stochastic (B) 
terms in the above formula were used to calculate the width of R G as a function of jet pj, while the 
constant term (C) was used to estimate the additional smearing of £™ 1SS not associated with jets (see 
Step 3 below). 




Figure 19: Standard deviations of Gaussian fits to measured response distributions vs. p\. The fit is of 
the functional form o R (p T ) = A + Bp T ~ l l 2 + Cp T ~ l ■ 



Step 2: Full (Gaussian + non-Gaussian) response function measurement In order to reproduce the 
tail of the QCD multijet £"^? 1SS distribution it is necessary to characterise the non-Gaussian response of 
the calorimeters to jets. The next stage in the technique therefore uses events in which the £"™ ss vector 
can be unambiguously associated in 0 with a single jet / to measure that response. The response of the 
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ATLAS calorimeters to jet / if its pj lies in the non-Gaussian tail can be obtained from 

p r (/)-p r (7,true) 
Rl= | Pr (/,true)| 2 ' (6) 

where pr(J) denotes the reconstructed transverse momentum of jet /. If it is assumed that the E™ ss in 
these events is dominated by fluctuations in Pt(J) then p r (/,true) can be approximated by 

p r (/,true)~p r (7)+p™ ss . (7) 

The non-Gaussian response of ATLAS, R NG , is measured with the distribution of event R2 values. 

QCD jet events from the sample described in Section 2.1 were required to pass one of the high-p-f 
multijet or £"™ 1SS triggers [9], and required to contain at least three jets with pj > 250, 50 and 25 GeV, 
respectively. Events were also required to possess E™ 1SS > 60 GeV. Events were further required to 
contain a jet unambiguously associated with the £"™ 1SS vector - i.e. events with one and only one jet 
parallel or anti-parallel to the £"™ 1SS vector. Of the remaining jets in each event, the two with the highest 
pj were required to have pj > 250, 50 GeV. In this way, the selected events predominantly had a 
'Mercedes' type configuration, with £"™ 1SS parallel or anti-parallel to the p T of one of the jets. The 
p T of this jet was used in Eqn. 6 to measure R 2 . The estimate of the jet response non-Gaussian tail 
R NG obtained from the distribution of R2 values is plotted as the histogram in Fig. 20(right) with a 
normalisation obtained with the procedure described below. Although not considered in detail here, the 
W(— ► TV) + 2 jet background could have been removed with a cut on the multiplicity of inner detector 
tracks per jet. 

Next we combine the Gaussian and non-Gaussian components of the jet response R measured pre- 
viously. The relative normalisation of the Gaussian (R G ) and non-Gaussian (R NG ) components can be 
measured using the balance of transverse momenta of jets in dijet events. The relative normalisation can 
be obtained from the ratio of the numbers of dijet events with respectively one and zero jets with pj lying 
in the non-Gaussian tail of the jet response function. We therefore need to define a variable which can 
classify dijet events according to the number of jets with pj lying in the non-Gaussian tail. A suitable 
variable is provided by Ri(j), defined as the projection of the transverse momentum of each jet / in the 
event onto the event £™ ss : 

j? f;\-u Pr 1SS -Pr(/) , R x 

which measures the response to the other jet j in the event (cf. Eqn. 4). If R3U) lies below some threshold 
then jet j can be considered to lie in the non-Gaussian tail. 

Events were selected from the QCD jet event sample used above with the requirement that they 
passed a 160 GeV single-jet trigger requirement [9] and contained two and only two jets, back-to-back in 
the transverse plane. The distribution of Rs(j) for these events is plotted as data-points in Fig. 20(right). 
The ratio of the integral of the low tail of the Rs(j) distribution to the integral of a Gaussian function fitted 
to the peak is used to obtain the relative normalisation of the non-Gaussian and Gaussian components of 
the jet response function. The full jet response function including normalised Gaussian and non-Gaussian 
components is plotted in Fig. 20 (left). 

As a 'closure test' of the reconstruction of the full response function, jets in a subset of the selected 
dijet events with low £"™ 1SS -significance (see Section 2.1) were smeared to reproduce the Rs(j) distribu- 
tion. The resulting distribution is shown in Fig. 20 (right) for comparison with 'data'. 

Step 3: Seed event selection and jet pj smearing In order to estimate the £"™ 1SS and M e g distributions 
of QCD multijet events the jet response function R measured in Step 2 was used to smear jet transverse 
momenta in multijet events with low £"™ 1SS (referred to below as 'seed events'). Seed events were 
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Figure 20: Left - smearing function for a jet of 250 GeV (thick line), with Gaussian and non-Gaussian 
components (right and left facing hatches respectively) shown separately. Right - dijet balance distribu- 
tion (points) compared with the equivalent estimated distribution obtained from the jet response function 
to provide a 'closure test' of the technique. Also shown are a Gaussian fit to the region 0.8 < Ri(j) < 1.15 
(thick line), and the non-Gaussian tail distribution (dashed histogram) measured with 'Mercedes' events 
normalised to the tail of the dijet balance distribution. 

selected from the same PYTHIA jet sample used in Step 2. Seed event candidates were required to pass 
one of the ATLAS high-/^ jet triggers [9] together with the standard SUSY jet selection cuts described 
in Section 1 with a reduced jet pj threshold (priji) > 45 GeV for i = 2, 3,4) in order to avoid biasing the 
final estimate after smearing. Both the £"™ ss significance (see Section 2.1) and the equivalent quantity 
constructed from £"™ ss derived only from in-cone jet energy were required to be less than 0.5 GeV 1 / 2 
and 0.7 GeV 1 / 2 , respectively. The latter of these two cuts ensures that the hadronic activity in the selected 
seed events is well contained in the jets that will be smeared. 

Smeared events were constructed from each selected seed event by smearing the transverse momenta 
of their constituent jets with the jet response function R determined in Step 2. The Ej lss of smeared 
events was calculated by replacing the contribution to the Ej lss from the pj of seed jets with a contri- 
bution from the pj of the equivalent smeared jets. An additional contribution p 1 ™^ to p™ ss generated 
by the constant term measured in Step 1 was also taken into account. The smeared £"™ ss was therefore 
given by the magnitude of 

P? W = Pijc -£PrU) +X>rC/>), (9) 

where primed quantities are smeared. 

Fig. 21 shows the i?™ lss andM e ff distributions for 23.8pb _1 of GEANT4 simulated 'data' and smeared 
seed events passing the SUSY jet cuts listed in Section 1. The estimate was normalised to GEANT4 
simulated PYTHIA QCD jet events ('data') with E^ iss < 50 GeV and somewhat tighter jet cuts (j>t{U) > 
60 GeV) imposed to ensure a reasonable sample of 'data' events obtained with the multi-jet triggers [9]. 

The dominant sources of systematic uncertainty in the estimate were pj bias in the selection of events 
and finite statistics in the non-Gaussian tail measurement. Uncertainties on the A, B, and C parameters in 
Eqn. 5 and the relative normalisation of the Gaussian and non-Gaussian parts of the jet response function 
were also considered. 

Good agreement can be seen between the estimated and GEANT4 'data' £"™ ss and M e ff distributions. 
Applying the full SUSY cuts described in Section 1, 2.36 ± 0.09 (stat) ±1.44 (syst) events are estimated 
versus 1 'observed'. The uncertainty in the estimate is therefore ~ 60% for 23.8 pb _1 . If these figures 
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Figure 21: £"™ 1SS (left) and M e ff (right) distributions for smeared events and GEANT4 'data' passing 0- 
lepton SUSY jet cuts. Also included for comparison are 23.8 pb~ ! of SUSY (SU3) and the summed 
contribution from Z -^vv + jets, W —> lv + jets and tt +jets. 

are extrapolated to 1 fb _1 then the equivalent calculated uncertainties are ~ 0.6% (stat.) and 12.6% 
(syst.), with the systematic uncertainty reduced through access to increased statistics at step 2. However, 
residual Standard Model (e.g. W(—> tv) +2 jets) contamination of the 'Mercedes' control sample has 
not been included in these uncertainty estimates and therefore we conservatively assume the same 60% 
systematic uncertainty for 1 fb _1 of data. Systematic bias in the background estimate caused by the 
presence of SUSY signal is expected to be small - in particular the fractional contribution of SUSY 
events to the £"™ ss normalisation region is < 10~ 4 . 

5.2 Lepton isolation in the one-lepton mode 

Introduction Although QCD multijet background is not expected to be significant after application of 
SUSY signal cuts in the 1-lepton mode [5], it still must be estimated. This will be particularly impor- 
tant when preparing control samples for data-driven background estimates of other backgrounds; it is 
important to ensure that such samples (which will have looser cuts than the final SUSY selection) are not 
significantly contaminated by QCD background. The technique studied here is also relevant for QCD 
backgrounds in other lepton+jets channels, such as W(—> Zv)+jets or tt in the semileptonic channel, al- 
though the kinematic cuts used in searches for Supersymmetry (e.g. lepton + E™ 1SS transverse mass, Mj 
> 100 GeV) might select different background mechanisms. 

The method, which has been used at the Tevatron, relies on the (near) independence of E™ 1SS and 
lepton isolation. The shape of the £"™ ss distribution is estimated from multijet events containing a 
non-isolated lepton. This distribution is then normalized to the number of events containing an isolated 
lepton, but with low £"™ ss . The method, if valid, would be complementary to the jet-smearing technique 
described in Section 5.1 applied to one-lepton events, thereby allowing an important crosscheck of the 
systematic uncertainties of the two methods. 

In evaluating this method, it is important to understand potential correlations between isolation and 
E™ ss . One possible mechanism can arise when there are multiple sources of leptons in QCD multijet 
events; if the sources each have a different Ej lss shape, and the relative mixture of the sources changes 
with lepton isolation, this will appear as a correlation between £"™ ss and isolation. A possible example 
involves electrons arising from jet mis-identification and electrons from heavy-flavour decay. Another 
example might involve leptons from charm versus leptons from bottom quark decays. Another possible 
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source of correlation arises from the kinematics of the heavy quark itself; the pj of the lepton, the 
associated neutrino, and the remainder of the heavy quark fragmentation (which will be related to the 
isolation) are all coupled. 

Preliminary indications from studies of GEANT4 simulated QCD dijet samples suggest that the pri- 
mary source of electrons is a jet faking an electron, while the muon case is dominated by the decay of 
B hadrons. Thus the issue of multiple background sources alluded to above is not a serious concern 
according to these Monte Carlo studies, but should be kept in mind for real data. 

We first study the applicability of the lepton isolation technique in a pure sample of bb + jets. We 
then move to a more realistic sample containing a mixture of tt and W + jets events in addition to the bb 
+ jets sample, focussing on the muon channel. The dominant systematic is found to be contamination of 
control samples by tt events. 
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Figure 22: Points: Ej lss distribution for the non-isolated lepton sample. Histogram: i?™ ss distribution 
for the isolated lepton sample. The two distributions have been normalized to the same area in the region 
Ef™ = [10,20] GeV. 

Lepton isolation versus £"™ 1SS for bb + jets The dependence of £'™ 1SS on lepton (e,/i) isolation was 
studied with a sample of to + jets events generated with ALPGEN [27], corresponding to an integrated 
luminosity of about 200 pb _1 . At the event generator level, one of the b quarks was forced to undergo 
semileptonic decay to e or ji. ; also applied at the event generator level were filters requiring true £™ ss 
> 30 GeV, four or more truth jets with p T > 40 GeV, a leading truth jet with p T > 80 GeV and truth 
lepton with pj > 10 GeV. In the offline analysis, the events were required to satisfy the SUSY jet cuts 
described in Section 1. The events were required to have one and only one lepton with pj greater than 20 
GeV, where the lepton could be either an electron or muon. Events with additional leptons (with pr>\0 
GeV) were rejected. 

A (non-isolated) control sample was defined by the requirement E T one = [10,20] GeV, where E T one 
is the Et inside an r\ — (j) cone of radius 0.2 centered around the lepton, excluding the lepton pj. The 
(isolated) signal sample was defined as E T one < 10 GeV. The £"™ ss distribution of the control sample was 
normalized in the region £"™ ss = [10,20] GeV to the signal sample. This normalized Ej lss distribution 
was used to estimate the i?™ ss distribution for isolated leptons. Fig. 22 shows the £"™ ss distribution for 
the isolated and non-isolated samples. The agreement is good, suggesting that the correlation between 
£"™ ss and lepton isolation in bb + jets events is small. 
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It is also apparent from Fig. 22 that the absolute level of background is low, even though typical 
SUSY selection cuts on E™ 1SS , transverse mass and effective mass (typically £"™ ss > 100 GeV, Mj 
> 100 GeV, and M eff > 800 GeV) have not yet been applied. We will return to this point later. The 
M e ff distribution after the jet and lepton selection cuts is shown in Fig. 23(left). Ideally, one would like 
to demonstrate that the lack of correlation between lepton isolation and i?™ ss still holds even after the 
other SUSY selection cuts, such as those on Mj and M e ft, have been applied. This was not possible in 
this study with fully simulated data due to the limited number of events available. 




Figure 23: Left: distribution of the effective mass in the bb +jets sample after the jet and lepton selection 
cuts. Right: £"™ ss distributions for tt isolated muon sample (black filled circle), tt non-isolated muon 
sample (red filled square), W + jets isolated muon sample (black open circle), W + jets non-isolated muon 
sample (black open square), bb isolated muon sample (blue open triangle), bb non-isolated muon sample 
(magenta filled triangle). 



Contamination from W + jets and tt An important systematic effect comes from the contamination 
of the control samples by W +jets and tt events. This was studied with fast simulation samples which had 
no generator-level cuts. The sample sizes corresponded to integrated luminosities of approximately 200 
pb -1 , 6 fb _1 and 3 fb _1 for bb + jets, tt and W + jets, respectively. The i?™ ss distribution, after the event 
selection cuts described above is shown in Fig. 23(right) for the bb, tt and W + jets samples for isolated 
and non-isolated muons defined as above. Looking first just at the bb events, there is good agreement in 
the shape of the £"™ ss distribution for the isolated and non-isolated samples; this reinforces the findings 
from the GEANT4-simulated bb+jets samples on the independence of muon isolation and Ej lss . It is also 
clear that the non-isolated sample is dominated by bb at low E™ 1SS . However, the i?™ ss tail is dominated 
by tt and W + jets events, even in the non-isolated sample. 4 This implies that the shape of the Ej lss 
distribution in the control sample will be distorted by the high £"™ ss tail. It is possible to increase the 
number of bb events in the control sample by further loosening the isolation requirement, however this 
has the disadvantage of extrapolating into the isolated signal region over a large range, which could be 
susceptible to correlations between isolation and E™ ms . Furthermore, even if the isolation requirement 
in the control sample were successfully loosened, there is the problem of normalization. In the isolated 
sample, tt and W + jets completely dominate, even at very low £"™ 1SS ; this means that the normalization 
of the i?™ ss distribution will be distorted when extrapolating from the control to the signal region. 

One might consider correcting the event yields in the control regions for the presence of tt and 
W+jets events. An early CDF tt cross-section analysis from Run2 [31] made corrections based on an 
assumed tt cross-section. DO [32] used the so-called "matrix method" where additional input on the 

4 As an added complication the shape of this tail will change after the application of the Mj cut which selects primarily 
dileptonic tt decays. 
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efficiency for tt and bb events to pass the isolation cuts was used to extract the multijet background. The 
utility of these methods for the SUSY search could be further studied. 

Despite the limitations of the technique in the presence of tt and W + jets events background it should 
be noted that this study also indicates that QCD backgrounds are unlikely to be dominant in the 1-lepton 
channel. Even with application of just the standard SUSY 4-jet and lepton selection cuts, the QCD mul- 
tijet background in the 1-lepton channel is significantly smaller than the tt and W + jets backgrounds 
and applying the remaining SUSY selection cuts further suppresses the QCD multijet background. Con- 
sequently in the baseline analysis the accuracy of QCD multijet background estimates is less important 
than that of the estimates other backgrounds described in Ref. [4]. It should be kept in mind however 
that for some background studies the jet or lepton selection cuts may be further loosened, in which case 
the the lepton isolation technique might be applicable without the problem of contamination from other 
backgrounds. 




Figure 24: Black circles: Ej lss distribution for tt plus W +jets samples where the lepton is non-isolated 
(£"cone _ [10,20] GeV). Histogram: £"™ ss distribution for tt plus W +jets samples with isolated leptons 
(E™ ne < 10 GeV). 

Upper limit on £"™ ss distribution for bb + jets If the multijet background is large compared to the 
background from tt (either because the existing Monte Carlo simulation happens to underestimate the 
contribution or because the selection cuts were further loosened), then issues of tt contamination will 
become negligible and the methods described in the previous sections should work well to estimate the 
multijet background. On the other hand, if the multijet background is small compared to tt as expected, 
one can obtain the £"™ 1SS distribution from the control region (Ef ne = [10,20] GeV) as an upper limit on 
the multijet background; the (reasonable) assumption here (confirmed in Monte Carlo) is that the Ef ne 
distribution for multijet background is flat (or falling) as Ej one approaches zero. After the Mj cut, this 
control region is expected to be dominated by tt and W + jets events containing non-isolated leptons with 
the QCD multijet contribution playing only a very small role. Thus this will be an upper limit on the 
multijet background, but the number of events so obtained will in any case be many fewer than from the 
main tt and W + jets backgrounds (with isolated leptons) and so are likely to be negligible. 

The contribution from bb+jets has been compared to GEANT4-simulated ALPGEN tt and W+jets 
samples (corresponding to integrated luminosities of about 3 fb _1 ). In addition to the event selection 
cuts listed above (jet and lepton selection), the events were required to have Mj > 100 GeV and M e s > 
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800 GeV. The Ej lss distribution for the sum of tt and W + jets samples for events with non-isolated 
(£"cone _ [10,20] GeV) leptons is shown in Fig. 24; the contribution of bb + jets cannot be seen compared 
to that from tt plus W +jets. Furthermore, all contributions are negligible compared to the tt plus W +jets 
background with isolated leptons. Counting the number of non-isolated tt plus W + jets with E™ 1SS > 
100 (200) GeV, one would find an upper limit on the multijet background in the lepton channel of 10 ± 2 
(5 ± 1) events for an integrated luminosity of 1 fb -1 . The true number of multijet background events 
would be estimated to be roughly an order of magnitude lower. 

6 Summary 

This paper has examined a wide range of techniques for estimating QCD backgrounds in searches for 
Supersymmetry at ATLAS. The focus of the paper has been on outlining the strategies which could 
be used and assessing the likely uncertainties associated with them. It is important to note that only 
by comparing uncorrected results from a number of such independent techniques can a robust QCD 
background estimate be obtained. 

Given the difficulty of obtaining accurate estimates of QCD jet backgrounds generating fake E™ 1SS it 
will be essential to reduce this source of background using the techniques described in Section 2 and also 
the more general cuts described in Ref. [3]. The remaining backgrounds can be estimated using Monte 
Carlo (Section 3 and 4) or data-driven (Section 5) techniques. 

All Monte Carlo based estimates will be subject to systematic effects arising from parton distribution 
and underlying event uncertainties (Section 3.1), likely to be of order 20% in each case for events satis- 
fying the baseline SUSY cuts described in Ref. [5]. A jet energy scale uncertainty of 5% (Section 3.2) 
will contribute a further ~ 30%. While the precision with which Monte Carlo generators model QCD jet 
physics at 14 TeV is difficult to assess prior to data-taking, the difference (Section 3.3) between a 'tra- 
ditional' parton shower dijet estimate and that obtained from one of the newer matched matrix-element 
+ parton-shower generators for the baseline SUSY cut set is of order 50% if an accurate normalisation 
to data can be obtained. To these effects must be added luminosity uncertainties ranging from 20-30% 
at start-up (from machine parameters) reducing to <3-5% (from total cross-section measurements and 
W/Z counting). 

In addition to the above systematic effects, Monte Carlo simulation-based estimates will be subject 
to detector simulation uncertainties - due to the imperfect description of the response of ATLAS to QCD 
jets, and statistical uncertainties caused by the large QCD cross-section and access to finite computing 
resources. It is impossible to assess the accuracy of current full (GEANT4) or fast (ATLFAST, 'fast G4'- 
Section 4.2 or PJTF-Section 4.1) detector simulations without recourse to data, however the results 
of Section 4.1 suggest that uncertainties ~ 100% in the understanding of the response of the ATLAS 
calorimeters to jets may lead to similar uncertainties in the background estimate. 

Data-driven background estimates have the advantage that they are less prone to input systemat- 
ics, and can in some cases benefit from large statistics in control channels used to measure detector 
performance. However, additional systematic uncertainties have to be considered due to the potential 
contamination of control samples with non-QCD events, and from relying on Monte Carlo simulation 
to extrapolate from the control into the signal region. The data-driven estimate for the jets + E™ 1SS + 
0-leptons channel described in Section 5.1 could potentially give a combined uncertainty of < 60% for 
1 fb -1 of integrated luminosity and the baseline SUSY cut set, although a precise figure is difficult to 
obtain without further Monte Carlo data. In the jets + £"™ 1SS + 1-lepton channel the QCD jet background 
is potentially far less significant, in which case the technique described in Section 5.2 is likely to generate 
a very conservative upper limit on the background a factor ~10 above the true background for 1 fb -1 . 
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Abstract 

This note describes searches for generic SUSY models with /^-parity conser- 
vation in the ATLAS detector at the CERN Large Hadron Collider. SUSY 
particles would be produced in pairs and decay to the lightest SUSY parti- 
cle, X\i which escapes the detector, giving signatures involving jets, possible 
leptons, and i?™ ss . The integrated luminosity simulated is 1 fb _1 . This arti- 
cle relies on work published elsewhere in this collection, where the Standard 
Model backgrounds for SUSY are discussed. 

1 Introduction 

This note describes the search for generic SUSY with R parity, so that SUSY particles are produced in 
pairs and decay to the lightest SUSY particle, %®, which escapes the detector, giving signatures involving 
jets, possible leptons and £'™ 1SS , an imbalance in the transverse energy measured in the detector. Most of 
the introductory information necessary to the understanding of this document is given in the introductory 
SUSY note [1], which should be read before this one. These include a brief description of the theoretical 
framework, a definition of the SUSY benchmark models SUn studied in the detailed analyses, a descrip- 
tion of the Monte Carlo samples used for signal and background. Common identification criteria for jets, 
taus and leptons have been adopted throughout the analyses in this note, and are also described in [1] as 
well as the definition of a few global variables relevant for the analysis, such as effective mass (M e ff), 
stransverse mass (mj2) and transverse sphericity (St). The background uncertainties used throughout 
this work are based on Standard Model background studies documented in [2, 3]. Special signatures 
associated, e.g., with Gauge Mediated SUSY Breaking are treated elsewhere [4]. 

Two different approaches have been used to develop the inclusive search strategy described here. 
Firstly, detailed studies have been carried out for various signatures (jets + £"™ ss + 0 leptons, jets + E™ 1SS 
+ 1 lepton, . . . ) using data-sets fully simulated with Geant 4 for specific SUSY signal parameters and for 
the relevant Standard Model backgrounds. These detailed studies are used to develop deeper understand- 
ing of how best to reconstruct these relatively complex events and to define strategies for separating the 
signal from the Standard Model backgrounds. In order to simplify the procedure of combining the results 
from the different analyses, the various leptonic signatures have been defined so that they are exclusive. 
For example the 1 -lepton signature rejects all events in which more than one lepton is present. However, 
no attempt is made to combine the different analyses in the present document. 

Secondly, the insight gained from studying specific points has been applied to several scans over 
subsets of the SUSY parameter space, Since large numbers of signal points must be studied, these scans 
are of necessity based on fast, parameterized simulation. The goal is to verify that the different sets of 
basic cuts studied on benchmark points provide sensitivity to a broad range of SUSY models. The results 
shown in this document will be used as a basis for the development of a strategy for SUSY discovery 
with early ATLAS data. 

1.1 Trigger 

The trigger efficiency for the inclusive SUSY signals at the benchmark points has been studied based on 
the complete simulation of all three trigger levels of ATLAS. For all the analyses we adopted the trigger 
thresholds defined for 2 x 10 33 cm _2 s _1 in the High Level Trigger TDR [5]. These triggers are discussed 
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futher elsewhere [6] in this volume, where a detailed explanation of the naming convention for the trigger 
menu items appearing in Table 1 can be found. 

The jet triggers, denoted by "JETS", consist of the logical "or" of the following triggers: 

• j400: 1 jet with p T > 400 GeV; 

• 3jl65: 3 jets with p T > 165 GeV; 

• 4jll0: 4 jets with p T > 110 GeV. 

The £™ iss trigger "j70_xE70", requires iss > 70 GeV accompanied by a jet with p T > 70 GeV. The 
lepton triggers are "e22i": an isolated electron efficient for pj > 25 GeV; "2el2i": two isolated electrons 
efficient for pj > 15 GeV, "mu20": a muon with pj > 20 GeV and "2mul0": two muons with pj > 
10 GeV. 

Since the goal of this note is to develop a generic SUSY search, only the basic trigger building blocks 
have been considered. More complex triggers combining different objects can be easily implemented in 
the trigger menus. Also, only triggers which are not prescaled have been used. 

The trigger efficiencies for the signal events passing the the 0, 1,2, and 3-lepton selections defined 
in the following sections are listed in Table 1 for different requirements on jet multiplicity. In general the 
j70_xE70 is highly efficient, although there is some loss for SU4, the low-mass point with a very large 
cross-section. The j70_xE70 trigger is also very efficient for the X and b modes, described in Sections 6 
and 7 below. 

The basic performance of the leptonic and j70_xE70 triggers will be determined from Standard Model 
events such as Z and It using the methods described in [6]. It may be useful to check that performance 
by comparing (Monte Carlo) samples of SUSY events selected with multiple triggers. For the 0-lepton 
selection, the efficiency for JETS trigger alone is in the range 30-70%, except for the very low mass point 
SU4. This provides a useful redundancy in the early phases, as the E™ 1SS trigger may require a longer time 
than the other triggers to be completely understood, but only j70_xE70 has an efficiency close to one. For 
the topologies involving leptons, both the single lepton triggers and j70_xE70 have typical efficiencies in 
excess of 80%, so comparing them should be quite effective. 

1.2 Systematic uncertainties and statistical procedure 

To assess the discovery potential of the different analyses it is necessary to take into account systematic 
uncertainties. SUSY searches will address very complex topologies, typically with many jets in the final 
state. The prediction of the Standard Model backgrounds to these topologies will require a complex 
inteiplay of Monte Carlo and data-driven methods. The development of these methods and the estimate 
of the corresponding uncertainties are described in detail in [2] and [3]. The approximate uncertainties 
for an integrated luminosity of 1 fb~' are estimated to be: 

• 50% for the background from QCD multijet events, 

• 20% for the background from tt, W + jets, Z+ jets, and W /Z pairs. 

The limited Monte Carlo statistics is also taken into account and all systematic uncertainties are 
added in quadrature. The background can never be known exactly. Uncertainties on the background are 
incorporated in the significance by convoluting the Poisson probability that the background fluctuates to 
the observed signal with a Gaussian background probability density function with mean Nb and standard 
deviation 8Nb (see e.g. [7,8] and references therein). Given these assumptions, the probability p that the 
background fluctuates by chance to the measured value Vdata or above is given by 
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Table 1: Average event trigger efficiency (in %) for events passing various lepton and jet selection criteria 
described in detail in the indicated sections. 



Trigger 


SU1 SU2 SU3 SU4 SU6 SU8.1 




0-lepton, 4-jet selection [Section 2.1] 


JETS 
j70_xE70 


44.6 51.0 33.8 7.7 51.7 48.2 

99.7 98.7 99.5 97.2 99.6 99.7 






JETS 
j70_xE70 


64.9 71.1 54.9 34.3 71.8 66.8 
100. 99.8 100. 99.9 100. 100. 




0-lepton, 2-jet selection [Section 2.2] 


i70 yF70 


44.1 39.9 30.1 8.8 53.6 47.6 

1 0O 1 0O 1 0O QQ Q 1 0O 1 0O 
1UU. 1UU. 1UU. yy.y 1UU. 1UU. 




1 -lepton, selection [Section 3] 


JETS 
j70_xE70 

IL,Lvr (muzU UK ezzi) 


41.8 50.5 31.7 8.1 48.4 45.6 
99.6 99.0 98.9 95.6 98.9 99.1 

010 01 A TO O OA *2 OA A TO < 

si. i oi.u /y.y ou.j ou.4 /y.j 




OS 2-lepton, selection [Section 4.1] 


JETS 
j70_xE70 

1LEP (mu20 OR e22i) 
2LEP (2mul0 OR 2el5i) 


36.7 47.3 34.0 6.7 47.2 40.8 
99.2 100.0 98.9 94.3 99.6 100.0 
87.0 90.0 87.5 84.8 79.6 86.4 
20.5 35.5 27.0 18.0 26.0 14.6 




SS 2-lepton, selection [Section 4.2] 


JETS 
j70_xE70 

1LEP (mu20 OR e22i) 
2LEP (2mulOOR 2el5i) 


39.9 48.8 29.2 1.6 46.6 34.5 
99.3 100.0 98.9 84.1 98.3 100.0 
94.2 92.7 95.9 95.2 89.7 96.6 
32.6 41.5 32.2 25.4 25.9 31.0 




3 -lepton, selection [Section 5] 


JETS 
j70_xE70 

1LEP (mu20 OR e22i) 
2LEP (2mulOOR 2el5i) 


43.7 60.2 40.1 17.6 46.4 48.3 
95.6 85.4 93.5 79.8 96.4 98.3 
95.2 94.2 95.8 94.7 94.6 96.7 
49.1 60.2 51.0 44.7 47.3 53.3 



where G(b;Nb, 5Nb) is a Gaussian and the factor 




ensures that the function is normalised to unity. If the Gaussian probability density function G is replaced 
by a Dirac delta function 8{b — Nb), the estimator p results in a usual Poisson probability. 

The probability p is transformed into "standard-deviations", denoted in this note by the symbol Z„, 
using the formula 

Z„ = v / 2err 1 (l-2p) 
The Root [9] library provides functions to calculate p and Z„. 

If many different data selections are considered, it becomes more likely that statistical fluctuations 
would be misinterpreted as new phenomena if the number of selections is not considered in the statistical 



1591 



Supers ymmetry - Prospects for Supersymmetry Discovery Based on Inclusive . 



Table 2: Number of events surviving subsequent selection cuts as defined in the text for 4-jets analysis 
normalized to lfb 1 using NLO cross-sections. 





Cut 1 


Cut 2 


Cut 3 


Cut 4 


Cut 5 


M e ff Cut 


SU3 


9600 


7563 


5600 


5277 


4311 


3349 


SU1 


3485 


2854 


2004 


1907 


1401 


1229 


SU2 


604 


369 


308 


279 


169 


131 


SU4 


79618 


57803 


46189 


42408 


34966 


8507 


SU6 


2551 


2062 


1468 


1383 


1080 


956 


SU8.1 


3118 


2540 


1778 


1686 


1448 


1284 


MC@NLO it 


12861 


8798 


6421 


5790 


4012 


305 


Pythia QCD 


29230 


7044 


4667 


848 


848 


13 


Alpgen Z 


1626 


1045 


732 


660 


644 


162 


Alpgen W 


4066 


2393 


1654 


1499 


1147 


228 


Herwig WZ 


22 


15 


9 


8 


4 


1 


Total Standard Model 


47805 


19294 


13483 


8806 


6655 


708 


SU3 S/B 


0.2 


0.4 


0.4 


0.6 


0.6 


4.7 


z„ 


0.5 


1.3 


1.4 


2.6 


2.7 


13 


SU3 eff (excl) 


35.1% 


78.8% 


74.0% 


94.2% 


81.7% 


77.9% 


SU3 eff (incl) 


35.1% 


27.7% 


20.5% 


19.3% 


15.8% 


12.3% 



procedure. This is known in statistics as the problem of "multiple comparisons". The probability values 
are therefore corrected for multiple comparisons via a Monte Carlo method. The effect is the reduction 
of approximately half a unit of Z n for Z n = 3, decreasing with increasing Z„. In the last section of the 
note the significance always corresponds to the corrected Z„. 

2 Zero-lepton mode 

A SUSY signal at the LHC is typically dominated by the production of squarks and gluinos. In the 
/^-parity conserving case, at the end of each sparticle decay chain one finds an undetected LSPs, which 
can together generate large £"™ ss . The least model-dependent SUSY signature is therefore the search 
for events with multiple jets and Ej lss . Traditionally searches have been performed requiring at least 
four jets; the high multiplicity helps to reduce the background from QCD and W/Z+jets. Both for this 
topology and for the lep tonic topologies in the following sections we adopt very simple sets of cuts, 
similar to the ones used in the ATLAS Physics TDR [10]. In addition to the four-jet signatures we have 
also addressed signatures with lower jet multiplicity. These signatures have more backgrounds, but might 
be favoured in some SUSY models, and should be more cleanly reconstructed in the detector, because of 
their less-complex topologies. This may be an advantage in the early phases of the experiment. 

2.1 Four or more jets in final state 

The basic selections applied for this channel are: 

1. At least four jets with pj > 50 GeV at least one of which must have pj > 100 GeV; and Ej lss > 
100 GeV. 

2. Ef ss > 0.2M eff . 
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Figure 1: M e ff distribution for events surviving successive selection cuts: cut 1 (top left), cut 2 (top 
right), and cuts 3-5 (bottom left). The open circles represent point SU3, and the different background 
contributions are shown according to the legend. The last plot (bottom right) show all of the SUSY 
benchmark points and the total Standard Model background after cuts 1-5. Open circles represent the 
SUSY SU3 signal as predicted by Monte Carlo simulation, while the shaded area shows the total Standard 
Model background. 



3. Transverse sphericity, S T > 0.2. 

4. A0(jetj -£™ ss ) > 0.2, A0(jet 2 - £^ iss ) > 0.2, A0(jet 3 - Ef a *) > 0.2. 

5. Reject events with an e or a [JL. 

6. M eff > 800 GeV. 

Most of the background samples have been filtered at generation level with various requirements on E™ 1SS 
and jet multiplicity. The first cut in the analysis flow applies harder requirements than any of the ones 
applied at the filter level to minimise the bias to the study from the use of filtered samples. 

The main background at this point from QCD events where £™ ss is produced either by a fluctuation 
in the measurement of the energy of one or more jets, or by a real neutrino from the decay of a B hadron 
produced in the fragmentation process. Since the statistical fluctuation on the £"™ ss measurement grow 
with increasing M e ff, the second cut above eliminates the Gaussian part of the £"™ ss measurement fluctu- 
ations. In SUSY events the jets are produced from the decay of heavy particles produced approximately 
at rest, and are thence distributed isotropically in space, whereas for the QCD events the direction of the 
two partons from the hard scattering provides a privileged direction. The cut on sphericity is intended 
to exploit this fact. Both for jet mismeasurement and for b decays, the Ej lss vector will be close the 
direction of one jets, and so the A(j) cuts are very efficient in reducing the QCD background. The lepton 
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veto is applied in order to facilitate the combination with analyses requiring leptons, but it is not expected 
to significantly modify the signal-to-background ratio. 

The number of events surviving each of the cuts for an integrated luminosity of 1 fb~ 1 is shown in 
Table 2 for all of the considered benchmark points and for the backgrounds. The number of events in the 
last column includes the effect of the j70xE70 trigger, which, as shown in Table 1 , has an efficiency in 
excess of 97% for all considered signal benchmarks. 

The distribution of the final selection variable M e s, for signal and background, is shown in Fig. 1 
for point SU3 at different stages of the analysis. The QCD background is dominant after the first cut 
but is reduced to a similar level to the backgrounds containing real neutrinos by subsequently requiring 
E™ ss > 0.2M e ff (cut 2). The cuts on event sphericity and A(j) strongly reduce the QCD background, which 
becomes concentrated in the region of low M e g . After all cuts it is the dominant background, but there 
are also significant contributions from W +jets and Z+ jets. The final cut, M e g > 800 GeV, reduces the 
background to below the level of the signal for all considered benchmark points except for SU2. For this 
point to be found in the 0-lepton channel, one would have to select larger values of M e ff to enhance the 
signal-to-background ratio, and a greater integrated luminosity would be required. 

The statistical significance Z n for 1 fb~ 1 was calculated using the prescription in Section 1 .2 including 
the systematic uncertainty on the background [2, 3]. The significance for point SU3 after each cut is 
shown in Table 2. The significances Z n after all cuts are 13 for SU3, 6.3 for SU1, 0.9 for SU2, 25 
for SU4, 6.3 for SU6, and 6.5 for SU8.1. Evidently only point SU2, for which the cross section is 
dominated by direct gaugino production (which is investigated elsewhere in this volume [11]), would not 
be accessible for the assumed set of cuts, integrated luminosity and level of background understanding. 

These numbers should be taken as indicative. The uncertainty on the background used in the calcu- 
lation is the estimate of what one would obtain using complicated procedures for background evaluation 
based on a combination of data-driven and Monte Carlo methods. The absolute value of the backgrounds 
used for this study is derived only from Monte Carlo, and the present uncertainty on this value is much 
higher. An idea of the robustness of the analysis can be obtained by studying the significance for the 
benchmark points if the background would be increased by a factor 2. In this case the significance for 
SU3 would drop to 7.8, and the one for SU6 to approximately 3.1. The significance for SU6 would be, 
in this situation, dominated by the systematic uncertainty on the background evaluation. Therefore an 
increase in integrated luminosity would result in an increased reach only if it can be used to reduce the 
uncertainty on the background evaluation. 

2.2 Inclusive two-jet and three-jet final states 

The analyses based on lower jet-multiplicities are based on very similar requirements to the 4-jet analysis 
above. The differences are: higher pj requirements on the remaining jets to cope with the increased 
QCD background, and a slightly harder £"™ ss cut. The sphericity cut is less relevant in the case of low 
jet multiplicities and is dropped. For the two (three)-jet analysis the cuts are respectively: 

1. At least two (three) jets, the hardest with pj > 150 GeV and the second (and third) with pj > 
100 GeV; £™ iss > 100 GeV 

2. Ef ss > 0.3(0.25)M eff . 

3. A0(jetj -£™ ss ) > 0.2, A0(jet 2 -£™ iss ) > 0.2, (A0(jet 3 -£™ iss ) > 0.2) 

4. Reject events with an e or a jx 

5. M eff > 800 GeV. 
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The M e ff variable is different from the one denned in [1] in that only the 2(3) highest pj jets for the 
2-(3-)jet analysis are used. 

Since the ALPGEN W +jets and Z+ jets background samples have a filter at generation level requir- 
ing 4 jets, samples produced with the PYTHIA generator were used in this case. 




Figure 2: M e ff distribution for the 0-lepton plus 2-jet analysis, after final cuts. Left: The open circles 
show the SUSY (SU3) signal Monte Carlo prediction, while the total Standard Model background is 
shown by the shaded histogram. The individual background contributions are shown by the points, as 
described in the legend. Right: The points show the distribution of the signal for a number of SUn points. 



Table 3: Number of events surviving the selection cuts denned in the text for the 2-jet analysis. Entries 
are normalized to 1 fb~ 1 using next-to-leading-order cross-sections. 



Sample 


Cut 1 


Cut 2 


Cut 3 


Cut 4 


M eff Cut 


SU3 


18660.7 


12519.8 


12217.5 


10055.2 


6432.2 


SU1 


7699.9 


5427.5 


5318.1 


3996.8 


3196.0 


SU2 


642.4 


319.7 


301.2 


185.1 


90.4 


SU4 


123219 


64502.4 


62172.9 


52108.0 


9434.4 


SU6 


4483.1 


3133.5 


3041.7 


2418.5 


1987.0 


SU8.1 


6384.7 


4482.5 


4381.8 


3804.5 


3067.7 


it 


17666.6 


6273.8 


5778.6 


3556.7 


304.8 


QCD 


124513.9 


7341.7 


1983.7 


1983.7 


107.6 


Z+jets 


3222.5 


2192.2 


2109.5 


2056.1 


391.6 


W + jets 


8887.2 


4504.5 


4072.4 


2775.5 


395.1 


Diboson 


150.4 


71.2 


66.0 


32.1 


6.8 


Standard Model 


154440.5 


20383.4 


14010.1 


10404.1 


1205.8 


SU3 S/B 


0.12 


0.61 


0.87 


0.97 


5.3 


SU3 S/y/B 


47.5 


87.7 


103.2 


98.6 


185.2 


SU3 eff (cum) 


67.4% 


45.2% 


44.1% 


36.3% 


23.2% 


SU3 eff (excl) 


67.4% 


67.1% 


97.6% 


82.3% 


64.0% 



The cut flow for the 2-jet analysis is given in Table 3, and the M e ff distributions before the M e ff cut 
for the different background contributions and for the different signal points are shown in Figure 2. The 
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number of events after all cuts includes the effect of the j70xE70 trigger, which, as shown in Table 1, 
has an efficiency in excess of 99% for all considered SUSY benchmark points. After the A(j) cuts the 
tt, W+jets, Z+jets and QCD all give comparable contributions to the background. After all cuts the 
surviving events are approximately doubled both for signal and background, as compared to the 4-jet 
analysis, except for the low-mass SU4 points for which the harder kinematic cuts reduce the signal 
efficiency. 

Assuming the estimated systematic background errors are the same as for the 4-jet case, the estimated 
significances are 13.3 for SU3, 8.0 for SU1, 17.2 for SU4, 5.5 for SU6, and 7.7 for SU8.1, for 1 fbr 1 
of integrated luminosity. The significance for SU2, for which direct gaugino production is dominant, is 
less than 1.0. The equivalent numbers for the 3— jet analysis are: 17.0 for SU3, 9.5 for SU1, 25.7 for 
SU4, 7.3 for SU6 and 9.6 for SU8. Although the signal over background ratio is equivalent or better (for 
the three-jet topology) than for the 4-jet analysis this is only partially reflected in the significances when 
the systematic uncertainty is taken into account. This is due to the increased contribution of the QCD 
background, which has an estimated uncertainty on QCD is of 50%, as compared to 20% for the other 
backgrounds. Therefore, since the uncertainties of the backgrounds were evaluated for a 4-jet analysis, 
a dedicated background study would be needed to obtain a correct estimate of the discovery potential in 
this topology. 

The 2-, 3- and 4-jet analyses are based on very similar cuts and therefore have a large overlap in the 
selected events. About 40% of all 2-jet events are also contained in the 3-jet slection and about 35% 
in the 4-jet selection. The biggest overlap is for the 3 jet events: about 59% of all 3-jet events are also 
contained in the 4-jet analysis and about 97% in the 2-jet analysis. 

An alternative strategy was explored where cuts 2 and 3 of the M e g analysis are dropped, and a cut 
on the mji variable [12, 13], mji > 400 GeV, is applied as the only discriminating observable. The 027-2 
variable, has the interesting properties that it takes low values for events where either the visible pj or 
£miss WQ small, and in the case of small A<j). It can therefore replace these topological cuts. For semi- 
invisibly decaying particles mj2 is related to the difference in mass between the particles produced in 
the interactions and their invisible decay products. It can therefore take a larger value for SUSY events 
than for top or W events. Taking the estimates of the systematic background errors into account, the 
significances for the 2-jet m T2 analysis are: 15.6 for SU3, 11.5 for SU1, 10.9 for SU4, 8.3 for SU6, and 
11.1 for SU8. 1, somewhat better than the equivalent analysis based on M e ff. The most effective strategy 
will be ultimately defined by how well the systematic uncertainty on the background evaluation can be 
controlled in the different approaches. 

3 One-lepton mode 

While the 0-lepton mode with multiple jets plus £"™ ss is probably the most generic search mode for 
SUSY with /^-parity conservation, it is sensitive to backgrounds from mismeasured QCD multijet events. 
Requiring one lepton in addition to multiple jets and Ej lss greatly reduces the potential QCD multijet 
background; the remaining backgrounds are under better control. Even if x decays of gauginos are dom- 
inant, leptonic x decays provide a significant 1-lepton rate, at least for high masses. It is not surprising, 
therefore, that the reach in the 1-lepton and 0-lepton modes are comparable. 

The cuts in this analysis are similar to those used in the ATLAS Physics TDR [10] but also include 
a cut on the transverse mass 1 , Mj, formed from the lepton and E™ 1SS which has the role of suppressing 
the W+jets and tt backgrounds. The St cut is included for historical reasons, but its effectiveness is 
questionable: 

'Note the distinction between the transverse mass, Mj, which is a function of the momentum of one visible particle and the 
missing transverse momentum and the stransverse mass which is a function of the momenta of two visible particles and the 
missing transverse momentum. 
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Table 4: Number of events surviving the selection cuts defined in the text for the 1-lepton analysis. 
Entries are normalized to 1 fb _1 using NLO cross-sections. The last column reports a simple S/VB 
calculation of the corresponding significance of an observation for the SUSY benchmark points (SUn). 





Cuts 1-4 


Cut 5 


Cut 6 


Cut 7 




SU1 


571.7 


423.0 


259.9 


232.3 


36.0 


SU2 


86.7 


75.6 


46.1 


39.6 


6.1 


SU3 


995.7 


767.9 


450.5 


363.6 


56.4 


SU4 


7523.6 


6260.4 


2974.4 


895.8 


138.9 


SU6 


342.3 


250.9 


161.9 


147.9 


22.9 


SU8.1 


296.4 


214.4 


151.4 


136.3 


21.1 


tt 


2028.5 


1546.8 


131.7 


36.0 




w 


425.2 


314.8 


9.9 


5.4 




z 


39.0 


27.3 


1.7 


0.2 




Diboson 


7.3 


5.1 


0.8 


().() 




QCD 


0.0 


0.0 


0.0 


0.0 




Standard Model BG 


2500.1 


1894.0 


144.1 


41.6 





1. Exactly one isolated lepton with pj > 20 GeV satisfying the selection criteria described earlier. 

2. No additional leptons with pj > 10 GeV. This ensures no overlap with the 0-lepton, 2-lepton, and 
3-lepton analyses. 

3. At least four jets with pj > 50 GeV at least one of which must have pj > 100 GeV. 

4. Ef ss > 100 GeV and E™ iss > 0.2M eff . 

5. Transverse sphericity, S T > 0.2. 

6. Transverse mass, Mj > 100 GeV. 

7. M eff > 800 GeV. 

Cuts 1-2 define the 1-lepton analysis, while Cuts 3-4 both reduce the Standard Model backgrounds and 
ensure compatability with the Standard Model filter cuts. Distributions without these four cuts are not 
meaningful and so are not shown. Cut 5 reduces the i?™ ss background from mismeasured dijet events; 
Cut 6 reduces the background from events in which the £'™ 1SS comes from W —>■ £v; and Cut 7 selects 
high-mass final states. 

The cut flow table for these cuts is shown in Table 4. The number of events after all cuts includes 
the effect of the j70xE70 trigger, which, as shown in Table 1, has an efficiency of around 99% for all the 
benchmark points considered other than the low mass SU4 point, for which the efficiency is still above 
95%. Note that the QCD background is reduced to a negligible level by the lepton and i?™ ss cuts as 
expected. The background after all cuts is dominated by tt and W + jets, both of which are expected to 
be better understood than the QCD background. Therefore, while the 1-lepton mode may not have better 
reach than the 0-lepton mode given the calculated backgrounds, its reach seems more robust against 
background uncertainties. 

The Af e ff distribution for point SU3 after each cut is shown in Figure 3. The Ej lss and M e ff distri- 
butions for all the SUn points after all cuts are shown in Figures 4. It is clear from these figures and 
from Table 4 that the only significant backgrounds to the 1-lepton mode are from tt and W + jets, as one 
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Figure 3: Expected M e g distributions after Cuts 1-4 (left), and Cut 6 (right) for the 1-lepton analysis. 
Compare with Table 4. 
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Figure 4: The M e g- distributions for each of the SUn benchmark points, and for the sum of the Standard 
Model backgrounds with 1 fb 1 for the 1-lepton analysis. All the cuts except onM e ff are applied. 



would expect. The estimated error on both of these backgrounds using data-driven methods is ±20% [2]. 
Given this and the calculated signal and background rates in Table 4, it is evident that all the SUSY points 
considered except SU2 could be discovered with good significance in the 1-lepton mode. For SU2, the 
production cross-section is dominated by gaugino pair production, so a different analysis [1 1] is required. 

To make this conclusion more quantitative, the significance Z n defined in Section 1.2 was calculated. 
The central value of each background is taken from the current Monte Carlo simulation; the studies 
of data-driven background estimation [2, 3] provide estimated errors of ±50% for QCD multijet back- 
grounds and ±20% for tt, W ± jets, and all other backgrounds. The results of this calculation are shown 
in Table 5 for an integrated luminosity of 1 fb -1 . Each of these points except SU2 would have Z n > 5 for 
just 100pb _1 if the same 20% background uncertainty could be obtained with that luminosity. 

The significances, Z n , (Table 5) for 1 fb 1 are much smaller than the S/y/B values in Table 4. This 
reflects the fact that, unlike S/y/B, the Z n measure of significance includes the estimated systematic 
uncertainty on the background. Table 5 also indicates that, provided the relative uncertainties in the 
background determiations did note increase, harder Af e ff cuts would lead to better significances after 
systematic uncertainties in the background are taken into account. 
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Table 5: Significance Z„ for the 1-lepton plus 4-jet analysis with lfb 1 including the systematic uncer- 
tainty in the background estimation. 



Sample 


M eff > 400 GeV 


M eff > 800 GeV 


M eft > 1200 GeV 




Events 


Zn 


Events 


Z n 


Events 


Z n 


Standard Model BG 


144 


42 


2 


SU1 


260 


7.6 


232 


12.3 


114 


18.0 


SU2 


46 


1.5 


40 


3.4 


15 


6.0 


SU3 


450 


9.5 


364 


16.7 


110 


17.7 


SU4 


2974 


33.7 


896 


29.4 


99 


16.6 


SU6 


162 


4.9 


148 


8.9 


76 


14.2 


SU8.1 


151 


4.6 


136 


8.4 


66 


13.1 
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Figure 5: M e ff distribution for events with one lepton and: 2 jets (left) or 3 jets (right) after all cuts were 
applied. 



Supersymmetry events need not contain large numbers of jets. For example, in mSUGRA the process 

& + &-tfiV + £i°9 

I > fttv 

can have a large rate and gives one lepton and just two hard jets. An alternative 1-lepton analysis has 
been performed requiring just two or three jets rather than four. Because a 4-jet selection was applied to 
the Alpgen samples at the generator level, Pythia was used for the W + jets backgrounds. The jet cuts 
employed are harder: 150 GeV for the leading jet and 100 GeV for the others. The missing energy cut 
is also harder, E^ ss > max(100 GeV,0.3M eff ) and max(100 GeV,0.25M eff ) for the 2-jet and 3-jet case 
respectively. The M e ff distributions after all cuts are shown in Figure 5. Evidently an analysis requiring 
a smaller number of jets with harder cuts also can be effective. 
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Table 6: The number of events surviving the selection cuts defined in the text for the opposite-sign 
dilepton analysis, normalized to lfb 1 using next-to-leading-order cross-sections. The last two columns 
give the S/B ratio and the Z n significance, the latter of which includes the systematic uncertainties on the 
Standard Model backgrounds. 



Sample 


Cuts 1-3 


Cut 4 


S/B 


z„ 


SU3 


200.8 


159.8 


1.88 


3.55 


SU1 


91.0 


72.6 


0.86 


1.65 


SU2 


22.5 


18.8 


0.22 


0.43 


SU4 


948.0 


809.5 


9.56 


22.5 


tt 


111.1 


81.5 






W + jets 


2.47 


1.97 






Z+jets 


1.77 


1.20 






QCD (J3-J7) 


0 


0 






Total Standard Model 


115.34 


84.67 







4 Two-lepton mode 

4.1 Opposite sign dileptons 

Supersymmetry events with two opposite-sign leptons can arise from neutralino decays, especially %® — > 
^FXi* either directly or through an intermediate slepton. Such dileptons must have the same flavour to 
avoid inducing /I — ► ey and other lepton-fiavour-violating interactions at one loop. By contrast leptons 
produced from independent decays can give either same-flavour (OSSF) or different-flavour (OSDF) 
dilepton pairs, again with i G {<?,jli}. 

The opposite-sign dilepton analysis uses the following cuts: 

1. Two isolated, opposite-sign leptons with pj > 10 GeV and \r\ \ < 2.5 which satisfy the cuts de- 
scribed in the introductory SUSY note [1]. Events containing additional leptons were vetoed. 

2. At least four jets with pj > 50 GeV at least one of which must have pj > 100 GeV. 

3. Ef ss > 100 GeV and £™ ss > 0.2M eff . 

4. Transverse sphericity, St > 0.2. 

Cut 1 defines the opposite-sign dilepton sample, while Cuts 2 and 3 both suppress the Standard Model 
backgrounds and provide consistency with the Monte Carlo generator cuts on those backgrounds. After 
Cuts 1-3 the dominant background by far is tt, as one would expect. The St cut only increases the S/B 
ratio by about 8% while reducing the signal by 20%. 

The signals and backgrounds after the cuts, and the corresponding significances, are shown in Table 6. 
The number of events after all cuts includes the effect of the j70xE70 trigger, which, as shown in Table 1, 
has an efficiency of around 99% for all considered signal benchmarks, except for the low mass SU4 
point, for which the efficiency is above 95%. The benchmark points SU3 and SU4 both have high 
discovery potential in the dilepton channel. While SU1 has fairly large dilepton branching ratios, many 
of the leptons are soft because of the small mass gaps between supersymmetric particles. An improved 
analysis based low -pj lepton reconstruction algorithms would help greatly for this point. 

It is instructive to see how the significances, Z„, vary with the cut on the leading jet and on £'™ ss . This 
is shown in Figure 6 for each of the points SU1 - SU4. It can be seen that the significance improves with 
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Figure 6: Significance of signal events for the four benchmark points, as a function of the cut on trans- 
verse missing energy (left) and the transverse momentum of the leading jet (right), for an integrated 
luminosity of 1 fb _1 . 

Table 7: The optimized cuts for each point and corresponding signal, background, and significance. 
Compare with Table 6. 



Sample 


£'™ ss cut 


Leading jet cut 


signal 


background 


Significance 


SU1 


100 GeV 


320 GeV 


37.97 


6.30 


6.94 


SU2 


140 GeV 


200 GeV 


13.74 


22.68 


1.07 


SU3 


140 GeV 


200 GeV 


125.34 


22.68 


11.45 


SU4 


1 10 GeV 


100 GeV 


772.53 


66.80 


24.70 



harder cuts than those given in the above cut list even for the low-mass point SU4. The optimal cuts for 
each point and the signal, background and significance are shown in Table 7. Systematic errors of 50% 
on on all Standard Model backgrounds are included. Of course one should not optimize an analysis for 
a single point, but the table suggests that, provided the systematic uncertainties on the Standard Model 
background determinations do not significantly increase, harder cuts would be preferred. Optimization 
for wider ranges of points is discussed in Section 8. 

Observing an non-resonant excess of OSSF dilepton events over OSDF events would be a clear 
indication of new physics. In SUSY leptonic %\ decays can produce this excess, and have a charac- 
teristic endpoint set by the masses involved. The significance of the difference, calculated as (Nossf — 
Nosdf)/VNossf +Nosdf, is shown in Table 8. This significance calculation assumes that the relative 
e and /I acceptances are well understood, which is not unreasonable given that all Standard Model pro- 
cesses satisfy e/jx/z universality. For SU1 the combined branching ratio for %\ —>■ £l R £ t is 11.7%, but 
the acceptance is reduced by the small mass gaps. For SU2 gaugino pair production dominates, so the 
jet cuts suppress the signal. 

4.2 Same sign dileptons 

In the Standard Model the rate for prompt, isolated, same-sign dileptons is small. Of course some 
leptons from hadronized heavy or light quarks can also pass the isolation cut and contribute like-sign 
backgrounds. In SUSY, on the other hand, the gluino is a self-conjugate Majorana fermion, so events 
containing like-sign dileptons can be common. Thus, same-sign dileptons are a good signature for SUSY 
and a characteristic feature of it. 

The cuts used for the same-sign dilepton analysis are: 



1601 



Supers ymmetry - Prospects for Supersymmetry Discovery Based on Inclusive . 



Table 8: The number of OSSF and OSDF dilepton events passing the optimized cuts and the correspond- 
ing statistical significance for 1 fb~ 1 . 



Sample 


£™ ss cut 


Leading jet cut 


Nossf 


NoSDF 


Significance 


SU1 


220 GeV 


100 GeV 


90.69 


58.53 


2.63 


SU2 


140 GeV 


100 GeV 


31.64 


29.95 


0.22 


SU3 


160 GeV 


160 GeV 


93.75 


38.58 


4.80 


SU4 


120 GeV 


100 GeV 


392.45 


281.55 


4.27 



Table 9: The number of events surviving the selection cuts (as defined in the text) for the same-sign dilep- 
tons analysis, normalized to lfb 1 using next-to-leading-order cross-sections. No background events 
pass the final cut; the 90% upper limit for it background is given. 



Process 


Cuts 1-3 


Cut 4 


z„ 


SU1 


30.1 


21.9 


7.2 


SU2 


13.0 


6.6 


1.9 


SU3 


37.9 


24.9 


7.7 


SU4 


251.8 


138.8 


19.9 


SU6 


18.0 


13.9 


4.5 


it 


2.1 


<2.3 




W + jets 


0.7 


().() 




Z+jets 


0.0 


0.0 





1. Exactly 2 same-sign leptons with pj > 20 GeV satisfying the usual isolation and other cuts [1]. 

2. At least four jets with pj > 50 GeV at least one of which must have pj > 100 GeV. 

3. Transverse missing energy £"™ ss > 100 GeV. 

4. Ef ss > 0.2M eff . 

The first cut defines the same-sign dilepton sample, while Cuts 2-4 suppress the Standard Model back- 
grounds. Cuts 1 and 2 are used in the Monte Carlo generator filters for some of the backgrounds. 

The cut flow table for these cuts is shown in Table 9. The number of events after all cuts includes 
the effect of the j70xE70 trigger, which, as shown in Table 1, has an efficiency of around 99% for all 
considered signal benchmarks, except for the low mass SU4 point, for which the efficiency is 84%. 

Since the W +jets and Z+jets backgrounds have been filtered at the generator level, the results 
for these are biased until after Cut 3, but evidently they are small. A number of other backgrounds 
were examined and found to be negligible compared to those listed in the table. None of the Monte 
Carlo events generated for the Standard Model background determination passed all the cuts. A 90% 
confidence upper limit of 2.3 Monte Carlo events gives the indicated upper limit on the it background 
after cut 4. This is used as the estimate of the total background since it is expected to dominate; b jets 
can produce a second lepton of the same sign and that lepton has a non-negligable probability of being 
well-isolated. The assumption of it dominance is consistent with the results after Cut 3 in Table 9. Two 
possible backgrounds, W ± W ± and tiit, are probably small but have not been studied. 

The Ej lss distributions after the other cuts are shown in Figure 7. While the rates are small, the S/B 
ratio is good and the signal is distinctive. The Ej lss cut was varied and the value in the cuts used here 



1 602 



Supers ymmetry - Prospects for Supersymmetry Discovery Based on Inclusive . . . 



found to be appropriate. 

It is clear that the Standard Model same-sign dilepton background is small and is probably dominated 
by tt. A data-driven analysis of this background has not yet been done. It is expected that it will be 
possible to measure the background from processes such as t —> £ + X, b —> ji + X as a function of the 
isolation cut and to extrapolate to the cut used here. For b —>■ e + X, where the e identification cuts 
impose an implicit isolation cut, it will be necessary to extrapolate from the fx result using Monte Carlo 
techniques. In the absence of such studies the significance for the same-sign dilepton analysis has been 
calculated using the 90% upper limit for tt given in Table 9 with the standard systematic uncertainty of 
±20%. This gives the Z„ values listed in the same table. Although the systematic error on the background 
is uncertain, certainly SU4 and very likely SU1 and SU3 would be observable with a significance greater 
than 5(7 with lfbr 1 . 

More work on the same-sign dilepton background, and more generally on estimates of leptons from 
b and c decays passing isolation cuts, is clearly needed. 

5 Three-lepton mode 

The trilepton signal from direct gaugino production [14] is perhaps the best search mode for SUSY at 
the Tevatron [15, 16]. The corresponding search with ATLAS is described elsewhere in this volume [11]. 
The analyses discussed here are aimed at trilepton production from all sources, not just from direct 
production. Two approaches have been followed. The first, the 3-leptons + jet selection makes explicit 
use of a high-^T jet, similar to the 1- and 2-lepton analyses described above. The second, the 3-leptons + 
EjP lss selection, relies on track isolation cuts to select prompt leptons and is similar to the exclusive 
analysis [11]. The 1LEP trigger typically gives an efficiency of J> 95% for these modes (see Table 1). 

5.1 Three-lepton + jet analysis 

The 3-leptons + jet selection requires: 

1. At least three leptons with pj > 10 GeV satisfying the usual identification and isolation cuts [1]. 

2. At least one jet with p T > 200 GeV. 




E T miss [GeV] 



Figure 7: E™ 1SS in the same sign dilepton events after all cuts except the £'™ 1SS cut. 
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No Zs™ ss cut is made, so this analysis could be used even if detector problems seriously degraded the 
i?™ ss performance. The jet cut is sufficient to suppress the WZ and Wy* backgrounds, so cuts on the 
invariant mass of opposite-sign same-family dilepton pairs are also not required. 

Table 10: The numbers of surviving SUSY and Standard Model events for the benchmark points SU2, 
SU3 and SU4, as the "3-leptons+jet" inclusive trilepton selection is applied. All numbers are nomalized 
to 1 fb~ 1 of integrated luminosity. 



Sample 


Cut 1 


Cut 2 


S/B 


s/Vb 


z„ 


SU2 


35 


13 


1.1 


3.7 


2.7 


SU3 


139 


94 


7.8 


27.1 


11.5 


SU4 


1284 


312 


26.0 


90.0 


24.4 


tt 


455 


11 








zz 


59 


0 








zw 


193 


1 








WW 


3 


0 








Z + y 


9 


0 








Zb 


656 


0 









The cut flow for this selection is shown in Table 10. The number of events after all cuts includes 
the effect of the 1LEP trigger, which, as shown in Table 1, has an efficiency of around 95% for all of 
the benchmark points considered. The jet cut (Cut 2) particularly reduces the ZW and Zb backgrounds 
in which the jets tend to be soft. The dominant background after all cuts is tt, but there is also a small 
remaining background from WZ. The same table shows the statistical significance S/V~B and the signif- 
icance Z n including a background uncertainty of 20%. As has already been discussed for the same-sign 
dilepton selection, the key issue for background determination is the estimation of leptons from b — > IX 
passing the isolation cut in tt events. We expect that this could be measured as a function of the isolation 
cut for jx and then applied to e using Monte Carlo simulation. Given the large S/B in Table 10, even a 
100% background uncertainty would yield Z n > 5 for points SU3 and SU4. 

Adding a cut on £"™ ss to this analysis was investigated. The surviving background events after the 
trilepton and jet cuts have a wide range of £"™ ss , so a cut to reduce them would also reduce the already 
rather small signal. 

5.2 Three-lepton + £™ 1SS analysis 

The 3-leptons + £™ ss selection does not require (or veto) jets, so it is sensitive to direct gaugino produc- 
tion as well as to trileptons produced in the decays of squarks and gluinos. The analysis cuts have been 
somewhat optimized for SU2, for which gaugino pair production dominates. Since the dominant source 
of trileptons in SUSY includes a decay %\ - *■ > at l east one OSSF lepton pair is required among 

the three leptons. 

The cuts for this analysis are: 

1. Ni > 3 leptons with pj > 10 GeV satisfying the usual identification and isolation cuts [1]. 

2. At least one OSSF dilepton pair with M > 20 GeV to suppress low-mass y*, J/\jf, T, and conver- 
sion backgrounds. 

3. Lepton track isolation: Pj 2 Tk < 1 GeV for muons and < 2 GeV for electrons, where Pj 2 trk is the 
maximum pj of any additional track within a cone R = 0.2 around the lepton. 



1 604 



Supers ymmetry - Prospects for Supersymmetry Discovery Based on Inclusive . 



Table 1 1 : Expected event numbers for the 3-leptons + E™ ss analysis for 1 fb 1 for signal and background 
processes. The WW and Zy backgrounds are small and so are not listed. 



Process 


Cuts 1-2 


Cut 3 


Cut 4 


Cut 5 


SU1 


42.2 


33.0 


32.6 


24.1 


SU2 


29.8 


24.1 


21.1 


17.6 


SU3 


130.1 


101.2 


98.6 


63.9 


SU4 


968.1 


691.5 


654.3 


544.9 


SU8.1 


10.2 


8.0 


8.0 


5.3 


WZ 


188.3 


166.2 


122.5 


22.8 


zz 


55.9 


46.4 


10.3 


1.6 


Zb 


582.5 


221 


1.3 


0 


tt 


283.2 


59.9 


56.6 


47.9 



Table 12: Number of signal (S) and background (B) events surviving the 3-leptons + £™ ss selection and 
the corresponding values for S/ a/5 and Z n . All numbers are normalized to 1 fb~' . 





SU1 SU2 SU3 SU4 SU8 


5 


24.1 17.6 63.9 544.9 5.3 


B 


73.5 


s/Vb 


2.8 2.1 7.5 63.5 0.6 


Z, 


1.3 1.0 3.5 16.4 0.3 



4. Ef ss > 30 GeV. 

5. M < Mz — 10 GeV for any OSSF dilepton pair. 

Cut 3 provides an additional rejection of leptons from b and c decays beyond the calorimeter isolation 
cut, while Cut 4 reduces Standard Model backgrounds containing a Z. 

The significances 5/ \fB and Z„ for this second analysis are shown in Table 12, where the Z„ signifi- 
cance includes the standard 20% background systematic uncertainty. A detailed study of the performance 
of the lepton isolation cuts is needed to understand the uncertainty on the it background in particular. 
Since this analysis does not require jets, one might hope that it would be sensitive to the dominant gaug- 
ino pair production for SU2, but only SU4 gives a signal with Z„ > 5 for 1 fb -1 . For all of the benchmark 
points studied the 3-leptons + jet analysis is more sensitive than the 3-leptons +£"™ 1SS one. 

6 Tau mode 

SUSY models generically violate e/jx/x universality; x decays can even be dominant, especially for 
tan/3 » 1. Hence it is worthwhile to look for signatures involving hadronic x decays even though the 
fake background from jets is much larger than that for eorfi. Leptonic x decays are indistinguishable 
from prompt leptons and are already included in the previous analyses. 
The cuts used in this analysis are: 

1. At least four jets with pj > 50 GeV and at least one with pj > 100 GeV. 
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Table 13: The number of signal (S) and background (B) events after tau selection and corresponding 
values of significance, normalised to lfb -1 . 



Sample 


S B S/B S/VB Z n 


SU3 
SU6 


259 51 5.1 36.3 12 
119 51 2.3 16.7 6.8 



2. Ef ss > 100 GeV. 

3. A(j)(ji,Ej ms ) > 0.2 for each of the three leading jets jj, i = 0, 1,2. 

4. No isolated leptons using the standard cuts [1]. 

5. At least one X with pj > 40 GeV and | r\ | < 2.5 reconstructed by the high pj x algorithm [17] with 
a likelihood, L > 4. 

6. E™ iss > 0.2M eff . 

7. Mr > 100 GeV, where Mr is calculated using the visible momentum of the hardest x and £"™ ss . 

Cuts 1, 2, and 6 are standard. Cut 3 requires a large A0 between E™^ and the leading jets, thus reducing 
the background both from mismeasured jets and from b and c decays. Cut 4 makes this analysis disjoint 
from the 1, 2, and 3-lepton analyses described above. There is still overlap with the 0-lepton analysis 
described in Section 2. Cut 5 defines the x sample; these cuts give an efficiency of ~ 50% with a purity 
of ~ 80% for the SU3 sample. Finally, if the Ef 1&s comes from one W — ► xv decay, then Mr used in 
Cut 7 should satisfy Mj < mw- The applied cuts are a superset of the basic cuts for the inclusive 4-jet 
0-lepton analysis, which does not employ a x veto and therefore the events selected for this analysis will 
have an almost complete overlap with the ones selected in the analysis described in the corresponding 
section. For the same reason, the efficiency of the j70xE70 trigger is expected to be between 97% and 
~100% for all the benchmark points, as was the case for the inclusive multi-jet analysis. 

The effect of these cuts is indicated graphically in Figure 8. The requirement of a reconstructed x 
(Cut 5) eliminates the QCD background. After the Mj cut the S/B ratio is high. The resulting signal, 
background, and significance for points SU3 and SU6 are given in Table 13 assuming the usual 20% 
systematic uncertainty for the background, which is dominated by tt with some contribution from W + 
jets. 

The data-driven uncertainty on x SUSY backgrounds has not yet been studied. Clearly x reconstruc- 
tion is difficult. However, it should be possible to simulate real x backgrounds by selecting backgrounds 
with reconstructed e and /I and replacing the leptons with simulated x decays. Fake backgrounds can be 
similarly determined using reconstructed events combined with the measured jet —>■ x fake rate. If the 
resulting uncertainty on the background is about 20%, as is assumed in Table 13, then both points SU3 
and SU6 would be observable in the x mode. 

7 Z?-jet mode 

SUSY signals are typically rich in b quarks because the b and t tend to be lighter than first- and second- 
generation squarks and because Higgsino couplings enhance heavy flavour production. In the benchmark 
points studied the fractions of events containing Z? jets range from 14.4% for SU2 to 72.8% for SU4. In 
QCD events b quarks typically occur at the percent level. Thus, requiring a b quark suppresses the QCD 
background, which may be difficult to control, just as requiring an e or jx does. 
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Figure 8: The M e s distributions for SUSY signals and Standard Model backgrounds in the % analysis 



In this section an analysis of signatures with b jets is performed for SUSY points SU1 , SU3, SU4 and 
SU6 using full simulation both for the signal and for the Standard Model backgrounds. Isolated leptons 
may also be present, and all channels with and without leptons are summed. SUSY processes almost 
always will give bb pairs, and this is taken into account. No equivalent analysis was performed in the 
Physics TDR. 

The cuts used in this analysis are as follows: 

1. At least 4 jets in the event with pi > 50 GeV. 

2. Leading jet p T > 100 GeV. 

3. Missing transverse energy, £"™ ss > 100 GeV. 

4. Missing transverse energy, £"™ ss > 0.2M e ff. 

5. Transverse sphericity, St > 0.2. 

6. At least 2 jets are tagged as b jets, as described below. 

7. M eff > 600, 800, or 1000 GeV. 

Note that Cuts 1-3 are also used in Monte Carlo generator filters for some of the background samples. 
Cut 7 is used to optimize the signal-to-background ratio in the selected events. 
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Table 14: Number of events surviving selection cuts as defined in the text for the inclusive search with 
b jets normalized to lfb 1 using NLO cross-sections. The results are shown for three different values of 
the M e ff cut (Cut 7). 



Sciniplc 


Cuts 1-3 


Cut 4 


Cut5 


Cut 6 




Cut 7 














600 GeV 


800 GeV 


1000 GeV 


SUl 


3469 


2806 


1994 


456 


442 


375 


263 


SU2 


608 


358 


299 


170 


166 


141 


87 


SU3 


9357 


7279 


5474 


1158 


1086 


818 


425 


SU4 


79761 


56697 


45661 


16478 


10204 


3186 


926 


SU6 


2557 


2049 


1467 


505 


495 


436 


340 


tt 


12864 


8273 


6117 


2182 


836 


215 


61 


QCD 


29435 


7402 


5171 


740 


259 


79 


5 


W + jets 


4068 


2309 


1600 


23 


16 


7 


2 


Z+jets 


1249 


680 


432 


5 


3 


1 


1 


Diboson 


22 


13 


8 


2 


1 


0 


0 


Bsm 


47527 


18676 


13328 


2950 


1115 


303 


69 




Figure 9: M e ff distributions for fr-jet analysis. Left: Standard Model backgrounds. Right: SUSY signals 
with total background. 

Jets with pr > 20 GeV are selected as b jets using the default tagging algorithm based on the 3- 
dimensional impact parameter and secondary vertex detection [18] with a cut weight > 6.75, giving a 
nominal efficiency of 60%. Above about pj = 100 GeV both the efficiency and the light-jet rejection 
decrease as discussed in the introductory SUSY note [1] and references therein. Naively one would 
expect that the increase of the B decay length with y = Eg /Mb S> 1 would offset the 1/y decrease of 
the angles since the multiple scattering angular errors also decrease similarly. There is also a substantial 
dependence of b tagging on the r\ of the jet. Many of the b jets in SUSY events have high pj\ the 
typical pj of the leading jet in SU3 is about 300 GeV, for which the light-jet rejection during b tagging 
is ^(100). 

Events with zero or more leptons and at least two tagged b jets were combined in a single inclusive 
analysis. Inevitably this means that there is overlap with the analyses in Sections 2-5. The cut flow is 
shown in Table 14. After Cut 6 (Nb > 2) the tt background is dominant, as one might expect, but the 
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Table 15: S/B ratio and signal significance Z n including systematic effects for the Z?-jet analysis with 
0.1 fb- 1 and 1ft- 1 withM eff > 1000 GeV. 





S/B 


Z„ for 0.1 fb" 1 


Z„for lfb 1 


SU1 


3.8 


6.0 


9.3 


SU2 


1.3 


2.3 


5.0 


SU3 


6.2 


7.5 


13.0 


SU4 


13.4 


12.6 


21.7 


SU6 


4.9 


7.1 


11.2 



QCD background remains substantial. The ^-tagging performance at high pj is clearly an important 
issue for this analysis. As the applied cuts are a superset of the basic cuts for the inclusive 4-jet analyses, 
the efficiency of the j70xE70 trigger is, as quoted in the corresponding section, between 97% and ~ 100% 
for all the considered benchmark points. 

To calculate the significance in this channel an uncertainty of 50% for the QCD background and 20% 
for the other backgrounds is assumed for 1 fb~ 1 . The uncertainty on the ^-tagging efficiency of 60% is 
5% [2, 18]. This is assumed to be included in the it uncertainty and is ignored for the other, smaller 
backgrounds. 

The hardest effective mass cut, M e ff > 1000 GeV, was found to be the most effective, so only results 
for it are shown here. The resulting significances, Z n , including the above systematic effects, are shown 
in Table 15, for two luminosities: 0.1 and fb -1 , where the same systematic uncertainty is assumed to 
be the same for both luminosities. All background uncertainties are added linearly. The low-mass point 
SU4 and perhaps also SU3 could be discovered using this analysis with only 0.1 fb -1 assuming that the 
background could be understood adequately. All points except SU2 could be discovered with lfb -1 , for 
which the background uncertainties are realistic. This analysis seems to be particularly useful compared 
to some other analyses for point SU6. 

8 Scans and optimization 

The SUSY points studied so far were chosen to give a variety of signatures, but there is no reason to 
think that they are representative of what might be found at the LHC. This section uses scans over the 
parameters of several models for SUSY breaking - all with R parity conservation - in order to sample 
a wider range of possibilities. The goal is to develop one or more search strategies covering as wide a 
subset of the scanned models as possible. Since each scan includes hundreds of points, this section must 
rely on ATLFAST [19], the fast parameterized simulation of the ATLAS detector. 

Data-driven methods [2,3] will be used to determine the Standard Model backgrounds to the possible 
SUSY signatures. For lfb 1 the estimated errors [2,3] are typically 50% for QCD jets and 20% for the 
W, Z, and t backgrounds. Several approaches were considered to look for an excess above a cut on M e ff 
or £"™ ss after basic jet and lepton selections. The significance is corrected for multiple cuts as described 
in Section 1.2. Results are shown here only for the M e s cut, that yielded best performance. A multivariate 
optimization using TMVA [20] gave a minor improvement with the available Monte Carlo statistics. This 
and other cut procedures are still being studied. 

The analyses described above in this note have used signal and background cross-sections normalized 
to next-to-leading-order calculations [1]. This was impractical for scans over many points, each involving 
many subprocesses. The goal here is not to determine the exact limit or exclusion value but rather to test 
whether the proposed approaches work for a wide range of models. It was therefore decided to normalize 
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the signal cross-sections for all scans to the leading-order HERWIG values but to use next-to-leading- 
order normalizations for the backgrounds. Since next-to-leading-order corrections generally increase 
cross-sections, the resulting reach estimates are conservative. 

8.1 SUSY signal samples 

It is impossible to scan the 105 -dimensional parameter space of the MSSM or even the 19 dimensional 
subspace with flavour and CP conservation and degeneracy of the first two generations. Hence a number 
of SUSY-breaking models with many fewer parameters were used. 

Several of these scans (e.g. the first two mSUGRA scans listed below) ignore dark matter and other 
existing constraints. Of course any true theory must obey such constraints. It is possible, however, 
to modify the SUSY breaking model to satisfy the constraints while keeping the basic phenomenology 
unchanged. One such example, the non-universal-Higgs model (NUHM), is discussed below. Since there 
is no unique model of SUSY-breaking, all these scans should be viewed only as possible patterns of LHC 
signatures, not as complete theories. 

mSUGRA fixed grid, tan/3 = 10, A 0 = 0, \x > 0: A 25 x 25 grid was made varying m 0 from 60 GeV 
to 2940 GeV in 25 steps of 120 GeV, and m 1/2 from 30 GeV to 1470 GeV in 25 steps of 60 GeV. SUSY 
spectra were generated using ISAJET 7.75 [21] with a top quark mass of 175 GeV. Out of the 625 
possible points, a spectrum could be successfully generated for 600; the other 25 failed for theoretical 
reasons. For each good point 20k events were produced using ATLFAST. Constraints other than from 
direct searches were ignored. While constraints such as the dark-matter relic density constrain specific 
SUSY-breaking models such as mSUGRA, they are much less restrictive for generic models. 

mSUGRA fixed grid: tan/3 = 50, Aq = 0, }X < 0: Large tan/3 increases the mixing of bin and Xl,r, 
leading to enhanced b and x production. A grid of 25 x 25 points with was generated with mq varied from 
200 to 3000 GeV in steps of 200 GeV and with m 1/2 varied from 100 to 1500 GeV in steps of 100 GeV. 
The top mass was fixed at 175 GeV. Constraints other than from direct searches were again ignored. 

mSUGRA random grid with constraints: In this sample all mSUGRA parameters were varied in two 
regions 2 previously found [22] to be compatible with dark-matter and other constraints with jx > 0 and 
m t = 175 GeV. 

The mSUGRA parameters were chosen randomly (with jx > 0) and their properties calculated using 
ISAJET 7.75. All selected points satisfy the LEP Higgs mass limit, m h > 114.4 GeV [23]; the WMAP 
total dark matter limit, Q.h 2 < 0.14 [24]; within 3a the branching ratio limits B(b —> sy) = (3.55 ± 
0.26) x 10~ 4 [25] within 3a and B(B S -> H + /l-) < 1.5 • 10~ 7 [26]; and with <5a M less than the 3a upper 
limit from the muon anomalous magnetic moment measurement = (11659208 ±6) x 10~ 10 [27]. 

GMSB grid: M mess = 500 TeV, N me , s = 5, C grav = 1: With N mess = 5 the NLSP is a slepton which 
decays promptly to leptons or t's. A fixed grid was made varying A was varied from 10 TeV to 80 TeV 
in steps of 10 TeV and tan/3 from 5 to 40 in steps of 5. 

NUHM grid: The NUHM model is similar to the mSUGRA model but does not assume that the Higgs 
masses unify with the squark and slepton ones at the GUT scale. This allows more gaugino/Higgsino 
mixing at the weak scale and so relaxes the mSUGRA dark matter constraints. The scan uses a step size 

2 The parameters are varied within {0 < m 0 < 2 TeV, 0.5 < m l / 2 < 1.3 TeV, -0.34 < A Q < 2.4 TeV, 39 < tan/3 < 55} and 
{!< mo < 3 TeV , my 2 < 0.5 TeV , -2.0 <A 0 < 2.0 TeV, 20 < tan/3 < 55} 
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Figure 10: Efficiencies for electrons as a function of pj for the SU3 sample (left) and r\ for the ALPGEN 
sample Z — > ee (right). The solid red line corresponds to the Geant 4 simulation, the solid circles to 
uncorrected ATLFAST, and the open circles to the corrected version of ATLFAST used for the reach 
analyses. 

of 100 GeV in both mo and raj/ 2 . For each point the values of and Ma at the weak scale are adjusted 
to give acceptable cold dark matter. 

8.2 ATLFAST corrections 

ATLFAST is a fast parameterized simulation of the ATLAS detector. The version used here is rather 
idealized. Corrections to the efficiency for e reconstruction were applied as a function of pj and r\ . An 
example of the effect of these corrections is shown in Figure 10. In addition, the ATLFAST algorithm 
finding reconstructed cone jets was missing the split-merge step, so jets matched to the same truth jet 
were combined. With these corrections the ATLFAST and full simulations agree reasonably well. All 
results shown here use ATLFAST with these corrections. 

8.3 Discovery reach 

The reach plots in this subsection are all based on analyses that require a certain number of jets and 
leptons (e or /i) and then find an optimal M e ff cut (in steps of 400 GeV) to maximize the significance Z„ 
corrected for multiple cuts of the signal over the Standard Model background, using background errors 
estimated from studies of data-driven methods [2,3]. Not all modes were studied because of limited time 
or Monte Carlo statistics. 

The analysis most similar to that in the Physics TDR [10], requires four jets with p T > { 100, 50, 50, 50} 
GeV and Ej lss > max(100 GeV,0.2M e ff). The 5a discovery reach for the analyses reqiring zero, one, 
or two opposite-sign leptons for mSUGRA with tan/3 = 10 are shown in Figure 11. The plot also shows 
the trilepton reach with just one jet. The 0-lepton mode has the best estimated reach, close to 1.5 TeV for 
the smaller of mg and m^. The 1-lepton estimated reach is somewhat less, but it is more robust against 
QCD backgrounds which might result from detector problems. Figure 1 1 also shows that the reach for 
tan/3 = 50 is similar for the zero- and one-lepton channels. Despite the enhanced X decays for tan/3 » 1, 
the one-T reach is slightly worse than the reach for zero and one leptons. This reflects the lower efficiency 
and purity for x reconstruction. Compared to T + 4jets, the reach for T + 3jets is slightly better, while 
X + 2 jets is about the same. The curves for the X + 2-jet and the X + 3-jet analyses are not shown. 

Requiring four jets is not necessarily the best choice. The 5a reach contours for the 0-lepton plus 
i?™ ss and the 1-lepton plus £"™ ss analyses for various jet multiplicities are shown in Figure 12, again for 
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m 0 [GeV] m 0 [GeV] 



Figure 11: The 1 fb _1 5a reach contours for the 4-jet plus £"™ ss analyses with various lepton require- 
ments for mSUGRA as a function of mo and mij 2 - Left: tan/3 = 10. Right: tan/3 = 50. The horizontal 
and curved grey lines indicate gluino and squark mass contours respectively in steps of 500 GeV. 
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Figure 12: The 1 fb _1 5a reach contours for the 0-lepton and 1-lepton plus £"™ ss analyses with various 
jet requirements as a function of mo and mi/ 2 for the tan/3 = 10 mSUGRA scan. The horizontal and 
curved grey lines indicate the gluino and squark masses respectively in steps of 500 GeV. 

the tan/3 = 10 mSUGRA scan. For the 0-lepton mode the choice of four jets seems best, while for the 
1-lepton mode the 2-jet, 3-jet and 4-jet reaches are all comparable. The reaches (not shown here) for the 
opposite-sign dilepton plus Ej lss signature requiring at least 2, 3, or 4 jets are comparable and in all cases 
are less than the reaches for the 0-lepton and 1-lepton modes. Observing a signal in multiple channels 
would provide further confidence that the observed excesses were evidence for new physics. 

The mSUGRA "random" scan with low-energy constraints samples only a limited range of parame- 
ters and hence of gluino and squark masses. The results of this scan are shown as a scatter plot of points 
in Figure 13 compared to those for the mSUGRA scans. The reach for those mSUGRA points which are 
compatible with low-energy constraints is comparable to that for generic points. This is not surprising 
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Figure 13: Reach for the "random with constraints" mSUGRA scan plotted in the nig,mq plane. Solid 
triangles represent points which are observable (Z„ > 5) with 1 fb~ 1 , while open triangles show points 
which are not. 
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Figure 14: The 1 fb _1 reach for NUHM models with 4 jets, 0 or 1 leptons, and ££" ss . The masses for 
NUHM are similar to those shown in Figure 1 1. 

given that the SUSY production cross-sections are mainly controlled by the gluino and squark masses, 
but it adds support to the approach used in this section. 

The mSUGRA model is only one possible mechanism for SUSY breaking. The non-universal-Higgs 
model has qualitatively similar phenomenology but different patterns of masses and decay modes. The 
reach plots with four jets, zero or one leptons, and £'™ 1SS for the NUHM are shown in Figure 14. The 
reach with zero and one leptons is virtually identical to that for mSUGRA. This is as expected: adding 
some Higgsino mixing allows %\ annihilation but has a minor effect on the other decays. 

Another alternative often considered, Anomaly Mediated SUSY Breaking (AMSB), is not examined 
here. Previous studies [28] have found an overall reach comparable to mSUGRA with similar assump- 
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Figure 15: The 1 fb 1 5a reach contours of the 2-lepton and 3-lepton analyses for the GMSB scan. 
The vertical solid and dashed grey lines indicate the gluino and squark masses respectively in steps of 
500 GeV. 

tions. The reach in the one-lepton modes is less because the lightest chargino is almost degenerate with 
the LSP and so does not give visible leptons. 

The models considered in the GMSB scan all have at least two leptons or t's at the Monte Carlo 
generator level, so the signatures are easier to distinguish from Standard Model backgrounds. The reach 
plots for this scan are shown in Figure 15. The reach for three leptons is significantly better than for two 
leptons and extends well beyond 2 TeV for gluinos for large tan/3 and is close to 2 TeV for all tan/3. 
Special signatures that can result from GMSB models are discussed elsewhere in this volume [4]. 

8.4 Summary 

The results of the scans presented in this section together with the full simulation analyses presented ear- 
lier indicate that ATLAS should discover signals for ^-parity conserving SUSY with gluino and squark 
masses less than @{\ TeV) after having accumulated and understood an integrated luminosity of about 
1 fb -1 . For favorable models the mass reach could be greater. The luminosity required to discover a given 
SUSY scenario is greater than that estimated previously [29]. The main differences in this analysis are 
that the uncertainty in the background (derived from data-driven methods) is taken into account and that 
the signal and background simulations are more realistic. Given the admittedly qualitative naturalness 
arguments about SUSY masses, it is plausible that SUSY could be found with 1 fb 1 if it exists at the 
TeV scale. Conversely, if SUSY is not found with 1 fb 1 , it might still eventually be discovered at the 
LHC, but it will be difficult to study in detail. 
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Abstract 

We review the techniques used to reconstruct the decay of supersymmetric par- 
ticles and measure their properties with ATLAS at the LHC, concentrating on 
strategies to be applied to a data set of integrated luminosity of 1 fb~ 1 that can 
be expected after the first year of operation of the LHC. These techniques are 
illustrated using several benchmark points chosen in the mSUGRA parameter 
space, but they are applicable to a broader range of supersymmetric (and other 
similar) models. The most appropriate methods will be selected and fine-tuned 
once (and if) signatures consistent with Supersymmetry are established. Su- 
persymmetric cascade decays typically have large transverse missing energy 
due to the presence of undectected neutralinos, and have characteristic edges 
and thresholds in the dilepton, dijet and lepton-jet invariant mass distributions. 
The reconstruction of such edges is the focus of the first part of the paper. The 
second part of the paper concentrates on the reconstruction of more specific de- 
cay channels, involving light stops, staus and Higgs bosons. The final section 
indicates how sparticle masses and other supersymmetric parameters could be 
constrained using such measurements. 

1 Introduction 

Supersymmetry (SUSY) can be discovered by the ATLAS experiment at the LHC during the initial run- 
ning period if some coloured sparticles have masses of the order hundreds of GeV and hence production 
cross-sections of the order of a few pb. A strategy to establish SUSY discovery is outlined in another 
paper in this collection [1], while here we concentrate on parameter measurements that can be performed 
with the early data. The same particle identification conventions and selection criteria as in [2] are used 
in this paper. 

Once a signature consistent with Supersymmetry has been established, the experimental emphasis 
will move on to measuring the sparticle mass spectrum and constraining the parameters of the model. In 
the case of R-parity-conserving models, the decay chain of sparticles cannot be completely reconstructed, 
as sparticles eventually decay into LSPs that can not be detected. For this reason edge positions, rather 
than mass peaks, are measured in the invariant mass distribution of sparticle decay products. In R-parity- 
violating models sparticles can have long lifetimes and can be detected by studying their decay in-flight 
within the detector. These types of signatures are discussed in [3]. 

A complete coverage of all allowed SUSY models is impossible, so we limit this study to a subset 
of the models where SUSY breaking is mediated by gravity (mSUGRA), and to the points in parameter 
space described in [2], however the measurement techniques and fit methods developed can be adapted 
for many models. During initial data-taking, the error on such measurements will be limited by statistics, 
making measurements possible only for models with moderate (< 1 TeV) values of the SUSY mass scale 
where enough events can be isolated. In this paper we study the cases of a total integrated luminosity 
of 0.5 fb _1 for the "Low Mass" point (SU4) and of 1 ftr 1 for the "Bulk" point (SU3), with the idea of 
developing the experimental analyses which might be performed after the first year or so of data taking. 
Some benchmark points require somewhat larger datasets in order to perform kinematic measurements; 
as an example we show a measurement of the dilepton mass edges for the "Coannihilation" point (SU1) 
with a dataset corresponding to 18 fb 

In sections 3 and 4 we study the decay chain: 

<?L -» %iq(-+ PFq) -» fii + rq (1) 
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in events containing two opposite-sign isolated electrons or muons, hard jets and missing energy. Kine- 
matic endpoints in the invariant mass spectra of lepton pairs and lepton+jet combinations are fitted and 
used to derive relations between the masses of sparticles. In the case of first- and second-generation 
squarks, it will often not be possible to experimentally determine squark flavour, so we define to be 
the average of the masses of the Ul and dt squarks, and m^ R , the average mass of ur and Jr. 

Events with tau leptons in the final state are studied in Section 5 and di-tau mass edges in the %2 
decay chain reconstructed. This signature is particularly important in the co-annihilation region where 
the decay into tau stau pairs is favoured. 

In Section 6 we analyse events with two hard jets and missing energy in order to to measure the jet 
"stransverse mass". This variable is sensitive to the mass of the right-handed squark in events where a 
pair of squarks are produced, each decaying as: 



A kinematical edge depending on the mass of the light stop is reconstructed in Section 7 by exploiting 
the decay: 



and reconstructing the tb invariant mass. 

The reconstruction of the lightest Higgs bosons, produced by the X® decay followed by the Higgs 
decay into a pair of b quarks, is investigated in Section 8. Simulations of the "Higgs" point (SU9) show 
that if these decays are allowed, then the Standard Model Higgs boson may be initially detected as a 
SUSY decay product rather than by signatures that involve its production via Standard Model processes. 

In Section 9 the parameters measured in sections 3 to 8 are combined to extract information about the 
SUSY model such as the sparticle mass spectrum and the mSUGRA parameters (under the hypothesis 
that mSUGRA is realised). 

All of these studies use a realistic detector geometry with residual misalignments, and all relevant 
Standard Model backgrounds are taken into account, as are the trigger efficiencies. The reconstruction of 
final state objects, the event selection criteria, the strategy used to simulate both signal and background 
events, and the methods for estimating systematic uncertainties are common across all SUSY analyses 
and are discussed in the introduction to this chapter [2]. 

2 Measurement of endpoints 

The decay chain in Eq. (1) is particularly suited to measure the mass of SUSY particles, as the presence in 
the final state of charged leptons, missing energy from the escaping neutralino and hadronic jets ensures 
a large signal to background ratio. Thus, fit results are not very dependent on the precise measurement of 
the Standard Model background. Although we discuss the reconstruction of edges and thresholds within 
the mSUGRA framework, the same methodology can be applied to the large variety of SUSY models 
where the decay channel in Eq. (1) is open. In the following we indicate with i only electrons and 
muons (with i being their superpartners) while x leptons are indicated explicitly. 

The endpoint in the di-lepton invariant mass distribution is a function of the masses of the particles 
involved in the decay. If the sleptons are heavier than the Xi then the decay proceeds through the three 
body channel X2 — ► X\l + i~ as in the SU4 model. In this case, the distribution of the invariant mass of 
the two leptons has a non-triangular shape described in [4, 5] with an endpoint equal to the difference of 
the mass of the two neutralinos: 




(2) 



g^ht^xtbt 



(3) 



edge _ 



(4) 
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If at least one of the sleptons is lighter than the X2 then the two-body decay channel X2 —> £~^£ T — > 
X\£ + £~ dominates. The distribution of the invariant mass of the two leptons is triangular with an endpoint 
at: 



1 - 


-fe)V 











(5) 



For the SU3 point, where the £r and the fi are lighter than the X2, such an endpoint is expected in the 
£ + £- (t+t - ) distribution for mf e ge = 100.2 GeV {mff = 98.3 GeV). For the SU1 point both £ R and 
II as well as Ti and T2 are lighter than X2, resulting in a double triangular distribution for the dilepton 
invariant mass with two edges. 

Measuring the dilepton endpoint allows us to establish a relationship between the masses of the two 
lightest neutralinos and any sleptons that are lighter than the X2. For a determination of the masses of 
all the particles involved in the decay chain Eq. (1), further mass distributions involving a jet are used: 
tnuq, m ^£ q < m i q {\ovi) an d "^(high)- Since it is not possible to identify the quark from the decay, we 
make the assumption that it generates one of the two highest pj jets in the event, as is normally the case 
if the q\ is much heavier than the x\. Hence only the two leading jets are considered. For the mu q 
distribution a maximum value of the distribution is expected so the jet giving the lowest mu q value is 
used. The mf^ q distribution is defined by the additional constraint m« > rn^ /y/l, giving a non-zero 
threshold value [6, 7]. Since a minimum is sought, the jet giving the highest mu q value is used in this 
distribution. The distributions 7w^(i ow ) and ra^(high) are formed from the lower and higher m\ q value of 
each event using the same jet as for mu q . Both distributions have well-defined endpoints. 

The theoretical values of the kinematic threshold and endpoints listed above can be calculated using 
the analytical expressions given in [6,7]. The theoretical positions of the end points for the SU1, SU3 
and SU4 models are summarised in Table 1. 

Table 1: Value of the end points of the invariant mass distributions for the three benchmark points 
considered in this section. For SU1 the two endpoints correspond to the two available decay chains of 
the X2 involving a right or left slepton. 

Mass Distribution SU1 end point (GeV) SU3 end point (GeV) SU4 end point (GeV) 



m„ ge 56.1,97.9 100.2 53.6 

mfr 77.7,49.8 98.3 53.6 

mf^ 611,611 501 340 

mf lq 133,235 249 168 

m™f ow) 180,298 325 240 

™£?Lm 604 ' 581 418 340 



Another advantage of the decay chain in 1 is the possibility of estimating both the SUSY combina- 
torial background and the Standard Model background from the data with high accuracy. The technique, 
known as, flavour subtraction, is based on the fact that the signal contains two opposite-sign same-flavour 
(OSSF) leptons, while the background leptons come from different decay chains, which can be of the 
same flavour or of different flavour with the same probability. The background thus cancels in the sub- 
traction: 

N(e + e-)/f3+f3N(n + Ll-)-N(e ± n T ) (6) 
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where /3 = 0.86 is an efficiency correction factor equal to the ratio of the electron and muon reconstruc- 
tion efficiencies. The value of /3 is taken from [8,9], and is assumed in the following to be known with 
an uncertainty of 10%. 

3 Dilepton edges 
3.1 Event Selection 

Events with two or three isolated leptons (electrons or muons) with pj > 10 GeV and |t}| < 2.5 are 
selected. If two leptons are selected, they are required to have opposite signs. If three leptons are 
present, the two opposite-sign combinations are considered and treated independently in the rest of the 
analysis. 

In order to select SUSY events and reject the Standard Model background it is necessary to require 
the presence of energetic jets and missing energy. The variables used to discriminate SUSY from the 
SM background are the transverse missing energy, the transverse momenta of the four leading jets, the 
ratio between the transverse missing energy and the effective mass, and the transverse sphericity (St). In 
order to optimise the cuts on these variables, the value of: 

S = (A^ossf - Nosdf)/ v / A'ossf + A'osdf (7) 

is maximized for each SUSY point, where jVossf and AfosDF are the number of same-flavour and different- 
flavour lepton pairs respectively. 

The S variable can be computed from collider data, since no Monte Carlo information is used. By 
maximizing the value of S we are maximizing the selection efficiency for signal events while suppressing 
the Standard Model and the SUSY combinatorial backgrounds. 

In order to improve the sensitivity to the signal, only lepton pairs with an invariant mass ma < 
m « ge + GeV are considered. Since the true value of the endpoint is a priori unknown, this choice 
implies that the edge has already been observed, and that afterwards the selection cuts are optimised as 
described here in order to improve the separation between signal and background and the measurement 
of the endpoint. We are thus focusing here on determining selection cuts that would allow a precise 
measurement of the endpoint with moderate statistics, rather than on finding the first evidence for an 
excess of different-flavour lepton pairs or the first evidence for the presence of the edge. 

In Table 2 the optimal selection resulting from the scan is shown. For all three points a 2-jet selection 
is preferred, leaving out cuts on the third and fourth jets, on Sj and on the ratio £"™ SS /M e ff. For the 
"Coannihilation" point (SU1) and the "Bulk" point (SU3) the 5- value is found to be stable in an interval 
around the maximum value. For the "Low Mass" point (SU4) the best 5-value is found for the loosest 
cut allowed by the available Monte Carlo samples. Hence even looser cuts may be preferred as far as the 
value of S is concerned. The cuts on E™ 1SS and the pj of leading jet are however required in order to 
have a high trigger efficiency 1 . 

The number of signal and background lepton pairs passing the selection cuts is shown in Table 3. All 
numbers are for 1 fb _1 . The main Standard Model background is always tt accounting for about 95% of 
the total background. The remaining background events are from W, Z and WW, WZ, ZZ production. 
The background due to QCD jets is negligible. The fraction of SUSY events in the selected sample with 
OSSF leptons is 59% for SU1, 77% for SU3, and 80% for SU4. 
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Table 2: Results of the event-selection optimisation for the S-variable Eq. (7) for signal (s) and Standard 
Model background (b) with range limit m u < mf^ e + 10 GeV, for SU1, SU3 and SU4 for 1 ftr 1 . The 
best selection is shown. 









Pt £ t 1ss 


£™ SS /M eff 5 T 


^OSSF 


SOSDF 


^OSSF 


^OSDF 


S 


SU1 


200 


150 - 


- 120 




120 


64 


69 


53 


5.1 


SU3 


180 


100 - 


- 120 




615 


149 


93 


92 


15.1 


SU4 


100 


50 - 


- 100 




3048 


1574 


411 


419 


19.9 



Table 3: Number of lepton pairs passing the selection cuts optimized for the SUSY sample SU1 (above), 
SU3 (middle) and SU4 (below), for 1 fb _1 of integrated luminosity. The contribution from it produc- 
tion is indicated separately as it constitutes most of the Standard Model background. The remaining 
background events are from W, Z and WW, WZ, ZZ production. The background due to QCD jets is 
negligible. 



Sample 


e + e 


H+H~ 


OSSF 


OSDF 


SUSYSU1 


56 


88 


144 


84 


Standard Model (it) 


35 (35) 


65 (63) 


101 (99) 


72 (68) 


SUSY SU3 


274 


371 


645 


178 


Standard Model (ft) 


76 (75) 


120(115) 


196 (190) 


172 (165) 


SUSY SU4 


1729 


2670 


4400 


2856 


Standard Model (ft) 


392 (377) 


688 (657) 


1081 (1035) 


1104(1063) 



3.2 Reconstruction of the dilepton edge 

The distribution of the invariant mass of same-flavour and different-flavour lepton pairs is shown in 
Fig. 1 for the SUSY benchmark points and backgrounds, after the selection cuts optimized for SU3 (left 
plot) and SU4 (right plot), and for an integrated luminosity of 1 fb _1 and 0.5 fb _1 respectively. It can 
be seen from regions where the signal does not contribute (i.e. for the Standard Model backgrounds 
and for mu > m e ^ ge for SUSY) that the different-flavour distributions are similar to the same-flavour 
backgrounds. 

The invariant mass distribution after flavour subtraction is shown in the left plot of Fig. 2 in the 
presence of the SU3 signal and for an integrated luminosity of 1 fb _1 . The distribution has been fitted 
with a triangle smeared with a Gaussian. The value obtained for the endpoint is (99.7 ± 1.4 ±0.3) GeV 
where the first error is due to statistics and the second is the systematic error on the lepton energy scale 
and on the /3 parameter [2]. This result is consistent with the true value of 100.2 GeV calculated from 
Eq. (5). 

The right plot of Fig. 2 shows the flavour-subtracted distribution in the presence of the SU4 signal for 
an integrated luminosity of 0.5 fb _1 . The fit was performed using the function from [5] which describes 
the theoretical distribution for the 3-body decay in the limit of large slepton masses, smeared for the 
experimental resolution. This function vanishes near the endpoint and is a better description of the true 
distribution for SU4 than the triangle with a sharp edge. The endpoint from the fit is (52.7 ± 2.4 ± 

'For this channel, both the lepton triggers and the trigger based on £™ ss may be relied upon. The latter are however less 
efficient, and they would imply different pj thresholds for electrons and muons. 
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m(ll) [GeV] [GeV] 



Figure 1: Left: distribution of the invariant mass of same-flavour and different-flavour lepton pairs for 
the SUSY benchmark points and backgrounds after the cuts optimized from data in presence of the SU3 
signal (left), and the SU4 signal (right). The integrated luminosities are 1 fb~' and 0.5 fb -1 respectively. 




Figure 2: Left: Distribution of invariant mass after flavour subtraction for the SU3 benchmark point with 
an integrated luminosity of 1 fb -1 . Right: the same distribution is shown for the SU4 benchmark point 
and an integrated luminosity of 0.5 fb -1 . The line histogram is the Standard Model contribution, while 
the points are the sum of Standard Model and SUSY contributions. The fitting function is superimposed 
and the expected position of the endpoint is indicated by a dashed line. 



0.2) GeV, consistent with the theoretical endpoint of 53.6 GeV. 

Since the true distribution will not be known for data, the distribution was also fitted with the smeared 
triangle expected for the 2-body decay chain. This also gives a good X 2 with an endpoint of (49.1 ± 
1.5 ±0.2) GeV. A larger integrated luminosity will be required to use the shape of the distribution to 
discriminate between the two-body and the three-body decays. 

In Fig. 3 the flavour-subtracted distribution of the dilepton mass is shown for the SU1 point at an 
integrated luminosity of 1 fb -1 (left) and 18 fb -1 (right) 2 . While there is already a clear excess of 
SF-OF entries at 1 fb -1 , a very convincing edge structure cannot be located. At 18 fb -1 the two 
edges are visible. A fit function consisting of a double triangle convoluted with a Gaussian, the latter 

2 Only 1 fb -1 of simulated Standard Model background was available. To scale the Standard Model contribution to higher 
luminosities a probability density function for the m(ll) distribution was constructed by fitting a Landau function to the 1 fb~' 
distribution, as-- for e + e~, and e ± ^ and normalisation according to a /3 of 0.86. The 

systematic uncertainty on the endpoint determination from this procedure was estimated to be a small fraction of the statistical 
uncertainty. 



1622 



Supers ymmetry - Measurements from Supersymmetric Events 




Figure 3: Distribution of invariant mass after flavour subtraction for the SU1 point and for an integrated 
luminosity of 1 ftr 1 (left) and 18 fb 1 (right). The points with error bars show SUSY plus Standard 
Model, the solid histogram shows the Standard Model contribution alone. The fitted function is super- 
imposed (right), the vertical lines indicate the theoretical endpoint values. 



having a fixed width of 2 GeV, returns endpoint values of 55.8 ± 1.2 ± 0.2 GeV for the lower edge and 
99.3 ± 1.3 ±0.3 GeV for the upper edge, consistent with the true values of 56.1 and 97.9 GeV. As can 
be seen from Fig. 3 (right) the mu distribution also contains a noticeable contribution from the leptonic 
decay of Z bosons present in SUSY events. Even though the upper edge is located close to the Z mass, 
adding a Z peak of fixed mass and width to the fit function only affects the endpoints at the 0.2-0.3 GeV 
level. However the Z peak changes the normalisation of the upper triangle so for considerations of 
couplings and branching ratios it should be included. 

4 Leptons+Jets edges 

In events selected for the dilepton analysis in the previous section, jets are added to construct further 
distributions as described in Sect. 2. Additional selection cuts are applied to refine the distributions: 

mu < m e ^ ge + Ai (all distributions) (8) 
muq < mf^ + A 2 (mi q distributions) (9) 

Here m e ^ ge and mf^ refer to experimental values found in this and the previous section. The value Ai 
is a small number, 10 (3.3) GeV for SU3 (SU4), to account for the fact that the edge stretches slightly 
beyond the fitted endpoint. One can see from Fig. 1 that this cut should be very effective for SU4, but 
much less so for SU3. The value A2 serves a similar purpose, but since the determination of is less 
reliable, a looser cut is used, 155 (37) GeV for SU3 (SU4). 

The invariant mass distributions mu q and rri£ q are shown in Figures 4 and 5 for both the "Bulk" point 
(SU3) and the "Low Mass" point (SU4) after efficiency-corrected flavour subtraction. 

While the Standard Model background causes considerable bin-by-bin fluctuations for integrated lu- 
minosities < 1 fb -1 , the net contribution of entries beyond the endpoints is mainly due to combinatorics 
from choosing the wrong jet in a true SUSY event. For the distributions where a clear tail is visible, a 
straight line is assumed for the background, otherwise it is set to zero. (The statistics box of the plots 
indicates which background hypothesis is used.) Since the tail is due to SUSY events, it will not be 
known beforehand and a data-driven approach (not described here) would be required. 

The lepton+jet distributions have shapes which depend on the sparticle masses [10]. Depending on 
the sparticle spectrum, the edge region may contain non-trivial features such as experimentally unde- 
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tectable 'feet' containing very few events or vertical drops. For all the relevant distributions of two-body 
scenarios, analytic formulas describing the shape in terms of the sparticle masses are known [11, 12]. 
With low statistics a straight-line fit is likely to give to give a sufficiently good description in many cases. 
All the edges and thresholds were fitted with the following formula, 

f{m) = -jL- /exp ( - (m ~^ 2 ) max{A(m / - m EP ) , 0} dm' + max{a + bm, 0} , (10) 
\/2ko J ^ 2a / 

where m EP represents the endpoint (or threshold), A is the slope of the signal distribution, while a and 
b are the background parameters. The Gaussian smearing gives a smooth transition between the two 
straight lines and mimics in a simple way the smearing of an edge due to mismeasurement of jet mo- 
menta. The smearing parameter o was fixed to 15 GeV. The fitted endpoint values were found not to be 
very sensitive to the choice of o in the range 0-20 GeV. For ma q and the m (q distributions the integration 
range is (0, m EP ). For the mf^ q distribution the lower integration limit is m EP while some 100-200 GeV 
above the upper fit range is a safe upper integration limit. The endpoints resulting from the fits to the 
distributions in Figures 4 and 5 are summarised in Table 4. 

For the "Bulk" point (SU3) the edges are found to be sufficiently well described by a straight line 
in Eq. (10) for the signal region, however describing the background region by a straight line results in 
large systematic uncertainties in the endpoint fit. In particular, the mu q distribution does not have a clear 
edge. Even though the mu q cut in Eq. (9) removes a considerable amount of background for the m eq (\ ov/ ) 
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Figure 5: Efficiency-corrected flavour-subtracted distributions of m {q ^ igh ) (top) and m tq ^ ow ) (bottom) for 
SU3 (left) with 1 ftr 1 and SU4 (right) with 0.5 ftr 1 of integrated luminosity. The points with error bars 
show SUSY plus Standard Model, the solid histogram shows the Standard Model contribution alone. 
The fitted function is superimposed, the vertical line indicates the theoretical endpoint value. 



distribution there is still some background left for the m^ high ^ distribution. A systematic uncertainty is 
assigned to account for the background estimation for both fits. In case of «J^( low ) a background-free fit 
can be made resulting in a few GeV uncertainty. The mfjf q distribution is expected to be concave, so a 
systematic uncertainty is added in the estimation when fitting the threshold by a straight line fit. 

For SU4, the m« 9 and both of the me q distributions have edges which are well described by the fit 
function Eq. (10). This is confirmed by the dominance of the statistical errors over the systematics ones. 
The mfpq fit is more problematic resulting in somewhat larger errors. This could be expected since the 
contributions from different-family and non-signal same-family peaks in this mass region, whereas they 
are close to vanishing in the edge regions of the other distributions. Another reason (although probably 
of less importance at 0.5 fb _1 ) is that the threshold edge is concave and only moderately well described 
by a straight line. 

While the fit values of m^ q e and m^ q are compatible with the theoretical values, the fitted w^ igh ) 
and 'w^ x ow j are off by 2a and 4a, respectively. This comes from the fact that in SU4 the decay of X2 is a 
three-body decay. In such scenarios the mi q distributions, and in particular m^( low ) are often so sparsely 
populated towards the high mass values that the endpoints are not experimentally deducible from the 
edges. Note, however, that there is no hint from the x 2 of the »i^( low ) fit that we are in such a situation. 
This topic is discussed further in Sect. 9. 1 where endpoint relations are inverted to give sparticle masses. 
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Table 4: Endpoint positions for SU3 and SU4, in GeV. The first error is statistical, the second and third 
are the systematic and the jet energy scale uncertainty, respectively. The theoretical values are also given 
for ease of comparison to the left of the fitted values. The integrated luminosity assumed is 1 ft -1 for 
SIBand 0.5 ft- 1 forSU4. 



Endpoint 


SU3 truth SU3 measured 


SU4 truth SU4 measured 


w edge 
m ttq 

™% 

m E ow) 
m ^(hi g h) 


501 517±30±10±13 
249 265±17±15±7 
325 333±6±6±8 
418 445±11±11±11 


340 343±12±3±9 
168 161±36±20±4 
240 201±9±3±5 
340 320±8±3±8 



5 Tau signatures 

5.1 Determination of the di-tau endpoint position 

The endpoint of the invariant mass distribution from two taus emerging from a %2 decay, 

Xi ^TiT^ZiVt^ , (11) 

depends on the masses of the Xi, the X\ and the fi, and therefore can contribute to the determination of 
SUSY parameters. 

Taus play an important role in scenarios like mSUGRA where X® is mostly wino, and therefore 
preferentially couples to L-type sfermions. The large L-R mixing in the stau sector significantly enhances 
the branching ratio for the decay X2 — > f^r T with respect to other leptons. In the scenarios SU1 and 
SU3 studied here, for example, the branching ratio for the decays into taus is a factor of 10 larger than 
for decays into electrons or muons. Since in many models (including mSUGRA) the f\ is the lightest 
slepton, for certain values of the SUSY parameters the only allowed two body decay is X2 — ► tt as 
Xi — > Xih, Xi — > XiZ, and X2 —> ee or jXjj. are kinematically forbidden. 

Finally, whereas mass information about Xi and X2 can often be obtained more precisely from X2 
decays into electrons and muons (if these decays are open), decays into taus are needed to probe the f 
mass parameters. 

In contrast to X2 decays into electrons or muons, the di-tau invariant mass spectrum does not have a 
sharp endpoint at the maximum kinematic value. Due to the presence of neutrinos from the tau decays, 
the m xx distribution (where m TT indicates the invariant mass of the visible decay products of the tau 
pair) falls off smoothly below the maximum value given by either Eq. (4) or Eq. (5). Only hadronic tau 
decays are considered for tau identification: the tracking-seeded reconstruction algorithm [13] is used 
to reconstruct taus in the "Coannihilation" point (SU1), while the calorimeter-seeded algorithm [13] is 
used for the "Bulk" point model (SU3). This choice is motivated by the higher efficiency of the former 
in reconstructing the low pj taus that are present in the SU1 model. 

The SU1 point also has a considerably lower cross-section than the SU3 point, so different selection 
procedures are used to maximize the signal significance. For the SU3 point events are selected with two 
taus, £"™ 1SS > 230 GeV, and at least four jets with pj greater than 220, 50, 50, 30 GeV respectively. For 
the SU1 point the cut on £"™ 1SS is relaxed to 100 GeV and at least two jets with pj greater than 100 and 
50 GeV respectively. In addition, an elliptical cut in the space of E™ 1SS and the sum pj(l) + pr(2) of the 
two highest pj jets is applied to SU1. The semi-axes of the ellipse are 450 GeV for Ej lss and 500 GeV 
for the sum of jet pj. This cut exploits the anticorrelation between £"™ ss and (px(l) + Pt(2)) which is 
different for the signal and the Standard Model background. 
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Figure 6: Invariant mass distribution of opposite-sign tau pairs with same-sign tau distribution subtracted, 
for the SU1 (18 fb _1 , left) and SU3 scenarios (1 fb -1 , right). The dashed histogram in the left plot shows 
the distribution at the generator level, while points show the reconstruction-level distribution. 



The invariant mass distribution emerging after the above cuts are applied is shown in Figure 6 for the 
SU1 and SU3 scenarios, where the corresponding distribution of two taus with the same sign of electric 
charge is subtracted from tau pairs with opposite sign in order to reduce combinatorial background. This 
is possible because uncorrected and fake taus should arise about as often with the same charge as with 
opposite charges. To decrease the SUSY background further, the two taus arising from the same decay 
chain are required to have a maximum separation AR < 2 in the T]-0-plane. 

The following log normal function with three parameters, inspired by [14], is used to fit the m xx 
distribution: 

/W = El . exp ("^|(lnW -Pi) 2 ) (12) 

This function does not contain the endpoint position explicitly, but approaches the x-axis asymptotically. 
The endpoint is then derived from the inflection point mjp of the fit function: 




using a Monte Carlo-based calibration procedure. 

The inflection point obtained for 14 SU3-like models is plotted against the theoretical endpoint value 
for each of these models. The SU3-like points are generated using the ATLAS fast simulation program 
[15] and varying the masses of the X® , the fi and the X\ separately, while keeping the other two masses 
fixed. In Figure 7 the inflection point is plotted as a function of the endpoint m E p and fitted with a straight 
line, yielding the following calibration function: 

m IP = (0.47 ± 0.02)m EP + (15 ±2) GeV (14) 

The covariance between the slope and the axis intercept is —0.034 GeV. 

The fit using function Eq. (12) for the SU1 and SU3 models is shown in Fig. 6 giving an inflection 
point at mip = 48±3 GeV and mjp = 62±8 GeV respectively, which translates into endpoints at m E p = 
(70 ± 6.5 stat ± 5 s y st ) GeV (SU1) and (102 ± 17 stat ± 5.5 s y st ) GeV (SU3) using the calibration relation in 
Eq. (14). The systematic uncertainty is dominated by the fitting procedure and is evaluated by changing 
the binning and fit ranges. The effects of 1% and 5% jet energy scale uncertainies have been tested and 
found to introduce an additional systematic uncertainty on the endpoint measurement well below 3%, so 
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Figure 7: Calibration curve showing the relation between the position of the inflection point (measured 
and fitted with function Eq. (12) after ATLFAST based detector simulation) and the endpoint (calculated 
with equation Eq. (5)) of the di-tau mass distribution. The SU3 point is not included. 

they are negligible compared to the systematic error introduced by the fitting procedure. The theoretical 
expectation for the SU1 (SU3) endpoint is 78 GeV (98 GeV). The difference between the theoretical 
and the extracted endpoint comes from the fitting and fit-calibration procedure rather than from detector 
effects. As Fig. 6 (left) shows, the generator-level distribution of the visible products is close to the 
reconstruction-level distribution and after fitting gives an endpoint value of 70 GeV for the SU1 scenario 
which is the same as for the reconstructed distribution. 

5.2 Impact of the tau polarization on the di-tau mass spectrum 

The method discussed in the previous section assumes a fixed polarization of the two taus from the Xi and 
therefore neglects the effect of the polarization on the invariant mass spectra. However, the polarization 
of the taus from the decay cascade can vary significantly between different SUSY models. Polarization 
effects on the di-tau mass distribution are studied by simulating samples of events where the polarization 
of the two taus from the decay Eq. (1 1) is allowed to vary. The ATLAS fast simulation program [15] is 
used to simulate a data sample equivalent to 51 fb _1 of data. 

Parity violation in weak interactions in conjunction with momentum and angular momentum conser- 
vation leads to a correlation between the visible tau energy and the polarization of the tau. In case of the 
decay T— > V T 7T~, since the pion is a scalar particle the neutrino spin is forced to be parallel to that of 
the tau and therefore the fixed neutrino helicity determines the neutrino momentum direction. Thus, to 
conserve momentum, the direction of the pion momentum is forced to be parallel or antiparallel to that of 
the tau depending on the tau polarization. This leads to the pion getting a boost parallel or antiparallel to 
the tau momentum resulting in harder and softer pions. This affects the di-tau mass spectrum, as shown 
in Fig. 8. The curves in the plot are theoretical predictions from [16]. The invariant masses of taus with 
left chirality (LL) are on average smaller than for right (RR) taus. 

For decays via the vector mesons p and a\, the momentum of the vector meson has the same (oppo- 
site) direction as in the case of pions for longitudinal (transverse) polarization of the vector meson. 

Adding all hadronic tau decay modes finally yields the invariant mass distributions shown in Figure 8 
for the chirality options LL, RR and LR/RL. The position of the trailing edge is clearly shifted for 
different polarizations whereas the shape difference calculated in [16] is barely visible after detector 



1628 



Supers ymmetry - Measurements from Supersymmetric Events 



simulation. 
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Figure 8: Left: Di-tau invariant mass spectrum for % — > 7TV T decays as obtained from Monte Carlo truth 
information together with the expectation from theory. Right: Di-tau invariant mass spectrum for all 
hadronic decays after an ATLFAST based detector simulation. Both plots show the mass distributions 
for the chirality states LL, RR and LR/RL. 



As a consequence, the inflection point of the distribution is shifted due to polarization effects. The 
maximum difference in the inflection point measurement Amipa(RR — LL) has been found to be 7 GeV 
which is comparable to the statistical error on the position of the inflection point presented in the previous 
section. Without additional information on the tau polarization we might quickly reach a point where it 
is not possible to improve the di-tau endpoint measurement. In this case the achievable precision for an 
integrated luminosity of 1 fb~ ! for the SU3 model is: 

m™ x = 102 ± 17 stat ± 5.5 syst ± 7 po1 

The uncertainty due to the polarization effects dominates over the other systematic uncertainties and 
therefore needs special attention. A study of the different polarization dependencies of the decay kine- 
matics for different decay modes of the tau might be helpful. As the emission direction of vector mesons 
is opposite for longitudinal and transversal states the net effect is determined by the branching ratio into 
the two states. It turns out that for the a\ there are as many longitudinal as transverse polarized whereas 
for the p there are more longitudinal vector mesons, leading to different polarization dependencies. 

6 <?r pair reconstruction 

Events where a pair of <7r particles is produced, and where each decays through the process 

q R -> Xiq (15) 

lead to a characteristic signature with two high-^x jets and large £™ ss from the escaping neutralinos. q R 
pair production represents about 10% (5%) of the total SUSY production cross-section for the "Bulk" 
point SU3 ("Low Mass" SU4). For both points (and more generally in most of mSUGRA parameter 
space) the <7r decays almost entirely through the process in Eq. (15). 

Events with large E™ 1SS and a pair of high-py jets are selected by requiring: 

• E™ iss > max(200 GeV, 0.25M eff ) and M eff > 500 GeV 

• Two jets with p T > max(200 GeV,0.25M eff ), |rj| < 1 and AR > 1 
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Table 5 : The total event yield and the number of Standard Model background events satisfying selection 
criteria for HI72 reconstruction, signal-to-background ratio and signal statistical significance. Only errors 
from the detector systematic uncertainties are quoted. 





Integrated luminosity (fb 1 


Event yield 


Standard Model S/B 5M 


SA/ b sm 


SU3 


1.0 


282 ± 20 


18 14.7 ±1.1 


62.2 ±4.7 


SU4 


0.5 


258 ± 65 


9 27.7 ±7.2 


83.0 ±21.7 



• No additional jet with p T > min(200 GeV, 0. 15M eff ) 

• No isolated leptons and no jets tagged as b jets 

• Transverse sphericity Sj > 0.2 

These selection cuts are tuned using the information from the event generation to select <?r pair produc- 
tion. 

The systematic uncertainty originating from the jet and £™ ss energy scale and resolution [2] change 
the event yield by 7% for the case of SU3 and by 25% for the case of SU4. The systematic effect on the 
energy scale and resolution of leptons is negligible. 

The total event yield and the number of Standard Model background events satisfying the selection 
criteria for 1 fb _1 for SU3 and 0.50 fb _1 for SU4 are given in Table 5 together with the signal-to- 
background ratio and the signal statistical significance. 

To reconstruct the <?r mass we use the the mji variable [2, 17, 18], sometimes referred to as "stran- 
verse" mass. This variable uses the kinematic features of the ^r decays to reconstruct mq R 3 making the 
assumption that m^o is known from the measurements in Section 4. The mji distributions for the SU3 
and SU4 points are shown in Figure 9. As is common for SUSY measurements, the distribution is ex- 
pected to have an edge at m^ R rather than a peak. A linear fit is applied to the right part of the distribution 
to determine the edge position at 590±9(stat)+^(sys) GeV for SU3 and 421 ± 17(stat)+^°(sys) GeV for 
SU4. This can be compared to the expected positions of mq R =611 GeV for SU3 and m^ R = 406 GeV 
for SU4. The systematic error accounts for the choice of the fit limits as well as the jet energy scale 
systematic. 

7 Light stop signature 

In the "Low Mass" benchmark point (SU4), the SUSY masses are all in the range m^o = 60 GeV < m < 
m~ h = 445 GeV. The stop t\ is light (m ( ~ = 206 GeV) and always decays by t\ —> xfb. A detailed analysis 
of the phenomenology of this point can be found in [19]. 

At this SU4 benchmark point the light stop is produced in the gluino decay Eq. (3) which has a 
branching ratio of 42%. Associated gluino production with a or ^r followed by the decay in Eq. (3) 
occurs in ~ 18% of all SU4 events. In the decay Eq. (3) the final state tb invariant mass distribution has 

3 For the benchmark point considered here, the effect of the SUSY background (i.e. events other than the q R pair production 
which pass the event selection) on the position of the edge is small. This has also been shown to be true for other benchmark 
points [10]. However, the identification of the edge as a measurement of the mass may not hold for all the SUSY parameter 
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Figure 9: Fit of the sum of the reconstructed mr 2 distributions in the selected SUSY and the remaining 
Standard Model background events with 1 fbr 1 for SU3 and 0.5 fb _1 for SU4. 



the upper kinematic endpoint: 

r 2 2 -i 1/2 

M max (tb) = mf + l im2 ^ ((,4-;H?-m f 2 ) + ^-K-m0 2 )H-K+m^2)) . (1 6 ) 

With m- g = 413 GeV, m- % ± = 113 GeVand a top mass of 175 GeV, Eq. (16) gives 

M m&x {tb) ~ 300 GeV. (17) 

The other significant decays at this benchmark point which lead to the same final state are: 

g -► hb^Xitb, (18) 

g -> bib^hWb^xtbbW, (19) 

g -► b 2 b^xftb, (20) 

g b 2 b^hWb^xfbbW. (21) 

The final states from the decays Eq. (19) and Eq. (21) are equivalent to the final state from the decay 
Eq. (3) if the bW invariant mass is close to the top mass. Associated gluino production with left or 
right squark followed by these decays occurs in 4% (Eq. (18)), 9% (Eq. (19)), 0.1% (Eq. (20)) and 0.9% 
(Eq. (21)) of all SU4 events. Due to the small mass difference between g and the b\ or b 2 , the final states 
Eq. (18) and Eq. (19) can be suppressed by imposing a minimum cut on the pj of the b jet, while the final 
states Eq. (20) and Eq. (21) are suppressed because jets originating from the gluino decay g — > b 2 b are 
on the average below the detection threshold. 

In order to extract light stop signal from the gq events where gluino decays to stop and top, the final 
state tb invariant mass distribution in Eq. (3) is reconstructed for top quark decays into hadronic final 
states only: 

t -> Wb -> qqb (22) 

making no assumptions about the %\ decay modes which dominantly produce two additional light-quark 
jets. The hardest jet in the event is assumed to be the light-quark jet originating from the decay of the 
left or right squark produced in association with the gluino. 

We select jets with pj > 20 GeVand |tj | < 2.5. In this range the ^-tagging efficiency is about 60% [1]. 
The event selection requires the following: 
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• At least 5 jets in the event with pj > 30 GeV, where 

- The hardest jet is a light-quark jet with pj > 100 GeV, 

- 2 and only 2 jets are tagged as b jets and they have pj > 50 GeV and 

- At least 2 of the light-quark jets have pj > 30 GeV 

• £™ ss > 150 GeV, M eff > 400 GeV, £™ SS /M eff > 0.2 

• S T >0.1 . 

Of the Standard Model backgrounds simulated, only events from our tt and QCD samples satisfy 
these selection criteria. No W + jets or Z + jets events pass the cuts. 

The dominant detector-performance systematic uncertainties come from the jet and Ej lss energy 
scale and resolution. Assuming that the uncertainties for 200 pb _1 are the same as for 100 pb , i.e. 
10% uncertainties on the jet and £"™ ss energy scale and resolution [20], and including 5% uncertainty 
on the ^-tagging efficiency, the resulting systematic uncertainty in the number of selected events would 
be ~ 40% for SU4 and ~ 50% for tt . 

The top-bottom invariant mass is reconstructed for events satisfying the selection criteria for the light 
stop search: 

• Excluding the hardest jet, all light-quark jets with p^ > 30 GeV are combined into dijet pairs. 

• All such pairs with invariant mass within the window \mjj — mw\ < 15 GeV are combined with 
each of the two b jets and the b jj combination with invariant mass closest to the top mass is 
selected. 

• The four-vectors of this dijet pair are rescaled such that mjj = my/ and ntbjj is recalculated and 
accepted as top candidate if \rribjj — m top \ < 30 GeV. 

• The same bjj combination is combined with the other b jet and m t \, calculated, with the require- 
ment that the angle between top and bottom be AR(t,b) < 2. 

The W sideband method [21-23] is used to estimate SUSY combinatorial background originating 
from supersymmetric processes in which jet pairs accidentally have an invariant mass within our W- 
mass window, and so fake W bosons. The sidebands used are the regions of dijet invariant mass 30 GeV 
below and 30 GeV above our IV-mass window. The fake W boson contribution to the m t b distribution is 
evaluated as the average contribution of the jet pairs from the W sidebands after they have been scaled 
linearly to the W mass zone and the procedure of m t b reconstruction has been repeated. 

The numbers of signal and the remaining Standard Model background events, together with the total 
event yield at 200 pb 1 , are listed in Table 6. The m t b distribution reconstructed in signal events without 
the subtraction of the SUSY combinatorial background and the SUSY combinatorial background itself 
are plotted in Figure 10. The contribution of fake W bosons in SU4 is higher than the number of events 
remaining after subtraction. 

The m t b distributions before and after the subtraction of the SUSY combinatorial background are 
shown in Figure 10. The background-subtracted distirbution is fitted in order to extract the endpoint. 
The resulting m t b distribution is fitted with a triangular function smeared with a Gaussian: 



where the kinematic endpoint, Af max , and the smearing, a, are two of the five fit parameters. The smear- 
ing, a, models the experimental resolution of the reconstructed m t b. The position of the upper kinematic 
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Table 6: The number of signal and remaining Standard Model background events and the total event 
yield at 200 pb~ ! . The last row gives the signal-to-background ratio. The errors quoted are from detector 
sources of systematic uncertainty. 



L = 200pb~ 1 


Initial selection 


\m bj j-m 


| < 30 GeV 


AR(t,b) < 2 






without 


with 


without 


with 






Wsub. 


Wsub. 


Wsub. 


W sub. 


SU4 


963 


537 


224 


267 


120 


tt 


99 


28 


13 


9 


4 


QCD 


6 


3 


2 


3 


2 


Total 


1068 ± 426 


568 ± 225 


239 ± 95 


279 ± 109 


126 ± 50 


SU4/(?f+QCD) 


9.2 ±4.1 


17.3 ±7.3 


14.9 ± 6.3 


22.3 ±9.1 


20.0 ± 8.3 



endpoint obtained from the 5-parameter fit (Figure 10) is M max = 297 ± 9 GeV with a = 28 ± 7 GeV 
corresponding to ~ 10% of the M max value. The position of the upper kinematic endpoint obtained from 
the 4 parameters fit is M max = 298 ± 6(stat)+4j (sys) GeV with a set to 10% of M max . The expected 
value of m t b given by Eq. (17) is 300 GeV. 




M(tb) [GeV] M(tb) [GeV] 



Figure 10: Left: Reconstructed m t b distributions in signal and SUSY combinatorial background events. 
Right: The 5 parameters fit of the sum of the reconstructed m t b distributions in signal and the remaining 
Standard Model events after the subtraction of the SUSY combinatorial background; all at 200 pb _1 . 



8 Higgs signatures in SUSY events 

In the context of supersymmetric models, Higgs bosons at the LHC can be produced in proton collisions 
either through direct interaction of Standard Model particles, such as gluon-gluon fusion, or through the 
decay of a supersymmetric particle produced in the initial interaction. 

We will consider the possibility of observing the lightest CP-even h boson via the second mechanism. 
In this case, a missing transverse energy signature, typical of R-parity conserving SUSY scenarios, can 
be reconstructed in association with the Higgs boson and exploited to reduce the background, making it 
possible to study the dominant decay channel h — > bb, which is otherwise hidden by the enormous QCD 
continuum. 
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Within mSUGRA the most promising source of Higgs production is the decay of a second-lightest 
neutralino. Indeed, X2 —> X\h, if open, dominates the X2 — >• X\Z mode, because the two lightest neu- 
tralinos are basically gauginos, so that the higgsino-gaugino-Higgs vertex is enhanced with respect to 
the higgsino-higgsino-gauge one. However, if the sleptons are lighter than the X2, the decay channels 
and VtVi open up, dominating the X2 width. As a consequence, we expect that mSUGRA points 
interesting for Higgs searches will not show a clear di-lepton signature. The benchmark point chosen for 
this analysis is SU9, at which BR{xl -► Xih) ~ 87%. 

Exploiting the capabilities of the ATLAS detector in missing transverse energy measurement and b 
tagging, the passage of weakly interacting particles may be revealed and a bb pair with invariant mass 
peaking around the Higgs mass can be reconstructed. 

Standard Model events with similar signatures which are backgrounds for this analysis, include 
events with neutrino production causing a genuine £"™ ss signal and QCD events with fake £"™ ss gen- 
erated by instrumental effects. SUSY events can themselves constitute a background, as they contain 
many b-]et candidates, both true and mistagged. These can be divided into two categories: SUSY cas- 
cades without and with production of a Higgs decaying to bb. In the latter case potential signal events 
may be incorrectly reconstructed as the selected bb pair is not the one coming from the Higgs decay. 
We will refer to the former type simply as "SUSY background" and to the latter as "combinatorial back- 
ground". 

The following selection cuts are applied: 

1. Ef ss > 300 GeV; 

2. two light-flavoured jets with pj > 100 GeV; 

3. two b jets with p T > 50 GeV; 

4. no leptons with pj > 10 GeV. 

The first two cuts are typical of SUSY analyses, while the pupose of the last cut is to suppress back- 
grounds from tt and W production. 

When three or more b jets with transverse momentum greater than 50 GeV are found in a single event, 
the second and third leading -px b-jets are chosen as the candidate Higgs decay pair. This is because an 
important source of b jets is the decay of a bottom squark to X2 and b and since — m%o ~ 500 GeV > m/, 
the sbottom daughters get more allowed phase space than the Higgs daughters and thus, in general, higher 
p T . 

In Figure 1 1 (left) the invariant mass of the selected b jet pairs is shown assuming 10 fb _1 of collected 
luminosity. The shaded histogram corresponds to the sum of the Standard Model backgrounds, the 
dashed and dotted lines are the SUSY and combinatorial backgrounds respectively. These last two, 
together with the tt production, are the most important backgrounds. The black curve is the result of 
a least squares fit to a Gaussian function, representing the Higgs resonance, superimposed on a second 
degree polynomial background. The estimated number of signal and background events is obtained by 
counting the b pairs with invariant mass inside a ±25 GeV range around the fitted peak centre. The 
signal significance, computed in the Gaussian approximation as the number of signal events over the 
square root of the background, is about 14. 

Table 7 summarises the expected event rates after the application of the selection cuts and after the 
additional mass window request. 

SUSY mass spectrum information is reconstructed in the SU9 model by studying the decay: 

<?l — > #2 q — > Xihq. (23) 
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Figure 1 1 : Invariant mass of the selected b-jet pairs (left) and invariant mass of the system consisting of 
the Higgs plus the jet minimising mi, q (right) for 10 fb _1 of integrated luminosity. 



Table 7: Summary of the number of expected SUSY and Standard Model events after the application of 
the different selection cuts, for 10 fb _1 of integrated luminosity. 

SU9 Signal Comb BG Susy BG 

No cuts 11050 21950 

Cut 1,2, 3 356 946 908 

Cut 4 230 449 433 

±25 GeV mass window 179 76 76 



Standard Model 


tt 


Z 


W 


bb 


Cut 1, 2, 3 


133 


12 


22 


43 


Cut 4 


53 


8 


10 


21 


±25 GeV mass window 


11 


2 


4 


4 



As a consequence of two-body kinematics, the invariant mass of the Higgs-quark system shows both a 
minimum and a maximum value, related to different combinations of the masses of the SUSY particles 
involved. 

The events passing the previous selection cuts, including the mass window cut, are also required to 
have at least one b jet with pj > 100 GeV. Furthermore, a veto is imposed on additional ^-tagged jets 
with pi > 50 GeV. This will result in fewer signal events, but also in a reduced background contamina- 
tion yielding a clear distribution shape, albeit with lower signal significance. As in Section 4 the quark 
from the q\ is expected to produce one of the two highest-/;^ jets and the two distributions m^ in and 
m hq X ^ reconstructed using the jet that maximises and minimises the mj iq value, respectively. Since 
the background events will tend to concentrate toward low mass values, the m^ m (Figure 11, right) is 
used to determine the mass upper limit m hq e .^. The wz/ !l? ,threshold value can determined from the m^ m 
distribution. 

The mass edge value is obtained by fitting a triangular shape convolved with a Gaussian: 
'w/ i9 ,ed g e = 695 ± 15 (stat) ± 3 (syst) ± 35 (JES), 
to be compared to the true value of 732 GeV. The statistical uncertainty is the error on the fitted parameter, 
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while the systematic comes from the parameter dependence on the fitting boundaries. An additional 5% 
systematic error is expected from the jet energy scale (JES) uncertainty. 

The mass threshold evaluation is more challenging, since background events tend to populate the low 
mass region. For an integrated luminosity of 10 fb -1 , it was not possible to fit to the mass distribution. 
However, high-statistics studies performed with a fast simulation of the ATLAS detector show that a 
mass threshold value could be extracted from 300 fb _1 of collected data, expected after 3 years of LHC 
running at design luminosity. 

9 Mass and parameters measurement 

This section is devoted to the extraction of SUSY mass spectra and parameters from the measurements 
described in the previous sections of this note. As an example, the mSUGRA benchmark points SU3 
(with a luminosity of 1 fb _1 ) and SU4 (with 0.5 fb _1 ) are chosen to give a flavour of what might be 
expected in the initial phase of the experiment in case of a rather optimistic SUSY scenario. The small 
luminosity at this stage results in a limited number of available measurements and rather large uncertain- 
ties. In such a situation only models with few parameters can be fitted. In Section 9. 1 the masses of the 
decay chain Eq. (1) are derived from the measurement of the kinematic endpoints described in Section 3 
and 4. In Sections 9.2 to 9.4 the parameters of the mSUGRA model are derived instead. 

9.1 Measurement of masses from SUSY decays 

We use the information from the experimentally measured endpoints to extract the masses of the SUSY 
particles. As described earlier in this paper, in many cases analytic expressions have been deduced for 
the endpoints expressed in terms of the masses. Examples are shown in Eq. (4) and Eq. (5). If sufficient 
endpoints are known, then the masses can be deduced. Here we use a numerical % 2 minimization based 
on the MINUIT package to extract the SUSY particle masses from a combination of endpoints. We 
define the x 2 as 

~h °l ' 

For each of the n endpoint measurements, k, the quantities m™ ax and Ok denote the fit value and its 
uncertainty respectively. The t' k nax are the theoretical endpoint expressions [6, 10], which contain as 
parameters the masses of the two lightest neutralinos, the scalar quark , and (for the two-body decay 
chain only) the scalar lepton . As a starting point for the fit the generated masses are used. The masses 
are constrained to be positive in the fit and the mass hierarchies from the model input are enforced. 

With the statistics expected for an integrated luminosity of 1 fb _1 (0.5 fb _1 ) for SU3 (SU4) we 
observe instabilities in the fit. Depending on the fluctuation and precision of the endpoint measurements 
the fit does not converge. Some of the measured endpoints have shown possible deviations from the 
generated values of up to several standard deviations. Systematic effects to explain such discrepancies 
are identified in earlier sections of this article. Significant deviations distort the results and also negatively 
affect the fit convergence, especially in the presence of degenerate kinematic endpoint equations. We also 
note large correlations (typically larger than 95%) among the fitted parameters. This is expected, since 
the endpoints are most sensitive to mass differences. The correlations can also lead to difficulties with 
convergence and result in larger uncertainties for the fitted masses. 

To show a potential result with early data, we quote here the results from converging fits for both 
the SU3 and SU4 points. We use the endpoints from the lepton+jets edges summarized in Table 4 and 
the dilepton edge fit from Section 3 (99.7 ± 1 .4 ± 0.3 GeV for SU3 and 52.7 ± 2.4 ± 0.3 GeV for SU4). 
For the SU3 fit, all dilepton+jets edges are used. In the SU4-fit, we discard the m^^) measurement 
as it has been shown not to be very reliable and does not provide additional constraints in three-body 
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Table 8: Resulting SUSY particle masses and mass differences within SU3 and SU4 from the x 2 mini- 
mization fit using the dilepton and lepton+jets edges. Shown are the measured masses «i meas and mass 
differences Am meas followed first by the parabolic errors as returned by MIGRAD and then by the jet 
energy scale errors. When the measured parameter is anticorrelated with the jet energy scale variation, 
this is indicated by a =F sign. The input Monte Carlo masses mmc an d mass differences A/mmc ai" e a l so 
shown. The integrated luminosity assumed is 1 fb _1 for SU3 and 0.5 fb _1 for SU4. 



Observable 


SU3 m meas 


SU3 /jimc 


SU4 m meas 


SU4 m M c 




[GeV] 


[GeV] 


[GeV] 


[GeV] 


m tl 


88±60^2 


118 


62 ± 126 T 0.4 


60 




189±60t2 


219 


115±126t0.4 


114 


rriq 


614±91±11 


634 


406 ±180 ±9 


416 


m l 


122±61^2 


155 






Observable 


SU3 Am meas 


SU3 AniMc 


SU4 Am meas 


SU4 Aotmc 




[GeV] 


[GeV] 


[GeV] 


[GeV] 


m^o — m^o 


100.6 ±1.9^0.0 


100.7 


52.7 ± 2.4 =F 0.0 


53.6 


rriq — m^o 


526 ±34 ±13 


516.0 


344±53±9 


356 




34.2 ±3.8^ 0.1 


37.6 







decay scenarios such as SU4. The rii^^g^ endpoint can be measured more reliably and its kinematic 
expression only differs from the one of wi^( low j by a constant factor. The di-tau edges are not used here. 

The masses resulting from the % 2 fit are shown in Table 8 (upper part). The parabolic errors are the 
first errors shown in the table and the jet energy scale errors are the second. Asymmetric errors show 
a large uncertainty on the positive side. The jet energy scale errors are determined by varying all the 
endpoints along with their fully correlated jet energy scale uncertainties and refitting the masses. The 
difference in the central values of the fit is taken as the jet energy scale uncertainty for Table 8. One can 
see that the jet energy scale errors are small compared to the error on the masses, but might be relevant 
to the mass difference measurements. 

We note that a fit with SU3 kinematic assumptions applied to the SU4 endpoints also returns consis- 
tent masses. The decision about the mass hierarchy would thus have to be based on additional information 
from collider data. A possible source is the shape of the dilepton edge as discussed in Section 3. 

Besides the masses we can also extract differences of SUSY particle masses. These are more directly 
related to the endpoints and we expect to be able to determine them more reliably than the masses of 
individual sparticles. For mass differences, a % 2 similar to Eq. Eq. (24) is used, where the parameters are 
written in terms of mass differences to the neutralino X\ . We obtain the results shown in Table 8 (bottom 
part). 

We conclude that a first look at sparticle masses is possible with early data, although with large 
uncertainties. Appropriate model assumptions and additional information will probably have to be used 
to constrain the fits. 

9.2 Observables and fit assumptions 

To demonstrate the feasibility of parameter determination with initial data, we show the constraints one 
would obtain for our benchmark points if one assumed an mSUGRA framework. 

The SUSY parameter-fitting package Fittino version 1.4.1 [24] is used, interfaced to a beta version 
of SPheno3 [25] to perform the theoretical calculations for a given set of parameters. 



1637 



Supers ymmetry - Measurements from Supersymmetric Events 




280 290 300 310 320 330 340 U U -1500-1000-500 0 500 10001500 2000 25003000 



M 1/2 [GeV] A 0 [GeV] 



Figure 12: Two-dimensional Markov chain likelihood maps for mSUGRA parameters Mq andM^ (left) 
as well as tan/3 and Aq (right) for sign /x = +1, for benchmark point SU3, with integrated luminosity of 
1 fb _1 . The crosses indicate the actual values of the parameters for that benchmark point. 

The fit is given the measurements presented in sections 3, 4 and 6. The lepton and the jet energy 
scale uncertainties are each considered to be 100% correlated between measurements. Uncertainties on 
the theoretical predictions are not taken into account. For illustration purposes an additional parame- 
ter determination is performed where - following a prescription used in [26] - 1% (0.5%) uncertainty 
on the theoretical calculation of the pole masses of coloured (un-coloured) sparticles is assumed. No 
correlations between the theoretical uncertainties on the pole masses are considered. 

9.3 Markov chain analysis 

To obtain a first glimpse of the possible parameter space a Markov chain analysis is performed. With this 
technique it is possible to efficiently sample from a large-dimensional parameter spaces. This allows us 
to check whether there are several topologically disconnected parameter regions which are favoured by 
the given measurements. 

Figure 12 shows two-dimensional likelihood maps for Mq and My/ 2 (left) as well as tan/3 and Aq 
(right) for sign jx = + 1 obtained for the given set of measurements. The plots demonstrate that for a 
given sign fi preferred parameters are found around the true parameter points independent of the starting 
point. No further preferred regions occur. For Mq and My j 2 a clearly preferred region is found around 
the SU3 values of 100 GeV and 300 GeV, respectively. As expected, given the measurements used, the 
determination of tan/3 and Aq is more difficult. Nevertheless, here too the region around the nominal 
SU3 values is the preferred one. 

9.4 Parameter determination 

In order to determine the derived central values of the parameters and their uncertainties, for each as- 
sumption of the sign of }X a set of 500 toy fits are performed. For each fit the observables are smeared 
using the full correlation matrix. Simulated annealing followed by a Minuit fit started with the best 
parameter estimates from simulated annealing is subsequently run using the smeared observables. The 
four-dimensional distribution of parameters obtained from the toy fits is used to derive the parameter 
uncertainties and their correlations. Figure 13 shows the one-dimensional projections of parameter dis- 
tributions for Mq, My j 2 , tan/3 and Aq. The mean and RMS values of the results of the fit are reported in 
Table 9. As already indicated by the Markov chain analysis Mq and My / 2 can be derived reliably with 
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uncertainties ± 9.3 GeV and ± 6.9 GeV (RMS of the toy fit results), respectively whereas for tan/3 and 
Aq only the order of magnitude can be derived from these measurements. The x 2 distribution of the toy 
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Figure 13: Distributions of the mSUGRA parameters obtained with the fits to pseudo-experiment results. 

fits can be used to evaluate the toy fit performance. The observed mean % 2 = 12.6 ±0.2 for sign jx = 
+1 is compatible with the expected value of Ndof =11- The solutions for the wrong assumption sign 
jU = — 1, also reported in Table 9, cannot however be ruled out as the observed mean x 2 = 15.4 ±0.3 is 
also acceptable. 

10 Conclusions 

If the supersymmetric partners of quarks and gluons exist at a moderate mass scale (< 1 TeV) they will 
be abundantly produced in pp collisions at the LHC centre-of-mass energy of 14 TeV. In this scenario, 
a few fb _1 of ATLAS data will allow the discovery of the new particles [1], once the commissioning of 
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Table 9: Results of a fit of the mSUGRA parameters to the observables listed in Sections 3, 4 and 6 for 
the SU3 point. The mean and RMS of the distribution of the results from the toy fits is reported. The two 
possible assumptions for the digital parameter sign(ju) = +1 sign(/i) = ±1 have been used, resulting in 
different preferred regions for the other parameters. The effect of different assumptions on theoretical 
uncertainties is also shown. 



Parameter 


SU3 value fitted value 


exp. unc. 


sign(M) = +1 


tan/3 


6 7.4 


4.6 


M 0 


100 GeV 98.5 GeV 


±9.3 GeV 


M 1/2 


300 GeV 317.7 GeV 


±6.9 GeV 


Ao 


-300 GeV 445 GeV 


±408 GeV 


sigm» = -l 


tan/3 


13.9 


±2.8 


M 0 


104 GeV 


±18 GeV 


M, 12 


309.6 GeV 


±5.9 GeV 


A 0 


489 GeV 


±189 GeV 



the detector has been completed and the Standard Model backgrounds have been well understood. 

The next step after discovery will be to select specific supersymmetric decay chains to measure the 
properties of the new particles. Here we have focused on those measurements that will be possible using 
1 fb" 1 of integrated luminosity. Specific benchmarks in parameter space have been used to demonstrate 
the precision that can be expected from these measurements, but the same (or similar) techniques can be 
applied to much of the SUSY parameter space accessible with early LHC data. 

For the benchmark points considered, the most promising decay chain involves the leptonic decay 
of the next-to-lightest neutralino (Z2 — > Xi£ + £~). The invariant mass of the two leptons shows a clear 
kinematic maximum (Section 3) which could already be measured with a precision of a few per cent with 
the limited data set considered. The combination of one or both leptons with the hardest jets in the event 
would allow observation of several other kinematic minima and maxima (Section 4). 

For high values of tan /3 the decays into taus will be far more abundant than those involving electrons 
or muons; the excellent performance expected for the identification and measurement of hadronic X 
decays in ATLAS will also allow observation of the dilepton edge in the T + T~ invariant mass distribution 
(Section 5). 

The leptonic decays will not be the only channel for early measurements with supersymmetric de- 
cays. The <7r — > qX\ decay can be used to determine the <7r mass (Section 6). The combination of 
hadronically decaying top quarks and ft-jets in supersymmetric events is also a promising possibility for 
low-scale Supersymmetry with decay chains involving scalar top and bottom quarks, as shown by the 
reconstruction of the edge of the tb invariant mass discussed in Section 7. 

If the %2 — > Xih decay is open, it will provide a substantial source of Higgs bosons. Since the 
Standard Model backgrounds can be suppressed by the usual SUSY cuts, it will then become possible to 
observe the h — > bb decay with moderate (5 fb _1 ) integrated luminosity (Section 8). 

The different channels will provide complementary information about the SUSY mass phenomenol- 
ogy. The various measurements will have to be combined to reconstruct the SUSY mass spectrum and 
attempt to understand the SUSY-breaking mechanism. In Section 9 it is discussed how a selected set 
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of early studies can be combined to obtain the first measurements of supersymmetric masses and of the 
parameters of the mSUGRA model. With 1 fb _1 the reconstruction of part of the supersymmetric mass 
spectrum will only be possible for favourable SUSY scenarios and with some assumptions about the 
decay chains involved. Larger integrated luminosity will help to overcome these limitations, as more 
measurements become possible and the precision of each increases. 
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Abstract 

We investigate the potential of the ATLAS detector to discover new physics 
events containing three leptons and missing transverse momentum. Such final 
states are predicted in a variety of extensions to the Standard Model. In the 
context of supersymmetric models, they could result from direct production of 
gaugino pairs. Using Monte Carlo simulations we present the discovery poten- 
tial for several benchmark Supersymmetry points. We pay particular attention 
to the case where all strongly interacting sparticles are heavy. We investigate 
trigger and reconstruction efficiencies and discuss methods for measuring var- 
ious systematic uncertainties. A solid discovery is expected with an integrated 
luminosity of the order of several inverse fb. If coloured particles are heavy 
direct production of gauginos dominates. In such scenarios, discovery would 
require about an order of magnitude larger luminosity. 

1 Introduction 

If supersymmetric (or other partner) particle production at the LHC is dominated by particles without 
colour charge, then one of the most promising discovery channels is in multi-lepton + missing transverse 
momentum (£"™ 1SS ) final states with little hadronic activity. In this section we investigate the ability of the 
ATLAS experiment to discover new physics in events containing three (or more) leptons - either electrons 
or muons - of which two must have opposite signs but the same flavour (OSSF). We determine the 
sensitivity which would be obtained for discovery of five benchmark points with an integrated luminosity 
of 10 ft- 1 . 

While an analysis of this channel is clearly sensitive to other models, in this section we use Su- 
persymmetry as our example signature, we assume R-parity conservation, and that the lightest SUSY 
particle (LSP) is the weakly interacting %\, which provides the missing transverse energy signal. 

In the supersymmetric case, the final states of interest could come from leptonic decay of pairs of 
heavy gauginos (such as an d X\) through real or virtual W ± , Z° or sleptons to leptons and a pair of 
LSPs. The heavy gauginos may be produced directly, or in the decay of heavier partner particles. 

The primary aim is to make the most realistic determination which is possible at this time of the 
discovery potential in this channel. We also wish to identify the most important Standard Model back- 
grounds, so that analyses can be prepared to measure them in control regions with the ATLAS data. 

Throughout this study we use the inclusive Supersymmetry production Monte Carlo samples de- 
scribed in [1]. The points lie in various regions of mSUGRA parameter space in which the LSP relic 
density is broadly consistent with the observed cold dark matter density. 

The Standard Model backgrounds simulated are listed in Table 1 . The most important backgrounds 
are found to be it , Zb and ZW. Fully leptonic ZW events represent one significant source of events 
containing three leptons and missing energy. Their contribution can be reduced by rejecting OSSF lepton 
pairs with invariant masses consistent with the Z mass. Leptonic decays of tt , and Zb are expected to 
produce two leptons but can generate a third from leptonic b quark decay. These backgrounds have large 
cross-sections, but can be reduced by the introduction of stringent cuts on the isolation of the lepton 
tracks - as we will discuss in Section 3. 

A particularly important benchmark point for the trilepton analysis is the point SU2 [1]. This point 
lies within the 'focus point' region of mSUGRA parameter space which is characterised by very large 
masses for squarks and sleptons and relatively light gauginos. The heavy squarks and sleptons (see 
Figure 1 and Table 2 for the SUSY mass hierarchy at SU2), will have very small production cross-sections 
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Table 1: List of the background samples used, with Monte Carlo generator cross-sections (a), next-to-leading- 
order to leading-order k factors (where known for leading-order generators), average weights ((w)) and corre- 
sponding integrated luminosities. The cross-sections for WW, WZ, ZZ, Zy and Zb are quoted after a filter is 
applied on the generator output requiring at least one lepton with pseudorapidity, | T7 1 < 2.8 and transverse momen- 
tum, pr > 10 GeV. The diboson (WW, WZ, ZZ) samples have a different cross-section than the diboson MC@NLO 
samples (found elsewhere in this volume) because they also include contributions from (and interference with) the 
photon pole. 



Process 


am 


k factor 


<w> 


fdtS? [ftr 1 ] 


WW 


24.5 


1.67 




1.22 


WZ 


7.8 


2.05 




2.98 


ZZ 


2.1 


1.88 


1 


12.7 


Zy 


2.6 


1.30 




2.98 


Zb 


154 




0.66 


0.75 




450 




0.73 


0.92 



at the LHC and make it a difficult region in which to discover SUSY using the analyses described in [2] 
based on the selection of hadronic jets and missing transverse momentum [3]. However, gaugino and 
gluino production will still be abundant, so we expect good discovery potential in multi-lepton events. 
The branching ratios of gauginos for SU2 can be found in Table 3 1 . 



x % g 3 T h - 

0 1 2 3 4 5 

Figure 1: SU2 sparticle mass spectrum. 



Table 2: Particle masses for SU2. 



Sparticle 


Mass [GeV] 


Sparticle 


Mass [GeV] 


tl 


103 




857 


X°2 


160 


«L 


3563 


X3 


180 


Ur 


3574 


Xa 


295 


d L 


3564 


Xx 


149 


d R 


3576 


xi 


287 


h 


2925 


4 


3548 




3501 


4 


3547 


h 


2131 




3546 


h 


2935 



The total cross-section times branching ratio for chargino-neutralino direct pair production and decay 
to a trilepton final state is 32.6 fb. Table 4 shows the contribution from each X ± X° pair production to a 
trilepton final state. The contribution from X^ X T and X° X° pair production to a trilepton final state 
is not tabulated, but also adds a small contribution to the signal. Exclusive trilepton signal for SU2 is 
dominated by the pair production xt Xa followed by X,f —>Xi iv and X2 ^Xi decays. 

The trilepton signal may include contributions both from direct gaugino pair events and from other 
SUSY events. The latter can lead to trilepton final states when cascades initiated by heavier sparticles 
decay via gauginos or sleptons. In a search it is not necessary to distinguish between the various contri- 
butions to new physics and so both classes of event form part of the signal. However in this study we are 
particularly interested in the scenario in which all strongly interacting particles are heavy, since in those 
cases other analyses (requiring jets) will have more difficulty making a discovery. Since we want to be 
sensitive to SUSY even in this harder case, we define the signal in two different ways: 



'Then 



•s and branching ratios were calculated using Isajet v7.71 [4] using a 



»s of 175 GeV. 
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Table 3: Some important branching 
ratios for the benchmark point SU2. 



Sparliclc 


Decay Mode 


B.R. 


%2 


x\i + i- 


7% 




Xi£+£- 


7% 




xfw* 


81% 




X3Z 


12% 


xf 


Xi£v 


22% 


xf 


X2W± 


38% 




xU 


30% 



Table 4: Leading-order cross-sections and num- 
ber of trilepton events for integrated luminosity of 
10 fbT 1 for SU2. 

Production a [fb] Trilepton events /10 fb" 1 



xfXa 1138.0 " 175 

X1X3 679.3 105 

XiXi 51.4 6 

X2X2 58.5 7 

X2X3 61.6 7 

X2X4 310.3 26_ 

TOTAL 326 



• "Inclusive SUSY" : Inclusive supersymmetric particle pair production (any sparticles) 

• "Direct gaugino" : Direct production of charginos and neutralinos only 

Inclusive SUSY represents the signal we would obtain at the benchmark points. Direct gaugino rep- 
resents a more pessimistic scenario, where coloured sparticles are heavy so have no significant LHC 
cross-section. 

2 Lepton selection 

The final selection will require three leptons - electrons or muons - which must consist of an OSSF pair 
and a further third lepton. The initial lepton selection requirements are based on the ATLAS standard 
criteria. The main definitions are summarised in Table 5, and are briefly discussed below. The relatively 
low pj threshold for electrons and muons of 10 GeV in these analyses is an attempt to increase the 
number of trilepton events, despite lower lepton reconstruction efficiencies at low pj . 

• Muons must satisfy pj > 10 GeV and \ t]\ < 2.5, and tracks found in the muon spectrometer must 
match inner detector tracks. For each muon spectrometer track only the best matching track in the 
inner detector is taken. It is required that the % 2 for the match of the muon track to the points on 
the track [5] is less than 100. A primary isolation criterion requires less than 10 GeV of transverse 
energy 2 in the calorimeter in a cone of radius AR = 0.2 around the muon 3 . 

• Electrons must satisfy pj > 10 GeV and \r] \ < 2.5. They must satisfy shower shape and isolation 
requirements as described in [6]. The entire event is rejected if any electron candidate is found 
in the barrel-endcap transition region (1.37< |tj| < 1.52) due to the lower electron identification 
performance in this region. A primary isolation criterion is that electrons must have less than 
10 GeV of transverse energy in an annulus of radius A/?=0.2 surrounding the electron. 

Electrons and muons are then subject to further vetoes as follows: electrons and muons within AR < 
0.4 of a jet are removed from the event, along with OSSF pairs with invariant mass M 0 ssf < 20 GeV, 
which are likely to have been produced from photon conversions or hadronic decays. 

2 Transverse energy, Ej = E sin 6, where E is the energy and 6 is the polar angle relative to the beam direction. 
3 AR = \/Ari 2 +A(j) 2 where A77 is the pseudorapidity difference, with 77 = -logtan(0/2), and A0 is the difference in 
azimuthal angle. 
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Table 5: Selection criteria for muons, electrons and jets. 





Muon 


Electron 


Jet 


p T cut 


> 10 GeV 


> 10 GeV 


> 10 GeV 


77 cut 


|r,|<2.5 


hi < 1.37 
or 1.52 < |r/| < 2.5 


hi < 2.5 


Calorimeter 
Isolation 


|£| < 10 GeV 
in AR = 0.2 


\E\ < 10 GeV 
in AR = 0.2 





2.1 Single lepton selection efficiencies 

Searches for SUSY final states with three leptons require both a high lepton reconstruction efficiency 
and low fake rates. There are few Standard Model processes with similar signatures, but the high cross 
section backgrounds ti and Zb can pass the trilepton requirement as there are both primary leptons in the 
event and a number of jets, in particular b jets which can introduce secondary leptons. Good isolation 
criteria are therefore important in order to reduce the fake rate and thus the background. This section 
presents a study of the lepton efficiency, fake rate and purity for the common object definition which 
includes a calorimeter based isolation criterion as well as the performance of a track-based alternative. 

When collision data are available, the best determination of these quantities will be made from the 
ATLAS data using the methods described in [7]. Since most of our events of interest will have a similar 
environment - isolated leptons and little jet activity - we may expect that the values found in those studies 
should closely match the corresponding quantities in our search. However since those measurements 
require collision data, in this section we present the efficiencies, fake rates and purities as determined 
from Monte Carlo information only. 

The following isolation criteria have been studied: 

• No isolation cut - showed as a reference; 

• Calorimeter-based isolation E^ =Q 2 < 10 GeV. E^ =0 2 is the energy deposited in a cone with 
AR = 0.2 around the lepton candidate; 

• Maximum-/^ track-based isolation p^^max W < 2 / 1 GeV for where Pra^max^) is the 

maximum pj of any track in a AR = 0.2 cone around the lepton; 

• Sum-^T track-based isolation Pj%n£t\,(fy < 4 GeV, where Pj^^\ * s tne sum °f tne Pt °f a ^ 

tracks above 1 GeV in a AR = 0.3 cone around the lepton. 

The lepton reconstruction efficiency is defined to be 

„ match 

n l 

where « match is the number of generator-level leptons matched to a reconstructed candidate and nf c is 
the total number of generator-level leptons. 

In this study generator-level leptons are defined to be charged leptons (I 6 which have come 

from decays of SUSY particles (sleptons and gauginos), Standard Model gauge bosons and tau leptons, 
but do not include leptons from other sources (such as hadronic decays, bremsstrahlung, or photon con- 
versions). No isolation cut has been applied on these generator-level leptons. Generator-level particles 
are matched to reconstructed candidates which have passed kinematics cuts and object selection accord- 
ing to the definition in Section 2. A match is required to be found within AR = 0.02. Figure 2 shows 
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3 No isolation 
k l^" 02 < 10GeV 

■ Kk° 02 {Pj^ <2GeV 



% ' ' ' ' ' i 1 1 1 ^ 



(b) 



.5 2 2.5 



(c) (d) 

Figure 2: The efficiency for electrons (a,b) and muons (c,d) to be reconstructed and to pass various 
isolation criteria, for the SU2 sample. The plots are shown as a function of the pr (a,c) and r\ (b,d) of 
the matched Monte Carlo lepton. 
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(c) (d) 

Figure 3: The fake rate for electrons (a,b) and muons (c,d) in the tt sample as function of the pr (a,c) 
and Tj (b,d) of the fake lepton. 



the reconstruction efficiency for electrons and muons from the SU2 sample using different isolation 
requirements as function of the pj and r\ of the matched Monte Carlo lepton. 

The requirement that leptons should be separated from selected jets by AR > 0.4 (referred to as the 
AR(£,jet) > 0.4 cut) is already a strong isolation requirement which reduces the efficiency by ~ 5% for 
electrons and ~ 15% for muons. For the case of tt events (not shown in the figures) the effect is even 
larger as there are more jets in the events. 

For the muons, one can see that after introducing the Ai?(^,jet) > 0.4 cut, the efficiency decreases 
with increasing pj , while the \r\ \ distribution decreases in the central region \r\ \ < 1.4. A clear drop in 
electron efficiency can be found near \r\ \ rj 1.4 which is the barrel-endcap transition region. 

After applying only the AR(£,]et) > 0.4 cut (referred to as "no isolation" in the plots), the total 
efficiency is (65.5 ±0.5)% and (75.2 ±0.5)% for electrons and muons respectively. The calorimeter- 
based isolation E^ =Q 2 < 10 GeV reduces the electron efficiency by « 3% to (63.6 ±0.5)% while leaving 
the muon efficiency almost unchanged. The track-based alternative has similar effect on both lepton 
flavours causing a loss of about 7 to 8% with respect to the "no isolation" performance. 

The fake rate is defined as 

MC,match 

^TC-' (2) 

"jet 

where w ^ C match j s th e number of reconstructed leptons which are not matched to a generator-level lepton 
and «j^ t c is the number of jets at the generator level 4 . 

Only generator-level jets with \ r\ \ < 2.5 are considered. They are rejected if there is an overlap with 
an electrons within AR < 0.2, but in general include photons, hadronic tau jets and Z? jets. The generator 

4 i.e. jets fount! b> running a jet algorithm over the Monte Carlo generator final state, before detector simulation. 
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jet must have energy, E > 7 GeV. Asymmetric pj cuts are applied to the reconstructed objects compared 
to their Monte Carlo equivalents. For the efficiency calculation p^f c > 10 compared to pj e > 5 GeV, 
while for the fake rate calculation pj MC > 5 and pj e > 10 GeV. This reduces the number of mismatches 
which would be found near the edge of the fiducial region from small mismeasurements of pj . 

Figure 3 shows the fake rate as function of the pj and tj of the fake lepton for the ti sample (one 
of the major backgrounds). The pj distribution is clearly peaked at low values typical for leptons from 
b-jet decays. The tj distribution follows the detector layout with higher fake rate in the transition region 
between the barrel and end caps. 

The AR(£,jei) > 0.4 requirement provides a reduction in the fake rate of about « 80% for muons and 
« 5% for electrons (as compared to leptons passing the standard selection with E^ =Q 2 < 10). With no 
isolation other than the A/?(/x,jet) > 0.4 requirement one obtains a fake rate of (4.1 ±0.1) x 10~ 3 and 
(1.1 ±0.1) x 10~ 3 for electrons and muons respectively. The isolation criteria provide a similar relative 
suppression of the electron and muon fake: « 20% for / c ^ =0 - 2 < 10 and « 74% for p^^ 2 ma7l (£) < 
2/1 GeV for ej\i. The effect of the isolation cuts is very similar in the case of the SU2 sample, but the 
fake rates are almost an order of magnitude lower than in it events. 

We define purity by: 



where ng is the number of reconstructed leptons, and the matching criteria are as described above. The 
lepton selection described in Section 2 provide samples with a purity of 92% (e) and 97% (/x) with very 
similar results for both the SU2 and it samples. However after requiring at least 3 leptons in the event 
there is a clear difference between the two. The purity is almost unchanged for SU2, while, in it trilepton 
events the purity drops to ~ 50% for electrons and ~ 70% for muons. 



3 Event selection 

Events are required to pass either of the two single-lepton triggers, labelled L2_e22i and L2_mu20, which 
are well-suited for the tri-lepton analysis at «£? = 10 31 ~ 32 cm _2 s _1 . A more detailed discussion of the 
motivation for using these triggers can be found in Section 5. 

Distributions of some important event variables for Standard Model backgrounds and for the bench- 
mark point SU2 (both for inclusive SUSY and direct gaugino) can be found in Figure 4. We show the pj 
distributions of reconstructed leptons (electron or muon) and jets. Requiring at least three leptons in the 
event results in mainly it and Zb Standard Model backgrounds, with larger contributions from dibosons 
in four lepton events. It can seen in Figures 4a and 4b that pj distributions of the two hardest leptons 
in three lepton events are similar in SU2 and it , with Zb adding a significant contribution in the low p T 
region. The p T distributions for the third-hardest leptons are plotted in Figure 4c where it can be seen 
that whilst it and Zb are the major backgrounds in the low pj region, there is a large contribution from 
the dibosons across the entire pj range. This is because the third leptons from ti and Zb are soft leptons 
from leptonic ft-quark decay, whereas all the leptons from dibosons are from Z/W ± boson decays. The 
pj distributions of the reconstructed jets are plotted in Figure 4d for events with three or more leptons. 

Since an OSSF lepton pair is expected from the X2 decay, a selection requiring two OSSF leptons is 
applied (i.e. we require e + e~ + £ or jJ. + jJ.^ + £, where £ G e,jj,) 5 . A further cut requires three or more 
leptons in the event. 

A stringent cut on the isolation of the tracks of the leptons is made next, using the P^ckmax variable 
described in Section 2.1. The purpose is to reduce backgrounds from bremsstrahlung, hadron decays and 

5 Trilepton events where all three leptons have the Same Sign, (SS, ^ SL l ± l ± , where (£c,/l) have also been investigated, 
however due to the very low statistics associated with the channel in the signal samples investigated, it is not considered further 
here. 
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photon conversions. The relevant distributions for electrons and muons are plotted in Figure 5a and 5b. 
The plots contain negative entries when no track is found within AR. We require p^^ct max < 1 GeV f° r 
muons and < 2 GeV for electrons. This reduces the number of background events to 23% (tt ) and 37% 
(Zb) of their previous levels, whilst keeping 82% (inclusive SUSY), 86% (direct gaugino) of the signal 
for our SU2 benchmark point. 

The diboson and Zb backgrounds will produce the two OSSF leptons mainly from Z decays, which 
will not be the case for three-body decays of neutralinos. The dilepton invariant mass distribution is 
shown in Figure 6a, (after applying the cuts already described) and shows the expected peak at the Z 
mass. A simple way to reduce the these backgrounds is to discard events which have any OSSF dilepton 
pair with invariant mass in the mass window 81.2 GeV < Mossf < 101.2 GeV. Note that this exclusion 
window also offers an excellent control region in which to measure the size of these backgrounds from 
the data. It is however somewhat model-dependent. Some points in SUSY parameter space preferentially 
decay through real Z bosons, rather than through three-body decays. At those points the OSSF dilepton 
mass distribution would also be strongly peaked at the Z mass. 

A large missing-transverse-momentum cut is used in most SUSY analyses, but in this analysis a 
smaller cut at 30 GeV is applied since in direct gaugino production the two invisible #{'s are often almost 
back-to-back in the transverse plane, resulting in a lower overall Ej lss (Figure 6b). Missing transverse 
momentum is calculated as described in [8]. 

Finally there is the possibility of adding a cut can be made on the hadronic activity in the event. This 
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could be useful in the case where direct gaugino production dominates, since it can be expected to reduce 
the tt background to a greater extent than the signal. Whether this cut is appropriate will depend on the 
SUSY scenario presented by nature, so analyses both with and without this cut will be necessary. 

Selected jets must satisfy p T > 10 GeV and |t]| < 2.5, and are reconstructed based on calorimeter 
tower signals [9] using a seeded cone algorithm with radius AR = 0.4. Jets are initially selected with 
pj > 10 GeV and \t]\ < 2.5. Jets are not considered if they overlap with a reconstructed electron within 
AR < 0.2. 

The pt distribution of the leading jet in the event is plotted in Figure 6c, where it is seen that the 
direct gaugino production has jets with lower pj than the Standard Model backgrounds. The level of 
this cut is chosen at pj > 20 GeV. 

The complete event selection is then: 

1. At least one pair of OSSF leptons (e + e~ or jU + ^t ) 

2. Ne >= 3 (£ G {e,ju}), and where all £{ satisfy the requirements in Section 2 

3. p^ k 2 max < 2 GeV for electrons, pff r =° k 2 max < 1 GeV for muons 

4. No OSSF dilepton pair has invarint mass in the range 81.2 GeV < M 0SSF < 102.2 GeV 

5. Ef ss > 30 GeV 

6. Optional - no jet with p T > 20 GeV 

In Figure 7 we show distributions of the invariant mass of the dilepton pair after all cuts. The 
distributions have been flavour subtracted; the quantity plotted is 

„OSSF_ B 05SF > (4) 

where « OSSF is the number o f eve nts in the signal selection (containing OSSF dilepton pairs) as described 
earlier in this section, and « OSSF is the number of events in which there are three leptons, but no OSSF 
pair. The non-OSSF events give an indication of the expected background size since many background 
sources do not necessarily produce OSSF pairs. 

4 Discovery potential 

The numbers of events and the resulting significance at each stage of the analysis are listed in Table 6 for 
the benchmark point SU2. We use the following definition of signal signficance, 




(5) 



where S is the number of signal events and B is the number of background events. After applying the 
event selection above (including the jet veto), 29 events remain for SU2 inclusive SUSY all of which are 
direct gaugino, with an expected background of 210 events (mainly ZW production). This corresponds 
to y of 1.87 for 10 fb _1 . This yields a 5a discovery signal after ~80 fb _1 of integrated luminsity. 
With the jet veto selection, ZW is the dominant remaining Standard Model background. Without the jet 
veto, 177 signal events (95 of them direct gaugino) remain. Statistical significances of 5.94 and 3.34 are 
found for the inclusive SUSY and direct gaugino signals respectively. 

The expected statistical significances for various mSUGRA benchmark points are summarised in 
Table 7 excluding the jet veto unless indicated otherwise by "SUx+JV". 
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Figure 7: Distributions of the OSSF dilepton invariant mass after all selections have been applied 
(a) without the jet veto and (b) including a jet veto. 



Table 6: Numbers of events and statistical significance as selection is applied for the benchmark point 
SU2, for integrated luminosity of 10 fb _1 . 



Kinematic Cut 


No Cuts 


N L >=2 


OSSF 


N L >=3 


'1 racklsol 


inn 




JetVeto 


SU2 gauginos 


64.0k 


1647 


1108 


178 


153 


120 


95 


29 


SU2 other 


7081 


776 


353 


127 


95 


85 


82 


0 




4.41M 


234k 


104k 


2812 


634 


507 


476 


42 


ZZ 


38.2k 


10.4k 


9984 


580 


476 


57 


13 


6 


zw 


156k 


17.2k 


14.5k 


1910 


1682 


322 


218 


154 


WW 


400k 


22.7k 


10.7k 


25 


8 


8 


8 


8 


Zy 


32.8k 


7184 


6970 


91 


27 


7 


3 


0 


Zb 


1.59M 


57.4k 


559k 


6523 


2409 


386 


0 


0 


inclusive SUSY 




2.60 


1.74 


2.76 


3.36 


5.31 


5.94 


1.87 


direct gaugino ./ 




1.77 


1.32 


1.61 


2.09 


3.20 


3.34 


1.87 



The discovery prospects for inclusive SUSY, reflected in columns titled "SUx", are rather encourag- 
ing: a 5a discovery can be expected with several fb _1 of integrated luminosity 6 . The direct gaugino pair 
production is highlighted in columns marked as SU2# and SU3#. Both for SU2 and SU3 we see a drop 
in significance which roughly corresponds to the fraction of the direct gaugino production compared to 
the total SUSY cross-section. In fact, the drop is somewhat higher, since the pj spectrum tends to be 
softer for direct gaugino pair production compared to strong SUSY production with its characteristic 
decay chains. Here, a discovery can be expected with several tens of fb -1 . 



5 Lepton trigger study 

Unlike most SUSY searches, in this channel we cannot rely on jet or £"™ ss triggers. We have studied a 
variety of triggers at the second level 7 , after the first level trigger has been applied. Other studies [10, 11] 

6 Taking into account statistical errors only. Systematic uncertainties are discussed in Section 6.5. 

' ATLAS lias 1 lircc trigger levels: a first level (LI) hardware trigger, a software-based second level trigger (L2) that examines 
regions of interest within the decector, and finally a software-based event filter (EF). More details about the electromagnetic [10] 
and muon [11] triggers may be found elsewhere in this volume. 
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Table 7: Discovery potential, and integrated luminosity required for 5a discovery. The jet veto is only applied in 
columns headed '+JV . For a fuller description of the notation, see the text. 





SU1 


SU2 


SU3 


SU4 


SU8 


sm x 


sm x 


SU2+JV 


SU3+JV 


y,w ft.- 1 


7.7 


5.9 


17.2 


69.3 


1.9 


3.3 


1.6 


1.9 


1.4 


JdtJf for5(J 


4.2 


7.1 


0.8 


0.1 


70.5 


22.4 


92.9 


66.9 


119.3 



Table 8: Fraction of events triggered at L2 at three selection stages of the selection for the heavy coloured 
sparton scenario in SU2 (first block), the direct gaugino production in SU3 (second block), and the 
inclusive SU3 signal (third block). "U" stands for the OR-combination of L2_e22i and L2_mu20. 



Selection 




sm x 






SU3 X 






SU3 incl. 


Stage 


L2_e22i 


L2_mu20 


u 


L2.e22i 


L2_mu20 


u 


L2.e22i 


L2_mu20 U 


OSSF pair 


41% 


54% 


89% 


42% 


54% 


92% 


51% 


51% 94% 


OSSF+3 rd ^ 


58% 


67% 


93% 


59% 


63% 


95% 


66% 


68% 98% 


after all cuts 


57% 


66% 


92% 


58% 


57% 


94% 


66% 


64% 97% 



demonstrate that objects passing L2 have a high efficiency for passing EF. 
As a figure of merit, the fractions of triggered events 

• after selecting for an OSSF pair; 

• after requiring a further third lepton; 

• after all cuts, including the jet veto; 
are studied. 

We have identified two single-lepton triggers, labelled L2_e22i and L2_mu20, that are well-suited for 
the tri-lepton analysis at Jgf = 10 31 ~ 32 cm _2 s _1 . At high luminosity it is planned to have corresponding 
triggers with additional isolation criteria (e22i_tight and mu20i) which will remain unprescaled. The 
resulting event-triggering efficiencies are listed in Table 8. OR-combined, they have «92% probability 
for direct gaugino production for the benchmark point SU2. They have sufficiently high pj thresholds 
to be easily studied with leptonic Z decays. 

The same trigger set was studied for the direct gaugino and inclusive SUSY for another bechmark 
point, SU3, that has lighter squarks than SU2. Again the OR-combination of L2_e22i and L2_mu20 
shows a good performance of ~92, 95, 94%. Their individual L2 event efficiencies are summarised in 
the middle block of Table 8. 

In addition we have studied the performance of the L2_e22i and L2_mu20 triggers for the tri-lepton 
analysis, outside of the heavy coloured sparton scenario, for inclusive SU3 production. Again in this 
case we find L2 event efficiencies of around ~94, 98, 97%. 

The efficiency is relatively high for lepton triggers even though the trigger thresholds are fairly high 
compared to the offline cuts on lepton transverse momenta. The reason for that is simple combinatorics: 
since the 3 leptons are ordered by their transverse momenta, it is likely that the leading lepton has a high 
pj if the third one passes the offline threshold. On the other hand, cases where all the three leptons are 
below 20 - 30 GeV but above 10 GeV are unlikely. 

We have investigated how the leptonic trigger efficiencies for the inclusive SUSY and direct gaugino 
for the benchmark point SU2 change as a function of the progressive event selection stages described 
in Section 3. In both cases, the trigger efficiencies reach their approximate maxima after the three- 
lepton selection stage. Beyond this cut stage, the efficiency values plateau, indicating that our event 
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selection requirements should not bias signal trigger efficiencies. It can be concluded that the L2_e22i and 
L2_mu20 single lepton triggers provide a good performance for the tri-lepton analysis in the early days 
of ATLAS running at Jz? = 10 31 cm~ 2 s _1 . Di-leptonic triggers with lower thresholds like 2el2i, 2mul0, 
and el5i_mul0 can be used to recover events where all three leptons have low transverse momenta around 
20GeV. 

6 Systematic uncertainties 

The dominant sources of systematic uncertainty for this search are different to other SUSY search chan- 
nels that focus on final states containing jets. For our multi-lepton search the main sources of uncertainty 
in the backgrounds are described below. 

6.1 Background rates 

The production rates for the majority of background processes such as diboson production and it are 
known at the parton level at better than next-to-leading order. However, it is generally better to deter- 
mine the rate of these backgrounds from the ATLAS data themselves, reducing any uncertainty from 
luminosity, PDFs and other systematics (e.g. from acceptances, efficiencies etc). The background rate 
measurements should be made in "control regions" in which they dominate. For the WZ background a 
sensible control region is the region of phase space where the OSSF lepton pair has an invariant mass 
near the ntz [12]. For the it background, one would examine single lepton and OSSF dilepton chan- 
nels [13]. Any statistical uncertainty in the background measurements forms a systematic uncertainty 
in our analysis. The expected integrated luminosities for cleanly measuring each background and the 
resulting statistical uncertainties can be found in Table 9. 

Any further systematic uncertainties in the background rates are not considered here so as not to 
double-counting their effects 8 . One must be careful that the extrapolation from the 'control' to the 
'measurement' region of parameter space is well understood. The extrapolation for the backgrounds 
involving the Z peak relies on the Z line-shape, which is theoretically well-known. The extrapolation 
from dilepton to trilepton final states for it relies on a good knowledge of the rate at which b quarks 
(from top decays) produce isolated leptons. A method of determining this rate from the ATLAS data is 
presented in Section 6.3. 

Table 9: Expected rates to cleanly select background samples in control regions, and the corresponding 
statistical uncertainties. 



One can see from Table 6 that, after event selection and jet veto, the WZ and it backgrounds are most 
significant in the signal region. The systematic uncertainty in measuring the rate of the backgrounds in 

8 In fact, uncertainties on background rates that positively correlate between the selection region and the control region will 
actually tend to cancel. 



Background Statistical uncertainty Reference 
after 10 ftT 1 [%] 
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the control regions is expected to be of the order of a few percent maximum for the WZ and even smaller 
for the tt (Table 9). Our estimate for the total systematic uncertainty due to statistical fluctuations in 
the background control sample is therefore dominated by the WZ background. When multiplied by the 
fraction of the final background that comes from WZ, this generates 1.9% (0.8%) uncertainty in the 
background rate for the analysis with (without) the jet veto. 



6.2 Trigger and reconstruction efficiencies 

Lepton trigger and reconstruction efficiencies can be determined using the "tag-and-probe" method for 
electrons and muons coming from Z — > 11 [7]. 



Table 10: Expected systematic uncertainties on background rates from other ATLAS measurements using 
the tag-and-probe method, and resultant estimated uncertainties for this analysis. The label 'reco' refers 
to combined reconstruction and selection efficiencies. 



Source Tag-and-probe [7] This analysis (10 fb l ) 
(H±, lfb" 1 ) \i± 3£ ± 



e (trigger) 
]l (trigger) 
e (reco) 
\l (reco) 



<1% 
0.4% 
0.5 % 



} 0.5 % 



The precision with which lepton trigger and reconstruction efficiency can be determined has been 
studied in [7] for an integrated luminosity scaled to 1 fb -1 . The statistical uncertainty on the trigger 
efficiency for 1 fb -1 is very small (<C 1%) and is negligible for 10 fb -1 . The systematic uncertainties 
in determining the trigger efficiencies are shown in Table 10. The trigger efficiency uncertainty for our 
analysis, for which the highest-/? T lepton has similar kinematics to the sample used in [7], is therefore 
estimated to be < 0.5% for /d/ JSf = 10 fb -1 . 

Reconstruction and selection efficiencies have also been studied using a similar method [7]. The 
estimates of the uncertainties in the efficiencies are also contained in Table 10. We have increased the 
expected uncertainty in the electron reconstruction efficiency from the "tag-and-probe" value of 0.5% to 
our own estimate of 1% to reflect a somewhat larger uncertainty for our lower pi electrons. 

In the final column of Table 10 we estimate the resulting systematic uncertainty for the trilepton 
selection efficiency of this analysis. Since we propose using single lepton triggers (Section 5), only 
the highest p T lepton contributes, and our event trigger uncertainties are the same as the single-lepton 
trigger ones. All three leptons are assumed to contribute to reconstruction and selection uncertainties, 
with equal contributions from electrons and muons. Our resulting uncertainty from combined trigger, 
reconstruction and selection efficiencies is 2.3%. 

For this study we assume that similar uncertainties will apply with and without the jet veto, but we 
note that there may be larger systematic uncertainties in lepton efficiencies if jets are permitted in the 
final state. 

6.3 Lepton fake rates 

The tt and Z + b backgrounds contribute to the trilepton final state when (along with a lepton pair 
from dileptonic tt decay or Z decay) a B hadron decays leptonically, producing a third isolated lepton. 
The third lepton requirement reduces each of these backgrounds by a large factor - about two orders 
of magnitude in each case. About one order of magnitude of reduction can be accounted for by the 
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branching ratio of B hadrons to leptons, while the other factor of about ten arises from our rejection 
of non-isolated leptons. We are therefore sensitive to the rate at which leptons from b decays pass the 
isolation criteria, which we denote Rb^e (it should be understood to include leptons both from direct 
b — ► £ and also from b — ► c — > £). 

We hereby propose a method by which Rb^e could be measured in the ATLAS data, and estimate 
the remaining systematic uncertainty after that measurement. The control sample we suggest is semi- 
leptonic tt decays in which a lepton from one (probe) b decay generates a same-sign dilepton pair. To 
cleanly select the control sample we would require events with tt kinematics {e.g. consistent invariant 
masses for t and W) and one clean vertex tag from the other (probe) b jet. This sample has the further 
advantage that the b quarks have the correct kinematical distributions for the background of interest 
(dileptonic tt ). The same-sign dilepton requirement removes contamination from normal dileptonic tt 
decays since the latter will produce opposite-sign lepton pairs. 

Dedicated studies [13] suggest that we can expect to cleanly select about = 5 x 10 4 semi- 

leptonic tt events (e and }i combined) for $&tJ£=\0 fb _1 . The fractional statistical uncertainty, 8, that 
we could expect from the proposed measurement of Rb^e is then, 

S(R^i) = , R = • (6) 

From examination of the event record in our Monte Carlo samples, we found Rb^e. to be approximately 
5 x 10~ 3 . If a similar rate were to be found from measurements of the semi-leptonic tt events, then the 
corresponding value for S(Rb^e) would be 6%. 

With a jet veto, tt forms about 20% of the background after full selection (and Z + b a very small 
contribution), and there is a resulting ps 1.2% uncertainty in the total background. Without the jet veto, 
the tt contribution is much larger (about 66%) and a 4% systematic uncertainty results. 



6.4 Jet and missing energy scales 

The global uncertainty on the jet energy scale is currently conservatively expected to be about 5%, but 
the true value is difficult to determine without collision data. In events in which the missing transverse 
momentum is dominated by hadronic activity, the fractional uncertainty in £"™ 1SS will be of a similar 
size, since the two measurements will be highly correlated. We therefore determined the effect of a 5% 
systematic uncertainty in the missing energy when no jet veto is used. 

For the analysis which includes a jet veto, we expect missing energy scale uncertainty to be rather 
smaller than 5%, since the majority of the missing energy will recoil against the (well-measured) leptons. 
We also assume that in this case the systematic uncertainties in the jet energy scale and the missing energy 
should also be largely uncorrected (for the same reason). Measurements of recoil of jets against Z bosons 
and photons allow the missing energy resolution to be well-determined as a function of jet energy [8]. 
Any residual uncertainty in the missing energy scale is estimated by us to be about 2% at low hadronic 
energies (based on systematic differences between methods in [8]). 

For the jet-veto analysis, we determined the effects of a 5% variation in the hadronic energy scale 
and (an independent) 2% variation in the missing energy scale. The results are shown along with the 
systematic uncertainties from other sources in Table 11. 

As well as the uncertainty from the energy scales, there can be some contribution to the uncertainty 
in the number of events failing the jet veto from our lack of knowledge of initial-state radiation (ISR) in 
electro-weak processes (such as WZ production). However the ISR spectrum can be readily determined 
from other electroweak production control regions (as described in [14] for single electroweak gauge 
boson production). The resulting statistical uncertainties in those control measurements will be small, 
and the systematic effects tend to cancel with those described in this section, so no additional systematic 
uncertainty is added here. 
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6.5 Summary of systematic uncertainties 

Table 11: Estimates of the dominant uncertainties in the background determination for fdt^f = 10 fb 



Source 


Uncertainty 
No jet veto With jet veto 


Background production rates 


0.8% 


1.9% 


Lepton Efficiency 


2.3% 


2.3% 


Fakes (Rb^e) 


4.0% 


1.2% 


Hadronic energy scale 




1.8% 


Missing energy scale 


1.5% 


1.0% 


Total systematic 


4.9% 


3.8% 


Statistical 


3.7% 


6.9% 


Statistical + Systematic 


6.2% 


7.9% 



Our best current estimates of the various sources of uncertainty are summarised in Table 11. One can 
see that with the full selection, including the jet veto, and if the supporting measurements can be made 
with the precision expected, then the SUSY-search analysis will be limited by statistical fluctuations in 
the background samples (about 6.9%) rather than by systematic sources of uncertainty (about 3.8%). 

Without the jet veto, the statistical uncertainty is smaller, however the systematic contribution to the 
total uncertainty increases to 4.9%, largely because we have an increased sensitivity to Rb^i since the 
top background is much larger if no jet veto is applied. 

7 Conclusions 

The ATLAS experiment will have sensitivity to new physics, including supersymmetry, in events with 
three leptons in association with missing transverse momentum. For most of the SUSY benchmark 
points studied, a discovery of new physics could be made in this channel with integrated luminosity of 
several fb -1 . Models in which all strongly interacting partner particles are heavy would have smaller 
cross-sections, but could also be discovered with integrated luminosity of the order of several tens of 
fb- 1 . 

The major sources of systematic uncertainty were indicated and methods for determining these from 
data discussed. While the actual sizes of these uncertainties can only be estimated reliably with collision 
data, estimates based on existing information were presented. In particular, knowledge of the rate at 
which which b jets lead to seemingly-isolated leptons was found to be an important element in under- 
standing Standard Model backgrounds. A method of measuring this rate from the data using tt events 
was proposed. 
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Supersymmetry Signatures with High-/? r Photons or 
Long-Lived Heavy Particles 

Abstract 

In certain Supersymmetry breaking scenarios, characteristic signatures can be 
expected which would not necessarily be found in generic SUSY searches for 
events containing high pj multi-jets and large missing transverse energy. This 
paper describes the expected response of the ATLAS detector to four signa- 
tures: high-^T photons which may or may not appear to point back to the 
primary collision vertex and long-lived charged sleptons and R hadrons. Such 
processes often have the advantage of small Standard Model backgrounds 
and their observation could provide unique constraints on the different SUSY 
breaking scenarios. Using these signatures discovery potentials are estimated 
for either Gauge-Mediated Supersymmetry Breaking or Split-Supersymmetry 
scenarios. Using Monte Carlo samples of SUSY and background processes 
corresponding to integrated luminosity of about 1 fb _1 we study all aspects 
of the analysis, including the expected trigger response and offline data recon- 
struction. 

1 Introduction 

Supersymmetry (SUSY) is one of the most widely investigated theories of physics beyond the Standard 
Model [1-4]. Searches for signatures of new physics processes predicted within SUSY models are thus 
central to the physics program of the ATLAS experiment [5], which will be sensitive to the production 
of SUSY particles with masses up to several TeV [6]. To facilitate the exploitation of early LHC data, 
a number of preparatory studies have been undertaken by the ATLAS SUSY group to estimate the re- 
sponse of the ATLAS detector to a variety of physics processes and to optimise and test the software 
which is necessary for the analysis of collider data. As a part of this exercise, the ATLAS SUSY group 
has performed a number of studies. Techniques to estimate Standard Model backgrounds have been de- 
veloped [7, 8] and calculations have been made of the discovery potential for generic inclusive SUSY 
signatures [9], based largely on the minimal SUGRA model [10-14] for which the principal observables 
are high-/?x jets and missing transverse energy. However, there exist a number of SUSY scenarios which 
predict specific final state topologies which may not be observed by generic searches, such as prompt 
photons and long-lived stable massive particles. In this work, the response of the ATLAS detector to 
such signatures is studied and, where appropriate, calculations are made of discovery potentials for early 
LHC running for which an integrated luminosity of around 1 fb~ ! is assumed. Although the work is 
performed within the framework of SUSY searches, the techniques which have been developed are more 
generally applicable to searches for new phenomena. 

The theoretical models used in this work are based on the minimal Gauge Mediated Supersymmetry 
Breaking (GMSB) model [6, 15-21] , the Split-SUSY model [22-25] and the gravitino LSP model [26]. 
A description of the models is given in Section 2. 1 while specific choices of parameters are found in the 
sections devoted to each of the studied signatures. Four signatures are investigated and are described 
below. 

Two high-;?x photons + £™ ss : In a GMSB scenario in which the next lightest supersymmetric particle 
(NLSP) is the X\ , two high-px photons are expected in each signal event each arising from the decay 
of a X\ to a G and photon. Such events with two isolated high-px photons plus large Ej lss have 
small Standard Model backgrounds and a high mass discovery reach is expected even in the early LHC 
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running. In the high mass regime, however, the production cross-section for SUSY events and the signal 
is of the same order as the instrumental background which includes misidentified photons arising from 
electrons or jets, and the irreducible background of radiation from leptons. Earlier experiments have 
exclusions limits of 93 GeV in X\ mass and 167 GeV in X,f mass [27]. 

Non-pointing photons: In certain GMSB scenarios, the Xi could be relatively long-lived. When the 
decay length is comparable to the size of ATLAS inner-detector, high-^T photons could enter the elec- 
tromagnetic calorimeter surface at large incident angles with respect to a pointing direction to the beam 
interaction point. An estimation of how the photon reconstruction and identification efficiency is de- 
graded for such non-pointing photons is essential in any measurement of the X\ lifetime, which is di- 
rectly connected to the SUSY breaking scale. Current lower limits on the mass and lifetime are 101 GeV 
and 5 ns, respectively [28]. 

Stable sleptons: Stable 1 heavy charged sleptons appear in certain regions of parameter space in GMSB 
scenarios. This signature consists of a penetrating charged track. Since interactions with detectors are 
ionizations only, the observed tracks will look more like muons, except for their higher energy deposition 
and longer time of flight than muons. The ATLAS muon system provides a time of flight measurement 
with an excellent time resolution (c to f ~ 0.7ns) [29], which allows for a precise particle mass mea- 
surements for slow particles. Owing to the very high LHC bunch crossing rate, the development of an 
appropriate triggering scheme is critical to the ensuring the detection of such particles. Previous experi- 
ments have provided a lower limit of around 105 GeV in slepton mass [30] 

Stable R hadrons: Stable massive supersymmetric hadrons (R hadrons) are predicted in Split-SUSY 
models or in the gravitino LSP scenario of SUGRA models [26]. The signature of R hadrons is sim- 
ilar to that of stable sleptons although multiple nuclear interactions before reaching the muon system 
lead to characteristic event topologies, such as the appearence of high-px tracks in the muon system 
with no matching track in the inner-detector, or the electric charge flipping between the inner-detector 
and the muon system. Lower mass limits of around 200 GeV have already been established by other 
experiments [31]. 

This paper is organized as follows. Section 2 provides an overview of the different models and a 
description of the Monte Carlo datasets which were used in this work. Sections 3-6 describe the studies 
of signatures described above. Finally, a conclusion is given in Section 7. 

2 Overview of the SUSY models and Monte Carlo datasets 

The signal Monte Carlo data samples used in this paper are based on the specific supersymetry breaking 
scenarios. These models are described in Section 2.1. Some technical details and parameter values used 
in individual datasets are summarized in Section 2.2. Common background samples are used which 
are described in the introduction to this chapter [32]. In some cases, fast simulation (ATLFAST-I [33]) 
samples are also used when estimating the background contributions. 

2.1 SUSY scenarios considered in this paper 

• GMSB (Gauge-Mediated Supersymmetry Breaking) scenarios: in GMSB, SUSY breaking which 
takes place in hidden sector is transmitted to visible MSSM fields through a messenger sector 
whose mass scale is much below the Planck scale (M m <C Mp) via the ordinary Standard Model 
gauge interactions. The gravitino is very light (in general <C 1 GeV) and is always the LSP. In 
the minimal model of GMSB, all Supersymmetry breaking interactions are determined by a few 

'The term stable is used throughout this chapter to refer to particles which, although they may possess finite lifetimes, do 
not decay during their traversal of the ATLAS detector. 
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parameters. The phenomenologies studied in this paper are based on this minimal model. Discus- 
sions of the non-minimal GMSB models are found elsewhere [34,35]. 

The squarks, the sleptons, and the gauginos obtain their masses radiatively from the gauge inter- 
actions with the massive messengers; their masses therefore depend on the number of messenger 
generations, N5 (the index 5 comes from the fact that the messenger fields form complete SU (5) 
representations). 

The gaugino masses scale like N5 while the scalar masses scale like y/Ns. Hence for N5 = \, the 
NLSP is the lightest neutralino (X\ ) which decays into photon and a gravitino (G ). For N5 > 2, 
the NLSP is a charged stau (fi). When tan/3 is not too large, the mass splitting between the %\ 
and the right-handed selectrons, smuons (£r, jUr) is small, rendering them co-NLSP's which decay 
into leptons and gravitinos. In contrast, in the large tan /3 region, the stau is the sole NLSP. 
The effective SUSY breaking order parameter F5, felt by the messengers, may not coincide with the 
intrinsic underlying SUSY breaking order parameter F, which determines the coupling strength. 
When Fs is smaller, the NLSP decay length becomes longer. The dimensionless factor Cq = 
F/F$(> 1) is introduced to control the NLSP decay length with all the other parameters fixed. The 
decay length is proportional to the square of the control parameter, i.e. scales as C 2 G . 
The effective visible sector SUSY breaking parameter, A(= Fs/M m ) sets the overall mass scale for 
all the MSSM superpartners, which scales linearly with A. On the other hand, these masses only 
depend logarithmically on the messenger scale M m . The MSSM masses are therefore predomi- 
nantly determined by the scale A. 

• Split-SUSY scenario: Stable exotic hadrons feature in a number of SUSY scenarios. Split-SUSYis 
one such model. Within this approach the hierarchy problem and the fine tuning of the Higgs mass 
is accepted, or assumed to be set by another, as yet unknown, mechanism. Phenomenologically, 
within Split-SUSY scenarios the gauginos and higgsinos have light masses of order the weak 
scale, which are protected by the chiral symmetry while the scalars have a mass scale m s which 
can be near the GUT scale. Since gluino decays proceed via internal squark lines, gluinos can be 
meta-stable. A meta-stable gluino will form a bound state, a so-called /?g-hadron. 

• A gravitino LSP and a stop NLSP scenario: in addition to stable gluinos, meta-stable stops are also 
features of some SUSY models. Here, the stops are usually the NLSP and decay to a gravitino LSP 
with gravitational strength interactions. The generic possible candidate for NLSP is the lightest 
stop t\, which, like the gluino case in split SUSY scenario, would form stable bound states, denoted 

2.2 Parameter values used in this work 

The basic feature of the samples used in this paper are similar to those introduced in [32]. However, 
our studies also investigate sparticles which decay with a measurable decay length, and new techniques 
have been introduced to allow these to be simulated. As in Ref. [32], ISAJET7.74 [36] was used to 
generate the sparticle mass spectrum in the context of minimal GMSB scenarios. The leading order (LO) 
and the next-to-leading order (NLO) cross-sections were assessed independently using PROSPINO2.0 
[37-39]. The mass spectrum tables from IS AJET were used by HERWIG/JIMMY [40,41] to generate 
the sparticle cascade decays, parton showers, hadronisations, and underlying events. The HERWIG 
option pltcut (lifetime threshold above which the Herwig does not decay the particles) was set to 3.3 x 
10~ n s, hence sparticles with finite decay length are not decayed at event generator level 2 . In the detector 

2 the value is set in order not to decay K° and A at event generator level, but at detector simulation level. 
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Table 1: Summary of the neutralino NLSP samples. Dataset GMSB1 is a prompt photon decay sample, 
while dataset GMSB2 and GMSB3 are the non-pointing photon samples. N5 = l,tanjS = 5,sgn(/x) = + 
are used at each point. 



name 


NLO (LO) 0 [pb] 


A [TeV] 


M m [TeV] 


C G 


ct [mm] 


M^o [GeV] 


GMSB1 


7.8 (5.1) 


90 


500 


1.0 


1.1 


118.8 


GMSB2 


7.8(5.1) 


90 


500 


30.0 


9.5 • 10 2 


118.8 


GMSB3 


7.8(5.1) 


90 


500 


55.0 


3.2 -10 3 


118.8 



Table 2: Summary of the slepton NLSP sample. N 5 = 3,tanj3 = 5,sgn(/x) = +, and no decay of slepton 
is assumed. 



name 


NLO (LO) a [pb] 


A [TeV] 


M m [TeV] 


M fl [GeV] 


GMSB5 


21.0(15.5) 


30 


250 


102.3 



Table 3: 7?-hadron samples. Dataset R-Hadronl - R-Hadron6 are the Rg samples, while dataset R- 
Hadron7 - R-Hadron9 are the R; samples. 



name 


NLO (LO) cross-section [pb] 


sparticle 


Mass [GeV] 


R-Hadronl 


567 (335) 


8 


300 


R-Hadron2 


12.2 (6.9) 


8 


600 


R-Hadron3 


0.43 (0.23) 


8 


1000 


R-Hadron4 


0.063 (0.033) 


8 


1300 


R-Hadron5 


0.011 (0.006) 


8 


1600 


R-Hadron6 


0.0014 (0.00075) 


8 


2000 


R-Hadron7 


11.4 (7.8) 


I 


300 


R-Hadron8 


0.27 (0.18) 


t 


600 


R-Hadron9 


0.010 (0.0064) 


t 


900 



simulation phase, the X\ decaying into G and y, decay length was passed to the ATLAS interface to 
GEANT4 and the sleptons (fi,<?R and /Ir), interaction with detector materials, i.e. ionization loss was 
implemented. 

In/Miadrons scenarios, the masses of the stable sparticles (Mg,M f ) fully determine the phenomenol- 
ogy and other supersymmetric parameters (other sparticle masses) do not enter into any calculation. The 
generated samples are therefore highly model independent, and the only parameter that varies is the mass 
of the stable sparticle. 

Tables 1-2 summarize the properties of the Monte Carlo signal samples used in this note. Tables 
1 and 2 describe the GMSB samples with a neutralino or slepton NLSP, respectively, while Table 3 
summarizes the /?-hadron samples. 

3 Discovery potential of GMSB SUSY with photon signatures 

In GMSB models with N5 = 1 and low tan/3 the lightest neutralino X\ is the NLSP and decays to a grav- 
itino G and, in scenarios where the neutralino is mainly a photino, to a photon. Therefore the standard 
SUSY decay cascade of squarks and gluinos is extended by the decay X\ jG, as shown in Figure 1. 
Depending on the branching ratios of the squarks and gluinos decaying to various types of SUSY parti- 
cles, this decay chain may also contain multiple jets. Events with two high energy photons are expected 
in pp collisions, if the NLSP lifetime is not too long (C grav ~ 1). These photons originate close to the 
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Figure 1: Typical SUSY decay chain for a neutralino NLSP decaying to a photon and a gravitino. 

primary interaction vertex ("prompt photons"). The corresponding event signatures and the discovery 
potential for these models are discussed in this section. The case of large C grav and therefore long lived 
neutralinos, resulting in non-pointing photons, is discussed in Section 3.3. For detailed reconstruction 
and trigger studies we consider the GMSB1 model point as a typical example (see Table 1). For this 
point the branching ratio of the decay of the lightest neutralino to a photon and a gravitino is ~ 97%, and 
the total SUSY production cross-section is ~ 7.8pb. 

In the following we discuss the optimisation of the signal selection, including the trigger selection, 
the expected background from Standard Model processes and a detailed study of the discovery potential 
with early data. Since for the latter a fast simulation approach is used, a comparison of fast and full 
simulation results is also presented. 

3.1 GMSB1 full simulation studies 
3.1.1 Signal trigger strategy 

The signal events studied here possess the standard SUSY event properties at the LHC: large E™ 1SS and 
multiple jets with high pj. These can be used for triggering the events. In addition, the feature of two 
high energy photons gives an additional way to trigger on these events independently of E™ 1SS and jet 
triggers. In the following we consider two different trigger menus: 

• The first menu is the ATLAS initial menu foreseen for data-taking at a luminosity of 10 31 cm~ 2 s _1 
[42]. This menu includes various combinations of jet (J) and E™ ms triggers (XE) and photon 
triggers (EM) as shown in Table 4. No prescale values are foreseen for these triggers, whereas 
the selection of the EM 100 trigger is based entirely on the Level- 1 (LI) trigger, with no selections 
envisaged at the Level-2 (L2) trigger or in the event filter. In the following, the trigger efficiency is 
defined as the ratio of the number of events without any preselection passing a trigger item divided 



p Tyl [GeV] 

Figure 2: The g55 event filter trigger efficiency (see text) for the GMSB1 sample as a function of the 
reconstructed pj of the leading photon. 
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by the total number of events. The total efficiencies for the LI trigger for the various triggers of 
this initial menu are summarised in Table 4. It can be seen, that in addition to the standard SUSY 
triggers based on jets and E™ 1SS , the photon triggers have very high efficiencies for selecting 
GMSB1 events and can be used for initial running. 

• The second menu investigated here is the standard ATLAS trigger menu foreseen for the stable 
running at a luminosity of Jz? = 10 33 cm _2 s _1 . In this menu all three trigger levels are used in the 
selection. The triggers investigated are combinations of jet (j), Ej lss (xE) and photon (g) signa- 
tures, see Table 5. Again, no prescale values are foreseen for these items. The trigger efficiencies 
for the signal at LI, L2 and event filter are summarised in Table 5, where the L2 and event filter 
efficiencies also contain the efficiencies of the previous levels. It can be seen that for both of the 
planned running phases the photon triggers are as efficient as the £"™ 1SS and jet triggers for the 
case of GMSB 1 signal events. The efficiency for the g55 photon trigger after the event filter as a 
function of the reconstructed p T of the leading photon is shown in Figure 2. As expected, after a 
steep turn on around 55 GeV a plateau is reached. The integrated efficiency above the threshold of 
55 GeV is ~ 98%. 



Table 4: Trigger efficiencies and statistical errors for the GMSB1 event sample for (J§? = 10 31 cm 2 s l ). 



Trigger item 


Efficiency 


Trigger item 


Efficiency 


2EM13 
EM 100 
EM18+XE15 
J100 


98.71 ±0.11 
83.00 ± 0.38 
98.79 ±0.12 
91.43 ±0.28 


XE70 
J70+XE30 
2J42+XE30 
4J23 


81.39 ±0.39 
93.64 ± 0.25 
93.98 ± 0.24 
92.27 ± 0.27 



Table 5: Trigger efficiencies and statistical errors for the GMSB 1 event sample for (Jzf = 10 33 cm 2 s l ). 



Trigger item 


LI 


L1+L2 


L1+L2+EF 


g55 
2gl7i 
j65+xE70 
3j65 


97.18±0.60 
71.13±1.65 
80.66±0.40 
83.63±0.37 


84.47 ±1.32 
55.07±1.81 
80.63±0.40 
83.55±0.37 


80.47±1.44 
47.91±1.81 
69.53±0.46 
83.37±0.37 



In summary it can be said that in the GMSB 1 scenario the use of photon triggers is possible for initial 
running conditions, as well as at a higher luminosity. The efficiencies are as high as for the triggers based 
on jets and £"™ 1SS and can thus provide good redundancy. 

3.1.2 Signal selection 

At the benchmark point GMSB1, 48.9% (16.4%) of the signal events have one (two) photons with 
pj > 20 GeV in the fiducial acceptance (|tj | < 2.5) used for photon identification. For the reconstruction 
of photons a standard cut-based photon selection [43] is used. This is mainly based on variables using 
information of the first and second samplings of the electromagnetic calorimeter. The photons are re- 
quired to be isolated and those located in the transition regions between barrel and endcap calorimeters 
are excluded. No track veto is applied. After a full GEANT4 simulation of the ATLAS detector, the 
selection efficiency for photons with p T > 20 GeV is about 65%. 
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Table 6: ALPGEN Background samples used in this section. The corresponding integrated luminosities 
is shown. 



Process 




Integrated luminosity (fb ^ ) 


Top 


leptonic 


~ 13.4 




semi-leptonic 


-3.5 




hadronic 


- 17.1 


Electroweak 


Z e+e- 


-4.8 


+ jets 




-8.3 




Z -► T+T- 


-21.6 




Z^vv 


-9.1 




W^ev 


-3.4 






-4.7 






-3.9 


QCD 


multiple jet production 


-0.03 



As background to the signal, events with QCD jets, single gauge boson (W and Z) production and tt 
production are simulated using the ALPGEN generator. The specific processes are listed in Table 6 and 
the corresponding integrated luminosities are given for each process. In order to separate the signal from 
the Standard Model background a standard preselection for SUSY-like signatures is first performed: 

• At least four jets must be found with pj > 50 GeV (pj > 100 GeV for the leading jet). 

• Missing transverse energy £"^ 1SS > 100 GeV and £™ 1SS > 20%-M eS , where the effective mass M eff 
is defined as the scalar sum of £"™ 1SS and the transverse momenta of the four leading jets. 




E7 s [GeV] M eff [GeV] 
Figure 3: Distributions after preselection for lfb -1 . Left: Missing transverse energy. Right: Effective 
mass for signal and Standard Model background. 

The distributions of £"™ ss and M e ff after this selection are shown in Figure 3 for the Standard Model 
background (histograms) and the signal (open symbols) for an integrated luminosity of 1 fb~ 1 . No large 
excess of events is seen over the Standard Model background. However, as shown in Figure 4, a cut 
on the number of reconstructed photons with pj > 20 GeV and \r]\ < 2.5 provides an effective way to 
further suppress the backgrounds. Figure 3.1.2 shows the pj distribution of the leading photon after 
the initial preselection. Table 7 shows the number of selected events for signal (S) and background (B) 
after requiring 0, 1 or 2 photons passing the cuts described above and either the g55 or the 2gl7i trigger. 
This combination of triggers has a combined efficiency for the signal of — 85% (— 99%) before (after) 
applying these selection cuts. The number of events is normalized to an integrated luminosity of 1 fb -1 . 
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In addition, the signal significance defined as Sig = S/\J~B is given in the table. In the calculation of Sig 
it is assumed, that there is at least one background event left. With the requirement of two high-energy 
photons the selection is mainly free from Standard Model background and the significance becomes very 
large. 

In addition to the selection criteria listed above, checks were made to see weather a better signal 
significance can be achieved by requiring an opposite sign same flavour (OSSF) lepton pair, which orig- 
inates in the squark/gluino decay cascade from a %® to X\ decay via a slepton, as depicted in Figure 1. 
Here, only electrons and muons are accepted as leptons. The requirement of at least one OSSF lepton 
pair reduces the number of selected signal and background events. The suppression factor for the signal 
for the combination of one photon and one OSSF pair is larger than for the combination of two photons. 
Hence, although the background is reduced to a very low level, just using the requirement of two photons 
gives the largest significance. Requiring an OSSF pair in addition to two photons just reduces the signal, 
because the background is already very low. 




Figure 4: Distributions after preselection for 1 fb 1 : Number of reconstructed photons with p T > 20 GeV 
and \rj\ < 2.5 (left) and transverse momentum of the leading photon for signal and Standard Model 
background, (right) 



3.2 GMSB parameter scan with fast simulation 

To investigate the discovery potential of the selection described above, over a wider range of the GMSB 
parameter space, it is necessary to make use of a fast detector simulation to obtain adequate statistics 
for signal event reconstruction at various points in the parameter space. The computing requirements of 
the full simulation make it impractical for this study, so for this part of the analysis the fast simulation 
package ATLFAST [44] has been used instead. ATLFAST performs no detailed simulation of particle 
interactions with the detector material, but instead parameterizes the detector response. It has two main 
features relevant to the analysis discussed here: 

• every generated particle is reconstructed. 

• there is no distinction between electromagnetic and hadronic calorimeter compartments and the 
energy of a particle is obtained by smearing the energy of the generated particle with a resolution 
function. No shower development is simulated. 

Note that ATLFAST does not simulate either reconstruction inefficiencies nor particle misidentification 
for any particles. 

Figure 5(a) shows the M e g- distributions of the GMSB1 event sample for full and fast simulation. In 
the low energy region a small deviation of the fast simulation with respect to the full simulation can be 
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Table 7: Number of selected signal and background events for 1 fb 1 for different cuts on the number of 
photons and opposite sign same flavour (OSSF) lepton pairs. 



N r 


Nossf 


Signal 


£ Background 


Sig 


N w 


N z 


Ntf 


0 


0 


1287.4 


929.6 


42.3 


274.4 


21.0 


632.8 


0 


1 


283.6 


73.0 


33.2 


8.7 


1.4 


63.0 


1 


0 


902.9 


51.7 


126.1 


19.5 


2.0 


30.1 


1 


1 


189.1 


1.4 


161.4 


0.2 


0.0 


1.2 


2 


0 


252.9 


0.1 


252.9 


0.0 


0.0 


0.1 


2 


1 


37.0 


0.0 


37.0 


0.0 


0.0 


0.0 




observed, which is due to a slightly higher jet momentum in ATLFAST in the low energy region. How- 
ever, of more importance for this analysis is the simulation of the photon identification and it is important 
that the fast simulation provides a reliable modelling. The transverse momentum of the leading photon is 
shown in Figure 5(b) for full and fast simulation. It can clearly be seen that some significant discrepan- 
cies between both simulation approaches exist. This is a result of the photon detection efficiency, which 
is not included in the fast simulation. The fast simulation assumes a 100% detection efficiency for all 
truth photons which pass a certain isolation criterion. This effect is taken into account in the analysis 
by imposing a realistic reconstruction probability on each photon by hand, depending on the transverse 
momentum of the photon. The corrected pj distribution is also shown in Figure 5(b). 

In Table 8 the numbers of selected signal events for full and fast simulation are shown and good 
agreement can be observed after each step of the selection. The level of agreement of the fast simulation 
with the full simulation is of the order of 10% which is considered to be sufficient for a rough estimation 
of the discovery potential via a scan of the GMSB model parameters using ATLFAST. 



Table 8: Number of selected signal events normalised to Jzf = 1 fb 1 for full and fast simulation for 
GMSB1. 





Nossf 


Signal (full) 


Signal (fast) 


0 


0 


1287.4 


1597.7 


1 


0 


902.9 


1029.5 


2 


0 


252.9 


275.3 
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In order to estimate the selection efficiency and the discovery potential of the two photon chan- 
nel in a more model independent way, a scan of some GMSB model parameters has been performed. 
The mass spectrum and branching fractions of the different GMSB model points have been calculated 
using ISASUGRA 7.74 [36]. For each point 12500 signal events have been generated using the HER- 
WIG/JIMMY [40,41]. As discussed above, ATLFAST is used for the detector simulation of the signal 
with a correction applied to the photon reconstruction efficiency. The estimates for the Standard Model 
background are taken from the full simulation as described in Section 3.1.2. 

For the scan, the SUSY breaking scale parameter A has been varied from 60 to 200 TeV in steps 
of 10 TeV and tan/3 has been varied from 2 to 50 in steps of 2. The other model parameters are fixed 
to 7Y 5 = 1, M mes = 500 TeV, sgn/x = +1 and C grav = 1, as for the GMSB1 model point. In this part of 
the parameter space the neutralino is usually the NLSP, which is most often not the case for larger N5 
or larger tan/3. For these regions, other channels need to be used to discover GMSB SUSY. These are 
discussed in Section 5. The discovery potential for the case of different C grav and hence non-pointing 
photons is briefly discussed in Section 3.3. 

Figure 6 shows the contour lines where the significance reaches Sig = 5a for the default selection cuts 
as described above for different integrated luminosities. Since it is assumed that there is 1 background 
event left, these contour lines represent the lines with 5 signal events. In the regions below and left of the 
lines a 5a discovery can be made with the corresponding amount of data. In the high tan/3 region above 
the solid line no sensitivity is quoted for the two photon channel, since in this region the % is the NLSP 
and so no significant excess of photons is expected from the SUSY decay chains. Due to the fact that the 
SUSY cross section decreases with increasing A, the significance decreases as a function of A for a given 
integrated luminosity. In general the discovery potential in most parts of the GMSB model parameter 
space is high, giving confidence that GMSB SUSY can be discovered in the two photon channel with 
early data, if it is realised in nature. 



ax 




Figure 6: 5a discovery potential contour lines for GMSB SUSY in the A - tan/3 plane for different 
integrated luminosities. 



3.3 GMSB3 (non-pointing photon) full simulation studies 

If the gravitino mass parameter C grav is larger than unity, the NLSP will not decay promptly. In the 
case where the NLSP is a light neutralino, the resulting photons in the final state will not point back 
to the interaction point and may therefore be reconstructed and triggered with lower efficiency. The 
reconstruction efficiency is discussed in greater detail in Section 4. 1 . Here, the standard photon selection 
will be used to estimate the discovery potential. 
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Table 9: Trigger efficiencies and statistical errors for the GMSB3 event sample for (Jz? = 10 33 cm 2 s l ). 



Trigger item 


LI 


L1+L2 


L1+L2+EF 


g55 
2gl7i 
j65+xE70 
3j65 


90.19±1.08 
34.13il.72 
80.38±0.56 
79.80±0.57 


46.04±1.81 
17.77±1.39 
80.24±0.56 
79.66±0.57 


36.88±1.75 
12.87±1.22 
71. 18±0.64 
79.62±0.57 



Table 10: Number of selected signal (GMSB3) and background events for 1 fb 1 for different cuts on the 
number of photons and opposite sign same flavour (OSSF) lepton pairs. 





Nossf 


Signal 


£ Background 


Sig 


N w 


N z 


N r f 


0 


0 


825.2 


929.6 


27.1 


274.4 


21.0 


632.8 


0 


1 


265.2 


73.0 


33.2 


8.7 


1.4 


63.0 


1 


0 


255.8 


51.7 


35.7 


19.5 


2.0 


30.1 


1 


1 


68.6 


1.4 


58.6 


0.2 


0.0 


1.2 


2 


0 


12.5 


0.1 


12.5 


0.0 


0.0 


0.1 


2 


1 


4.7 


0.0 


4.7 


0.0 


0.0 


0.0 



A suitable model point to study is the GMSB3 point, which has the same parameters as GMSB1, but 
with C grav = 55. The X\ decay length in this point is therefore y/kr w3m Although only 12.4% (0.6%) 
of the reconstructed events contain one (two) photons with pj > 20 GeV in the detector acceptance 
region, this well exceeds the number of background photons. This suggests that one could also use the 
above defined selection, which is based on the requirement of two hard photons. 

Table 9 shows the trigger efficiencies for the same items listed in Table 5. For the LI trigger, the 
main source of inefficiency for the g55 trigger is from neutralino decaying to photons outside the inner- 
detector volume. The larger the C grav parameter is, the greater is the number of neutralinos that will decay 
outside the inner-detector. This effect is more pronounced for the 2gl7i trigger, which is optimized for 
the production of both photons within the inner-detector volume. At L2 trigger and at the event filter, 
cuts are placed on the shape of the electromagnetic showers, which are less efficient for non-pointing 
photons due to their wider shower shape in the r\ direction compared to prompt photons. The small 
difference in jet trigger efficiencies between GMSB1 and GMSB3 is again due to the difference in the 
number of photons produced within the inner-detector volume. Photons in the event are treated as jets 
up to the event filter, so that GMSB1 has effectively a larger number of jets, compared to the GMSB3 
sample, which makes a small but significant difference in jet trigger efficiency. 

The resulting numbers of selected events for 1 fb -1 are shown in Table 10. It can clearly be seen 
that, although the significance, again defined as Sig = S/y/B, is smaller than in the GMSB1 case, there 
are enough photons to select a large number of signal events. The difference to the prompt photon case is 
that with the requirement of an OSSF lepton pair one could obtain a larger significance, which is largest 
for a combination of one hard photon and one OSSF pair. 

3.4 Conclusion of GMSB SUSY with photon signatures 

In certain regions of the GMSB parameter space the NLSP is a light neutralino, decaying to a gravitino 
via the emission of hard photons. These photons can be used to efficiently reject the Standard Model 
background and to discover GMSB SUSY, if it is realized in nature. Attention must be payed to the 
fact that the photons might not point back to the interaction point leading to losses in reconstruction and 
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Figure 7: Schematic diagram of a non-pointing photon in the barrel section of the electromagnetic 
calorimeter. A long lived neutralino can travel a significant distance before decaying into a photon 
and a gravitino. The gravitino will escape the detector without interacting. A photon produced in this 
manner can enter the calorimeter at a significantly different angle (Tj r ) than a photon produced at the 
primary vertex (T]i). 



trigger efficiencies. 



4 Prospects for neutralino lifetime determination 

If GMSB SUSY is discovered, the ATLAS calorimeter can be used to first establish whether the neu- 
tralino has a long mean lifetime, and then to quantify it. The calorimeter can be used to both measure the 
direction and the time of the electromagnetic shower. Both the capabilities can be utilised to determine 
the mean lifetime of the neutralino. 

If the neutralino has a significant decay length, a photon can be observed 3 that will not "point-back" 
to the primary interaction point. This is shown schematically in Figure 7. The neutralino (%\) travels 
a significant distance before decaying into a photon (y) and a gravitino (G). Due to the finite opening 
angle between the photon and G, the path taken by the photon does not extrapolate back to the primary 
interaction point. 

The first sampling layer can measure the r\ position (Cluster 1 in Figure 7) whereas the second 
sampling layer can measure both X] and 0 (Cluster 2 in Figure 7). A vector corresponding to the path of 
the photon can therefore be constructed in the r — z plane. Although we can not measure the exact decay 
point of the neutralino based on these two measurements, we can extrapolate the path of the photon back 
to the beam axis, and measure the distance between this point and the primary vertex (Z' in Figure 7). 

Since the ATLAS calorimeter has a pointing geometry, if a photon enters the calorimeter at a signif- 
icant angle, the resulting electromagnetic shower can be spread out over a larger number of calorimeter 
cells. This wider shower-shape can result in issues for photon reconstruction algorithms and identifica- 
tion criteria. The effects of the reconstruction and identification of the non-pointing photons are discussed 
in Section 4.1. 

If the neutralino has a mean lifetime greater than 0.05 ns 4 , the Z' value associated with the photon 
can be used to establish that the neutralino has a "long" lifetime. Once this observation has been made, 
the Z' distance can also be used to measure the mean lifetime. This is discussed in Section 4.2. 

The neutralino is a massive particle. This means that photons produced from long-lived neutralinos 
will arrive at the calorimeter later than prompt photons from the primary vertex. A method which uses 
the calorimeter timing information to calculate the mean neutralino lifetime is discussed in Section 4.3. 

3 As long as the neutralino decays before the calorimeter. 

4 For typical values assumed for the neutralino energy and mass of 200 and 100 GeV respectively, photons with a Z' of at 
least 1cm from the primary vertex were observed. 
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Figure 8: Left: The distributions of the decaylength of the neutralino in the z direction. Right: The 
transverse momentum of the photons they produce. 



4.1 Reconstruction and identification of non-pointing photons 

4.1.1 77 definitions 

Due to the nature of the non-pointing photons, two definitions of 77 are used in this section. The 'truth 
77 ' refers to the 77 from the particle vector from the Monte Carlo event record (shown as Tj r in Figure 7). 
The term 'detector 77 ' refers to the 77 as measured by constructing a vector from (0,0,0) to the barycenter 
of the electromgnetic shower(shown as 772 in Figure 7). 

4.1.2 Photon reconstruction efficiency 

The photon reconstruction efficiency is defined as the fraction of photons, produced from the decay of a 
neutralino, that are reconstructed as a photon candidate. Using information from the Monte Carlo event 
record, photons are selected to be used in the efficiency calculation if they satisfy the following criteria: 

• originate from a neutralino decay occuring inside the outer envelope of the inner detector, 

• /? r >20GeV, 

• (detector 77 1 < 2.5. 

Photons are declared as successfully reconstructed if a photon candidate is found within AR < 0.2 of the 
position of the truth photon in the calorimeter. 

The z-component of the decay length and the momentum of the photon in the GMSB samples, as 
shown in Figure 8 and 8, depend upon the C gmv parameter of the sample. 

The overall efficiency for a photon to be reconstructed in the long-lived neutralino data samples, 
GMSB2 and GMSB3, is 88.3 ±0.2% and 83.3 ±0.4% respectively. This efficiency includes the recon- 
struction of photons which have converted to an electron-position pair. This occurs approximately 30% 
of the time and is dependent on how much material the photon travels through the inner-detector. This 
means that a photon from a long-lived neutralino will have a smaller probability of converting than a pho- 
ton produced at the primary vertex. The reconstruction efficiency of a photon which does not convert, is 
89.3 ± 0.2% for the GMSB 2 sample and 84.2 ± 0.5% for the GMSB 3 sample. 

The difference in overall reconstruction efficiency measured between the GMSB2 and GMSB3 sam- 
ples is due to the distribution of neutralino lifetimes in the sample, and hence the proportion of photons 
which are significantly "non-pointing". 
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Figure 9: Reconstruction efficiency as a function of At] for the GMSB2 and GMSB3 samples. 

The reconstruction efficiency is measured independently from the sample parameters as a function 
the variable Ar) = detector r\ - truth Tj as shown in Figure 9. 

There is excellent agreement between the two cases. The efficiency is approximately 90% for 
AT7 < 0.25, and falls steadily to 75% for At] « 0.5. It is clear that the reconstruction efficiency could 
bias any measured neutralino lifetime distribution. This efficiency distribution is parameterised with an 
approximation to the top-hat function: 

/? eff (A7]) = b -r^ + d (1) 

with a = 4.7(6.9), b = 174(229), c = 0.779(1.16) and d = 0.545(0.351) for GMSB2 (GMSB3). The 
GMSB2 fit result is used in Sections 4.2 and 4.3 (due to greater statistics) to account for any bias in the 
neutralino lifetime determination due to inefficiency. 

4.1.3 Study of photon identification efficiency 

The dependence of the photon efficiency on the neutralino decay length has been studied for all of 
the standard photon selection variables [43]. The efficiency for each cut is defined as the fraction of 
reconstructed photons with pj > 20 GeV and (detector r\ | < 2.5 that pass the standard select requirement 
for the given variable. Only photons that are successfully identified with true photons that come from the 
decay of a SUSY particle are used. 

Figure 10 shows the efficiency of these cuts as a function of the component of the neutralino decay 
length parallel to the beam axis. From this figure it can be seen that the hadronic leakage (Had/Em) is 
independent of the neutralino decay length. Of the cuts forming the standard selection from the second 
sampling layer of the electromagnetic calorimeter, only the ratio of energy in the 3x3 / 3x7 cells (R33) 
is shown to be flat with respect to the neutralino decay length. The efficiency of the cuts on the ratio 
of energy in the 3x7 / 7x7 cells (R37) and the the lateral width of the shower (weta2) are shown to 
have a clear dependence on the decay length. These two cuts are removed to form an 'unbiased photon 
selection' in the second sampling layer. 

For the cuts forming the standard cut selection in the first sampling layer of the electromagnetic 
calorimeter, the fraction of energy (f 1) and the cuts on the search for a second minima in the first sampling 
layer (DeltaE and DeltaEmax2) are shown to be relatively stable with respect to the neutralino decay 
length. There is however, a significant dependence of the efficiency on the decay length for the fraction 
of energy outside the shower core (fracm), the shower width in three strips (wetal) and the total width 
of the shower (wtot). These three cuts are removed to form an 'unbiased photon selection' in the first 
sampling layer. 
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Figure 10: Photon identification cut efficiencies as a function of neutralino decay length in Z 

Table 11 shows the effect on the overall efficiency as the five cuts which have been shown to be 
dependent on the neutralino decay length cuts are excluded from the standard photon selection to form 
an 'unbiased photon selection'. For simplicity the cuts are seperated according to which calorimeter 
sampling layer they are based upon. 

Table 11: Summary of photons identification efficiencies for the three different signal samples. The 
hadronic, second sampling and first sampling cuts are applied sequentially. 





Standard photon selection 






hadronic 


2 nd sampling 


1 st sampling 


GMSB1 


(94.1 ±0.2)% 


(75.7 ±0.4)% 


(64.1 ±0.4)% 


GMSB2 


(94.2 ±0.1)% 


(56.4 ±0.3)% 


(41.9 ±0.3)% 


GMSB3 


(94.4 ±0.3)% 


(49.8 ±0.6)% 


(36.1 ±0.6)% 




Unbiased selection 






hadronic 


2 nd sampling 


1 st sampling 


GMSB1 


(94.1 ±0.2)% 


(93.4 ±0.2)% 


(85.7 ±0.3)% 


GMSB2 


(94.2 ±0.1)% 


(92.2 ±0.1)% 


(82.5 ±0.1)% 


GMSB3 


(94.4 ±0.3)% 


(92.1 ±0.3)% 


(80.7 ±0.5)% 



The relative effect of loosening cuts on the background is shown in Table 12, which shows the 
fraction of jets, from a di-jet Monte Carlo data sample, that are reconstructed as photons, that also pass 
the two different photon selections. 

Table 12: The fraction of jets reconstructed as photons and passing all photon criteria. The hadronic, 
second sampling and first sampling cuts are applied sequentially. 





Hadronic 


2 nd sampling 


1 st sampling 


Default photon selection 
Unbiased selection 


(3.4±0.1)% 
(3.4±0.1)% 


(0.57 ±0.06)% 
(2.7 ±0.1)% 


(0.19 ±0.03)% 
(0.70 ±0.07)% 



4.2 Projected impact-parameter method for neutralino mean lifetime measurement 

The distribution of the photon's projected longitudinal impact parameter Z' arising from GMSB signal 
events can be used to estimate the mean neutralino lifetime. 
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Figure 11: Fitted slope parameters of the projected intersection distributions versus mean neutralino 
lifetime from the custom-built Monte Carlo simulation for an integrated luminosity of 30 fb _1 . 



A distribution of the intersection points of the z-axis with the projected photon path is then created 
for all reconstructed photons. Each intersection point is corrected for the vertex displacement. An 
exponential function is fitted to the intersection distribution in order to extract the slope parameter which 
is sensitive to the mean neutralino lifetime. The range for the fit was chosen to be 50 to 500 mm, to 
remove any possible vertex effects, and to ensure that the decay occured within the volume of the inner- 
detector. 

Plotting the resulting slope parameters versus the mean neutralino lifetime reveals a clear correlation 
(Figure 11) between the slope parameter and the neutralino lifetime. These results were obtained using 
a custom-built Monte Carlo program which provides a detailed parameterisation of the response of the 
transition radiation tracker and the electromagnetic calorimeter. It is envisaged that a calibration curve 
such as this, created using full simulation, could be used to determine the mean neutralino lifetime from 
a measurement of the slope of the Z' distribution. In reality this slope will also be a function of the j3 of 
the neutralino. 

For the GMSB2 data set, a slope of -4.35(6) x 10~ 3 was measured and for the GMSB3 data set, 
a slope of —3.8(2) x 10~ 3 was measured. These slopes were obtained before any photon identification 
cuts were applied. To estimate the effect of bias due to the reconstruction efficiency, a weight (= R ff ( At? ) ) 
is applied to the events, see Section 4.1.2. By comparing the effect of applying this correction on the 
resultant Z' slope, a — 5 x 10~ 5 systematic error was obtained. Table 13 shows the values obtained when 
different photon identification cuts are used. 

The results from the GMSB2 and GMSB3 samples shown in Table 13 demonstrate very clearly how 
the slope of the Z' distribution can be affected by photon identification cuts which are based on width 
measurements of the electromagnetic shower. 



4.3 Calorimeter timing 

A comparison has been made of the timing of the electromagnetic shower in the calorimeter, compared 
to the lifetime of the generated neutralino (in its rest frame). A Gaussian is fitted to this distribution for 
different bins of true lifetime, and the resultant mean cluster-time per generated neutralino mean lifetime 
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Table 13: The slope of the projected-impact-parameter (Z ; ) distributions of the photon, fitted from 50 
to 500 mm and measured in two different fully simulated Monte Carlo samples with different photon 
identification cuts. 



GMSB2: Generated mean lifetime of 3.17 ns 




dataset 


hadronic 


2 nd Sampling 


1 st Sampling 


default photon selection 


-4.35(6) x 10~ 3 


-7.00(9) x 10~ 3 


-8.3(1) x 10~ 3 


unbiased selection 


-4.35(6) x 10~ 3 


-4.39(6) x 10~ 3 


_4.54(7) x 10~ 3 


GMSB3: Generated mean lifetime of 10.7ns 




dataset 


hadronic 


2 nd Sampling 


1 st Sampling 


default photon selection 


-3.8(2) x 10- 3 


-6.6(2) x 10~ 3 


-7.7(3) x 10~ 3 


unbiased selection 


-3.8(2) x 10~ 3 


-3.9(2) x 10~ 3 


-4.1(2) x 10~ 3 



Table 14: The mean lifetimes, measured using the calibrated calorimeter time, in two different full 
simulation Monte Carlo samples with different photon identification cuts. Also shown is the generated 
mean lifetime of the samples used. 



GMSB2: G 
dataset 


jnerated mean 
hadronic 


lifetime of 3.17i 
2 nd Sampling 


is 

1 st Sampling 


default photon selection 
unbiased selection 


2.9 ± 0.2 ns 
2.9 ± 0.2 ns 


1.1 ±0.07 ns 
2.9 ±0.2 ns 


1.33 ±0.05 ns 
3.0 ± 0.2 ns 


GMSB3: G 
dataset 


;nerated mean 
hadronic 


lifetime of 10.7i 
2 nd Sampling 


is 

1 st Sampling 


default photon selection 
unbiased selection 


9±4ns 
9±4ns 


3.4 ±0.7 ns 
8±3ns 


2.9 ± 0.6 ns 
19 ± 19 ns 



is plotted in Figure 12. This fit to this plot is used to calibrate the calorimeter time. It has been shown that 
this method is robust against photon reconstruction or identification efficiency biases. This is because it 
is independent of the angle of incidence of the photon on the calorimeter. The arrival time of the photon 
at the calorimeter is a function of the /3 = v/c of the neutralino as well as its lifetime. 

Using this calibrated calorimeter time, the neutralino lifetime is plotted for each photon. This dis- 
tribution has the expected exponential shape modified by acceptance and resolution effects. In order to 
remove these effects, an exponential is fitted between 0.2 and 1 ns. The mean lifetime of the sample (t) 
was calculated from T = where slope is the slope of the exponential fitted. 

To calculate the effect of bias due to the reconstruction efficiency, a weight (= R ff ( AtJ ) ) was applied 
to the events (see Section 4.1.2). The systematic error on the lifetime determination due to uncertainties 
on the reconstruction efficiency was determined to be 2%. 

To study the effect of the photon identification on this method, the mean lifetime deduced from the 
slope of the exponential is measured after different identification cuts are applied to the photon sample. 
These mean lifetimes are shown in Table 14. The large errors on the calculated lifetimes from the 
GMSB3 sample are due to lack of statistics available for the fit, over the limited range. 

In order to obtain estimates for the systematic uncertainty due to the predicted j3 distribution of the 
GMSB sample (or an error in the timing calibration), the calibration curve (Figure 12) was scaled up and 
down by 5% corresponding to the difference in mean /3 value between the GMSB2 and GMSB3 samples. 
The effect on the measured mean lifetime determination corresponds to a systematic error of 0.4(2) ns 
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gen time(ns) 

Figure 12: The measured cluster time as a function of the generated neutralino lifetime, for individual 
neutralinos in the Monte Carlo samples. 

for the GMSB2(3) samples. 

In order to obtain estimates for the systematic uncertainty in the mean lifetime determination from 
the choice of fitting region or a global shift in the timing calibration, the fit range of the exponetial 
distributions was shifted by 100 ps. A systematic error of 1(10) ns was obtained for the GMSB2(3) 
samples. 

The shape of the observed neutralino lifetime distribution is strongly affected by geometric accep- 
tance and event kinematics. To obtain a lifetime measurement unaffected by these issues, a very limited 
fitting range has been used. A more accurate fit could be achieved by fitting the entire distribution. A full 
acceptance correction, including model dependent effects, would then be required to relate this distribu- 
tion to the mean neutralino lifetime. This work, beyond the scope of this publication, should produce a 
more accurate measurement of the mean neutralino lifetime. 

4.4 Conclusion of the neutralino lifetime determination 

Two independent methods for determining the mean neutralino lifetime have been discussed. The two 
methods are independent with different issues. For the Z' method of Section 4.2, a study using a simple 
custom-built Monte Carlo program shows that there is a good correlation between the Z' parameter and 
the mean neutralino lifetime. However, full simulation of a range of lifetime samples will be required to 
get the correct parameterisation of the relationship. The calorimeter timing study (Section 4.3), shows 
there is a good correlation between the calorimeter time and lifetime of the associated neutralino. The 
largest errors from this studies are due to the limited statistics in the range of the measured neutralino 
lifetime distribution, used for the exponential fit. 

In both methods, one has to take care that the biases introduced by the reconstruction and identi- 
fication of the photons are both measured and reduced, in order to prevent a distortion of the lifetime 
distribution. 

Both of these methods are signal dependent and rely on simulation calibration, either to produce the 
Z' calibration curve, or the timing calibration. This is because both the Z' value and the arrival time of the 
photon in the calorimeter are functions of the /3 of the neutralino as well as its lifetime. One can assume 
a distribution of /3 from Monte Carlo simulation (as has been done here), but constraints on the both 
parameters can in principle be achieved by combining the methods. It is proposed that, if non-pointing 
photons are observed, a multivariate analysis method could be used to combine information from the 
calorimeter timing and Z' together with information from the primary vertex and cluster positions and 
energy to place model-independent constraints on the j3 and lifetime of the parent of the non-pointing 
photon. 
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5 Trigger and reconstruction for searches of long-lived heavy particles 

Heavy, charged, long-lived particles are predicted in many models of physics beyond the Standard Model. 
One example is GMSB where for high tan/3 the £ is the NLSP which couples weakly to the gravitino. 
The signal is a heavy long-lived charged particle with velocity significantly smaller than the speed of 
light, j8 < 1. The momentum (and therefore j3) spectrum of these particles is model dependent. Those 
which have j8 close to unity are indistinguishable from ordinary muons. Those with j3 significantly lower 
than 1 could be identified and their mass determined. 

In ATLAS event fragments from different parts of the detector are assiged to a particular bunch 
crossing (BC) using the BC identifier (BCID). The usual assumption is that the particles traverse the 
detector at nearly at the speed of light (j8 p» 1). Hits from a slower particle may be lost during data 
collection, or may be marked with the wrong BCID. The implications of low particle speed in the ATLAS 
trigger and data acquisition design are considered below. 

This note does not address the case where the decay length of heavy charged NLSP is such that a 
significant number of particles will decay inside the tracking volume. 



5.1 Datasets used 

This analysis is based on a data sample of 10,000 events from the CSC production generated with the 
characteristics of GMSB point 5: A = 30 TeV , M m = 250 TeV, N 5 = 3, tan/3 = 5, sgn(ju) = +, C grav = 
5000. At this point the squarks and gluinos have masses around 700 GeV, the neutralino has a mass of 
1 14 GeV and the f and i have masses of 102 and 100 GeV respectively. The cross-section for this point 
is 23 pb and the f , e and p. are co-NLSPs and are produced in the decay X° —> l^i^ . Because of the 
small mass difference between the neutralino and the slepton, the £ and lepton are nearly collinear. The 
pi and /3 spectra of the sleptons and accompanying leptons are shown in Figure 13. 
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Figure 13: Transverse momentum and velocity spectra for sleptons and accompanying leptons from the 
GMSB 5 sample. 



GMSB5 is a single benchmark point and cannot be taken to represent all the possibilities of long- 
lived particle production. Some issues that impact our ability to discover long-lived new particles, if 
they exist, depend on the mass and j8 spectrum of these particles. In order to make our study less model 
dependent we also used for this study additional samples of events containing a single f each, generated 
at different j8 with a uniform tj distribution between tj = — 3 and tj = +3. 

Split-SUSY events containing long-lived gluinos with masses of 300 GeV and 1000 GeV were also 
used to assess the efficiency of the slow particle trigger, as discussed below. The generation and simula- 
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tion of these Split-SUSY events is described in Section 6. 

For the background study we used single muon events from the CSC production. They were produced 
at constant p T with a uniform T] distribution, like the single z samples. We used cross-section estimates 
of 1000 pb (200 pb) for muons with p T > 40 GeV (> 100 GeV) inside the ATLAS acceptance of | tj | < 2.5 
[45]. 

The simulation of a long-lived heavy sleptons in the ATLAS detector required a special patch to 
GEANT4 [46]. 

5.2 Trigger and DAQ issues 

When trying to identify slow particles [47, 48] one must pay special attention to the dimensions of AT- 
LAS. Since the detector extends over 20m in length from the interaction point to each detector side and 
the bunch crossing period is 25ns, this means that particles from three separate bunch crossings can 
co-exist in the detector at the same time. 

As described above, the matching of event fragments from different subdetectors is achieved using 
the BCID. This is calibrated so that particles originating together at the interaction point and traveling at 
the speed of light will have the same BCID assigned to them in all detector elements. 

When j6 is sufficiently small, the particle will take longer to reach the detectors (especially those far 
from the interaction point) and hits may be assigned a wrong BCID and thus not be read-out. Figure 14 
shows the efficiency with which slepton hits in the muon trigger chambers are associated to the correct 
bunch crossing as a function of j3. This figure was produced using the single z events described above. 
It can be seen that efficiency drops sharply below jS = 0.8(0.7) in the endcap (barrel). In order to find 
particles with j8 < 0.7 (/3 < 0.6) in the endcap (barrel), ATLAS must collect hits from the following 
bunch crossing (BC). Fortunately the MDT chambers collect data over a 700 ns interval, and thus hits 
from many BCs will be present. The RPC and TGC data acquisition can be set up to read out data from 
±7 and ±1 BCs around the triggered BC respectively. The option to read out the information about extra 
BC, which was originally intended for debugging, must be switched on during routine ATLAS operation 
if we want to increase our efficiency for long-lived charged particles. 




Figure 14: The efficiency, as a function of j8, for all slepton hits in the muon trigger chambers to be 
included in the same BC with fast particles, for the barrel (right)and the endcap (left). 

The hits from a slow particle may fall outside the correct BC, either for all trigger stations (e.g. if the 
particle is very slow and the trigger was produced by another feature in the event) or hits in the low-pj 
stations in the barrel may arrive in the correct BC, but the hits in the outer station may be late. In such a 
case, even if the pi of the particle was high, it would produce a low pj trigger. 
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Table 15: The LI trigger efficiencies for GMSB5 simulated events for items from the trigger menu for 
££ = 10 33 cm~2s~l. The description of the trigger items is in [42] 



Trigger label 


Description & Efficiency in GMSB5 


MU40 
xE200 
EM25i 


95% 
63% 
46% 



In GMSB5 the two sleptons are produced with different j3's, and since most of the sleptons have high 
j6, at least one of the two produced sleptons will have j8 > 0.7 in 99% of the events. The level- 1 [49] 
muon trigger efficiency in the correct BC is very high, either from a slepton with high /3, or from one of 
the accompanying leptons. Table 15 shows the level- 1 trigger efficiencies for GMSB5 events, based on 
the Level- 1 thresholds defined in the standard ATLAS menu for a luminosity of 10 33 . 

Nevertheless, a low j8 slepton, one with good potential for a mass measurement, could arrive to the 
muon spectrometer, or more likely the outer muon station, in the next BC. In such a case the slow slepton 
will not be found by the trigger, or be identified as a low pj muon. In order to identify such a slow 
particle muon trigger chamber data from the next BC has to be collected. 

In Split SUSY, the gluinos are produced directly, and there are few other features in the event. There- 
fore the R hadrons themselves must trigger the event. Since both of the R hadrons may be slow, the muon 
trigger may correspond to the wrong BC for the central parts of the detector. As a result, other event 
information such as that from the inner detector may be lost in the previous BC. This problem may be 
solved by also collecting data from the previous BC in the inner detector, but the feasibility of doing this, 
from the point of view of increased data volume, has not been investigated in this work. Additional data 
from the calorimeter is not required in order to find long lived heavy charged particles. 

5.3 A L2 trigger for heavy sleptons 



1 ATLAS 

0.6 0.65 0.7 0.75 0.8 0.85 0.9 0.95 



Figure 15: The j3 measured for sleptons in the barrel at L2 for different values of true /?. The error bar 
represents the fitted sigma of the measured j3 distribution. 

At L2 [50], algorithms are activated based on the Region of Interest (Rol) identified at LI. Each 
algorithm has a reconstruction stage which processes the data from the relevant parts of the subdetectors, 
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and a hypothesis stage which makes the decision to keep or reject the reconstructed feature. L2 and the 
Event Filter run in "trigger chains" which define the order in which algorithms are called and the input 
and output of each processing stage. 

The role of the L2 muon trigger is to confirm muon candidates flagged by the LI and to give more 
precise physics quantities of the muon candidate. 

The L2 muon selection is performed in two stages. The first stage is performed by the muFast 
algorithm [51], which starts from a LI muon Rol and reconstructs the muon in the spectrometer, giving 
a new p T estimate. The hypothesis cuts on the estimated pj are set so that 90% of the muons with p T 
above the nominal threshold would pass the selection. The resulting trigger element is then passed to the 
next algorithm. 

Track finding in the inner-detector is performed based on the region of interest found by muFast. The 
muFast candidate and inner-detector tracks then pass to the next algorithm, muComb, which matches an 
inner-detector track to the muon spectrometer track and refines the pj estimate [50]. 

In the muon barrel, the excellent time resolution (about 3 ns) of the RPC allows measurements of the 
time of flight (TOF). A method for finding the slepton and measuring its mass at L2 has been developed. 
We will show that, in the barrel, a slow particle may already be selected effectively at L2. Figure 15 
shows the mean value and error of the j3 reconstructed at the L2 as a function of generated j3. This figure 
was produced using the single f events described above. 

A selection based only on the /3 measurement would leave too many muons in the sample. At the hy- 
pothesis stage, we select using /? T (candidate)> 40 GeV, j8 (candidate) < 0.97 and m(candidate)> 40 GeV. 
The mass is calculated from the measured j3 and the candidate's pr and tj estimated by muFast. Fig- 
ure 16 show the mass distribution of signal and background resulting from the selection for an integrated 
luminosity of 500 pb _1 . It can be seen that the signal to background ratio is already good at the L2. 

At a luminosity of 10 33 , a slow particle trigger without further trigger selection would produce a rate 
lower than 1 Hz coming from muons. Further refinement of the slow particle selection using subsequent 
muon trigger stages can reduce this rate to 0.2 Hz. The measurement of j3 for high p T muon candidates is 




Figure 16: Mass distribution of signal and background resulting from the L2 selection for an integrated 
luminosity of 500 pb _1 . The shaded area is the GMSB5 signal, the dashed line is the muon background, 
and the full line is the sum. 

already part of the standard ATLAS L2 program MuFast, and a program to make the selection described 
above is part of the ATLAS L2 trigger. 

Measuring /3 in the muon spectrometer at the second level trigger could be particularly useful for R 
hadrons, which have a L2 trigger efficiency of about 50% for a mass of 300 GeV. The efficiency loss 
comes mainly from events where the R hadron is neutral in the inner detector (and undergoes charge 
exchange before the muon spectrometer), which then fail the second stage of the L2 trigger requiring 
matching between the candidate found in the muon spectrometer and a track in the inner detector. An- 
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Figure 17: The efficiency to reconstruct sleptons as muons as a function of j3 for two ATLAS muon 
reconstruction packages. 

other source of loss (probably less well simulated) is the assignment of inner detector hits from the 
charged R hadron to the incorrect bunch crossing. 

If the R hadron is identified in the muon spectrometer as a slow particle candidate, it could be ac- 
cepted without the requirement of a matching inner detector track. The slow particle selection at L2 
results in an efficiency of 92% to select events with R hadrons in the barrel. The corresponding muon 
rate is expected to be completely negligible since all muons have an inner detector track. 

The limitations of the L2 slow particle trigger should be noted. Firstly, this selection is performed 
only in the barrel of the muon spectrometer, where the RPCs are the trigger chambers. The timing 
information from the endcap is in BC granularity and cannot be used to measure j8. Secondly, a slow 
particle which does not produce a Rol in the correct BC will not cause the muFast algorithm to be called, 
and the selection will not be performed. Therefore, slow particles in the endcap, as well as very slow 
sleptons in events triggered by other objects such as high pj leptons, can only be identified offline. 

The final trigger decision in ATLAS is made in the event filter, which uses algorithms adapted from 
the offline reconstruction. As will be shown in the next section, the standard muon reconstruction is 
not efficient for slow particles; therefore many would be rejected at this stage. The combined trigger 
efficiency of L2 and the event filter for sleptons with velocity j8 = 0.6 is 39%. 

To avoid the efficiency loss at the event filter, a specific reconstruction algorithm for slow particles, 
like the one we describe below for reconstruction, is being implemented at the event filter. 

Slow particle candidates found at L2 that do not have a matching inner-detector track (such as charge 
flipped /?-hadron candidates) should be accepted without further event filter selection. This will increase 
the combined trigger efficiency of the L2 and the event filter for R hadrons with a mass of 300 GeV from 
25% to 92% in the barrel. The muon rate for this selection is completely negligible since all muons have 
an inner-detector track, but the effect of cavern background on this selection has not been studied yet. 

5.4 Reconstruction of heavy sleptons with the current muon reconstruction packages 

In the standard ATLAS reconstruction [52], stable sleptons are expected to be reconstructed as muons. 
The efficiency to reconstruct slow sleptons is compromised due to the following issues: muons are 
reconstructed by forming track segments in the three stations of the Muon Spectrometer. Segments in 
the 0 direction are found using the trigger chamber data. The data may not be collected if the hits from 
a slepton are in the next bunch crossing instead of the collision BC. The lack of a 0 segment hinders the 
reconstruction of the precision segment in r\ using the MDT data. Furthermore, the radii of MDT hits 
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are distorted by late arrival of the slepton and may not fit well to a segment. 

The efficiency of reconstructing the slepton as a muon depends on the reconstruction technique. 
Figure 17 shows the reconstruction efficiency for sleptons with the two ATLAS combined reconstruction 
packages, Staco and Muid which are based on the muon standalone packages MuonBoy and MOORE 
respectively. It can be seen that reconstruction efficiency starts dropping sharply for /3 < 0.8. This 
indicates that special reconstruction techniques will be required for the slow particles. This is discussed 
in the next subsection. 

5.5 Identifying heavy long-lived particles at reconstruction 

Estimating the velocity and mass in the event reconstruction allows us to identify heavy long lived 
charged particles, as well as to avoid the efficiency losses suffered when reconstructing them as muons. 
We do this with the MuGirl package [53], which enables us to select candidates also when the segment 
reconstruction is imperfect. Offline, the velocity can be also determined using the MDTs, the ATLAS 
precision muon chambers. In the MDT detectors, the hit position is obtained from the particle drift time. 
The drift distance is calculated assuming the particle passed the chamber with j8 = 1, which is wrong 
for the slow slepton. Minimizing the % 2 of fit with respect to the time of arrival to the muon detectors 
yields an estimate of /3 and of the particle mass. This information is combined with estimates of /3 from 
the muon trigger chambers. Figure 18 shows the /3 resolution and reconstructed mass of sleptons from 
GMSB5 obtained from the reconstruction with MuGirl. 




B - true B~ reconstructed mass (GeV) 

Figure 18: jS resolution and reconstructed mass for sleptons from the GMSB5 sample. 



5.6 Conclusion of strategies for the long-lived heavy particle search 

Heavy, long-lived charged particles can be discovered in ATLAS, should they exist. However, this can- 
not be done effectively using the standard ATLAS muon reconstruction tools. Furthermore, much of 
the discovery must be done before the analysis stage, in the data acquisition, high level trigger and re- 
construction. We have added to the standard ATLAS software components which can identify sleptons 
effectively. 

The efficiency to discover slow long-lived charged particles depends on collecting extra data from 
the bunch crossing following the one in which the interaction occurred. This is most important for data 
from the muon trigger chambers, where this possibility is included in the data acquisition design. 

For the GMSB5 model, discovery would assured with low integrated luminosity, once the MDT and 
RPC time calibrations are established. The discovery methods are largely independent of the model 
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characteristics, with discovery for a given integrated luminosity depending primarily on the production 
cross-section of the slow particles. 

6 Search strategies for R hadrons at ATLAS 

6.1 Introduction 

The presence (or absence) of massive exotic stable hadrons will be an important observable in the search 
for and quantification of any new physics processes seen at the LHC. Stable exotic coloured particles are 
predicted in a range of SUSY scenarios (see, for example, Refs. [22-26, 31]). Such particles could be 
copiously produced at the LHC and sensitivity to particle masses substantially beyond those excluded by 
earlier collider searches (<200 GeV) [31] could be achieved at ATLAS even with rather modest amounts 
of integrated luminosity (~ lfb x ). This section outlines a strategy for the detection of exotic massive, 
long-lived hadrons (so-called R hadrons) formed from either stable gluinos or stops (Rg and Rf hadrons, 
respectively). As described in section 2.1, the Rg hadrons (Rf hadrons) are considered in the context of a 
Split-SUSY (stop NLSP/gravitino LSP) scenario. Although this work is performed in the framework of 
SUSY, the techniques presented here may be used in generic searches for stable heavy exotic hadrons. 
As in the heavy lepton studies presented in section 4, this work relies on a signature of high-/^ muon- 
like track, although the distributions presented in this section provide a means of discriminating between 
lepton and hadron hypotheses for any observed stable massive particle. 

This section is organised as follows. First a description is given of the simulation of R hadrons at 
ATLAS, including both the event generation and scattering of R hadrons in matter. Final state observ- 
ables associated with Rg and Rf hadrons are then presented and it is shown how it may be possible to 
experimentally distinguish between these two types of particles should a discovery be made of stable 
massive exotic hadrons. Finally, the discovery potential for Rg and Rf hadrons is presented. 

6.2 Physics and detector simulation 
6.2.1 Event generation 

The leading-order event generator PYTHIA [54] was used to produce samples of pair-produced gluino 
and stop-antistop events for a range of gluino and stop masses, as summarised in Table 16. Production 
mechanisms of stops and gluinos are illustrated in Figure 19, which shows leading-order Feynman dia- 
grams. The effects of higher orders, which are important for the jet selection used later in section 6.3.3, 
are computed within PYTHIA using the partem shower technique. 

The processes studied were selected such that they provide conservative estimates of likely rates at 
the LHC, which depend principally on the mass of the heavy object under study and not other free SUSY 
parameters. 

For the gluino generation a Split-SUSY scenario was used in which squarks masses were set to 
4 TeV. To ensure the results presented here are as model-independent as possible, only the PYTHIA 
sub-process gg — > gg was considered whilst neglecting the quark annihilation process qq — »■ gg, which is 
sensitive to the squark mass. The former process is anyway the dominant production channel for gluino 
masses up to ~ 1 .5 TeV. 

For the stable stop sample, the diagonal production of pairs of the lighter stop state (fi) were assumed 
and the following sub-processes modelled: gg — > t\t\ qq -^t\t\. All sparticle masses except that of the t\ 
quark were set to 4 TeV although, at leading-order, the masses of other sparticles are not relevant for the 
cross-section calculations [55]. 
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Table 16: Expected number of gluino and stop pair-production events for lfb 1 of integrated luminosity 
and the equivalent luminosity of signal samples. 



sparticle 


Mass (GeV) 


Events/fb 1 




8 


300 


2.69 x 10 5 


3.72 x 10~ 2 


8 


600 


4.84 x 10 3 


2.07 


8 


1000 


138 


72.5 


8 


1300 


16.4 


610 


8 


1600 


2.12 


4.72 x 10 3 


8 


2000 


0.230 


4.35 x 10 4 


I 


300 


7.82 x 10 3 


1.12 


t 


600 


1.76 x 10 2 


35.2 


t 


1000 


6.4 


1.5 x 10 3 



9 




Figure 19: Selection of leading-order processes illustrating the production of gluino and stop particles. 



String fragmentation [56] was used to model the momenta of the R hadrons. A Peterson fragmenta- 
tion function [57], 

D( z )ocI(l-I_-^L)- 2 , (2) 

for which the e parameter for the heavy coloured object (£qg) has a value extrapolated from its value for 
^-quarks (£qg/£b = 'njj'njja) [58] was used to model the momentum distribution of the R hadron. 

Following hadronisation, and based on calculations of the mass hierarchy of the lowest-lying R- 
hadron states [59], around 55% (40%) of the stable Rg (ft ; ~) hadrons are predicted to have zero electric 
charge. This difference arises principally due to the possible existence of gluino-gluon states, for which 
the production probability is set to 10% here, and which are treated as mesons when propagating through 
matter (Section 6.2.2). 

To complement the signal samples, various background samples were used, each corresponding to 
an integrated luminosity of at least ~ 1 fb _1 . The generated and reconstructed number of events, and 
the equivalent luminosity of each sample is given in Table 17. As the simulated trigger used for this 
analysis requires a hard muon-like track, only events which could give rise to a high ^r-muon (pj > 150 
GeV) were simulated. The following processes were considered. Leading-order 2-to-2 QCD processes, 
which include all quark flavours except top, and which differ in the values of the internal matrix el- 
ement cut-offs (Pt) were generated with PYTHIA, the predictions of which are denoted QCD when 
discussed in section 6.3.2. In addition, backgrounds arising from diboson (HERWIG [40]) and single 
boson (PYTHIA) production, denoted electroweak, were produced, again with matrix element cut-offs 
in order to produce hard muons. A sample of tt pair-production events, termed top, was also prepared 
using the MC@NLO [60] program. 
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Table 17: Background samples used in this work. The number of generated and reconstructed events 
are shown along with the equivalent integrated luminosity.Event weighting accounts for the non-integer 
number of generated and reconstructed tt events. 



Sample 


Dataset ID 


Gen. Events 


Rec. Events 


J^fb- 1 ) 


QCD: (PYTHIA) 






(140 GeV < Pt < 280GeV) 


5013 


3.125 x 


10 s 


2572 


0.98 


(280GeV < pt < 560 GeV) 


5014 


2.5 x 


10 7 


4800 


1.12 


(poiAjev <. pj <^ i izu oev ) 


5015 


3.5 x 


10 5 


738 


1.01 


(1120GeV <p T < 2240GeV) 


5016 


5x 


10 4 


241 


9.46 


(2240GeV < p T 


5017 


1 X 


10 4 


42 


442.29 


Electroweak 






ZZ (HERWIG) 


5985 


2.5 x 


10 4 


53 


9.82 


WW (HERWIG) 


5986 


2x 


10 4 


50 


1.21 


WZ (HERWIG) 


5987 


1.5 x 


10 4 


29 


2.32 


Z -> (PYTHIA) 


5145 


1.3 x 


10 4 


600 


1.29 


Z — > XT (PYTHIA) 


5106 


3x 


10 3 


108 


9.94 


W -> liV (PYTHIA) 


5105 


3x 


10 4 


600 


0.94 


W — > TV (PYTHIA) 


5146 


3x 


10 4 


120 


7.82 


Top 






tt: (MC@NLO) 


5200 


1 X 


10 6 


4065.08 


0.98 



6.2.2 Simulation of 7?-hadron scattering in matter 

A model of /?-hadron scattering [61,62] recently implemented in Geant4 [63] is used in this work. This is 
an update of earlier work [59] which, in view of the inherent uncertainties associated with modelling such 
processes, adopts a pragmatic approach in which the scattering rate is estimated with the geometric cross- 
section and phase space arguments are used to predict the different 2-to-2 and 2-to-3 reactions. Other 
approaches to modelling /?-hadron scattering have been proposed, based on Regge phenomenology [24, 
25, 64]. These yield predictions of energy loss and scattering cross-sections which are qualitatively 
similar to those given by the model used here and any differences would not be expected to change the 
conclusions of this paper. 

The typical energy loss per interaction is predicted to be low (around several GeV [61]) since only the 
light quarks within the R hadron should participate in interactions with matter, leaving the heavy squark 
or gluino as a spectator. This implies that the fraction of R hadrons which would be triggered (j3>0.6, 
see Section 6.3.1) and which would then be stopped during their traversal of the detector is negligible 5 . 

In addition to energy loss, another feature of /?-hadron scattering, which has implications for experi- 
mental searches, is the possibility of charge and baryon number exchange. Following repeated scattering 
Rg hadrons and R, hadrons not containing an anti-stop should enter the muon system predominantly as 
baryons. This is due to the occurence of meson-to-baryon conversion processes for which the inverse 
reaction is suppressed [59]. Anti-baryons, however, would be expected to quickly annihilate in matter 
and Rj hadrons containing anti-stops would thus largely remain as mesons. 

The material budget of the part of the ATLAS detector enclosed by the muon system varies as a 
function of pseudorapidity between 1 1 and 21 interaction lengths [5,29], with the calorimeters providing 
the largest contribution. It is estimated that a Rg hadron (Rf hadron) will typically undergo 10-20 (7-15) 

5 Although it does not form a part of this work, the possibility of observing the decay of R hadrons which would be stopped 
offers a promising and complementary means of searching for R hadrons at the LHC [65, 66]. 
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nuclear interactions before reaching the muon system [62]. The difference in scattering rates is due to the 
smaller number of light valence quarks present in Rf hadrons. Owing to the large number of interactions 
a substantial rate of events is thus expected in which a R hadron appears to possess different values of 
the electric charge in the inner detector and muon system. 

While such topologies represent a challenge for track reconstruction software, they also provide 
observables useful for the discovery and characterisation of R hadrons, something which is explored in 
section 6.3. For example, a Rg hadron can reverse the sign of its charge. Rf hadrons [31] can only show 
this behaviour in the case where an intermediate neutral state oscillates into its anti-particle [67,68]. Since 
the expected rates of such are processes are essentially unconstrained by experimental limits on SUSY 
scenarios, they are not included in the simulation used here. Instead, a conservative, zero oscillation 
scenario is considered. 

6.3 Event selection 

6.3.1 Trigger 

The selected level 1 trigger is the mu6 trigger [29], which has been considered in previous studies of 
R hadrons at ATLAS as the most promising trigger for this type of work [69]. This trigger is sensitive 
to the 'classic' stable massive particle signature of a high transverse momentum muon-like track. The 
trigger efficiency after the level 2 selection for both Rg and Rf hadrons falls from around 30% at masses 
of several hundred GeV to around 20% at 2 TeV. The variation of efficiency with mass is due to the 
slowness of the R hadron. Here, we consider events in which a /?-hadron track in the muon system 
must be associated with the correct bunch crossing 6 . This leads to a rapid fall in efficiency for /3<0.6. 
A gradual decrease in efficiency would therefore naively be expected for increasing /?-hadron mass. 
After including requirements that the event filter is passed and the muon-like track is well-reconstructed, 
there is little mass-dependence in the overall efficiency for Rg hadrons (around 10-15%) or Rf hadrons 
(20-30%). This difference arises due to the stringent track cuts in the event filter which, at low masses 
(~ 300 GeV) reject a substantial proportion of tracks which have reversed the sign of their charge, which, 
as explained in section 6.2.2, occurs only for Rg hadrons. At higher masses, corresponding to higher 
transverse momentum, the poorer momentum resolution allows more 'charge flippers' to pass. As shown 
in section 6.3.3 this source of inefficiency has little effect on the discovery potential owing to the large 
predicted cross-section for low mass Rg hadrons. However, future work could involve the development 
of triggers which do not rely on linked inner detector-muon chamber tracks. Should such a trigger 
configuration be introduced which is based on the mu40 [29] trigger at level 1, this could potentially 
improve the overall efficiency by a factor of 2 — 3 for j8>0.6. 

6.3.2 /?-hadron final state observables 

Following the trigger selection, reconstructed final state quantities were used to select /?-hadron events 
and suppress background. A number of variables are presented which were found to provide discrimi- 
nation between /?-hadron and Standard Model background processes. Since observables associated with 
Rg hadrons and Rf hadrons are mostly very similar, generally only the /?|-hadron spectra are presented 
in this section. Where there is a substantial difference in the spectra of the two particle species, separate 
distributions are shown. 

Figure 20 shows the expected transverse momenta distributions, j^, of muon-like tracks in Rg and 
/?f-hadron events, for an integrated luminosity of lfb 1 . Distributions from background events are also 
shown. As expected, the /?-hadron spectra become harder with larger mass, extending up to ~ 1 TeV at 
the largest mass values, while the background events are comparatively softer. 

6 Section 5 explores possibilities of probing the lower |3 region 
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Figure 20: Distributions of transverse momenta ^ of hard tracks (pr > 50 GeV) as reconstructed in 
the ID (left) and muon (right) system. The top, middle, and bottom plots show tracks from Rg, R;, and 
background events, respectively. As labelled, /Miadron spectra are scaled according to /?-hadron mass. 
The spectra correspond to an equivalent integrated luminosity of lfb 1 . 

The ratio of high and low threshold HT/LT TRT hit multiplicities is shown in Fig 21 (top left) for 
Rg hadrons of mass 1000 GeV and muon candidates from background events. The different thresholds 
correspond to low (~ 200 eV) and high (~ 6.5 keV) amounts recorded of ionisation energy for a hit. 
Owing to the large mass and restricted /3 range the simulated /?-hadron data peak at lower values of 
HT/LT than the background tracks. 

Since R hadrons will typically suffer only several GeV energy loss per interaction through scatter- 
ing in the calorimeter, it is unlikely they will be associated with a hard calorimeter jet. Figure 21 (top 
right) shows the distance R = (Atj 2 + A0 2 ) 1 / 2 between a Rg-hadron track and a jet (defined with the kj 
algorithm) with pj > 100 GeV. Clearly for the R hadrons, the spectrum is typically larger than around 
1 , unlike the background sources which peak at lower values. The QCD background peaks around zero 
since a large proportion of muons in this sample are produced in the decay of heavy quarks. The top 
distribution peaks at values around 0.4 reflecting the higher jet multiplicity in such events compared to 
/?-hadron events. The distribution is not shown for the electroweak backgrounds owing to the statisti- 
cal imprecision of the Monte Sample sample (very few events with high pj jets arise in the selected 
kinematic region under study here). 

In a leading-order picture, R hadrons will be produced approximately back-to-back, unlike a number 
of background sources, as can be seen in Figure 21 (bottom left), which shows the cosine of the angle 
between two /?|-hadron candidates which both leave hard tracks in the muon system (cos AO^ the 
/?|-hadron sample peaks strongly at (cosA^ M ) ~ — 1. Figure 21 (bottom right) shows the distribution of 
the cosine of the angle between hard tracks in the inner detector and muon system (cosA<J>/£ which, 
for pair production events, would be expected to display peaks at ~ ±1. 

As described in section 6.2.2, charge exchange processes can give rise to events in which a linked 
track is assigned different values of electric charge in the inner detector and muon systems. This is 
shown in Figure 22 which shows the variable q '^p T '° , where q ID ,q^, ptjd, and p Tll are the charge as 
reconstructed in the inner detector and muon system, and the reconstructed transverse momentum in 
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Figure 21: Ratio of the number of high to low threshold hits in the TRT (top left); distance between a 
/?-hadron candidate and a jet (top right); cosine of the angle between two high pj tracks in the muon 
system (bottom left); and cosine of the angle between high pj tracks in the ID and muon systems (bottom 
right). Distributions are shown for Rg hadrons of mass 1000 GeV and three background sources. 

the inner detector and muon system, respectively. The gluino distributions show a two peak structure, 
with the peak at negative values of q 'q P p T ' D arising from charge 'flipping' processes. Rj-hadvon spectra 
indicate a very small rate (several per cent) of candidates with oppositely signed charge in the muon 
and ID systems which is due to charge misidentification in the muon and ID systems. Both the stop 
and gluino distributions become broader with increasing mass; this reflects the commensurate increase 
in pj and hence degraded resolution of the tracking systems, which has the effect of allowing a greater 
proportion of charge- 'flipping' Rg hadrons to satisfy the event filter. 




% P T.A P T, ' 1, D P T,A P T» 



Figure 22: Distributions of for Rg (left) and Rf hadrons (right). Predictions for a range of /?-hadron 

masses are shown. 

6.3.3 /?-hadron selection criteria 

Using the information presented in section 6.3.2 /?-hadron selection criteria were developed following an 
optimisation procedure [62]. First, no hard muon-like track (p T > 250 GeV) can come within a distance 
R < 0.36 of a hard jet (jpj > 100 GeV). Furthermore a candidate R hadron must satisfy at least one of 
the following conditions listed below. For consistency the same selection is applied both for Rg and 
R~ t hadrons though criteria 3-4 are only relevant for Rg hadrons. However, these have a negligible impact 
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on the study of the stop discovery potential. 

1. The event contains at least one hard muon track with no linked inner detector track. A linked track 
is defined such that the distance R = (Atj 2 + A0 2 ) 1 / 2 between the measurements in the ID and 
muon systems is less than 0.1. 

2. The event contains two hard back-to-back ID tracks with the TRT hit distribution satisfying HT/LT < 
0.05. A back-to-back configuration is defined such that the cosine of the angle between the two 
muon tracks is less than -0.85. 

3. The event contains two hard back-to-back (as defined above) like-sign muon tracks. 

4. The event contains at least one hard muon track with a hard matching ID track of opposite charge 
fulfilling the condition ptjd > 0-5prn- 

Table 18 shows the acceptance numbers and rates for the various samples. It can be seen that for 
/?-hadron masses below 1 TeV ATLAS opens up a discovery window with integrated luminosity of the 
order of 1 fb _1 . For masses above 1 TeV the rate of signal events is small, and is comparable to the 
expected background rate, so even discovery would be challenging even with larger data-sets. 

Table 18: Number of events selected for the given samples. Background samples not mentioned here are 
rejected by the selection. 



Sample 


Accepted events 


Rate (Events /fb ) 


300 GeV gluino 


235 


6.44 x 


10 a 


600 GeV gluino 


551 


2.70 x 


10 3 


1000 GeV gluino 


774 


10.7 




1300 GeV gluino 


732 


1.20 




1600 GeV gluino 


685 


0.147 




2000 GeV gluino 


546 


1.26 x 


io- 2 


300 GeV stop 


78 


70.0 




600 GeV stop 


134 


3.9 




1000 GeV stop 


170 


0.1 




J5 


1 


0.893 




J8 


1 


2.26 x 


io- 3 


Z^nn 


1 


0.776 





6.4 Conclusion of 7?-hadron search strategies 

Stable massive exotic hadrons (R hadrons) are predicted in a number of SUSY scenarios. By exploiting 
the signature of a hard penetrating particle which may undergo charge exchange in the calorimeter and 
seemingly does not fall within a jet, ATLAS will be able to discover R hadrons for masses below 1 TeV 
with relatively low amounts of integrated luminosity (~ lfb 1 ). 

7 Conclusion 

Search strategies at ATLAS have been developed for a range of signatures of new physics processes 
expected within SUSY models. Studies were made of high transverse momentum photons, which may or 
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may not have been produced at the primary collision point, and of slow moving stable massive interacting 
particles (sleptons and /?-hadrons). 

It was shown that with early LHC data, corresponding to an integrated luminosity of around lfb~\ 
ATLAS opens up a discovery window for those SUSY scenarios giving rise to the aforementioned sig- 
natures. Although the studies were performed within the framework of SUSY, the techniques used can 
be applied to generic searches for physics beyond the Standard Model. 
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Abstract 

We present the discovery potential of a heavy new resonance decaying into a 
pair of leptons with early LHC data with the ATLAS detector. The dilepton 
final states are robust channels to analyze because of the simplicity of the event 
topology. The unprecedented available center-of-mass energy will allow one 
to probe regions that are inaccessible at previous experiments even with mod- 
est amounts of data. After studying the Standard Model predictions and the 
associated uncertainties one can then look for significant deviations as indica- 
tion of beyond the Standard Model physics (BSM). The focus of the note is 
to study the prospects for discovering BSM physics in the dilepton final states 
with an integrated luminosity ranging from 100 pb _1 to 10 fb _1 . 

1 Introduction 

New heavy states forming a narrow resonance decaying into opposite sign dileptons are predicted in 
many extensions of the Standard Model: grand unified theories, Technicolor, little Higgs models, and 
models including extra dimensions [1-4]. The discovery of a new heavy resonance would open a new era 
in our understanding of elementary particles and their interactions. Because of the historic importance 
of the dilepton channel as a discovery channel and the simplicity of the final state, these channels will 
be very important to study with early ATLAS data. The strictest direct limits on the existence of heavy 
neutral particles are from direct searches at the Tevatron [5-7]; the highest excluded mass is currently 
almost 1 TeV. The LHC will have a center-of-mass energy of 14 TeV which should ultimately increase 
the search reach for new heavy particles to the 5 - 6 TeV range. Many exotic models can be tested at the 
LHC, but analyzing all the existing models is impossible. Instead we choose to take a different approach, 
grouping the early-data analysis by their final state topologies. There have been several other ATLAS 
studies evaluating the potential for discovery of a heavy resonance [8]. However, this is the first study 
to include full trigger simulation, misalignments, and data driven methods. Including these experimental 
issues is important to realistically estimate the analysis potential. We focus on the early data phase of the 
experiment, defined roughly to include the accumulation of up to 10 fb -1 of ATLAS data. 

In the remaining of this introduction, the investigated models are reviewed. In sections 2 and 3, 
we explore the detector performance concerning the electron, muon and tau reconstruction abilities at 
high energies and the corresponding trigger efficiencies. In section 4 we investigate the Standard Model 
predictions and associated uncertainties, as well as the signal cross-section. In section 5, we proceed to 
search for Exotic resonances. 

1.1 Models Predicting a Z' 

Several models [1,3] predict the existence of additional neutral gauge bosons. In particular, grand unified 
theories, as well as "little Higgs" models, predict their existence as a manifestation of an extended sym- 
metry group. Generically, there are no predictions for the mass of these particles. Since the experimental 
consequences are very similar in the dilepton final state, we examine only some representative models: 
the Sequential Standard Model (SSM) 1 , the and the Left-Right Symmetric models [9]. The partial 
width of the Z' boson is given by T(Z' — >• « [(gf ) 2 + (gf) 2 ] ^ where gf and g\ are the right and 

left handed couplings of the charged leptons to the Z' boson and m z < is the mass of the Z' boson. For 

'The Sequential Standard Model includes a new heavy gauge boson with exactly the same couplings to the quarks and 
leptons as the Standard Model Z boson. 
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Z' Model 


Indirect Searches (GeV) 


Direct Searches (GeV) 




Colliders 


p + p Colliders 


4 


680 


781 




864 




481 


366 




853 


5 


619 


515 




933 




804 


518 






Z SSM 


1787 


1018 




966 



Table 1: 95% C.L. limits on various Z' models. 



the masses and couplings considered here the natural width is typically around 1% of the mass of the 
resonance. 

The strictest limits from direct searches come from the DO and CDF experiments at the Tevatron [5- 
7]. Indirect searches have also been undertaken by the LEP experiments [10]. The direct limits range 
from several hundred GeV to approximately 1 TeV and are shown in Table 1. These limits are not 
expected to improve much beyond 1 TeV [11]. It should be noted that for models where the Z' couples 
preferentially to the third generation the limits are lower, therefore we consider it important to look at a 
lower invariant mass region in this channel. 

1.2 Randall- Sundrum Graviton 

The Randall-Sundrum model [4] addresses the hierarchy problem by adding one extra-dimension linking 
two branes, the Standard Model brane and the Planck brane. The hierarchy is solved by assuming for the 
fifth dimension a warped geometry in which the size of the ordinary coordinates decreases exponentially 
from the Planck scale to the TeV scale. The Randall-Sundrum model predicts the existence of a tower 
of Kaluza- Klein excitations of the graviton. These should be observable as resonances which decay into 
lepton pairs at the LHC. The current limits depend on the parameters of the model, and range from several 
hundred GeV to one TeV [5]. We consider the observability of a Randall-Sundrum graviton decaying into 
electron pairs. The width of the graviton resonance would be very small. For the parameters considered 
here it ranges from 10~ 4 to a few 10~ 3 times the mass. 

1.3 Technicolor 

Strongly interacting theories, like Technicolor and Extended Technicolor, provide a dynamical solution 
to the problem of Electroweak Symmetry Breaking. Many new technifermions which are bound together 
by a QCD-like force are predicted. One of the most promising search channels is the dilepton decay 
of the pre and (Ore- We study the "Technicolor Strawman Model" or TCSM [12, 13] as a benchmark 
model for generic strongly interacting theories. The most stringent limits on technihadrons in the TCSM 
framework come from the CDF collaboration, who rules out pre and (Ore with masses below 280 GeV 
for a particular choice of the TCSM parameters [14]. The width of the techni-mesons depends on the 
number of technicolors, but is generally assumed to be small, of the order of a few percent of their mass. 
More details on the exact values of the parameters considered are discussed in a later section. 

2 Object Identification and Performance 

This section describes the requirements used to select objects for the analyses and summarizes findings on 
the performance using Monte Carlo simulations of the production and decay of new dilepton resonances. 
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2.1 Electron Identification 

The electron identification and performance is described in detail elsewhere [15]; here we summarize 
the results concerning very high transverse momentum 2 (pr)- electron identification and reconstruction. 
The background to very high pj electron pairs is expected to be low, therefore only minimal selection 
criteria need to be applied, in order to maximize efficiency. These minimal criteria are called loose. On 
the other hand, when trying to select very high p T % lepton pairs, where one % decays hadronically, a 
tighter selection on the electron from the other z decay is needed. 

On top of the minimal requirements that the reconstructed clusters should have an absolute pseudo- 
rapidity (77) less than 2.5 and should be associated with a track reconstructed in the inner detector, two 
electron selections were studied (both described in detail in [15]): 

• A loose selection based on hadronic leakage and shower shape variables. This selection achieves 
very high efficiency while maintaining rejection against highly energetic pions with wide showers. 

• A medium selection, which makes further requirements to obtain better rejection against n° — > 77 
background by exploiting the very fine granularity of the first compartment of the electromagnetic 
calorimeter, and tighter requirements on the associated track. 

Figure 1 shows the reconstruction efficiency together with the efficiency of the two selections in a 
sample Z' —> e + e~ events with mz> = 1 TeV as a function of transverse momentum and pseudo-rapidity, 
normalized to truth electrons with pj greater than 50 GeV and | Tj | smaller than 2.5. The efficiency of the 
reconstruction is dominated by the efficiency of the cluster to track association and is on average slightly 
below 80%. It must be noted that this efficiency has improved in the more recent software version used 
in [15]. The loose selection criteria have a relative efficiency close to 1, whereas the medium selection 
leads to an overall average efficiency between 65% and 70%. 

The energy resolution for electrons at high pj is about 1 % except in the crack region between the 
forward and central calorimeters where the resolution is about 5% . The probability to assign the wrong 
charge to an electron ranges from 1% to at most 5% as the transverse momentum goes from 100 GeV to 
1 TeV [16]. For a 1 TeV Z' a dielectron mass resolution of (0.80 ±0.02)% is obtained. 

2.2 Muon Identification 

Here we discuss the requirements used to select muons, as well as a method to extract the identification 
efficiency from data. The ATLAS detector has an excellent standalone muon spectrometer: muon tracks 
can be found both in the inner detector and the muon spectrometer. A "combined" muon track consists 
in matched tracks from both the muon spectrometer and inner detector. We require that a muon, with 
p T > 30 GeV, 

• forms a combined track (inner detector and muon spectrometer) with \rf\ < 2.5, 

• has a match % 2 < 100.0 (5 D.O.F) between the parameters of the inner detector and muon spec- 
trometer tracks. 

The muons in the 1 TeV Z' sample have a most probable p T of about 500 GeV. An efficiency of (95 ± 
0.2)% with a resolution of approximately 5% is found with this selection. The results are consistent with 
previous studies [17, 18]. 

The muon identification efficiency as a function of pj has been determined using two methods. The 
first method is the 'tag and probe' method, which has been used successfully at the Tevatron. In this 
method one uses a 'standard candle' as an in situ calibration point. It involves selecting Z — >• \x\x events 

2 The transverse momentum is denned as the momentum projected on the plane transverse to the beam axis. 
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Figure 1: Efficiency of the loose and medium selection criteria in Z' — » e + e~ events with m z i — 1 TeV as 
a function of p T (left) and rj (right). The reconstruction efficiency is normalized to truth electrons inside the 
geometrical acceptance |r/| < 2.5 and with pj > 50 GeV. 



and evaluating the reconstruction efficiency from data on these events. One combined muon is used as 
the tag while an inner detector track is used as a probe track. One can then study how often the probe 
muon also has a combined track to get an unbiased measurement of the combined muon reconstruction 
efficiency. The reconstruction efficiency was measured by fitting to the dimuon invariant mass spectrum 
and finding the fraction of events where the probe track was found as a combined track. A comparison 
between this tag and probe method and Monte Carlo truth is shown in Fig. 2. The Monte Carlo truth 
efficiency is determined by counting the number of generated muons with successfully reconstructed 
inner detector tracks that also have a combined muon track. This study demonstrates that we should be 
able to use this method to extrapolate into the very high pr range. 

2.3 Tau Identification 

The algorithm to reconstruct hadronically decaying x lepton candidates is described in [19]. It is 
calorimeter based; it starts from a reconstructed cluster with a transverse energy 3 Ej > 15 GeV and 
then builds identification variables based on information both from the electromagnetic and hadronic 
calorimeters, as well as the inner tracker. Finally, an electron and a muon veto are applied, which means 
that hadronically decaying x candidates which are matched (which "overlap") with an identified electron 
or muon are removed. 

The reconstruction efficiency, defined as the probability of a true hadronically decaying x to be 
reconstructed as a cluster, and normalized to all true hadronically decaying x leptons with Ej >15 GeV 
inside the tj acceptance, is flat as a function of rj and (j). The average efficiencies are summarized in 
Table 2. Efficiencies for electron and muon vetoes are given with respect to all reconstructed x leptons. 

A likelihood is computed for each x candidate. The x likelihood combines information from the 
calorimeter describing the shower shape and tracking information in a multivariate likelihood to maxi- 
mize the discrimination from background. Detailed studies were done to optimize the x lepton efficiency 

3 The transverse energy is defined as the energy multiplied by sin£*, where 6 is the angle between the beam axis and the 
direction from the interaction point to the cluster. 
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Figure 2: Efficiency of muon reconstruction and identification as a function of pj from two different 
methods (see text). 



Events in tj < 2.5 

Events in 7] < 2.5 AND E T >15 GeV 


(87.1 ±0.1)% 
(85.6 ± 0.2) % 


Reconstruction 
Electron veto 
Muon veto 


(98.8 ±0.1)% 
(99.3 ±0.1)% 
(99.9 ± 0.0) % 



Table 2: Reconstruction efficiency, efficiency of e//i-T-jet overlap removal for hadronically decaying % 
leptons from Z' boson decays. The efficiencies for kinematic requirements are also given. 
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Requirement 


Efficiency (%) 


E T > 60 GeV 

AND l<N trk <3 

AND likelihood requirement 


89.8 ± 0.2 
79.2 ± 0.3 
51.0 ±0.3 



Table 3: Preselection and identification efficiency for Z' — ► TT (m — 600 GeV). Efficiency is given with respect to 
reconstructed hadronically decaying T leptons (after removal of overlap with electrons or muons). 



and jet rejection for the Z' boson search. The result (shown in Table 3) was to impose a /^-dependent 
likelihood requirement, a requirement on the number of tracks, and a requirement on the transverse 
energy. 

3 Trigger 

The aim of the trigger system is to reduce the rate of events flowing through the data acquisition to 200 
Hz while maintaining a highly efficient selection for rare signal processes. Even at the initial luminosity 
of ££ = 10 31 cm~ 2 s~ l , it will be a challenge to keep the trigger highly efficient for all important final 
states. Several detailed trigger studies were undertaken for the dilepton final state. In this section we 
summarize those results. 

3.1 Electron Triggers 

There are several proposed triggers which in principle can be used for the dielectron analysis. We studied 
four triggers: e55 - requiring one electron with pj > 60 GeV, e22i - requiring one isolated electron with 
p T > 25 GeV, 2el2 - requiring two electrons with p T > 15 GeV, and 2el2i - requiring two isolated 
electrons with p T > 15 GeV. 

Table 4 shows the efficiency at the three ATLAS trigger levels [20]: level 1 (LI), level 2 (L2), and 
the event filter (EF) for a sample of graviton events. As can be seen in this table, the most efficient 
triggers are the high p T triggers that do not require isolation. The low p T triggers (2el2, 2el2i) will 
not be considered any further. 



Signature 


Efficiency (L1/L2/EF) (%) 


Total Trigger Efficiency (%) 


e55 


99.9 ±0.0 


95.9 ±0.2 


94.6 ±0.3 


90.8 ±0.3 


e22i 


85.9 ±0.3 


96.4 ±0.4 


83.9±0.3 


80.9 ±0.4 


2el2 


99.9±0.1 


84.9 ±0.5 


85.5 ±0.3 


72.6±0.6 


2el2i 


59.1 ±0.7 


86.1 ±0.7 


86.2±0.3 


43.9±0.7 



Table 4: Trigger level efficiencies on G^e + e (m = 500GeV) events with respect to loose electron offline selec- 
tion. The last column shows the overall trigger efficiency after all levels. 



3.2 Muon Triggers 

For the dimuon channel we investigated the trigger efficiency for dimuon events using the single muon 
20 GeV pr trigger mu20 [20] . Results for various signal samples are shown in Table 5. Detailed studies 
on ways to estimate the trigger efficiency were carried out and presented in [21]. It was found that a tag 
and probe method, similar to the method described for the offline muon reconstruction, could be used to 
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Sample 


mu20 Efficiency (L1/L2/EF) (%) 


Total Trigger Efficiency (%) 


m = 400GeV p T /(Or 


97.6 ±0.1 


98.8 ±0.1 


99.5 ±0.1 


96.0 ±0.1 


m = 600 GeV p T /(0 T 


98.1 ±0.1 


98.5 ±0.1 


99.2 ±0.1 


95.9 ±0.1 


m = 800 GeV p T /(Or 


97.6 ±0.1 


98.7 ±0.1 


99.2 ±0.1 


95.6 ±0.1 


m= 1 TeV Pt/(O t 


97.6 ±0.1 


98.7 ±0.1 


99.2 ±0.1 


95.6 ±0.1 


m = 1 TeV Z' x 


97.8 ±0.1 


98.9 ±0.1 


99.5 ± 0.0 


96.3 ±0.1 


m = 2TeVZ' SSM 


97.6 ±0.1 


98.7 ±0.1 


98.9 ±0.1 


95.3 ± 0.2 



Table 5: Simulated trigger level efficiencies of dimuon resonance samples with respect to offline selec- 
tion. The last column shows the overall trigger efficiency after all levels. 



extrapolate Z — > results to high pj. In addition, efficiencies were obtained using orthogonal triggers, 
giving a sample which was minimally biased with respect to the muon triggers. It was found in [21] that 
these estimates agreed with both the tag and probe method and the results from the simulated samples 
shown here. As can be seen from Table 5 the single muon triggers are highly efficient for any of our 
signal samples with a total trigger efficiency around 95%. 

3.3 Triggers for Taus 

The T lepton decays to hadronic states in 65% of the cases, and the rest of the time to lighter leptons (e or 
ji). In our studies of ditau final states we select events triggered with a single lepton (e/jj.) trigger. Thus, 
we consider two true final states, which we denote e%h and /XT/,. 

For the e%h channel we consider two triggers: e22i and e55, as studied in section 3.1. The \iXh 
events are selected using the mu20 trigger already used in section 3.2. Note that the efficiencies shown 
here are lower than for the dimuon or dielectron channel. This arises because there is only one electron or 
muon in the final state considered here while in the dielectron or dimuon final state either electron/muon 
can satisfy trigger requirements. Table 6 summarizes the trigger efficiencies. 



Signature 


Efficiency (L1/L2/EF) (%) 


Total Trigger Efficiency (%) 


e22i 


85.2 ±0.5 


89.8 ±0.4 


90.2 ±0.3 


69.1 ±0.6 


e55 


90.0 ±0.3 


74.2 ±0.4 


78.7 ±0.6 


52.6 ±0.8 


e22i or e55 


96.7 ±0.1 


88.7 ±0.4 


88.9 ±0.3 


75.5 ±0.5 


mu20 


79.8 ±0.6 


90.7 ±0.4 


97.5 ±0.4 


70.6 ±0.5 



Table 6: Trigger level efficiencies for different triggers for z leptons from m = 600 GeV Z' bosons 
decaying to ez h and jix h final states with respect to offline selection. The last column shows the overall 
trigger efficiency after all levels. 

4 Standard Model Predictions and Other Sources of Systematic Uncer- 
tainties 

In this section, we investigate the main background sources and we show the dominance of the neutral 
Drell-Yan process. Then we investigate the uncertainties in the Standard Model predictions for Drell-Yan 
production, as well as for an extra neutral gauge boson. Finally, we discuss the experimental sources of 
uncertainties, including a dedicated study of the effect of the muon spectrometer alignment. 
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4.1 Background Sources 

The neutral Drell-Yan (DY) process constitutes the irreducible background in the search for new heavy 
dilepton resonances. 

The dielectron reducible background results from events in which one or two electrons come from 
the jet— ^electron or photon^electron contamination. In addition, true isolated electrons can produced 
by W — ► ev or Z — ► ee decays. By combining these effects in the dielectron case, one can list the re- 
ducible background sources: inclusive jets, W+jets, VK+photon, Z+jets, Z+photon, photon+jet and pho- 
ton+photon. For a first estimation of these backgrounds, we have used the event generator Pythia [22] 
to compute the differential cross-sections as a function of the invariant mass of the object pair. The re- 
sults are shown on the left of Fig. 3. The neutral Drell-Yan process has a much lower cross-section than 
most of the backgrounds. For each electron-candidate leg originating from a jet (photon), we then apply 
a rejection factor 4 of R e -j et = 4 x 10 3 (R e - r = 10) [15]. We apply an additional requirement to take into 
account the geometrical acceptance in which the electrons are identified, i.e. |t]| < 2.5, and require at 
least one object with pj > 65 GeV. The resulting differential cross-sections are shown on the right of 
Fig. 3. One can see that each contribution represents at most 25% of the neutral Drell-Yan process. The 
sum of all contributions does not exceed 30%. Both the transverse momentum and the rapidity require- 
ments play an important role in reducing the QCD-jet background because it is produced mainly in the 
r-channel resulting in jets with high rapidities. Further reduction of these backgrounds may be obtained 
by requiring opposite electric charges. 

The WW, WZ, ZZ and top pair processes can also produce two opposite sign electrons. Whereas 
the cross-sections of the diboson processes are of the same order as the smallest backgrounds above, 
the top pair cross-section is not negligible. As the topology of the events is not as simple as in the 
above backgrounds, no conclusion can be drawn without a full simulation study. Using a sample of fully 
leptonic and semi-leptonic tt events, it was checked that the tt background was of the order of 10% of 
the Drell-Yan contribution for di-object masses above 500 GeV after applying electron identification and 
the same rejection factor as above to the most energetic jet. 




M (GeV) M (GeV) 



Figure 3: Background contribution to the e + e invariant mass spectrum: before selection requirements (left) and 
after selection requirements (right). 



4 This number is given for the medium selection while the loose selection was used here. However, this corresponds to an 
efficiency of (80.6 ±0.2)%, which is higher than our average efficiency using loose criteria, due to the better performance of 
the more recent release used in [15]. 
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The rejection factors /x-jet and ji -photon are higher than the ones corresponding to the electrons and 
the resulting reducible backgrounds are lower. 

All these assumptions will have to be checked with the real data. One can use for instance electron- 
muon or same charge samples, in which no signal is expected. 

In the following, only the neutral Drell-Yan is considered as a source of background in the dilepton 
channel. The ditau case is treated later. 

4.2 Controlling the Dilepton Cross-Section 

The background estimations in the last section were performed with the Pythia event generator which 
uses tree-level calculations of the cross-sections. The tree-level dilepton cross-sections are subject to 
large higher order electroweak and QCD corrections. These are known at least to next-to-leading order 
(NLO) of perturbation theory, not only for the Standard Model Drell-Yan process, but also for a number 
of new physics processes. They have the additional benefit of reducing the uncertainty induced by the a 
priori unknown renormalization and factorization scales IXr^f- In the following, we discuss in detail the 
various known radiative corrections and the remaining theoretical uncertainties, focusing on the Standard 
Model Drell-Yan process and the corrections to the tree-level cross-section. 

4.2.1 NLO Electroweak Corrections 

The electroweak corrections to the Drell-Yan process are known to NLO in the fine-structure constant 
a [23,24]. Initial-state photon radiation must be factorized into the partem density functions (PDFs), 
which in principle modifies the DGLAP evolution of quarks and gluons, but has in practice little effect 
on the quality of the global fit [25]. Only at very large x and can the correction become of the order 
of 1%. Multiple initial-state photon emission can also be resummed, leading to a 0.3% modification of 
the cross-section [26], or matched to partem showers [27]. The remaining initial-state QED contributions 
are also small, whereas the photon radiation emitted by the final state leptons can have a significant 
impact on their mass (M) and transverse momentum spectra as well as the forward-backward asymmetry 
A FB [28]. 

In the vector-boson resonance region(s) these and the universal parts of the weak corrections, which 
can amount to +80 (+40) % for muon (electron) pairs below and —18 (—10) % above the resonance, 
can be taken into account by using a running value of a(M 2 ) or, more generally, effective vector and 
axial vector couplings in the Effective Born Approximation. The corrections are then reduced to +6 
(+2) % for muon (electron) pairs below and +1 (< +1) % above the resonance. While the presence of 
new physics can modify the running of the weak parameters, the QED corrections remain unaffected. 

The electroweak corrections coming from non-factorisable box diagrams with double-boson ex- 
change are small in the Z (and Z') resonance region(s), but they can be quite large away from these 
resonances (-4 to -16 % for electron pairs, -12 to -38 % for muon pairs of invariant mass 300 GeV 
to 2 TeV at the LHC, see Fig. 4). 

4.2.2 NLO QCD Corrections 

The QCD corrections to the Standard Model Drell-Yan process are known at NLO [29] and next-to- 
next-to-leading order (NNLO) [30,31] in the strong coupling constant a s . The latter include in principle 
non-factorisable corrections through qq and gg initial states, which remain, however, smaller than 1 % in 
practice, even at small values of x, where the gluon density is large. The effects of multiple soft-gluon 
radiation have been resummed simultaneously in the low-p T and high-mass (above 500 GeV) regions 
at next-to-leading logarithmic (NLL) accuracy not only for Standard Model Z-bosons [32], but also for 
Z' bosons [33]. They were shown to be in good agreement with the NNLO result as well as the one 
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Figure 4: NLO electroweak corrections in the high-mass region for Standard Model electron and muon 
pair production at the LHC [23]. In the presence of a new resonance, these relative corrections would be 
largely reduced (see text). 



obtained by matching NLO QCD to parton showers in MC@NLO [33, 34]. In contrast, the match- 
ing of tree-level matrix elements to parton showers in Pythia [22] requires the ad hoc application of 
a (slightly) mass-dependent correction (K) factor and leads to an unsatisfactory description of the pr 
spectrum. For resonant spin-2 graviton production, which involves not only color-triplet quark, but also 
color-octet gluon initial states, the NLO QCD corrections are substantially larger (K ~ 1.6) [35,36] than 
those for Standard Model or extra neutral gauge bosons (K ~ 1.26, see Fig. 5). In this case, the matching 
of matrix elements to parton showers has only been performed at the tree-level [37] , and resummation 
has only been performed in the low-pr region [36]. 

While the NLO total cross-sections for vector bosons and gravitons still change substantially when 
the renormalization and factorization scales are varied simultaneously around the resonance mass M by 
a factor of two (±9%) [33,36], the scale uncertainty is reduced to the percent-level at NNLO [30,31] or, 
alternatively, to +6 and -3 % after joint resummation at the NLL order [33]. 

The theoretical uncertainty coming from different parameterizations of parton densities is estimated 
in Fig. 5 [33] for invariant masses above 500 GeV. Since the invariant mass of the lepton pair is corre- 
lated with the momentum fractions of the partons in the external protons, the normalized mass spectra 
(left) are indicative of the different shapes of the quark and gluon densities in the CTEQ6M 5 parameter- 
ization [38]. The latter also influence the transverse-momentum spectra (right). The shaded bands show 
the uncertainty induced by variations, added in quadrature, along the 20 independent directions that span 
the 90% confidence level of the data sets entering the CTEQ6 global fit [39]. With about ±5% at 1 TeV 
(±11% at 3 TeV), the PDF uncertainty is slightly larger than the scale uncertainty [33,40]. 



5 CTEQ collaboration has recently proposed new sets of PDFs. Using them, both the central values and uncertainties may 
change by several percents. 
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2500 3000 0 10 20 30 40 50 60 70 

M (GeV) p T (GeV) 



Figure 5: Mass (left) and transverse-momentum (right) spectra after matching the NLO QCD corrections 
to joint resummation with CTEQ6M parton densities. The mass spectra have been normalized to the LO 
QCD prediction using CTEQ6L parton densities. The shaded bands indicate the deviations allowed by 
the up and down variations along the 20 independent directions that span the 90% confidence level of the 
data sets entering the CTEQ6 global fit. 

The uncertainty at low transverse momenta coming from non-perturbative effects in the PDFs is usu- 
ally parameterized with a Gaussian form factor describing the intrinsic transverse momentum of partons 
in the proton. Three different parameterizations of this form factor have been proposed [41-43]. In 
all three cases the transverse-momentum distribution is changed by less than +3 and —6 % for pj > 
5 GeV [33]. 

Combining the three contributions (from the scales, the PDFs and the non-perturbative form factor), 
the total theoretical QCD uncertainty is ±8.5% at 1 TeV, ±14% at 3 TeV. It must be noted that these 
uncertainties are common to the signal (heavy resonance) and background (Standard Model Drell-Yan). 



4.3 Effect of Muon Spectrometer Misalignment 

At large pj (> 100 GeV), an important contribution to the muon momentum resolution is the alignment 
of the muon spectrometer. In the early data period, the resolution is expected to be dominated by the 
alignment. The ultimate goal of the alignment system is to determine the position of the chambers in the 
muon spectrometer to about 40 \im and o rot {mrad) = 0.5a, ra „ s (mm). 

A detailed study was carried out in order to determine the effect of possible larger uncertainties in the 
position of the chambers to the Z' search. For the analysis, in addition to the ideal case of no misalignment 
at all, we have chosen 7 different hypotheses of misalignment: (40jxm,20jj,rad), corresponding to the 
target value of the alignment system, (100 \lm, 50 \lrad), (200 \im, 100 pirad,), (300 \lm, 150 \lrad), 
(500 jum, 250 pirad), (700 \lm, 350 \lrad) and (1000 \lm, 500 \lrad). In the last two cases, the alignment 
resolution is of the same order or greater than the track sagitta we want to measure. 

As shown in Fig. 6, the dominant effect leading to a wash out of the signal is the resolution loss 6 . The 

6 The slight excess of events around 600 GeV is due to mis-reconstructed Z bosons. In later versions of the software, this 
effect is not present anymore. 
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loss of resolution due to misalignment will deteriorate our ability to determine the charge of the muon. 
This was also studied as a function of the misalignment and is summarized in Table 7. 



Drell-Yan invariant it 
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Figure 6: Left: reconstructed invariant mass distribution of Drell-Yan events for different misalignment 
hypotheses. The numbers corresponds to an integrated luminosity of 100 pb _1 . Right: reconstructed 
invariant mass of the Z' model for the seven misalignment scenarios. 



Misalignment (/im) 


Ideal 


40 


100 200 


300 


500 


700 


1000 


Relative efficiency 


0.984 


0.984 


0.984 0.98 


0.973 


0.948 


0.918 


0.877 



Table 7: Loss in signal efficiency due to the charge misidentification for seven misalignment hypotheses. 



4.4 Other Systematic Uncertainties 

Additional experimental systematic uncertainties must be taken into account, listed as follows: 

• the uncertainty in the efficiency of object identification was assumed to be 5% for muons, 1% for 
electrons, and 5% for T leptons; 

• the uncertainty in the energy scale was assumed to be 1% for muons, 1% for electrons, and 5% for 
T leptons; 



L 00.00017 for muons, 



• the uncertainty in the resolution of the objects is as follows: o{-^ 
20 % for electrons, and 45% for % leptons. 

• the uncertainty in the luminosity was assumed to be 20% with an integrated luminosity of 100 pb _1 
of data and 3% for 10 fb" 1 . 

The effect of all the above on the discovery potential is discussed in the next section for individual 
channels. 
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5 Search for Exotic Physics 

In this section we present the discovery potential for several resonant signatures in the early running of 
ATLAS. We focus on the reach with an integrated luminosity of up to 10 fb _1 of data. 

The statistical significance of an expected signal can be evaluated in several ways. The simplest 
approach, "number counting" is based on the expected rate of events for the signal and background 
processes. From these rates, and assuming Poisson statistics, one can determine the probability that 
background fluctuations produce a signal-like result according to some estimator; e.g. the likelihood 
ratio. In the "shape analysis" approach, a detailed knowledge of the expected spectrum of the signal and 
background for one observable (like the invariant mass distribution for example) can be used to improve 
the sensitivity of the search by treating each mass bin as an independent search channel, and combining 
them accordingly. 

The resulting sensitivity is in general higher in the shape analysis than the estimation given in the 
number counting approach. In the shape analysis, the data is fitted or compared to two models: a 
background-only model and a signal-plus-background model. These are also called "null hypothesis", 
noted HO and "test hypothesis", noted HI, respectively. The input signal and background shapes are 
given to the fitting algorithms either as histograms in the non-parameterized approach [44] or as func- 
tions in the parameterized approach. For each of the models, a likelihood or a x 2 distribution is computed 
and the log of the ratio of the two likelihoods (LLR) or the difference of two # 2 s are estimated and used 
to compute the confidence levels. Either CL h = CL H0 alone, or CL S = CL m jCL m (in the "modified 
frequentist approach" [44]) can then be used to compute the significance S: 

S=V2xErf-\l-CL h ) or S = V2 x Erf~ \l - ^-) (1) 

in the double tail convention 7 . 

A convenient way to compute the LLR is to use the Fast Fourier Transform (FFT) method presented 
in [45]. The advantage of this method is that it does not require the generation of millions of pseudo- 
experiments needed for high significances and which can be time consuming. The sources of systematic 
uncertainties can then be incorporated as nuisance parameters. 

The above methods have been used to investigate the discovery potential of the Z' boson in the 
dilepton (e, jj,, t) channels, of the graviton in the dielectron channel, and of Technicolor in the dimuon 
channel. This is presented in the following sections. 

5.1 Background Estimation 

As discussed in the previous section, neutral Drell-Yan production of lepton pairs is expected to be the 
dominant background for all the analyses (but T + T~) and other contribution will be neglected here. Since 
different techniques are used to estimate the signal significance we also treat the Drell-Yan background 
in a few different but entirely consistent ways: 

• In the "number counting" approach, we simply count the expected number of events under the 
resonance peak from various background sources, including the Drell-Yan process. 

• In the non-parameterized CL S method, we use the number and shape of the mass distribution by 
producing a histogram for the background. 

• For several analyses, we perform a fit to the Drell-Yan background parameterizing the shape which 
allows to estimate the number of background events and extrapolate it to higher masses. 

7 In this convention, 1 - CL h has to be lower than 2.87 x 10~ 7 to correspond to a 5a significance. 
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Each of these methods produces a complementary and consistent approach to estimating the main 
background. When the Drell-Yan fit is needed, we parametrize the shape of the background by the for- 
mula ae~ hM ' \ where M is the invariant mass of the lepton pair and a,b,c are parameters of the fit. Fits to 
the Drell-Yan spectrum presented in section 4.2 suggest that the parameterization exp(— 2.2M 03 ) used 
by [46] describes the background shape well. It is this one which is used in the Z' — > fiji, G — > ee and 
technicolor analyses. In the Z' — > ee analysis, these parameters are allowed to vary in the individual en- 
semble tests. The fit to the entire spectrum letting all parameters float is consistent with this prescription. 

5.2 Z' — > ee Using a Parameterized Fit Approach 
5.2.1 Event Selection 

The selection of events with two electrons coming from a Z' has been studied in samples of fully simu- 
lated Z' x —> e + e~ events with Z' boson masses of 1 , 2 and 3 TeV, corresponding respectively to integrated 
luminosities of 21 fb _1 , 204 ftr 1 and 2392 fb _1 . 

The first requirement is that the two highest pj clusters in the event be in the geometrical acceptance. 
The next requirement is that these clusters be associated with a track; its efficiency is 67% at 1 TeV and 
decreases for higher masses. The third requirement is that these two reconstructed electron candidates 
be identified as loose electrons. The relative efficiency of such a selection is at least 94% and increases 
with invariant mass. The trigger studies have been normalized to events with two loose electrons. As 
shown in section 3.1, the highest trigger efficiency is obtained with a non-isolated single electron trigger 
(e55). Its efficiency is 90.8% per event. The last requirement is that the two electrons have opposite 
electric charges. The requirement flow is presented in Table 8, where the events are counted in a window 
of ±4 Tz> around the center of the resonance. Although the opposite charge requirement is optional in 
the absence of a large background, especially at very high invariant mass, it allows to have a control 
sample (made of same sign dielectrons) for the background. The resulting overall efficiency is 48% at 
m = 1 TeV, 42% at m = 2 TeV and about 34% at m = 3 TeV. 



Selection 


Signal 


DY 


Signal 


DY 


Signal 


DY 




at 1 TeV 


at 1 TeV 


at 2 TeV 


at 2 TeV 


at 3 TeV 


at 3 TeV 




347. 


3.56 


14.7 


0.16 


1.22 


0.015 


2 generated e ± , \r\\ < 2.5 


299. 


3.07 


13.7 


0.15 


1.16 


0.013 


2 clusters with a track 


201. 


2.06 


8.0 


0.09 


0.62 


0.009 


2 loose electrons 


190. 


1.96 


7.2 


0.08 


0.52 


0.008 


At least one p T > 65 GeV 


190. 


1.96 


7.2 


0.08 


0.52 


0.008 


Event triggered 


173. 


1.77 


6.6 


0.07 


0.47 


0.007 


2 opposite charges 


166. 


1.70 


6.2 


0.07 


0.43 


0.007 



Table 8: Requirement flow table for the Z' — »■ e + e analysis: cross-sections in fb. The events are counted 
in a window of ±4 F Z ' around the resonance. 

The above efficiencies, normalized to events in the geometrical acceptance (|tj| < 2.5), are shown in 
Fig. 7 (left) as a function of the invariant mass of the electrons. They do not depend on the model used 
to generate the Z' samples. Only the requirement that the two electrons be in the geometrical acceptance 
depend on the model. Indeed, the relative proportions of initial quark flavors depend on the couplings of 
the Z' to the quarks. The PDF of the up quarks being harder than that of the down quarks, Z' produced 
by a uu pair tend to be slightly more boosted, and therefore the electrons stemming from their decay 
tend to be produced at slightly higher pseudo-rapidities. This effect is visible in Fig. 7 (right) showing 
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the efficiency of the | T] | selection for uu and dd events separately, and for a number of benchmark Z' 
models: the Sequential Standard Model (Z' SSM ), the E6 models Z' v , Z' x , Z'^, and the left-right symmetric 
model (Z' LR ). It is therefore possible to generalize the efficiencies that have been measured in the fully 
simulated samples to models which haven't been simulated as well as to intermediate masses. 

5.2.2 Discovery Potential 

Modeling of the dilepton invariant mass spectrum. In order to compute the significance for several 
Z' models, a parameterization of the mass spectrum of the signal and of the background has been used. 
The differential cross-section can be factorized with a good precision in a parton-level term ^ and a 
PDF-dependent term Gpcftra): 

^(m) = ^(m)xG PDF (m) (2) 
dm dm 

Using this factorization, one can write: 



da_\ 
dm | , 



do\ , x 1 „ , . 

(m) = —^xGp DF {m) 



< G PDF (m) (3) 



u 1 1 Signal 

r 2 , m 2 

m\, (m 2 -m 2 ,) 2 + m 2 ,r 2 , 
Fy, m 2 — ml, . x 

+ Mnterf X ~\ — 2 2 2 X G PDP (m) 

m L z , (m z -m z ,) z + m|,r|, 

where iSpeak is the amplitude of the Z' process and ^ nte rf is the amplitude of the interference Z'/Z and 
Z' 1 7, both normalized to the Drell-Yan process. This parameterization only depends on four parameters: 
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mz', Tz<, s^peak and Mnteri- The differential cross-section is then multiplied by the appropriate ^-factor 
(see section 4.2). The detector performance is accounted for as follows: the differential cross-section 
is multiplied by the efficiency computed above and convoluted by the invariant mass resolution (see 
section 2.1). The agreement between this parameterization and the full simulation is shown in Fig. 8 
(left) for a Z' x at 1 TeV. 



Significance = 6.45 





Figure 8: Left: mass spectrum for a m = 1 TeV Z' x — > e + e~ obtained with ATLAS full simulation (histogram) 
and the parameterization (solid line). The dashed line corresponds to the parameterization of the Drell-Yan process 
(irreducible background). Right: Log-likelihood ratio densities with 1 fb _1 for am = 2 TeV Z' % for the signal and 
background hypotheses. The vertical line is the median experiment in the HI hypothesis. 



Results Using the parameterization presented above to generate mass spectra for signal (y/Z/Z' — >• 
e + e~) and background (y/Z — > e + e~), one can compute the distributions of the log-likelihood ratio of 
the signal (HI) and background (HO) hypotheses. 

Figure 8 (right) shows the LLR distributions obtained for a 2 TeV Z' x with 1 fb~ 1 as well as the median 
signal experiment used to calculate CL S . The FFT method [45] was used in the computation of the LLR 
distributions. It is important to note that the mass window used to perform the analysis does not affect 
the result. 

Figure 9 (left) shows the integrated luminosity needed for a 5a discovery of the usual benchmark 
Z 1 models as a function of the Z 1 mass. Only statistical uncertainties were taken into account. The 
systematic uncertainties are discussed in the next paragraph. A fixed mass window of [500 GeV — 4 TeV] 
was used to compute the significance. Roughly speaking, less than 100 pb _1 are needed to discover a 

1 TeV Z', about 1 fb _1 are needed to discover a 2 TeV Z' , and about 10 fb _1 are needed to discover a 
3 TeV Z'. 

Systematic Uncertainties The sources of systematic uncertainties were listed in section 4. Since the 
main background is the Drell-Yan process, the systematic uncertainties from both the efficiencies and 
the theoretical predictions on the cross-section will affect the number of signal and background events 
in the same way, and can be added in quadrature. The uncertainties in the event selection efficiency 
mainly come from the electron identification and the geometrical acceptance. The former amounts to 

2 x ±1% = ±2% for two electrons. Taking the extreme efficiencies for pure uu and dd events as a 
conservative estimate, the latter goes from ±3 to ±0.5%. Overall, this represents a systematic uncertainty 
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Figure 9: Integrated luminosity needed for a 5(7 discovery of Z' — > e + e~ as a function of the Z' mass. Left: 
for various benchmark models with statistical uncertainties only; right: for the Z' x with systematic uncertainties 
included. 



of ±3.6% to ±0.6% from the event selection. This is small as compared to the theoretical uncertainties, 
which range from ±8.5% to ±14%. The effect of these combined uncertainties on the luminosity needed 
to discover 1, 2 and 3 TeV Z's is t\ Q %, t\ 5 3 % (respectively). 

The uncertainty in backgrounds other than the Drell-Yan process is another type of uncertainty. How- 
ever, given that the Drell-Yan contribution is at the level of about 1% of the signal, any variation of the 
level of non-Drell-Yan background, which is more than ten times smaller, is negligible. 

The uncertainty in the electron energy resolution is another type of uncertainty. In addition to the 
expected uncertainties in the energy resolution as measured in the calorimeter (see section 4), we have 
conservatively assumed that there was no increase in precision on the measured dielectron invariant mass 
coming from the angle measurement provided by the tracker. In this case, the resolution of invariant 
mass increases from about 1% (see section 2.1) to about 1.5%. The effect of these uncertainties on the 
luminosity needed for a discovery is +\%, independent of the Z' mass. 

The last type of uncertainty which has been considered is the electron energy scale. When varied 
within the expected uncertainties, the discovery luminosity varies by +g 5 %, independent of the Z! mass. 

Combining all the above systematic uncertainties, the luminosity needed to discover, for example, a 
Z' x is shown in Fig. 9 (right). It must be noted that the systematic effect coming from the fact that we 
do not know a priori the mass of the signal was not taken into account. This is adressed separately in 
appendix A. 

5.3 Z 1 ' — > /I /I Using a Parameterized Fit Approach 

The dimuon channel represents an important complement to the dielectron channel. Although the reso- 
lution is expected to be up to an order of magnitude worse in the kinematic regime of interest, reducible 
backgrounds are expected to be considerably lower as discussed in Section 4.1. This feature makes the 
dimuon channel competitive, especially with early data where the design background rejection may not 
be achieved. In this section we consider two signal models decaying into dimuons - the Z' SSM and the Z' x 
boson. 
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5.3.1 Event Selection 

To select events from the Z' — > jit/i process we require two muons of opposite charge. The muons are 
required to fulfill the muon identification criteria studied in Section 2.2, including pj > 30 GeV and 
|t]| < 2.5 . Events are triggered using the mu20 trigger described in Section 3.2. As seen in Section 4.1, 
this should select a sample which consists mainly of Z/y — > jit/i with limited contamination from other 
sources of the order of a few percent. Table 9 indicates the effects of the various requirements on both 
the signal and background samples. 



Sample 


Z£ SM (lTeV) 


Z' x (1 TeV) 


Drell-Yan 


Generated 


508.6 


380.6 


13.5 


< 2.5 


366.8 


271.5 


10.8 


p T > 30 GeV 


364.0 


270.1 


10.7 


Muon identification 


342.3 


256.0 


10.0 


Trigger 


325.2 


243.2 


9.5 


Opposite charge 


324.8 


243.0 


9.5 



Table 9: Selection requirement flow for the Z' — > jujU analysis - cross-sections in fb. Events are counted 
in a mass window of ±50 GeV of the resonance mass (signal) and for m w > 800 GeV (background). 



5.3.2 Discovery Potential 

To evaluate the discovery potential, we use the FFT method [45], as in section 5.2. The amount of 
data required to discover a Z' boson is computed from the log-likelihood ratio (LLR) of the signal (HI) 
and background (HO) hypotheses. Figure 10 shows the 1 —CL/, obtained as a function of the integrated 
luminosity for the two studied Z' boson models at m = 1 TeV. The largest expected systematic uncertainty 
(from misalignment of the muon spectrometer) is shown separately. One can see that the amount of 
luminosity needed for a 5a discovery ranges from 20 to 40 pb _1 , which is competitive with the dielectron 
channel. 




Figure 10: Results of the FFT computation of 1 — CLt for m = 1 TeV Z' SSM (left) and Z' x (right) bosons. 
The horizontal line indicates the 1 — CL\, value corresponding to 5a. 



1713 



Exotics - Dilepton Resonances at High Mass 



Systematic Uncertainties Section 4 describes the systematic uncertainties that were considered. As 
can be seen from Fig. 10 the effect of the nominal systematic uncertainties is modest in this channel. 
The largest theoretical uncertainty entering this study is the knowledge of the Standard Model Drell- 
Yan cross-section. In the dimuon channel, the largest experimental uncertainty is the resolution for high 
Pt muons which will be initially dominated by the alignment of the muon spectrometer. As already dis- 
cussed, the nominal alignment precision may not be achievable with the integrated luminosities presented 
here and hence could significantly alter the conclusions. Figure 10 shows that the integrated luminosity 
needed to reach 5a increases from 13 to 20 pb _1 if the muon spectrometer is aligned with a precision of 
300 ]lm. This takes into account an uncertainty of 150 jlm on the alignment precision estimate, which 
will have to be measured in data (e.g. from the Z — »■ /x/J, sample) and which is treated as a nuisance 
parameter in the sensitivity computation. 

5.4 Z' — > xx Using a Number Counting Approach 

The ditau signature is an important component to the high mass resonance search. In particular, there 
are models in which a hypothetical new resonance couples preferentially to the third generation [47]. 
For these models the branching ratios are such that the dielectron and dimuon channels are not viable - 
hence it is critical that we consider all possible channels including ditaus. In this section we discuss the 
discovery potential for such a resonance. Because of finite resources we restrict ourselves to the process 
Z' — > xx with a single mass point m = 600 GeV although much of the discussion generalizes to a generic 
ditau resonance search. The ditau final state can be divided into three final states: hadron-hadron (where 
both x leptons decay hadronically), hadron-lepton (where one x lepton decays hadronically and one 
decays leptonically), and lepton-lepton (where both decay leptonically). Here we consider the hadron- 
lepton Qi — i) final state. The possibility of observing the hadron-hadron final state using a hadronic x 
trigger will be examined later. 

5.4.1 Event Selection 

To select events in the hadron-lepton final state, we select events with a "hadronic x" candidate, a charged 
lepton (muon or electron), and missing transverse energy 8 ($ T ). As opposed to the dielectron or dimuon 
channel, the backgrounds to the ditau channel are considerably larger and include Drell-Yan production, 
W+jets, tt and dijet events. After the initial object selection several additional requirements are needed 
to maximize the expected signal significance. 

We consider hadronic x candidates with pj > 60 GeV and impose a requirement on the likelihood as 
a function the x transverse energy as described in Section 2.3. Candidates which overlap with an electron 
or muon are removed. 

For electron candidates we require medium electron selection criteria in this analysis (see Section 
2.1). The initial muon selection is the same as described in Section 2.2. Since this channel only requires 
one high pj lepton the backgrounds are considerably higher than for the dielectron or dimuon final 
states. To address this we impose additional requirements on the isolation of the lepton. The isolation 
requirement imposed on electron candidates is Y,E^^ 02 / pr < 0.1 where L£^ <0 ' 2 is the sum of the 
energy deposits in the electromagnetic calorimeter within a cone of AR = 0.2 from the location in r\-§ of 
the electron, less the electron candidate energy. Isolated electrons are required to have p T > 27 GeV. We 
impose an isolation requirement similar to that of electrons on muon candidates: L-E^ 0 2 < 0.1. For 
isolated muons we require that the X 2 lie between 0 and 20 and to be considered by the analysis muons 
must have p T > 22 GeV. 

8 The missing transverse energy was reconstructed using the cell based algorithm described in [48]. 
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Selection || Signal || tt | Drell-Yan | Multijet | W+jet j 



Trigger 


1356. 


213600. 


2.3950 10 7 


4.19000 10 6 


6.69400 10 6 


Lepton 


905. 


150900. 


1.2600 10 7 


1.08230 10 6 


120400. 


x selection 


368. 


7818. 


145680 


40080 


4587. 


Opposite charge 


315. 


2498. 


5306 


23240 


771. 


# r >30 GeV 


270. 


2040. 


2562 


835 


162. 


m T < 35 GeV 


203.2 


302.4 


388.0 


436.4 


83.8 


pf < 70 GeV 


155.0 


106.7 


331.5 


221.6 


28.4 


m vis > 300 GeV 


132.5 


26.2 


105.6 


33.8 


15.0 


cosA0 tt > -0.99 


13.3 


2.1 


5.5 


2.3 


2.7 



Table 10: Requirement flow table for the m = 600 GeV Z' SSM — ► %% — ► Ih analysis - cross-sections given 
in fb. The Drell-Yan process includes all flavors of leptons {e + e~, T + T~) with an invariant mass 

of at least 60 GeV. 

After making the T candidate selection we make several further requirements to maximize the signal 
significance. First, we require that #t> 30 GeV. To greatly help with the rejection of the tt backgrounds 
we employ a requirement on the total event pj which is defined as the sum of ^and the vector sum of 
the hadronic z with the lepton transverse momentum. We require p j ' < 70 GeV. 

The transverse mass of the event is determined by using the lepton kinematics and the event Tf.j. 
Defining a four-vector for the missing energy: p'r = ($T x ,#T y ,0, \#t\), the transverse mass is calculated 
as: 

m T = ^/2p rif > r (l-cosA^ r ). 

We require that m T < 35 GeV. 

In the case of the lepton-hadron channel one cannot simply reconstruct the invariant mass of the 
resonance as energy is taken away from the event by the neutrinos. However, two quantities can be 
constructed 

• A visible mass variable is calculated as defined by CDF [49] using the hadronic % and the lepton 
four-vector information: 

m vis = ^(pi + Ph + pV) 2 

• The collinear approximation is used to build up the event-by-event invariant mass. The fraction 
of the T momentum carried by the visible decay daughters, xe and Xh, are calculated with the 
following formulas: 

_ PxjPy,h ~ Px,hPy,£ _ PxjPyM ~ Px,hPy,i 

Py,hPx,£ + Py,h 4x ~Px,hPy,e ~ Px,h^y ' Py,hPx,£ + Px,£ p"y ~Px,hPy,e ~ Py/rfx 

The reconstructed mass is then calculated as m TT = ^= . 

To greatly help the background rejection and to restrict our search to the region of interest we require 
m v i s > 300 GeV. Since the collinear approximation breaks down when the two T leptons are back-to- 
back, we impose the requirement that cosAfe > -0.99. Of course, since a very heavy particle tends 
to be produced at rest, the decay objects are mostly back-to-back, leading to a highly inefficient mass 
reconstruction. 
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Visible Mass GeV Collinear Mass GeV 



Figure 1 1 : Left: the visible mass distribution in the Z' — > TT — > Ih analysis for signal and background pro- 
cesses (1 fb" 1 of data is assumed). Right: the reconstructed invariant mass obtained using the collinear 
approximation. 

5.4.2 Discovery Potential for 1 fb 1 of Data 

Table 10 shows the effect of the various selection requirements for the signal as well as all background 
processes considered. Distributions of the visible and reconstructed masses for signal and background 
are shown in Fig. 11. Here we assume a 600 GeV Z' and the SSM cross-section. In 1 fb -1 of ATLAS 
data we estimate 132. signal events and 181. background events after imposing the event selection up to 
the requirement on visible mass. Using S/^/B we estimate the signal significance to be 9.9. The collinear 
approximation breaks down when the two T leptons are back-to-back, so that even a loose requirement 
(such as cosA0#, > —0.99) reduces the signal by a large factor. Hence, we expect that the search will 
proceed by looking at the visible mass. If a significant excess over background is seen, the collinear 
approximation will then be used to help establish the presence of a new resonance. 

Systematic uncertainties The systematic uncertainties that were considered are described in Section 4. 
For an analysis of 1 fb -1 of data the dominant systematic source on the signal, just over ±18%, comes 
from the uncertainty in the luminosity. The second most dominant systematic, the hadronic x energy 
scale, affects the signal at the ±10% level. Summing in quadrature the effect of all systematic uncer- 
tainties on the signal Monte Carlo sample results in a total systematic uncertainty of about ±20%. The 
current Monte Carlo samples available for the backgrounds to the ditau analysis are statistically limited 
and hence prevent a rigorous evaluation of the systematics at this time. As a conservative estimate, we 
assume that the total systematic uncertainty in the backgrounds is identical to that observed in the signal 
Monte Carlo. This is a conservative estimate because the majority of the backgrounds in the data have 
very large cross-sections (dijets, W+jets, etc.) and in principle the evaluation of systematic uncertain- 
ties there should be less sensitive to statistical fluctuations than for the signal events. Summing these 
systematic uncertainties in quadrature and using the formula 5/ \JB + 8B 2 gives a significance of 3.4 in 
1 fb" 1 . 
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5.5 G — > e + e in a Parameterized Fit Approach 

In this section we present a sensitivity study for the Randall- Sundrum G — > ee final state. In this channel, 
it is assumed that there is no interference between the G and the dilepton background. Table 1 1 shows 
the parameters of the different G samples used in this analysis. is the simulated graviton resonance 
width and o m stands for the width of the observed resonance after convolution with detector resolutions. 
For k/Mpi < 0.06 the resonance is narrow compared to the experimental resolution. 

The main Standard Model background is neutral Drell-Yan production. Other backgrounds such as 
dijets with both jets misidentified as electrons are expected to be small and neglected at this time. 



Model Parameters 


r G 

[GeV] 


[GeV] 


o-BR(G^e+e-) 
[fb] 


m G 


k/M pl 


500 GeV 


0.01 


0.08 


4.6 


187.4 


750 GeV 


0.01 


0.10 


6.4 


27.7 


1.0 TeV 


0.02 


0.57 


7.9 


26.0 


1.2 TeV 


0.03 


1.62 


10.3 


22.4 


1.3 TeV 


0.04 


2.98 


11.4 


25.3 


1.4 TeV 


0.05 


5.02 


13.1 


26.8 



Table 11: Parameters of the G —>■ ee samples used: natural width (To), Gaussian width after detector 
effects (o m ) and leading order cross-section. 



5.5.1 Event Selection 

In reconstructing the resonance mass, we require a pair of electrons - we do not make any charge re- 
quirements - with pr > 65 GeV using the loose electron selection criteria described in Section 2.1. We 
require that the events pass the e55 single electron trigger (see section 3.1). Finally we require that 
the two electrons are roughly back-to-back in <j) with cos A(j) ee < 0 between the two electrons. Table 12 
shows the remaining cross-section at each stage of the selection and the total eficiency for different mass 
points. The efficiency decreases at high graviton masses, due to the track match requirement, which is 
consistent with the Z' boson analysis (see section 5.2). Table 13 shows the same requirement flow for 
the Drell-Yan. 

The Drell-Yan background distribution after this event selection is shown in Fig. 12 along with signal 
at m G = 1 TeV and coupling k/M p[ = 0.02. The exponential described in Section 5.1 has been used to 
model the shape of the background. 

5.5.2 Discovery Potential 

We search for an excess of events in the mass range from 300 GeV up to 2 TeV and study the signal 
sensitivity by use of "extended maximum likelihood" fitting. We consider two hypotheses. The null 
hypothesis, HO, is the hypothesis that the data are described by the Standard Model. The test hypothesis, 
HI, is that the data are described by the sum of the background and a narrow Gaussian resonance. 

To investigate the potential for discovery pseudo-experiments are generated from both the null and 
test hypothesis. Each pseudo-experiment is fit twice. The first fit assumes the data are described by 
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Selection / Sample 


500 GeV 


750 GeV 


1.0 TeV 


1.2 TeV 


1.3 TeV 


1.4 TeV 


Generated 


187.4 


27.7 


26.0 


22.4 


25.3 


26.8 


Acceptance 


172.4 


25.9 


24.7 


21.2 


24.0 


25.4 


Trigger 


168.7 


25.0 


22.6 


19.1 


21.4 


22.3 


Electron Id. 


127.9 


18.3 


16.4 


12.8 


14.6 


14.7 


p T > 65 GeV 


125.7 


18.2 


16.3 


12.7 


14.5 


14.6 


cosA0 ee < 0 


123.0 


17.8 


16.0 


12.6 


14.3 


14.4 


Selection efficiency (%) 


65.6±1.1 


64.4± 1.1 


61.7±1.1 


56.3±1.1 


56.4±1.1 


53.9± 1.1 



Table 12: Requirement flow for the G — > ee analysis. The remaining cross-section (in fb) is given at each 
stage. The mass window is chosen as ±4a m around the signal peak. 



Selection/Sample 


500 GeV 


750 GeV 


1.0 TeV 


1.2 TeV 


1.3 TeV 


1.4 TeV 


Generated 


20.33 


4.91 


1.43 


0.90 


0.51 


0.51 


Acceptance 


18.53 


4.50 


1.36 


0.87 


0.48 


0.49 


Trigger 


18.45 


4.25 


1.16 


0.80 


0.45 


0.44 


Electron Id. 


14.13 


3.18 


0.88 


0.58 


0.38 


0.33 


p T > 65 GeV 


13.85 


3.15 


0.88 


0.57 


0.38 


0.33 


cosA<p ee < 0 


13.41 


3.09 


0.85 


0.56 


0.36 


0.33 



Table 13: Remaining Drell-Yan cross-section (in fb) at each stage of the G — ► ee analysis. The mass 
window is chosen as ±4o m around the signal peak. 
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Figure 12: Left: expected (histogram) and "observed" (rilled circles) Drell-Yan spectrum from full sim- 
ulation. The observed distribution includes a graviton with mass of 1 TeV and coupling k/M p i = 0.02. 
Note that for the purposes of this plot the vertical axis has been rescaled: the error bars correspond to an 
integrated luminosity of 100 fb~ 1 . Right: Log likelihood ratio curves for one million pseudo-experiments 
generated with background only (filled circles), and signal plus background (empty circles) for the same 
m = 1 TeV signal point. 



the Standard Model using the function described in Section 5.1. The second fit assumes the data are 
described by the sum of a Gaussian and the shape describing the Drell-Yan background. During this 
second fit the mean of the Gaussian is allowed to float throughout the entire mass region considered, and 
the width is fixed to the detector resolution. 




Graviton M (GeV) 



: — •— 5a Discovery Limits 




: -r* — 3o Evidence Limits 




- \ L dt=1 .Ofb" 1 




; \/s=14TeV 






ATLAS - 
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Graviton M (GeV) 



Figure 13: 5a discovery potential (full squares) as a function of the graviton mass. The 3a evidence 
potential is also shown (full circles). Left: shown with cross-sections as calculated by Pythia (LO) and 
multiplied by a K factor of 1.6 for several values of the coupling; right: dependence of the discovery 
potential on the coupling. 



We can then compare the likelihood of the signal and background hypotheses. The distribution of 
the logarithm of the likelihood ratio between HO and HI is constructed, and shown for one signal point 
in Fig. 12. Based on this, we calculate the average expected discovery potential from the fraction of 
the likelihood ratio distribution for background-only pseudo-experiments that extends beyond the mean 
of the distribution for signal plus background experiments. Figure 13 shows the 5a discovery and 3a 
evidence reach in cross-section and k/M p i coupling constant as a function of graviton mass, estimated 
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for an integrated luminosity of 1 fb -1 . 

The LO cross-sections are multiplied by the ^-factors discussed in section 4.2.2 for both signal and 
Drell-Yan background. Various sources of systematic uncertainties for signal and background are con- 
sidered in the evaluation of the experimental sensitivity, including luminosity, energy scale, energy reso- 
lution, electron identification efficiency and Drell-Yan background uncertainties as listed in section 4.4. 
The combined effect of the systematic uncertainties is to increase the amount of integrated luminosity 
needed for discovery between 10 and 15 percent for the different parameter sets. 

5.6 Technicolor Using a Non-Parameterized Approach 

Topcolor-assisted Technicolor models with walking gauge coupling predict new technihadron states that 
would be copiously produced at the LHC. The lowest mass states are the scalar technipions (ftp 0 ) and 
the vector technirho and techniomega (pp° and co T ). The vector mesons decay into a gauge boson plus 
technipion (ynr, WlZj or Zkt,) and fermion-antifermion pairs. This analysis searches for the decays 
Pt — > and g>t — > Ai + M~- The dimuon mode has a lower branching fraction than the modes involv- 

ing technipions but the signal is clean, straightforward to trigger on, and can be readily observed with 
early ATLAS data. 

The particular model studied here is the "Technicolor Strawman Model" or TCSM [12, 13]. In the 
TCSM, it is expected that techni-isospin is an approximate good symmetry and therefore the isotriplet 
p T and isosinglet <»r will be nearly degenerate. We will assume for what follows that m PT = m fflr . 
The technipions are also expected to be nearly degenerate. In the TCSM, the technipion masses are 
generically not small. In particular, if m % > m Pr /2 the decays of the pr and (Or to technipions would 
be kinematically forbidden [50]. The dimuon rate is expected to come dominantly from the (Oj with a 
smaller contribution from the pj. 

The event selection is summarized in Table 14. The technivector meson natural widths are less than 
a GeV, so the observed width a(m) is entirely due to detector resolution. 

In principle, the best search sensitivity is not obtained by examining the entire dimuon mass distribu- 
tion for a bump all at once but by using an optimized mass window that maximizes the signal significance 
for a given assumed signal mass. A prescription for the optimal window size is taken from an analytic 
calculation in Ref. [51]. Assuming a narrow Gaussian peak on a linear background, the optimal window 
was found to be ±1.4<7 about the peak mass. Since we are not really in the narrow resonance regime, 
we did a study using full-simulation ATLAS Monte Carlo for a Technicolor signal on a Drell-Yan back- 
ground. Taking S/\jB as our measure of significance, Fig. 14 (left) shows that a window size of ± ~ 1 .5a 
or a bit larger is optimal. For this study, a window size of ±1.5 a about the peak mass is used. 

Figure 14 (right) shows the integrated luminosity necessary to observe either 3a evidence or a 5a 
discovery, using the modified frequentist approach [44], of technimesons in this channel. The systematic 
uncertainties summarized in section 4.4 were included in this calculation of technimeson search sensitiv- 
ity. It should be noted that the integrated luminosity needed for 5 a discovery will be affected by the level 
of misalignment of the muon spectrometer. The contours in Fig. 14 were computed assuming the level 
of alignment we expect to achieve. The studies in sections 4.3 and 5.3 show that for an initial precision 
of 300 p,m with an uncertainty of 150 p,m the amount of data needed to reach 5a would increase by 
approximately 50%. 



1720 



Exotics - Dilepton Resonances at High Mass 



m„ ,v_ fGeV") 


400 


600 


800 


1000 


Peak mass (GeV) 


403 


603 


804 


1004 


a(ra) (GeV) 


13 


22 


34 


46 


Requirement 










Generated 










hi < 2.5 


116 


39.8 


15.8 


7.3 


p T > 30 GeV 


114 


39.5 


15.7 


7.2 


L1.MU20 


112 


38.7 


15.3 


7.0 


L2jiu20 


110 


38.0 


15.1 


6.9 


EFjiiu20 


109 


37.5 


14.9 


6.8 


Match x 2 < 100 


104 


35.7 


14.0 


6.4 


Opposite charge 


104 


35.7 


14.0 


6.4 


Mass window 


78.2 


26.3 


10.3 


4.7 


Drell-Yan background 


46.9 


14.1 


6.1 


2.8 


Selection efficiency (%) 


38.9±0.5 


43.2±0.5 


44.8 ±0.5 


46.8 ±0.5 



Table 14: Selection requirement flow for the analysis - cross-section in fb. 




Window Size (a) Mass(p T ,co T ) (GeV) 

Figure 14: Left: for two different Pt,(Ot signal masses, S/V~B is plotted as a function of mass- 
window size for windows centered on the peak mass. Right: integrated luminosity needed for 3a 
evidence or 5a discovery as a function of Pt,(Ot mass. The dashed lines include only statistical 
uncertainties while the solid lines contain the systematic uncertainties as well. 
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6 Summary and Conclusions 

Several models which lead to resonances in the dilepton final state have been studied. Various systematic 
studies have been undertaken which estimate the effect of uncertainties from both theoretical knowledge 
of Standard Model processes as well as expected and assumed early detector performance. Data-driven 
methods have been developed to evaluate efficiencies, backgrounds, and uncertainties. It has been shown 
that even with early data the discovery potential can be dramatically increased from current limits. The 
discovery potential with an integrated luminosity of 10 fb _1 depends on the particular model and varies 
in the m = 1.0 to 3.5 TeV range. It should be noted that resonance masses above 1 TeV which are 
unreachable by the Tevatron experiments could be discovered with 100 pb _1 of data already. 

A Effect of the Unknown Location and Rate 

When estimating the significance of a local excess of events, the size of the region considered and un- 
certainties in the shape of the background can significantly reduce the sensitivity of the search. This 
appendix presents an assessment of the size of this effect for the Z' boson to dilepton searches. If an 
excess is found in the dilepton invariant mass, its significance needs to be evaluated in a way that takes 
into account the possibility of background fluctuations of different masses, cross-sections and widths. 
One possible way to do this is through the use of maximum likelihood fits, where these quantities are 
free parameters. 

To estimate the effect on the sensitivity of the unknown rate and location of a dilepton resonance, 
the decay Z' SSM —> ee and Z' SSM —>■ were both generated for 16 true Z' masses between 1 and 4 TeV 
(evenly spaced every 200 GeV), with a lower cut on the true dilepton mass of 0.5 TeV in all cases. Each 
sample was simulated and reconstructed using fast simulation, and events were required to have two 
back-to-back (A0 > 2.9) leptons of opposite charge with p T > 20 GeV and within |?]| < 2.5. For an 
estimation of the expected background, Standard Model Drell-Yan production was used. 

The dilepton resonance was modeled using an ad-hoc parameterization that models appropriately the 
shapes of both the Z' — > ee and Z' — ► fljl modes, consisting of a product between a Breit-Wigner and a 
Landau distribution with a common mean, and where the width of the Landau was parameterized as a 
function of the width of the Breit-Wigner 9 . The common mean, the width parameter and the amplitude 
of the signal are allowed to float in the fits. 

Figure 15 shows the likelihood ratio distributions for an m — 3 TeV Z^^ — ► ee fit-based signifi- 
cance, where the signal rate, the peak's width and the mean mass all float in the fit, corresponding to 
an integrated luminosity of 4 fb _1 . The distributions of the log-likelihood ratio for fits to HO pseudo- 
experiments and for fits to HI pseudo-experiments are shown. The fraction p of the HO distribution 
that has a likelihood ratio larger than the mean of the HI distribution is shaded. The value of p is then 
transformed into a significance following the convention under which p = 2. SI x 10~ 7 corresponds to 
5(7 (see section 5). The fraction shown in the plot corresponds to a significance of 4.29a. 

Several million pseudo-experiments were generated and fit, covering different masses and luminosi- 
ties. Figure 16 shows the significance for different approaches in the case of an m = 3 TeV Z' SSM for 
both the dielectron (left) and the dimuon (right) cases. The plots compare the significance as obtained 
from number counting (circles), fixed mass fits (dots) and floating mass fits (squares). The floating-mass 
significances are on average 20% lower than the fixed- mass calculations for Z' —> ee, and about 15% 
lower in the dimuon case (in obtaining these numbers, we exclude the region below 2.25 fb _1 , which is 
affected by low statistics effects). 

9 The best motivated shape is a Breit-Wigner convoluted with a Gaussian resolution. Unfortunately, the convolution fit is 
very time consuming and for this study millions of fits were performed. Empirically the combination of a Breit-Wigner and a 
Landau were found to give essentially identical results. 
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Figure 15: Likelihood ratio distribution for an m — 3 TeV — ► ee\ the distribution on the left corre- 
sponds to background-only pseudo-experiments; the one on the right, to signal plus background. 
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Figure 16: Comparison of the fit-based significance for fixed-mass (dots) and floating-mass (squares) fits 
for both cases, Z' — > ee (left) and Z' — ► (right). Circles show the estimation from number counting. 
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Abstract 

The prospects for the discovery of heavy lepton-neutrino resonances with the 
ATLAS detector are evaluated using full detector simulation. The performance 
of large missing transverse momentum measurement is studied. Its impact on 
the lepton-neutrino transverse mass reconstruction, and on the backgrounds 
rejection, is then discussed. As benchmark, the sensitivity to a Standard Model 
like W' is evaluated. Emphasis is put on the discovery potential of ATLAS with 
early data, namely with an integrated luminosity of 10 pb _1 to 10 fb _1 . 

1 Introduction 

The Standard Model of particle physics has been able to predict or describe, within errors, almost all 
measurements performed within its domain. However, several fundamental questions remain unresolved. 
Its mechanism for electroweak symmetry breaking has not been experimentally confirmed. The model 
parameters still lack a theoretical explanation. There are indications, therefore, that the Standard Model 
is not a fundamental theory, but a good approximation of nature at the energy ranges that have been so far 
accessible to experiment. Thus, the search for physics beyond the Standard Model is an important part 
of the ATLAS physics program. In this document, a study is presented of the potential for the search of 
final states comprised of one electron or muon (lepton, in what follows) plus missing transverse energy. 

A large variety of theories beyond the Standard Model, predict additional gauge bosons. Any 
charged, spin 1 gauge boson which is not included in the Standard Model is called W' boson and accord- 
ing to several predictions there is at least one W' boson detectable at the LHC. These theories and mod- 
els which predict new charged gauge bosons range from the Grand Unified Theories [1-3], the various 
Left-Right Symmetric Models [1,4-10], Kaluza-Klein theories [11-15], Little Higgs models [16-18], 
dynamical symmetry breaking models [19] and even models inspired from superstrings [20-22]. As an 
example, the 45 decompositions of the SO(10) gauge group, which is a candidate for large GUT sym- 
metries, under the SU(3) C x SU(2) L x SU(2) R x U(1) B -l gives rise to a (1,1,3,0) triplet coming from 
the SU(2)r group. That is, a triplet of right-handed W ± 0 fields, which carry weak (V+A) interactions. 
A theoretical model, based on the gauge group SU (3)c x SU (2) L x SU (2) R x U(\)b-l which is called 
a Left-Right Symmetric Model (LRSM), after spontaneous symmetry breaking, predicts a right-handed 
Wr gauge boson mixes with the left-handed Wl boson of the Standard Model. The Wr gauge boson is 
a very attractive W' boson candidate. The search for these particles is an important part of the studies 
for new physics to be performed at LHC. Studies presented here are based on predictions of a "Stan- 
dard Model-like" W' boson from so-called extended gauge models [23]. This W' boson has Standard 
Model-like couplings to fermions and its decays to WZ bosons are suppressed. 

The DO experiment, at Fermilab, has set the present lower limit for the W' boson mass [24] to 
m w i > 1 TeV at 95% C.L. The LHC, with a centre-of-mass energy of 14TeV, is expected to increase 
the search reach even at early stages of data taking. Other ATLAS studies have evaluated the potential 
for discovery of W' — > £vi where £ = jj,,e [25]. This study is based on the most recent realistic detector 
description, including a complete simulation of the trigger chain. 

The remaining of this paper is organized as follows. The Monte Carlo samples which were used 
are summarized in section 2. Section 3 discusses the expected performance on lepton reconstruction, 
as well as on missing transverse energy. Section 4 briefly describes the triggers which were used in this 
study. The event selection is discussed in section 5. The discovery potential of a W' with Standard Model 
couplings is assessed in section 7, after examining the systematic uncertainties in section 6. 
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2 Monte Carlo Samples 

Table 1 summarizes the samples used in this study; a detailed account of the procedures, generators and 
settings used is given in [26]. Signal samples for masses other than 1 and 2 TeV were produced locally 
and validated against central production samples. 

For the signal, samples of W' — > £v events were generated with Pythia v6.403 [27], based on the 
leading order cross sections and the parton distribution functions CTEQ6 [28], where £ can be any type 
of lepton (t included), for true W' boson masses ranging from 1 to 4 TeV. 

The main background for a W'-type state is the high-mass tail of Standard Model W boson produc- 
tion; in order to provide enough background to study also the higher W' boson masses, two samples of 
Standard Model W boson events were produced, with different requirements on the true invariant mass 
of the W boson: one with 200 GeV < m w < 500 GeV, and one with m w > 500 GeV. In these stud- 
ies, the alignment and calibration of the detector is assumed to be well described in the reconstruction 
algorithms. 



Process 




Generator 


a x BR [fb] 


Comments 


Events 


1 TeV W -► £v 




Pythia 


9430. 




30K 


1.5 TeV W' -^£v 




Pythia 


1786. 




2.8K 


2 TeV W -► £v 




Pythia 


437. 




30K 


2.5 TeV W' -► £v 




Pythia 


146. 




2K 


3 TeV W £v 




Pythia 


54. 




10K 


3.5 TeV W'^£v 




Pythia 


20. 




2.8K 


Standard Model W -> 


£v 


Pythia 


18721.1 


200 GeV< m w <500 GeV 


20K 


Standard Model W -> 


£v 


Pythia 


708.26 


m w >500 GeV 


20K 


tt 




MC@NLO 


452000 




340K 


Dijet JO 




Pythia 


1.76 x 10 13 


p T = S- 17 GeV 


380K 


Dijet Jl 




Pythia 


1.38 x 10 12 


p T = 17-35 GeV 


380K 


Dijet J2 




Pythia 


9.33 x 10 10 


p T = 35-70 GeV 


390K 


Dijet J3 




Pythia 


5.88 x 10 9 


p> = 70-140 GeV 


380K 


Dijet J4 




Pythia 


3.08 x 10 8 


p T = 140-280 GeV 


390K 


Dijet J5 




Pythia 


1.25 x 10 7 


p T =280-560 GeV 


370K 


Dijet J6 




Pythia 


3.60 x 10 5 


p T = 560 -1120 GeV 


380K 


Dijet J7 




Pythia 


5.71 x 10 3 


p T = 1120-2240 GeV 


430K 



Table 1 : Monte Carlo samples used for the study of W' bosons. p~r represents the transverse 
momentum of the partons in their rest frame. The tt sample includes only fully leptonic and 
semi-leptonic channels. 



3 Reconstruction Performance 
3.1 Muon Reconstruction 

Muon reconstruction in ATLAS uses all main detector subsystems. The Muon Spectrometer (MS) is 
designed to provide efficient and precise stand-alone momentum measurement for muons of transverse 
momentum up to 0{pr= 1 TeV). During the back-tracking of the muon to the production vertex, energy 
loss fluctuations can be measured with the use of the calorimeters, which can also provide independent 
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Figure 1: Inverse pj resolution for muons from W' boson decays; left: pj < 400 GeV (muons 
from m = 1 TeV W' bosons), right: pj > 800 GeV (muons from m = 2 TeV W' bosons). 




Figure 2: Muon transverse momentum resolution as a function of tj (left) and pj (right) in W' 
boson decays. Filled circles represent muons from m = 1 TeV W' bosons, while open circles 
correspond to muons from m = 2 TeV W' bosons. 



muon tagging to increase the identification efficiency. For optimum performance in momentum resolu- 
tion, the MS information is combined with the track information obtained in the Inner Detector (ID). A 
full description of the algorithms for muon performance and identification can be found in [29] and [30]. 

Figure 1 shows the inverse transverse momentum (1 / pj) resolution for muons from decays of a W' 
boson for muons below pj = 400 GeV and above pj = 800 GeV. Especially relevant for the analysis 
are the negative tails in these plots, since they correspond to reconstructed muon candidates that have 
a pj larger than that of the true particle. The relative contribution of these tails can be assessed by the 
fraction of muon candidates separated by more than 2a from the mean of the distribution (which in 
both cases is consistent with zero, as it should). The fraction in that negative tail is (4.9 ±0.3)% for 
muons with p T < 400 GeV, and (3.8 ±0.4)% for p T > 800 GeV (to be compared with 2.275% for 
a gaussian distribution). The transverse momentum resolution achieved is shown as a function of the 
pseudo-rapidity rj and pj in Fig. 2. On average a resolution of 4.5 and 5.5% is recorded for muons from 
W' bosons of m = 1 TeV and 2 TeV respectively. 

Figures 3 and 4 show the efficiency for combined muon reconstruction as a function of pseudo- 
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Figure 3: Combined muon reconstruction efficiency as a function of 77 and (j) for muons from W' 
boson decays. 
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Figure 4: Combined muon reconstruction efficiency as a function of pr for muons from W' boson 
decays. 



rapidity (77), azimuthal angle (0) and transverse momentum (pj) for muons from fully simulated W' 
boson decays. An overall efficiency of 93.6% and 92.4% is measured for m = 1 TeV and 2 TeV W' 
boson samples, respectively. The regions with lower efficiency in muon reconstruction are observed, as 
expected, in the middle plane (77 = 0) and in the transition regions between the barrel and the end-cap 
sections of the MS (at |t?| ~ 1.2). The regions with low efficiency in 0 correspond to the feet of the 
detector (0 ~ —2, —1) and to passages for services. 

One important issue in this study concerns the background that can rise from badly reconstructed 
muons. Their momentum being wrongly estimated upwards, can cause both the presence of a high 
pr muon and, correspondingly, large missing transverse energy. In the definition of the muon, extra 
quality criteria may be imposed in order to diminish this probability. In this study, the following mild 
requirements are adopted: 

• A matching % 2 , between MS and ID tracks, smaller than 100 (further discussed in [30]). 

• An impact parameter in the z-axis (i.e. the beam axis) smaller than 200 mm. 

• An impact parameter significance in the transverse (R-0) plane smaller than 10. 
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Figure 5: Electron energy resolution as a function of pseudo-rapidity (left) and energy (right) in 
W' boson decays. Filled circles represent electrons from m = 1 TeV W' bosons, while open circles 
correspond to m = 2 TeV W' bosons. 




Figure 6: Electron pj resolution in W' boson decays; left: pj < 400 GeV (muons from m = 
1 TeV W' bosons), right: p T > 800 GeV (muons from m = 2 TeV W' bosons). 



3.2 Electron Reconstruction 

Electron candidates are built starting from clusters of calorimeter cell energy depositions, which are 
matched to a track from the inner detector. Electron identification and reconstruction are described in 
detail in [31] and [32], where three standard selections were developed to be used in physics searches. 
The present study uses the medium set of selection requirements, which consists in several requirements 
on the clusters used (size, containment, association with a track, shower shapes and quality of the track 
match). 

Figure 5 shows the electron energy resolution (in percentage) as a function of pseudo-rapidity (|tj|) 
and true energy. The average energy resolution for electrons in this energy range is close to 1%, and is 
worse in the transition region between the two calorimeter systems. Figure 6 shows the relative differ- 
ence between reconstructed and true transverse momenta of isolated electrons with true pj lower than 
400 GeV and higher than 800 GeV. The fractions of events in the upper tails (more than 2a over the 
fitted mean) are (11.8 ±0.6)% and (5.3±0.5)%, respectively. These non-Gaussian tails are due to the 
amount of material in the inner detector and are, therefore, tj -dependent. 
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Figure 7: # T resolution in muonic W' boson decays. m w i = 1 TeV (left) and 2 TeV (right). 
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Figure 8: $ T resolution in W' boson decays to electrons. m w > = 1 TeV (left) and 2 TeV (right). 



3.3 Reconstruction of the Missing Transverse Energy 

The final state under consideration includes a neutrino, whose momentum information can be inferred 
only partially from the energy imbalance in the detector (since the total transverse momentum of the 
event has to add up to zero). The reconstruction of the missing transverse energy (Jfj) in ATLAS is 
described in detail in [33]. 

The resolution of Tf.j reconstruction in W' boson events containing muons can be seen in Fig. 7. An 
average resolution of about 18 GeV is observed for m w > = 1 TeV (25 GeV for m w i = 2 TeV). In the 
case of m w > = 2 TeV the non-Gaussian tails in the resolution are more pronounced, and come from the 
degraded performance of muon reconstruction at high pj. 

Figure 8 shows the tfj resolution for events that contain one high-pj electron from a W' boson 
decay. The left plot corresponds to the m =1 TeV W' boson, and the right plot to 2 TeV; the resolutions 
are around 10 and 14 GeV, respectively. These values agree well with the expected resolution from 
the mean of the scalar sum of transverse energy (< ££y >) in each case; for the m = 1 TeV sample, 
< L^t > for the selected events is 439GeV, which yields an estimated a{$ T ) ~ 0.5^/£E T = 10.5 GeV; 
for m = 2 TeV, the expected value (based on < ££V >) is 13.3 GeV. 
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Figure 9: Transverse mass distribution for m = 1 TeV W' bosons, as obtained from the true 
particles' momenta (filled histograms), and from reconstructed information after basic selection 
(black outline). Left: electron mode; right: muon mode. 





Figure 10: As Fig. 9, for m 
structed transverse mass. 
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= 2 TeV W' bosons, filled: from true information; outline: recon- 



3.4 Transverse Mass Reconstruction 

In the W' boson search, the transverse momentum pj of the single lepton in the event and the missing 
transverse energy $t are combined to obtain the transverse mass as follows: 

m T = ^2p T # T {\-cosA$ e p T ) (1) 

where A<t> g ]£ T is the angle between the momentum of the lepton and the missing momentum, in the 
transverse plane. Figures 9 and 10 show the transverse mass distributions for m = 1 and 2 TeV signals, 
respectively, as obtained from truth information (light gray filled histograms) and the degradation due to 
detector resolution and efficiency (black hollow histograms). As can be expected from Figs. 2 and 5, the 
shape of the transverse mass spectrum has a larger distortion in the muon channel than in the electron 
channel, with larger tails for higher W' boson masses. On the other hand, the reconstruction efficiency is 
higher in the muon channel (over 86% for each mass) than in the electron channel (about 72%). 

Figures 11 and 12 show the distribution of the difference between the "true" transverse mass (i.e. as 
obtained from the true momenta of the lepton and the neutrino) and its reconstructed value, for electron 
and muon modes, and for m = 1 and 2 TeV W' boson masses. In Fig. 11, single Gaussian fits are shown; 
a fitted width of about 12 GeV is obtained for the electron channel, while the muon channel, besides 
having much larger non-Gaussian tails, has a fitted width of about 23 GeV. 
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Figure 11: Distribution of the event-by-event difference between the reconstructed and "true" 
transverse mass for the electron and muon channel, for m = 1 TeV W' bosons. 
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Figure 12: As Fig. 1 1, for m = 2 TeV W' bosons. 



Figure 12 shows the corresponding comparison for a 2 TeV signal; however, in this case, the muon 
channel (on the right) has a stronger non-Gaussian character, which is why no fit was performed. The 
quadratic mean of the distribution is about 84 GeV 

4 Trigger 

The ATLAS trigger [34] has three levels: events passed by the LI (level 1) hardware trigger are partially 
reconstructed in L2 (level 2) processors and, if accepted there, are fully processed in the EF (event filter) 
processor farm. Only events accepted by the EF (and thus also by LI and L2) are recorded for later 
reconstruction and analysis. 

Trigger rates are estimated in separate studies of the electron [35] and muon [36] trigger systems. 
However, this studies were performed with a more recent version of the software than the one used here 1 . 
Therefore, we have measured some of the rates directly in simulation using the dijet and top samples 
described earlier in this note. We additionally measured LI rates and efficiencies for single-electron and 
single-muon triggers with thresholds higher than those defined in the simulated trigger menu. The errors 
we assign to our rate estimates are purely statistical. 

Especially for electrons, the trigger menu and algorithms in the simulated samples are quite different from those in the 
above notes which are much closer to those expected to be used during actual data acquisition. 
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4.1 Electron trigger 

For a single electron trigger with an Ej threshold of 100 GeV, we measure a LI rate of 14 ± 1 Hz at 
an instantaneous luminosity of 10 32 cm _2 s _1 , similar to the electron trigger study estimate of 10 Hz. 
The efficiency to trigger on W' —> ev events for |t]| < 2.5 is 98% for a mass of either 1 or 2 TeV. If the 
threshold is raised to 250 GeV, we measure a rate of 25 ±4 Hz at 10 33 cm _2 s _1 and an efficiency of 96% 
for the 2 TeV mass. 

Loose requirements in L2 and EF can further reduce these rates with a moderate degradation of 
the efficiency. For definiteness in the calculations in the following sections, we assume that a trigger 
efficiency (applied after all requirements) of 0.90 ±0.10 is achieved with an acceptable rate for all W' 
boson masses. 

4.2 Muon trigger 

The trigger menu and algorithms in the simulation samples are similar to those in the muon trigger study 
and those expected for data acquisition. In contrast to the electron case, lower thresholds can be applied 
thanks to the lower fake rates. A significant decrease in rate is then obtained thanks to an improved 
measurement of p T at each level. Applying a threshold of 20 GeV at each trigger level, we obtain an 
EF rate of 20 ± 10 Hz for an instantaneous luminosity of 10 32 cm _2 s _1 , consistent with the muon study 
prediction of 13 Hz. We measure aW'^^v trigger efficiency for |tj| < 2.5 of 74% for m = 1 TeV and 
73% at 2 TeV. At 10 33 cm~ 2 s _1 , we apply a pj threshold of 40 GeV, the maximum LI value, and obtain 
a trigger rate of 4.1 ±0.7 Hz close to the 5.6 Hz obtained in the trigger study. The corresponding trigger 
efficiency for the m = 2 TeV W' boson is 69%. 

It should be noted that most of the efficiency loss comes from holes in the coverage of the muon 
system, where the reconstruction is also inefficient. 

5 Event Selection 

The decay W' — »■ iv provides a rather clean signature consisting of a high-energy isolated lepton and 
large missing transverse energy. The largest backgrounds are the high-/?/- tail of the W —>■ iv decays and 
tt production. Both these final states are accompanied by significant jet activity, but contain also leptons 
that are as isolated as those expected from W' — ► iv decays. 

A potentially dangerous background is the one arising from fake leptons; since this issue is more 
likely to be significant for electrons than for muons, the backgrounds will be presented separately for 
W' — ► ev and W' — > piv final states. 

5.1 Event Preselection 

In addition to the electron and muon identification criteria described above, events are required to have: 

• Only one reconstructed lepton with pj > 50 GeV within \r\ | < 2.5. 

• Missing transverse energy $ T > 50 GeV. 

Figure 13 shows, on top, differential cross-section as a function of the lepton pj for the m = 1 
TeV and 2 TeV signal samples, Standard Model W boson, tt and dijet production. The dashed vertical 
line shows the requirement value (50 GeV). The bottom plots in Fig. 13 show the distributions for 
the same processes after requiring only one lepton with pj > 50GeV; again, the requirement value (at 
50 GeV) is shown with the dashed vertical line. This selection provides a relatively clean signal in the 
high transverse mass region, as shown in Fig. 14, which shows the differential cross-section as a function 
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Figure 13: Top: leading lepton pj distributions (left: electron events, right: muon events). Bottom: 
$t distribution of events with only one reconstructed lepton with pj > 50 GeV (left: electron 
events, right: muon events). 



of the transverse mass after the requirements on pj and Jf.j. The background can be further rejected 
by exploiting additional observables, described in next sections: lepton isolation, lepton fraction and jet 
veto criteria. 

5.2 Background Rejection 

After the kinematic requirements are applied, the it and dijets backgrounds are still larger than the high- 
mass tail of the Standard Model W boson close to the threshold value on the lepton pj and on Tf.j. 
Since the uncertainties on the rate of these backgrounds are large, it is desirable to bring them below the 
irreducible background from W bosons. To achieve this, additional requirements are imposed on lepton 
isolation and on the lepton fraction, described below. A simpler selection strategy, based on a jet veto, is 
also explored, since it could prove useful during the first stages of data taking. 

5.2.1 Lepton Isolation 

As the lepton from a W' boson decay is expected to be isolated, only events without high energy tracks 
around the lepton trajectory are accepted. The tracking isolation is done by requiring that the sum of the 
p T of tracks in a A/?-cone around the lepton be below a threshold; AR is defined as 

AK = ^/(A0)2 + (An)2, 

where A(j) and Atj are the distances in azimuthal angle and in pseudo-rapidity, respectively, from the 
lepton under consideration. 
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Figure 14: Transverse mass spectrum after the basic kinematic requirements for background and 
signal (mw> = 1 and 2 TeV). Left: electron mode; right: muon mode. 




Signal Efficiency 

Figure 15: it background rejection and signal efficiency for different requirement values on the 
£ Pt (open markers) and (Y,Pt)/ ^riepton (filled markers), for the muon channel. Each marker type 
corresponds to a different value for AR, from 0.1 to 0.5. 



Calorimeter isolation was also explored (the calorimetric energy deposited within the volume be- 
tween two A/?-cones is required to be below a threshold). 

Besides requiring a maximum value of £/?rtracks (of 10 GeV to 1 GeV), the use of a normalized 
isolation requirement was also explored, in which the requirement is applied to the L/?rtracks/priepton 
ratio. This ratio is required to be smaller than 0.1 to 0.01. Five different AR values were used in both 
cases; as shown in Fig. 15 for muons, the normalized isolation selection achieves a higher tt rejection for 
the same efficiencies. The efficiencies and rejections achieved for electrons are similar. 

The calorimeter energy difference in two cones is not only of use in the electron case, but also in 
the muon one. High pj muons coming from W' boson decays can also radiate a lot inside the material 
preceding the MS. This radiation appears as energy depositions close to the muon in the calorimeters. 
As can be seen in Fig. 16 (right), the energy deposition in a cone of AR < 0.1 around the muon is much 
higher, around 30 GeV on average, than the deposition in a cone of AR < 0.5 when the inner cone is 
subtracted (in this case the average is aboout 7 GeV). Moreover, in Fig. 16 (left) it is shown that in the 
majority of the cases, a high reconstructed energy deposition indicates the existence of high final state 
radiation. Therefore, the energy deposition in an inner cone (e.g. AR < 0.1) must be subtracted also in 
the case of muons when isolation criteria based on calorimetry are applied. Also on track based isolation 
criteria an inner cone containing the muon track itself must be subtracted. In this case however, the 
inner cone can be much narrower, since it only needs to be able to exclude the track associated with the 
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Figure 16: Left: the true energy deposition as a function of the reconstructed one in a cone of 
AR=0. 1 for muons coming from decays of m = 2 TeV W' bosons. Right: the solid histogram 
shows the energy recorded in a cone of AR=0. 1 around the muon. The dashed histogram shows 
the energy recorded in a cone of A/?=0.5 after the subtraction of the inner cone deposition. 




Sum p_T ,racks in cone dR=0.3 Sum E T in cone dR=0.3 /p_T mu °" 



Figure 17: Left: distribution of an absolute track based isolation variable for muons. Right: 
distribution of a relative calorimetry based isolation variable for muons. In both cases the inner 
cone of AR=0. 1 is subtracted. 



lepton under consideration. Figure 17 shows the distributions of the isolation energy for different event 
categories. For these plots, muons with pj > 20 GeV are considered. 

For the analysis, a loose requirement of 0.05 is used on the normalized track-based isolation for 
both channels (electron and muon), and no requirement on the calorimeter-based isolation is applied. 
Tracks are included in the sum if 0.02 < AR (track, lepton) < 0.3. This requirement keeps about 99% of 
the signal for both masses (mw> = 1 and 2 TeV), rejects about 10% of the tt events left after the basic 
selection and rejects over 99% of the dijet background. 

5.2.2 Lepton Fraction 

Another variable that can be used to reduce the dijet and tt backgrounds is the "lepton fraction" of the 
event, which can be expressed as Pt ! "°" > ' / (L Pt P "" 1S + HEt), where the scalar sum on the lepton pj 
sums over as well. Essentially this variable measures the fraction of energy that can be attributed to 
leptons (including neutrinos, which are assumed to be the main contribution to $t) in an event. Here, 
out of the visible leptons, only the most energetic one is included in the sum (its pj is added to the Jf.j to 
form Ypj P ' om ). The distribution of this variable is shown for different event categories in Fig. 18 (left). 
As expected, it shows a much lower value for tt events (in pink) than for the rest of the samples used 
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Figure 18: Left: distribution of the lepton fraction variable (see text) for different event categories. 
Right: signal efficiency versus tt efficiency for different requirement values on the lepton fraction 
variable. 



(W 7 boson signals and Standard Model W bosons). The efficiency for signal versus tt events for different 
values of the variable, is shown in Fig. 18 (right). A requirement at 0.5 results in a signal efficiency of 
~ 96% in both channels and a rejection factor of ~45 against the tt background, and it suppresses all the 
remaining dijet events. This value will be used subsequently. 

5.2.3 Jet Veto and Jet Multiplicity Requirements 

A selection procedure based solely on veto-ing events with high jet activity could provide an alternative 
way to extract a signal in this search. Several requirements on jet activity were explored; in some, events 
are rejected if they include any jet over an energy threshold, in others, jet multiplicity information is used. 
The jet veto was applied just after the basic selection (i.e., lepton identification, pj and #t requirements). 

Figure 19 shows the distribution of the pj of the leading jet after the basic selection; the distribution 
on the left corresponds to the electron channel and the one on the right to the muon channel. Tables 2 
and 3 show the expected rates for several jet veto criteria . 

Figure 20 shows how after a 200 GeV jet veto requirement (and without isolation or lepton fraction 
requirements), most of the tt and dijet background is rejected, and the signal to background ratio is good 
for high transverse mass values. Although the signal is reduced by between 5 and 10% with respect to 
selecting on isolation and lepton fraction, a jet veto requirement may be a good tool if the calibration of 
the Y*Et (used to compute the lepton fraction) is not well understood in early data. However, in what 
follows, this requirement is not used. 

5.3 Event Selection Results 

Figure 21 shows the expected transverse momentum spectra for signal and background for both channels 
after all requirements (preselection, isolation, and lepton fraction). The selection requirement flow is 
shown in Tables 4 and 5. The transverse mass requirement has been chosen by minimizing the luminosity 
needed to get a 5a excess. The initial cross-sections for the W' boson signals and for the high mass W 
boson tail include the ^-factor obtained in section 6. 1. 
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Figure 19: Distributions for the pr of the leading jet after the basic selection. Left: electron 
selection. Right: muon selection. 
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Figure 20: mj spectrum after preselection requirements and a jet veto of Ej < 200 GeV. Left: 
events with a high-py electron; right: events with a high pr muon. 
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Figure 21: Expected transverse mass spectra after all requirements. Left: electron channel; right: 
muon channel. 
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a[pb] 


Requirement 


W' ITeV 


W 2TeV 


W 


tt 


Dijets 


No jets with p T > lOOGeV 


2.71(4) 


0.112(2) 


4.74(5) 


7.07(7) 


17±16 


No jets with p T > 200GeV 


3.13(4) 


0.132(2) 


5.09(5) 


15.7(1) 


27±16 


No jets with p T > 500GeV 


3.38(4) 


0.146(2) 


5.18(5) 


18.7(1) 


44±17 


Less than 4 jets with p T > 40GeV 


3.38(4) 


0.148(2) 


5.18(5) 


14.0(1) 


43±17 


Less than 3 jets with p T > 100 GeV 


3.39(4) 


0.148(2) 


5.18(5) 


17.8(1) 


44±17 


Less than 2 jets with p T > 200GeV 


3.38(4) 


0.148(2) 


5.18(5) 


18.4(1) 


44±17 


200GeV veto, m T > 0.7 TeV 


1.73(3) 




0.0290(8) 






200 GeV veto, m T > 1.4 TeV 




0.066(1) 


0.0013(1) 







Table 2: Cross-sections for signal and backgrounds for dijets, tt, W and W' boson samples for different 
requirements on jet content for the electron channel. The number in brackets is the error on the least 
significant digit. 





a[pb] 


Requirement 


W ITeV 


W 2TeV 


W 


ft 


Dijets 


No jets with p T > lOOGeV 


3.22(4) 


0.141(2) 


5.50(5) 


8.77(8) 


2(1) 


No jets with p T > 200GeV 


3.70(4) 


0.166(2) 


5.92(5) 


19.1(1) 


17(4) 


No jets with p T > 500GeV 


3.96(4) 


0.182(2) 


6.04(5) 


22.7(1) 


39(5) 


Less than 4 jets with p T > 40 GeV 


3.98(4) 


0.184(2) 


6.03(5) 


16.9(1) 


53(5) 


Less than 3 jets with p T > 100 GeV 


3.98(4) 


0.185(2) 


6.04(5) 


21.5(1) 


73(5) 


Less than 2 jets with p T > 200 GeV 


3.98(4) 


0.185(2) 


6.04(5) 


22.3(1) 


73(5) 


200GeV veto, m T > 0.7 TeV 


2.07(3) 




0.040(1) 


0.005(2) 




200GeV veto, m T > lATeV 




0.084(1) 


0.0033(8) 


0.0008(8) 





Table 3: Cross-sections for signal and backgrounds for dijets, tt, W and W' boson samples for different 
requirements on jet content for the muon channel. The number in brackets is the error on the least 
significant digit. 



6 Systematic Uncertainties 

6.1 Generator- level Systematic Uncertainties 

The input for the full simulation studies described in earlier sections was obtained by generating W' boson 
events using Pythia [27]. Events in the high-mass tail of the W boson were generated using Pythia 
as well. Both use the default Pythia parton distribution functions (PDFs), CTEQ61, the CTEQ6 [37] 
LO (leading-order) fit with NLO (next-to-leading-order) a s . Here we report on generator-level studies 
which examine the effects of making use of the NLO matrix elements and varying the PDFs. 

6.1.1 Higher Orders 

To evaluate contributions from higher order diagrams, we used MC@NLO [38] input to the HER- 
WIG [39] event generator. Both W and W' boson events were generated using the W boson production 
process with the W boson mass set to the W' boson value for the latter. The W' boson widths were set 
to the values calculated by Pythia. The masses and widths used are listed in Table 6. Both MC@NLO 
and HERWIG were run using the default HERWIG PDFs, MRST2004nlo, the MRST 2004 fit using the 
standard MS scheme at NLO [40]. 

One million events were generated for each generator at each of the masses. The cross-section is 
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<T[pb] 


Requirement 


W (1 TeV) 


W (2 TeV) 


W tail 




Dijets[l-7] 


(No requirement) 


4.99 


0.231 


10.28 


452 


1.91 x 10 1U 


Preselection 


3.67±0.04 


0.160±0.002 


6.80±0.06 


150.57±0.40 


(13.6±0.2) x 10 6 


p T > 50 GeV 


3.43±0.04 


0.150±0.002 


5.53±0.05 


51.13±0.23 


(7.23±0.6) x 10 3 


$ T > 50 GeV 


3.40±0.04 


0.149±0.002 


5.19±0.05 


25.78±0.16 


45.33±16.65 


Isolation 


3.36±0.04 


0.148±0.002 


5.01±0.05 


23.30±0.16 


0.65±0.13 


Lepton fraction 


3.25±0.04 


0.145±0.002 


4.10±0.04 


0.50±0.02 




m T > 700 GeV 


1.86±0.03 




0.0317±0.0008 


0 




m T > 1400 GeV 




0.0740±0.001 


0.0014±0.0002 


0 





Table 4: Cross-section for signal and backgrounds after each requirement. Electron mode. 





C7[pb] 


Requirement 


W (1 TeV) 


W (2 TeV) 


W tail 




Dijets[l-7] 


(No requirement) 


4.99 


0.231 


10.28 


452 


1.91 x 10 1U 


Preselection 


4.28±0.05 


0.199±0.002 


7.77±0.06 


205.30±0.46 


(11.2±0.19) x 10 6 


p T > 50 GeV 


4.03±0.04 


0.187±0.002 


6.40±0.06 


61.71±0.25 


(1.24±0.26) x 10 3 


$ T > 50 GeV 


4.00±0.04 


0.186±0.002 


6.04±0.05 


31.34±0.18 


74.32±23.28 


Isolation 


3.95±0.04 


0.185±0.002 


5.99±0.05 


28.70±0.17 


1.00±0.82 


Lepton fraction 


3. 81 ±0.04 


0.181±0.002 


4.85±0.05 


0.64±0.03 


(1.96±1.38)x 10" 3 


m T > 700 GeV 


2.20±0.03 




0.043±0.002 


0.007±0.003 


0.001±0.001 


m T > 1400 GeV 




0.094±0.0001 


0.0031±0.0006 


0.001±0.001 


0.001±0.001 



Table 5: Cross-section for signal and backgrounds after each requirement. Muon mode. 



calculated for transverse mass above 70% of the W' boson mass, i.e. above the values listed in Table 6. 

We define the ^-factor to be the ratio of the MC@NLO cross-section to that from Pythia. These 
are shown as functions of tj in Fig. 22. 

Integrals of the W' boson and W boson tail differential cross-sections are given in Table 7. The NLO 
predictions are 30-40% higher than those from Pythia, with little change with the variations in scale. 
Although the NLO/LO cross-section and acceptance ratios are of order 40%, the uncertainties on the 
NLO values are expected to be significantly smaller. Also, the QED corrections are partially included 
through PHOTOS [41] for FSR, and should have a small impact on the measurements in case of the 
observation of a signal. 

6.1.2 Parton Distribution Functions 

The LHC will take data in a new energy regime and so we expect significant uncertainty in signal and 
background predictions due to our uncertainty in knowledge of the PDFs. 

The CTEQ6.1 fits include 40 error PDFs corresponding to the two limits on each of 20 eigenvectors. 



M (GeV) 


T(GeV) 


Minimum mj (GeV) 


1000 


34.739 


700 


2000 


70.540 


1400 


3000 


106.390 


2100 



Table 6: Masses and widths used as input to MC@NLO/HERWIG generation of W' boson events. 
The third column gives the lower limit for the masses used to calculate cross-sections. 
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Figure 22: W' boson ^-factors (ratios of MC@NLO and Pythia cross-sections) as functions of X] 
for positive (left) and negative (right) charge for masses of 1 (top), 2 (middle) and 3 TeV (bottom). 
S is the common scale factor. The errors are statistical. 
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Process 




Min. mj 


Pythia a (fb) NLO a (fb) 


^-factor 


S=0.5 


S=2.0 


W'(m = 


1 TeV)+ 


700 


534. (1) 


742. (1) 


1.389 (4) 


1.8% (2) 


-1.8% (2) 


W'(m = 


1 TeV)- 


700 


1204. (1) 


1644. (2) 


1.365 (3) 


1.7% (2) 


-1.8% (2) 


W'(m = 


2 TeV)+ 


1400 


62.6 (1) 


83.0 (1) 


1.327 (3) 


2.7% (2) 


-1.6% (2) 


W'(m = 


2 TeV)- 


1400 


20.3 (6) 


27.7 (4) 


1.362(4) 


3.0% (2) 


-1.4% (2) 


W'{m = 


3 TeV)+ 


2100 


6.73 (1) 


8.69 (1) 


1.292 (3) 


3.7% (2) 


4.4% (2) 


W'(m = 


3 TeV)- 


2100 


1.791 (6) 


2.540 (4) 


1.370 (5) 


3.7% (2) 


4.4% (2) 


W+ 




700 


20.22 (7) 


27.66 (8) 


1.368 (6) 


2.2% (4) 


-0.6% (4) 


w- 




700 


8.93 (5) 


12.56 (4) 


1.407 (9) 


2.6% (5) 


-0.8% (5) 


w+ 




1400 


1.042 (4) 


1.424 (4) 


1.366 (7) 


2.2% (5) 


-1.5% (5) 


w- 




1400 


0.354 (2) 


0.499 (2) 


1.41 (1) 


1.8% (6) 


-1.6% (5) 


w+ 




2100 


0.1231 (3) 


0.1657 (3) 


1.346(4) 


3.0% (3) 


2.4% (3) 


w- 




2100 


0.0346(1) 


0.0492 (1) 


1.421 (6) 


3.1% (3) 


2.5% (3) 



Table 7: Integrated W' boson and W boson tail cross-sections for PYTHIA and MC@NLO with 
common scale factor S=l. Integral is over the full Tj range —2.5 < T] < 2.5. The listed ^-factors 
are the ratios of the integrated MC@NLO and Pythia cross-sections. The last two columns give 
the change in the MC@NLO cross-section when the common scale factor is changed by a factor 
of two. The statistical error in the last digit of each calculated quantity is shown in parentheses. 



These can be used to estimate the uncertainty in predictions obtained with the fit. Figure 23 shows the 
PYTHIA prediction for the m = 1 TeV W' boson differential cross-section as a function of Tj for the 
CTEQ6.1 central value and each of the 40 error sets. Events are required to have transverse mass above 
the threshold in Table 6. The difference in shape between the positively and negatively charged bosons is 
a consequence of the parton distribution functions since W' + are from ud fusion and W'~ from du fusion. 




Figure 23: Muon Tj distributions for positively- and negatively-charged m = 1 TeV W' bosons 
using the CTEQ 6.1 PDF central value (black) and 40 error sets. 

We calculated cross sections for W' boson production with mass of 1 TeV using the CTEQ6.1 cen- 
tral value and error PDFs by integrating over the full Tj range (|t]| < 2.5) in Fig. 23. To estimate the 
overall uncertainty, the positive and negative deviations for each eigenvector were summed separately in 
quadrature for each charge sign. Where both deviations for an eigenvector had the same sign, only the 
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larger magnitude was included in the sums. Table 8 shows the results. 



Process 


Min. mj 


W+ 


W- 


W'{m=\ TeV) 


700 


-4.1% (5), +8.2% (5) 


-11.1% (7), +3.5% (8) 



Table 8: CTEQ6.1 combined error set deviations for W' boson cross-sections. The statistical error 
on the last digit is shown in parentheses. 

Combining all the above, we assign a common ^-factor of 1.37 for all masses and charges and assign 
an 8% uncertainty on this factor. 

6.2 Instrumental Uncertainties 

Detector related uncertainties for these studies can be divided in two categories: the ones related to the 
reconstruction of the leptons and the ones corresponding to the global event activity as the lf.j and the 
jet characteristics. However, the lepton reconstruction uncertainties can be the dominant factor in the $t 
resolution. 

6.2.1 Lepton Reconstruction 

Three main contributions can be identified in this category. The efficiency of lepton identification, as 
well as the fake rates associated with this, the pj or Ej scale and its measurement resolution. 

Systematic errors on the momentum scale of the muons can arise for instance due to the non-perfect 
knowledge of the magnetic field. To take into account such effects, a variation of ±1% is applied to the 
p T of the reconstructed muons. Positive and negative variations are considered separately. In a similar 
way but for energy, a variation of ±0.5% was made for electrons. 

An incomplete understanding of the material distributions inside the detector as well as possible mis- 
alignments in the MS can lead to an additional smearing of the momentum measurement resolution of 
muons. To evaluate the impact of such contributions on the analysis, a smearing, based on early cali- 
brations of o{\/p T ) = 0.011/^r ©0.00017 is applied. The first term enhances the Coulomb scattering 
smearing, while the second enhances the alignment contribution, and is the crucial factor in this study. 

For the energy measurement resolution for electrons, the total o{Ej) is smeared by 0.0073 x Ej, 
which enhances the constant term only. 

Lepton identification efficiency is obviously important for this analysis. The identification efficiency 
can be estimated from the data, using the tag-and-probe method described in [42] for muons in the region 
20 < pr < 50 GeV and extrapolated to higher pj using simulated data. A value of ±5% has been chosen 
for the evaluation of this uncertainty, corresponding to the early running period of integrated luminosities 
Jzf < 100 pb~ ! . In the case of electrons a ±1% variation has been applied. 

6.2.2 Jet Reconstruction 

An uncertainty on the jet energy scale of ±7% was imposed, together with an uncertainty on its resolution 
of o{E T ) = 0.45 x ^Et®5%. 

6.2.3 Missing Energy 

If jets or leptons are systematically shifted, then missing transverse energy should be systematically 
shifted in a known direction. Based on the jet and leptons performance, the missing energy is shifted as 
follows: 
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«. v J7 — V J7 _l T^lepton/jet „lepton/ jet 

• L^T( S hifted) ~ L^T +t T ~ ^{shifted) 

In the case of muons, momentum is used instead of energy. 
6.2.4 Summary of Experimental Systematic Uncertainties 

The effects of the experimental uncertainties are summarized in Tables 9 and 10. In the high-pr range 



Description of systematic 


electrons 


muons 


8 S [%] 


8b [%] 


8 S [%] 


8b [%] 


Lepton energy scale + 


+0.8 


+1.8 


+1.2 


+4.6 


Lepton energy scale - 


-0.7 


-2.1 


-1.2 


-4.4 


Lepton energy resolution 


+0.1 


+0.2 


-1.0 


+3.7 


Lepton identification efficiency + 


+1.0 


+1.0 


+5. 


+5. 


Lepton identification efficiency - 


-1.0 


-1.0 


-5. 


-5. 


Jet energy scale + 


+0.1 


-0.2 


-0.1 


+0.1 


Jet energy scale - 


+0.1 


-0.2 


+0.1 


+0.7 


Jet energy resolution 


+0.0 


+0.1 


-0.1 


+0.3 


Luminosity 


±3. 


±3. 



Table 9: Effect of the detector systematics in percentage for m w i = 1 TeV. <5 S is the uncertainty on 
the signal, 8b is the uncertainty on the background. 



Description of systematic 


electrons 


muons 


8 S [%] 


8 b [%] 


8 S [%] 


8 b [%] 


Lepton energy scale + 


+0.7 


+1.2 


+1.5 


+3.4 


Lepton energy scale - 


-0.4 


-3.7 


-1.7 


-2.5 


Lepton energy resolution 


-0.03 


0.0 


-4.2 


+6.8 


Lepton identification efficiency + 


+1.0 


+1.0 


+5. 


+5. 


Lepton identification efficiency - 


-1.0 


-1.0 


-5. 


-5. 


Jet energy scale + 


+0.1 


1.2 


+0.1 


+0.8 


Jet energy scale - 


-0.1 


0 


-0.3 


-0.1 


Jet energy resolution 


-0.1 


0 


+0.1 


-0.1 


Luminosity 


±3. 


±3. 



Table 10: Effect of the detector systematics in percentage for m w > = 2 TeV. 8 S is the uncertainty 
on the signal, 5/, is the uncertainty on the background. 



under consideration, the systematic uncertainties on the quality of single lepton reconstruction have a 
stronger effect on the muon channel, for which there are comparable contributions from energy scale, 
resolution and identification efficiency (with the resolution uncertainty becoming more important for a 
higher W' mass); out of these three, the energy scale uncertainty dominates in the electron channel for 
mw> = 1 TeV, but becomes less important for m^i = 2 TeV. Jet uncertainties do not play a strong role 
on either channel. 
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7 Discovery Potential 

In order to assess the ATLAS discovery potential in the search for a W' — ► t + #t signal, the luminosity 
needed for a 5a excess is obtained as a function of the mass of the W' boson. 

The significance is obtained from the expected number of signal and background events in the region 
m T > 0.7m W i, where m w > is the mass of the hypothesized W' boson. Calling these expected numbers s 
and b, respectively, the significance S is obtained as 

S=y/2((s + b)]n(l+s/b)-s) 

which gives a good approximation to the likelihood-ratio based significance in the low statistics regime. 
Figure 24 shows the expected integrated luminosity needed for a 5-sigma excess as a function of the 
mass of the W' boson. 

Higher order corrections for W' /W — >• £v processes are taken into account as stated in section 6.1. 
Systematic uncertainties listed in Tables 9 and 10 are taken into account by increasing the expected 
background by the sum in quadrature of its positive expected variations, and by reducing the signal by 
the sum in quadrature of its expected negative variations; this assumes no correlations of the expected 
signal and background expectations and, as a result, produces a conservative estimate. 

For comparison, the integrated luminosity values for a 5a significance were also obtained taking 
into account the shape of the signal and background mj distributions. This was done using a technique 
in which, instead of an ensemble of Monte Carlo pseudo-experiments [43], a Fast Fourier Transform 
(FFT) is used to calculate the experimental estimator distributions [44]. This method allows a fast de- 
termination of the probability that background fluctuations produce a signal-like result, but it depends 
on the assumption that both the location of the signal and its shape are well known. Treating each bin 
of the transverse mass distribution as an independent search channel, and combining them accordingly, 
the resulting sensitivity is in general higher than the estimation given in the number counting approach. 
With this method, the luminosity required for a 5a effect was reduced between 20 and 35% with respect 
to the values shown in Fig. 24. 

Even for very low integrated luminosities (of the order of picobarns), a W' boson with a mass above 
the current experimental limits could be found with a significance in excess of 5a, while, with 1 fb -1 , 
masses of the order of 3 TeV can be reached. As an illustration, Figs 25 and 26 show Monte Carlo 
outcomes of pseudo-experiments corresponding to lOpb" 1 and 100 pb" 1 , respectively, for both channels. 
The solid line histograms depict the expected background, those in dotted lines the m = 1 TeV W' boson 
signal and the dashed-dotted line histograms show possible m = 2 TeV W' boson signals. 

8 Summary and Conclusion 

The potential for the ATLAS experiment to reconstruct and identify the decay of a heavy, charged gauge 
boson into a lepton and a neutrino has been studied. Various systematic and theoretical uncertainties 
have been considered, as well as plausible estimations of our uncertainties about the performance of the 
detector in the early stages of data taking. These studies show that, even with integrated luminosities 
as low as 10 pb" 1 of data, it would be possible to discover this type of bosons, should they exist not 
far beyond the current experimental limits and have Standard Model like couplings. With an integrated 
luminosity of a few fb" 1 , ATLAS has the potential to discover these particles for masses up to 4 TeV. 
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Figure 24: Integrated luminosity needed to have a 5a discovery as a function of the mass of the 
W' bosons; triangles correspond to the ev search, squares to }XV, circles to the combined search. 
Filled markers include the effect of systematic uncertainties. 




Figure 25: Monte Carlo pseudo-experiment for 10 pb l . Left: electron channel; right: muon 




2.5 



Figure 26: Monte Carlo pseudo-experiment for 100 pb l . Left: electron channel; right: muon 
channel. 
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Search for Leptoquark Pairs and Majorana Neutrinos from 
Right-Handed W Boson Decays in Dilepton-Jets Final States 

Abstract 

Final states with high-pr leptons and jets are predicted by many Beyond the 
Standard Model scenarios. Two prominent models are used here as guides 
to understanding the event topologies: the scalar leptoquarks and the Left- 
Right Symmetry. In contrast to many SUSY signatures, their topologies rarely 
contain missing energy. Their discovery potential with early ATLAS data, 
corresponding to an integrated luminosity of a few hundred inverse picobarns, 
is discussed. 



1 Introduction 

Grand Unification has inspired many extensions of the Standard Model. Such models introduce new, 
usually very heavy particles, and previous searches for Grand Unification Theory (GUT) signatures have 
placed limits on masses and interaction strengths of the new particles. The LHC will probe new regions 
of parameter space, allowing for a direct search for these particles. Decays characterized by final states 
with two highly energetic leptons, two jets and no missing transverse energy are studied in this note. 
The models for new physics considered for this note are described below. The simulation of signal and 
background processes is described in section 2. In section 4 the baseline selection that is used for all 
analyses is explained. After the trigger requirements are given (section 3), section 5 details the specifics 
of each of the analyses. The systematics are described in section 6 and the final sensitivity estimates are 
given in section 7. 

1.1 Leptoquarks 

The experimentally observed symmetry between leptons and quarks has motivated the search for lepto- 
quarks (LQ), hypothetical bosons carrying both quark and lepton quantum numbers, as well as fractional 
electric charge [1-5]. Leptoquarks could, in principle, decay into any combination of a lepton and a 
quark. Experimental limits on lepton number violation, flavor-changing neutral currents, and proton de- 
cay favour three generations of leptoquarks. In such a scenario, each leptoquark couples to a lepton and 
a quark from the same Standard Model generation [6]. Leptoquarks can either be produced in pairs by 
the strong interaction or in association with a lepton via the leptoquark-quark-lepton coupling. Figure 1 
shows the Feynman diagrams for leptoquark production processes accessible at the LHC. 

This note describes the search for leptoquarks decaying to either an electron and a quark or a muon 
and a quark. The branching ratio of a leptoquark to a charged lepton and a quark is denoted as /3. 
Decays to neutrinos are not considered, and events are not explicitly selected based on the flavor of the 
quark. The experiments at the Tevatron have searched for first (decaying to eq), second (decaying to jj,q), 
and third (decaying to xq) generation scalar leptoquarks. For /3 = £%(LQ —>■ i^q) = 1, the D0 [7] and 
CDF [8] collaborations have set 95%CL limits for first generation scalar leptoquarks of otlq, > 256 GeV 
and mLQj > 236 GeV, respectively. These limits are based on integrated pp luminosities of approxi- 
mately 250 pb -1 and 200 pb -1 . The results for second generation leptoquarks, > 251 GeV and 
m L Q 2 > 226 GeV, were obtained with 300 pb _1 and 200 pb _1 by the D0 [9] and CDF [10] experiments, 
respectively. 

The Tevatron exclusion limits are expected to reach 300-350 GeV in the near future. 
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Figure 1 : Feynman diagrams for leptoquark production. 
1.2 Left-Right Symmetry 

Left-Right Symmetric Models (LRSMs) of the weak interaction address two important topics: the 
nonzero masses of the three known left-handed neutrinos [11] and baryogenesis. LRSMs conserve parity 
at high energies by introducing three new heavy right-handed Majorana neutrinos N e , and N z . The 
smallest gauge group that implements an LRSM is SU(2)l x SU(2)r x U(\)b-l- At low energies, the 
left-right symmetry is broken and parity is violated. The Majorana nature of the new heavy neutrinos 
explains the masses of the three left-handed neutrinos through the see-saw mechanism [12]. The lepton 
number L could be violated in processes that involve the Majorana neutrinos. This opens a window to the 
very attractive theoretical scenario for baryogenesis via leptogenesis, where baryon and lepton numbers 
B and L are violated but B - L is conserved. 

In addition to the Majorana neutrinos, most general LRSMs also introduce the new intermediate 
vector bosons Wr and Z', Higgs bosons, and a left-right mixing parameter. The most restrictive lower 
limit on the mass of the Wr boson comes from the Ki — K$ mass difference which requires mw R > 
1.6 TeV. This lower limit is subject to large corrections from higher-order QCD effects. Heavy right- 
handed Majorana neutrinos with masses of about a few hundred GeV would be consistent with the data 
from supernova SN1987A. Such heavy neutrinos would allow for a W R boson at the TeV mass scale. 
This scenario would also be consistent with LEP data on the invisible width of the Z boson. Present 
experimental data on neutral currents imply a lower limit on the mass of a Z' boson of approximately 
400 GeV. Recent direct searches [13] for the Wr boson at D0 give a lower mass limit of 739 GeV and 
768 GeV, assuming the Wr boson could decay to both lepton pairs and quark pairs, or only to quark pairs, 
respectively. However, heavy Majorana neutrinos decaying to a lepton and a pair of quarks (detected as 
jets) were not searched for in those analyses. 

The new intermediate vector bosons Wr and Z' would be produced at the LHC via the Drell-Yan 
(DY) process like Standard Model W and Z bosons. Their decays would be a source of new Majorana 
neutrinos. The Feynman diagram for Wr boson production and its subsequent decay to a Majorana 
neutrino is shown in Fig. 2. This note describes an analysis of Wr boson production and its decays 
Wr — > eN e and Wr — > flN^, followed by the decays N e — »■ eq'q and — »■ Hq'q, which can be detected in 
final states with (at least) two leptons and two jets. 



1751 



Exotics - Search for Leptoquark Pairs and Majorana Neutrinos from Right- 




q' 

Figure 2: Feynman diagram for Wr boson production and its decay to a Majorana neutrino Ng. 



m LQ in GeV 


a(pp — > LQLQ) (NLO) inpb 


300 


10.1 ± 1.5 


400 


2.24 ± 0.38 


600 


0.225 ± 0.048 


800 


0.0378 ± 0.0105 



Table 1: NLO cross-sections for scalar leptoquark pair production at the LHC [16]. 



2 Simulation of Physics Processes 

2.1 Leptoquarks 

The signals have been studied using samples of first generation (1st gen.) and second generation (2nd gen.) 
scalar leptoquarks simulated with the Monte Carlo (MC) generator Pythia [14] and using the CTEQ6L1 
parameterization [15] of the parton density functions (PDFs). A leptoquark-lepton-quark coupling X = 
0.8 was used in the event generation leading to a natural width of the leptoquarks of 0.63 GeV and 
1.3 GeV for leptoquark masses of 400 GeV and 800 GeV respectively. The next to leading order 
(NLO) cross-sections for leptoquark pair production at 14 TeV pp centre-of-mass energy were taken 
from Ref. [16] and are shown in Table 1 for the four simulated leptoquark masses. 

2.2 Left-Right Symmetry 

Studies of the discovery potential for Wr bosons and the Majorana neutrinos, N e and produced in their 
decays, were performed using datasets simulated with the MC generator Pythia according to a particu- 
lar implementation [17] of an LRSM described in [18]. The Standard Model axial and vector couplings, 
the CKM matrix for the quark sector, no mixing between the new and Standard Model intermediate vector 
bosons, and phase space isotropic decays of Majorana neutrinos are assumed for the right-handed sec- 
tor in this model. The products of leading-order production cross-sections o(pp — > WrX) and branching 
fractions to studied final states W R £N e ttjj are 24. 8 pb for m WR = 1 800 GeV, m Ne = m Nfl = 300 GeV 
and 47.0 pb for mw R = 1500 GeV,m^ e = mjq^ = 500 GeV. In the rest of this note, these samples are re- 
ferred to as LRSM_18_3 and LRSM_15_5, respectively. The Majorana nature of the new heavy neutrinos 
allows for same-sign and opposite-sign dileptons. 
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2.3 Background Processes 

The main sources of background for the analyses presented here are tt and inclusive Z/y* production pro- 
cesses. Multijet production, where two jets are misidentified as leptons, represents another background. 
In addition, minor contributions arise from diboson production. Other potential background sources, 
such as single-top production, were also studied. Their contribution was found to be insignificant. 

• Z/y* background was studied using a combination of two MC samples with generator-level dilep- 
ton invariant mass preselections of m« > 60 GeV and m« > 150 GeV, the latter sample corre- 
sponding to a much larger integrated luminosity than the former. The samples were normalized to 
the given luminosity using their partial cross-sections and the NLO estimate o(pp —> Z) x 3§(Z — > 
£+£-) = 2032 pb, obtained with the MC generator FEWZ [19,20]. A lepton filter was applied at 
the event generation, requiring at least one electron or muon with transverse momentum greater 
than 10 GeV and absolute pseudo-rapidity smaller than 2.7, resulting in an effective cross-section 
of 1808 pb. 

For logistical reasons, the sample with the lower mass preselection was generated using the MC 
generator Pythia [14], and the sample with higher mass preselection was generated using HER- 
WIG [21]. In both cases, the CTEQ6L1 [15] parton distribution functions were used. The consis- 
tency between the two samples was verified at high dilepton masses. 

• background was simulated using the MC generator MC@NLO [22] using the CTEQ6M [15] 
parton distribution functions. It was normalized to the given integrated luminosity using a produc- 
tion cross-section of 833 pb estimated to the next-to-leading order (NLO+NLL) [23]. In addition, 
a lepton filter was applied, requiring at least one electron or muon with transverse momentum 
greater than 1 GeV, which resulted in an effective cross-section of 450 pb. 

• The diboson samples were generated using HERWIG with a generator-level preselection on the 
invariant mass of Z/y* > 20 GeV. With this requirement, the NLO partial cross-sections for WW, 
WZ and ZZ boson pair production processes were numerically estimated (using MC@NLO) to be 
117.6 pb, 56.4 pb, 17.8 pb, respectively. The CTEQ6L1 parton distribution functions were used 
for event generation. Again, a lepton filter was applied, with a transverse momentum threshold 
of 10 GeV and a maximum absolute pseudo-rapidity of 2.8. This resulted in a total effective 
cross-section of 60.9 pb. 

• The multijet background was simulated using Pythia with the CTEQ6L1 structure functions. 
The normalization was based on Pythia cross-section estimates. The statistics of these samples 
are very limited, such that no reliable estimate of this background could be made at this time. 

3 Trigger Requirements 

The trigger system [24] of the ATLAS experiment has three levels, LI, L2 and the Event Filter (EF). To 
ensure high overall trigger efficiencies, our analyses rely on single lepton trigger streams with relatively 
low thresholds. The dielectron analyses rely on the single electron-based trigger called e55 which has a 
threshold of around 60 GeV [24]. When selected events fail this trigger, the analyses rely on the lower- 
threshold (about 25 GeV) single electron trigger called e22i [24] in which the electron is required to be 
isolated. A single muon trigger with threshold about 20 GeV (mu20 [24]) is used in the dimuon analyses. 

Final states studied in this note always contain two high-pr leptons. While the baseline selection 
described in section 4 requires two leptons with pj > 20 GeV, most signal events contain at least one 
lepton with significantly higher p T . As a result, the overall trigger efficiency for events that satisfy all 
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analysis selection criteria (section 5) exceeds 95%. The trigger efficiencies for signal MC events that 
satisfy all selection criteria are shown in Table 2. 



Process 


LI 


L2 


EF 


L1*L2*EF 


1st gen. leptoquarks m LQ = 400 GeV 
2nd gen. leptoquarks m^Q = 400 GeV 


100.0% 
97.7% 


99.4% 
99.1% 


97.6% 
99.7% 


97.0% 
96.5% 


LRSM (ee) m WR = 1800 GeV, m Ne = 300 GeV 
LRSM (jip) m WR = 1800 GeV, m Nlk = 300 GeV 


100.0% 
96.8% 


99.2% 
98.7% 


97.2% 
98.9% 


96.4% 
94.5% 



Table 2: Overall trigger efficiencies for signal events that satisfy all selection criteria. 



4 Baseline Event Selection 

The baseline event selection, common for all analyses presented in this note, requires two leptons and 
two jets. All analyses use the same selection criteria for signal electron, muon, and jet candidates. The 
baseline selection criteria for these reconstructed objects are summarized below. Performance studies 
are described elsewhere [25-28]. 

Electron candidates are identified as energy clusters reconstructed in the liquid argon electromagnetic 
calorimeter that match tracks reconstructed in the inner tracking detector and satisfy the medium electron 
identification requirements [25]. 

Muon candidates are identified as tracks reconstructed in the muon spectrometer [26] that, when 
extrapolated to the beam axis, match a track reconstructed in the inner detector, and satisfy relative 
isolation energy requirements Ej°j pj < 0.3. pj is the muon candidate's transverse momentum and E l f 
is the energy detected in the calorimeters in a cone of AR=y / ATj 2 + A0 2 =O.2 around the muon candidate's 
reconstructed trajectory, corrected for the expected energy deposition by a muon. 

Jets are identified as energy clusters reconstructed in the calorimeters using a AR=0.4 cone algo- 
rithm [27]. AR between a jet and any electron candidate (as defined above) must be larger than 0.1. This 
veto is imposed to avoid electrons being misidentified as jets. It is applied in all analyses, regardless of 
whether electrons are explicitly considered in the final states or not. The jet energy scale calibration is 
performed using full MC simulation and requires that the average reconstructed jet energy agrees with the 
average energy of the jets reconstructed with the Monte Carlo truth particles. The same jet reconstruction 
algorithm, with cone size AR = 0.4, is used for both reconstruction and calibration. 

All objects are required to have pr > 20 GeV, the leptons must have an absolute pseudo-rapidity | r\ \ 
smaller than 2.5 and jets must have |tj| < 4.5. 

To suppress contributions from Drell-Yan backgrounds, the dilepton invariant mass is required to be 
at least 70 GeV. Tighter analysis-specific requirements are later applied to this and other variables in 
order to achieve the best sensitivities in individual studies, as described in the following section. 

5 Individual Analyses 

5.1 Search for Leptoquark Pair Production 

Following the baseline object identification criteria described above, the leptoquark pair analyses require 
events to have at least two oppositely charged leptons of the same flavour and at least two jets. Signal 
sensitivity and discovery potential are estimated using a sliding mass window algorithm: only events in 
the mass region around the assumed mass of the leptoquark are analyzed. 
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For large leptoquark masses, signal leptons and jets have, on average, larger transverse momenta 
than background particles. The following kinematic quantities are used to separate the signal from back- 
grounds: the transverse momentum of the leptons (pr ), the scalar sum of the transverse momenta of 
the two most energetic jets and leptons (S T = L|Pr|y e , + 'L\pT\i ep ), the dilepton invariant mass (m«), 
and lepton-jet invariant mass. The lepton-jet invariant mass represents the mass of the leptoquark if the 
correct lepton-jet combination is chosen. Since there are two leptons and two jets there are two possible 
combinations, and we choose the combination which gives the smallest difference between the masses 
of the first and second leptoquark candidates. 



Physics 


Before 


Baseline 


S T > 


m ee > 


m\- - mf- window (GeV) 


sample 


selection 


selection 


490 GeV 


120 GeV 


[320-480] - 


[700-900] - 












[320-480] 


[700-900] 


LQ (m = 400 GeV) 


2.24 


1.12 


1.07 


1.00 


0.534 




LQ (m = 800 GeV) 


0.0378 


0.0177 


0.0177 


0.0174 




0.0075 


Z/f > 60 GeV 


1808. 


49.77 


0.722 


0.0664 


0.0036 


0.00045 


tt 


450. 


3.23 


0.298 


0.215 


0.0144 


< 0.0012 


Vector Boson pairs 


60.9 


0.610 


0.0174 


0.00384 


< 0.002 


< 0.0014 


Multijet 


10 s 


20.51 


0.229 


0.184 


0.0 


0.0 



Table 3: 1st generation leptoquark analysis. Partial cross-sections (pb) that survive selection criteria. 
The upper limits are given at 68% confidence level. 



Physics 


Before 


Baseline 


p$>60 GeV 


S T > 


m m > 


mij window (GeV) 


sample 


selection 


selection 


p^>25 GeV 


600 GeV 


110 GeV 


[300-500] 


[600-1000] 


LQ (400 GeV) 


2.24 


1.70 


1.53 


1.27 


1.23 


0.974 




LQ (800 GeV) 


0.0378 


0.0313 


0.0306 


0.0304 


0.030 




0.0217 


Z/f >60 GeV 


1808. 


79.99 


2.975 


0.338 


0.0611 


0.021 


0.014 


tt 


450. 


4.17 


0.698 


0.0791 


0.0758 


0.0271 


0.0065 


VB pairs 


60.9 


0.876 


0.0654 


0.00864 


0.00316 


0.00185 


0.00076 


Multijet 


10 s 


0.0 


0.0 


0.0 


0.0 


0.0 


0.0 



Table 4: 2nd generation leptoquark analysis. Partial cross-sections (pb) that survive selection criteria. 

In both channels, the values of these selection criteria are optimized 1 to achieve discovery with 5a 
significance at the lowest luminosity possible. Tables 3 and 4 show the values of the selection criteria 
and resulting signal and background cross-sections for 1st and 2nd generation channels, respectively. 
One important difference between the two channels is the background due to jets being misidentified 
as electrons. This background can be significantly reduced by requiring both reconstructed jet-electron 
masses, (mjj, mfj), to be close to the tested leptoquark mass. However, such a selection in the 2nd gen- 
eration analysis would significantly reduce the signal efficiency, especially for larger leptoquark masses. 
Therefore, a less strigent selection is applied, and only the average of the two muon-jet masses {mfj) is 
required to be near the tested leptoquark mass. 

Figure 3 shows the St variable distribution with m^Q = 400 GeV, along with the main backgrounds, 
Drell-Yan and tt production, after baseline selection plus, for the 2nd generation case, the requirements 

1 At this stage, only statistical uncertainties are taken into account. 
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Figure 3: St in leptoquark MC events {m^Q = 400 GeV) after baseline selection. Left: 1st generation, right: 2nd 
generation with the additional requirements pj > 60 GeV and p J T > 25 GeV. 



p$ > 60 GeV and p J T et > 25 GeV. 
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Figure 4: mm of the selected lepton pair after St selection in leptoquark 1st generation (left) and 2nd generation 
(right) events (m LQ = 400 GeV). 

The dilepton mass distribution after the Sj selection is shown in Figure 4. 

Figures 5 and 6 show the reconstructed invariant mass of leptoquark candidates (m/,g=400 GeV) in 
signal events and the main backgrounds, Drell-Yan and tt production, after the subsequent selections 
on dimuon mass and St- Due to gluon radiation, quarks produced in the decays of heavy particles are 
not equivalent to standard jets. This shifts the peak of the jet energy resolution function towards smaller 
energies and results in a low-mass shoulder in the distribution of reconstructed masses of heavy particles. 
Figure 5 shows two entries per event corresponding to the two reconstructed electron-jet objects obtained 
by adding x and y mass projections of (mjj, mfj) on a common axis, mij. 

All figures show predicted distributions for an integrated luminosity of 100 pb _1 . 

The trigger efficiency is not included in the plots and tables shown in this section. However events 
satisfying all selection criteria would trigger with an efficiency exceeding 95%, as discussed in Section 3. 
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Figure 5: Reconstructed electron-jet invariant mass in the 1st generation leptoquark (m£g=400 GeV) analysis 
for signal and background MC events after baseline selection (left) and after all selection criteria (right). All 
distributions are given for 100 pb~' of integrated luminosity. 
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Figure 6: Reconstructed muon-jet invariant mass for 2nd generation leptoquarks (m L Q = 400 GeV) in signal and 
background MC events after baseline selection (left) and after all selection criteria (right). All distributions are 
given for 100 pb~' of integrated luminosity. 

5.2 Search for New Particles from Left- Right Symmetric Models 

Signal event candidates are required to contain (at least) two electron or muon candidates and two or 
more jets that pass the baseline selection criteria. As previously described, the minimum separation 
between a jet and an electron candidate AR > 0.1 is required. The two leading pj lepton candidates and 
the two leading pj jets are assumed to be the decay products of the Wr boson. The signal jet candidates 
are combined with each signal lepton, and the combination that gives the smallest invariant mass is 
considered as the heavy neutrino (Nc in Fig. 2). This assignment is correct in more than 99% of signal 
MC events. The other lepton is assumed to come directly from the decay of the Wr boson. 

When the Wr boson is at least twice as heavy as the Majorana neutrino, the daughter lepton from the 
neutrino's decay often begins to partially merge with one of the daughter jets. In the dielectron analysis, 
when the separation between this lepton and a signal jet candidate is in the range 0.1 < AR < 0.4, using 
all three reconstructed objects to estimate the invariant mass of the neutrino would often result in double- 
counting. To solve this problem, signal event candidates in the dielectron analysis are divided into two 
groups. When the separation is outside the discussed range, i.e. AR > 0.4, all three objects are used. 
However, when the separation is in the critical range, i.e. 0.1 < AR < 0.4, only jets are used to estimate 
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the mass of the N e neutrino. It must be noted that this procedure has little effect on the N e neutrino mass 
resolution, because it is dominated by the resolution on the jets energy. The fraction of events falling in 
the critical range depends on the ratio of Wr boson and Majorana neutrino masses, and increases with 
this ratio, being 8% for the 1500 GeV to 500 GeV ratio and 26% for the 1800 GeV to 300 GeV ratio 
considered here. No such problem exists in the dimuon analysis because muon reconstruction is possible 
even when the reconstructed trajectory's projection into the calorimeters randomly coincides with jet 
activity. The mass of the Majorana neutrino can be reconstructed with a relative resolution of about 
6%, and the mass of the Wr boson can be reconstructed with a relative resolution of 5% to 8%; better 
resolution on the latter is achieved in the dielectron analyses because the muon spectrometer resolution 
is degraded at high transverse momenta. 




0 500 1000 1500 2000 2500 0 500 1000 1500 2000 2500 

S T [ GeV ], dielectron channel S T [ GeV ], dimjon channel 



Figure 7: LRSM analysis. St distributions for signals and backgrounds normalized to 100 pb _1 of integrated 
luminosity after baseline selection in dielectron (left) and dimuon (right) analyses. Vertical lines indicate the region 
used in the analysis. 

While the main background sources in LRSM analyses are tt, Z/y* , and vector boson pair production 
processes, multijets were also identified as a source of potentially dangerous background in the dielectron 
analysis. The distributions of the scalar sum of signal object candidates' transverse momenta Sj, and the 
reconstructed dilepton invariant mass mm for signal and background events, normalized to an integrated 
luminosity of 100 pb~\ are shown in Figs. 7 and 8. 

The choice of the selection criteria St > 700 GeV and mu > 300 GeV is made in order to maintain 
good efficiency not only for mass values used in this study, but also for signals with mw R > 1000 GeV. 

Partial cross-sections for signal and background processes passing the selection criteria are shown 
in Tables 5 and 6. Some remarks are in order concerning the selection criteria's efficiencies. First, the 
dimuon channel is more efficient than the dielectron channel. This is due to the jet-electron merging 
discussed previously. This issue becomes especially important for a larger ratio of masses mw R /mN e - 
However, for a very heavy Wr boson, the dielectron channel could become more significant because the 
Wr boson mass resolution does not become as wide in the dielectron channel as it does in the dimuon 
channel. Also, because of its heavy mass, the potential to discover the Wr boson and the heavy neutrino 
together is much better than in the inclusive search for the new heavy neutrino (assuming the same 
production mechanism) because of backgrounds. 



1758 



Exotics - Search for Leptoquark Pairs and Majorana Neutrinos from Right- 




100 600 1100 1600 100 600 1100 1600 

M(ee) [ GeV ] M(nn) [ GeV ] 



Figure 8: LRSM analysis. The distributions of m« for signals and backgrounds normalized to 100 pb _1 of 
integrated luminosity after baseline selection in dielectron (left) and dimuon (right) analyses. Vertical lines indicate 
the region used in the analysis. 



Physics 


Before 


Baseline 


m e jj 




m ee 


St 


sample 
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0.0882 
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LRSM_15_5 


0.470 
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Z/f,m> 60 GeV 


1808. 
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43.36 
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0.0132 


0.0064 


tt 
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3.23 


3.13 


0.215 


0.0422 


0.0165 


VB pairs 


60.9 


0.610 


0.522 


0.0160 


0.0016 


0.0002 


Multijet 


10* 


20.51 


19.67 


0.0490 


0.0444 


0.0444 



Table 5: LRSM dielectron analysis. Partial cross-sections (pb) that survive the selection criteria. 



Figures 9 and 10 show the distributions of the reconstructed invariant masses of the heavy neutrino 
and Wr boson candidates for signal and background MC samples before and after the selection criteria 
are applied. All distributions are normalized to 100 pb -1 of integrated luminosity. It should be remarked 
that the trigger efficiency is not included in the plots and tables shown in this section. However, events 
satisfying all selection criteria would trigger with an efficiency exceeding 95%, as discussed in Section 3. 

Background contributions to signal invariant mass spectra could also arise from jets that are misiden- 
tified as signal electrons. In principle, such misidentified jets are efficiently suppressed because at least 
two signal electron candidates are required, but at present this background remains poorly understood be- 
cause larger statistics of multijet MC, or better, real data, would be necessary to evaluate its contribution 
reliably. If needed, a better suppression of events with multijets that are misidentified as electrons is pos- 
sible by applying a more sophisticated isolation energy requirement. The multijet background does not 
pose a problem in the dimuon analysis, where estimates of the misidentification rate predict a vanishing 
contribution from multijet to dimuon events. 
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Table 6: LRSM dimuon analysis. Partial cross-sections (pb) that survive selection criteria. 



Finally, the analyses described in this note do not discriminate between same-sign and opposite-sign 
dileptons. Same-sign dileptons, however, are a very important signature of Majorana neutrinos, which, 
being their own anti-particles, could decay to a lepton of either charge. The background contribution to 
same-sign dileptons is much smaller than to opposite-sign dileptons. Of course, both channels would 
have to be studied if the discovery is made. The studies of charge misidentification performed in the 
framework of the presented analyses, predict a rate as high as 5% for high-p^ leptons which is strongly 
r\ -dependent. 

6 Systematic Uncertainties 

The following sources of systematic uncertainties have been considered in the described analyses: 

• 20% uncertainty was assumed on the integrated luminosity. 

• In the dielectron analyses, 1 % was used for the uncertainty in overall trigger efficiency. 

• For electron identification and reconstruction efficiency, an uncertainty of 1 % was assumed. 

• For muon identification, including trigger and reconstruction efficiencies, an uncertainty of 5% 
was assumed. 

• The uncertainty on the electron energy scale was assumed to be ±1%. 

• The uncertainty on the muon momentum scale was assumed to be ±1%. 

• The uncertainty on the jet energy scale was estimated by changing the energies of all jets simulta- 
neously by ±10% and ±20%, for \ r\ jet \ < 3.2 and \ r\ jet \ > 3.2, respectively. 

• The 20% uncertainty in electron pj resolution was estimated using a Gaussian smearing of pr 
with a relative width of 0.66 * (0.10/^700.007), where p T is in GeV. 

• The uncertainty due to muon l/pr resolution was estimated using a Gaussian smearing of l/pr 
with a width of 0.01 \/p T 0 0.00017, where p T is in GeV. 

• The uncertainty due to jet energy resolution was estimated using a Gaussian smearing of jet 
energies in such a way that the relative jet energy resolution widens from 0.60/\/£ 00.05 to 
0.75/ ^£00.07 for \r} jet \ < 3.2, and from 0.90/ \/E 00.07 to 1.1O/^E0O.1O for \r\ jet \ > 3.2, 
where E is in GeV. 
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Figure 9: LRSM analysis. The distributions of the reconstructed invariant masses for N e (top) and (bottom) 
candidates in background and signal (LRSM_18_3 and LRSM_15_5) events before (left) and after (right) back- 
ground suppression is performed in dielectron and dimuon analyses. All distributions are normalized to 100 pb~' 
of integrated luminosity. LRSM_15_5 and LRSM_18_3 refer to two sets of LRSM mass hypotheses. See the text 
for more information. 

• Statistical uncertainties on the number of background MC events were considered as systematic 
uncertainties on the number of background events. 

• The systematic uncertainty on the leptoquark cross-section (NLO) [16] was calculated by taking 
the 40 PDF CTEQ6M tables (two per eigenvector of PDF variations, provided by the CTEQ group 
for calculating uncertainties [15]), recalculating the leptoquark cross-section with each of these 
tables, and taking the largest difference of the two variations for each of the 20 eigenvectors to 
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Figure 10: LRSM analysis. The distributions of the reconstructed invariant masses for Wr — » eN e (top) and 
Wr — »• fiN^ (bottom) candidates in background and signal (LRSM_18_3 and LRSM_15_5) events before (left) 
and after (right) background suppression is performed in dielectron and dimuon analyses. All distributions are 
normalized to 100 pb _1 of integrated luminosity. Notice that the invariant mass of the Wr boson is shown before the 
requirement m^jj > 1000 GeV is imposed. This variable is strongly correlated with the background-suppressing 
variables St and my*. LRSM_15_5 and LRSM_18_3 refer to two sets of LRSM mass hypotheses. See the text for 
more information. 

the cross-section calculated with the standard CTEQ6M table. The estimate shown is the sum in 
quadrature of these 20 differences and the relative difference in cross-section obtained by varying 
renormalization and factorization scales by a factor of 2. The systematic uncertainty is between 
15% and 28% for the tested leptoquark masses. 

• The uncertainty of the jet modeling in Z/y* events was estimated by comparing the background 
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predictions obtained using MC samples produced with Pythia to MC samples produced with 
Alpgen. For the leptoquark pair analysis, this results in an uncertainty of about 30% on the 
background from Z/y* events. 

• Background cross-sections for tt and Z/y* processes were assumed to have uncertainties of 12% 
and 10%, respectively. 

Systematic uncertainties affect both signal and background efficiencies, however the significance compu- 
tation (next section) is mainly affected by the uncertainty on the background. The dominant systematic 
effects on the background are due to the uncertainties in integrated luminosity (20%), the jet energy 
scale (16%-35%), jet energy resolution (6%-28%), and the limited statistics of background MC samples 
(15%-30%). Possible other sources of systematic uncertainties such as initial and final state radiation 
modeling, or pile-up, were not evaluated. The total systematic uncertainties for signals and backgrounds 
are summarized in Table 7. 



analysis 


effect on signal events 


effect on background events 




1st gen. 


2nd gen. 


1st gen. 


2nd gen. 


leptoquark 


±27% 


±29% 


±53% 


±51% 


LRSM 


±23% 


±25% 


±45% 


±40% 



Table 7: Summary of total systematic uncertainties (%) for 100 pb 1 luminosity. 



7 Results 

The program S cp [29] is used to calculate the significances of possible observations of the signals studied 
in this note. The significance is defined in units of Gaussian standard deviations, corresponding to the 
(one-sided) probability of observing a certain number of events exceeding the MC-predicted background 
Ni, at a given integrated luminosity. This probability is usually referred to as CLb(N), where N is the 
number of observed events. We report the 5a discovery potential evaluated in terms of CLb(N s +Nb), 
where N s is the expected number of signal events. Systematic uncertainties in the number of background 
events were also included in the significance calculations. For second generation leptoquarks, the signal 
selection was optimized at each mass point to minimize the cross-section times branching ratio needed 
to reach a 5a discovery, while for all other analyses the selection cuts presented in earlier sections were 
used. 

The overall reconstruction and trigger efficiencies discussed earlier are used to estimate ATLAS' 
sensitivity and discovery potential for the studied final states below. These estimates include the trigger 
efficiency for signal and background events, as discussed in Section 5, Table 2. 

7.1 Leptoquarks 

The integrated luminosities needed for a 5a discovery of the 1st and 2nd generation scalar leptoquark 
signals are shown in Table 8 as function of leptoquark mass, assuming j3 = 1. Also, Fig. 1 1 predicts the 
integrated luminosities needed for a 400 GeV leptoquark mass discovery, with various values of j3 2 , at a 
5 a level. 

Finally, Fig. 12 shows the minimum j3 2 that can be probed with ATLAS with 100 pb -1 of integrated 
luminosity as a function of leptoquark mass. Lighter leptoquark masses can be probed with a smaller j3 
because of their larger cross-section. It is evident from this figure that ATLAS is sensitive to leptoquark 



1763 



Exotics - Search for Leptoquark Pairs and Majorana Neutrinos from Right- 



Leptoquark mass 


Expected luminosity needed for a 5a discovery 




1st gen. 2nd gen. 


300 GeV 


2.8 pb- 1 


1.6 pb" 1 


400 GeV 


11.8 pb 1 


7.7 pb" 1 


600 GeV 


123 pb" 1 


103 pb 1 


800 GeV 


1094 pb" 1 


664 pb- 1 



Table 8: The integrated luminosities needed for a 5(7 discovery of 1st and 2nd gen. scalar leptoquarks for different 
mass hypotheses. 




Figure 11: 5 a discovery potential for 1st and 2nd 
gen. m = 400 GeV scalar leptoquarks versus j3 2 
with and without background systematic uncertainty 
included. 



Figure 12: Minimum f5 2 of scalar leptoquarks versus 
leptoquark mass for 100 pb- 1 of integrated luminosity 
at 5(7 (background systematic uncertainty included.) 



masses of about 565 GeV and 575 GeV for 1st gen. and 2nd gen., respectively, at the given integrated 
luminosity, provided leptoquarks always decay into charged leptons and quarks. 

7.2 Left-Right Symmetry 

The significances of studied signals versus integrated luminosity are shown in Fig. 13. Figure 14 shows 
the product of signal cross-section and dilepton branching fraction versus the integrated luminosity nec- 
essary for a 5a discovery. 

The overall relative systematic uncertainties on Drell-Yan and tt backgrounds are approximately 
45% and 40% in the dielectron and dimuon analyses, respectively. These estimates are dominated by 
contributions from jet reconstruction, uncertainty in integrated luminosity and insufficient MC statis- 
tics. Currently, multijet background is poorly understood and is not included in the presented sensitivity 
estimates for the dielectron channel. 



8 Summary and Conclusions 

Studies of final states with two leptons and multiple jets have been discussed, considering both electrons 
and muons. The early-data discovery potential for Beyond the Standard Model physics predicted by two 
prominent GUT-inspired models has been investigated. 

Both 1st and 2nd generation scalar leptoquark pair production could be discovered with less than 
100 pb _1 of integrated luminosity, provided that the mass of the leptoquarks is smaller than 500 GeV 
and the branching ratio into a charged lepton and a quark is 100%. 
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Figure 13: LRSM analysis. Expected signal significances versus integrated luminosity for N e , neutrino and 
Wr boson mass hypotheses, according to signal MC samples LRSM_18_3 and LRSM_15_5. Open symbols show 
sensitivities without systematic uncertainties. Sensitivities shown with closed symbols include an overall relative 
uncertainty of 45% (40%) estimated for background contributions in the dielectron (dimuon) analysis. LRSM_15_5 
and LRSM_18_3 refer to two sets of LRSM mass hypotheses. See the text for more information. 

Two LRSM mass points (m WR = 1.8 TeV,m Nt = 300 GeV and m Wr = 1.5 TeV,m N( , = 500 GeV) for 
the right-handed Wr boson and Majorana neutrinos Ne have been studied in the dielectron and dimuon 
channels. It was found that discovery of these new particles at these mass points would require integrated 
luminosities of 150 pb -1 and 40 pb _1 , respectively. 
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Figure 14: LRSM analysis. The product of signal cross-section and branching fraction to dielectron and dimuon 
final states versus integrated luminosity necessary for a 5a discovery. N e , neutrino and Wr boson mass hypothe- 
ses are for signal MC samples LRSM_18_3 and LRSM_15_5. Horizontal lines indicate nominal cross-sections for 
two signal MC samples, according to the LRSM implementation in the MC simulation. Open symbols show dis- 
covery potentials without systematic uncertainties. Discovery potentials shown with closed symbols include an 
overall relative uncertainty of 45% (40%) assumed for the background contribution in the dielectron (dimuon) 
analysis. LRSM_15_5 and LRSM_18_3 refer to two sets of LRSM mass hypotheses. See the text for more infor- 
mation. 
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Abstract 

In the absence of a light Higgs boson, the mechanism of electroweak symme- 
try breaking will be best studied in processes of vector boson scattering at high 
mass. Various models predict resonances in this channel. Here, we investigate 
WWscalar and vector resonances, WZ vector resonances and a ZZ scalar reso- 
nance over a range of diboson centre-of-mass energies. Particular attention is 
paid to the application of forward jet tagging and to the reconstruction of dijet 
pairs with low opening angle resulting from the decay of highly boosted vector 
bosons. The performances of different jet algorithms are compared. We find 
that resonances in vector boson scattering can be discovered with a few tens of 
inverse femtobarns of integrated luminosity. 

1 Introduction 

In the absence of a light Higgs boson, an alternative scenario to the Standard Model, Supersymmetry, 
or Little Higgs models must be invoked. In particular, Electroweak Symmetry Breaking (EWSB) could 
result from a strong coupling interaction. Here, we will make no assumptions about the underlying dy- 
namics of EWSB; we treat the Standard Model as a low energy effective theory, and evaluate the potential 
for measuring vector boson scattering. In the Standard Model, perturbative unitarity is violated [1] in 
vector boson scattering at high energy for a Higgs mass m# > 870 GeV or, if there is no Higgs (mg — > °°), 
for a centre-of-mass energy above a critical value of around 1.7 TeV. The only way to avoid a light Higgs 
boson is therefore to presume new physics at high energy [2], possibly in the form of vector boson pair 
resonances. Such resonances are predicted in many models such as QCD-like technicolour models with 
the required Goldstone bosons resulting from chiral symmetry breaking [3] ; Higgsless extra dimension 
models [4], where Kaluza- Klein states of gauge bosons are exchanged in the ^-channel [5]; as well as in 
models with extra vector bosons, from GUT or from strong interaction (BESS models [6]) mixing with 
the Standard Model vector bosons. The present search for resonances in vector boson scattering can be 
considered generic and may be interpreted in terms of any of these models. 

1.1 The Chiral Lagrangian Model 

The Chiral Lagrangian (ChL) model is an effective theory valid up to 4nv ~ 3 TeV, where v = 246 GeV 
is the vacuum expectation value of the Standard Model Higgs field. It can provide a description of longi- 
tudinal gauge boson scattering at the TeV scale when no light scalar Higgs boson is present. Electroweak 
symmetry breaking is realised non-linearly. A set of dimension-4 effective operators describe the low 
energy interactions (see for example [7]). Since, at the LHC, vector boson scattering can occur at the 
TeV energy scale where the interaction becomes strong, it is necessary to unitarise the scattering am- 
plitudes. One popular unitarisation prescription is the so-called Pade prescription, or Inverse Amplitude 
Method [8]. This is based on meson scattering in QCD, where it gives an excellent description [9], 
reproducing observed resonances. Among the terms of the Lagrangian which describe vector boson 
scattering, under some basic assumptions (custodial symmetry and CP conservation), only 2 parameters 
(namely CC4 and as) are important for this process. Depending on the values of these two parameters, one 
can obtain Higgs-like scalar resonances and/or technicolour-like vector resonances [10]. The resulting 
properly- unitarised amplitudes for vector boson scattering may therefore give information in a higher 
energy range. They yield poles for certain values of CC4 and 0:5 that can be interpreted as resonances, as 
shown in Fig. 1. The hashed region in the figure is forbidden by causality arguments [11]. 
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Figure 1: Left: regions in the (c^Cfct) parameter space indicating which values exhibit vector and/or 
scalar resonances in the Pade unitarisation scheme. Right: number of events per fb _1 as a function of the 
di-boson invariant mass for different resonance masses studied here. 

Other unitarisation procedures are possible, such as the K-matrix method [ 1 2] or the N/D method [13]. 
In general, resonances are not necessarily produced. In non-resonant cases, it remains vital to measure 
the vector boson scattering cross section, but high luminosity and a very good understanding of back- 
grounds will be required in order to measure the regularisation of the cross section. 

1.2 Characteristic Signatures of Vector Boson Scattering 

Discovery of the physics signals studied here will, in general, require high integrated luminosity. It 
will require also extremely large samples of simulated backgrounds, fine tuning of all reconstruction 
algorithms, and a good understanding of the detector performance, which will only gradually develop 
after the first few years of LHC running. The main purpose of this note is not, therefore, to evaluate 
with precision the discovery potential of ChL resonances, but to establish a strategy for the search of this 
important signal. The main emphasis will be put on those aspects most particular to the high mass vector 
boson scattering process; that is, the reconstruction of hadronically decaying vector bosons at high pj 
and the reconstruction of the high rapidity tag jets. 

The decay of a high mass ChL resonance will produce two highly boosted vector bosons in the central 
rapidity region of the detector. For transverse momenta greater than about 250 GeV, a hadronically 
decaying vector boson will be seen as one single wide and heavy jet. Methods of distinguishing such jets 
from single-parton jets will be investigated with different jet algorithms. 

A characteristic signature of vector boson scattering is the presence of two high rapidity and high 
energy "tag" jets [14], arising from the quarks which radiate the incoming vector bosons. The process can 
thus be efficiently distinguished from contributions to the production of (mostly transversely polarised) 
final state vector bosons due to bremsstrahlung of these vector bosons from the quarks. In that case, the 
accompanying jets are softer and more central. A further component of the signature is the suppression 
of QCD radiation in the rapidity interval between the tag jets due to the fact that no colour is exchanged 
between the protons in these processes [15]. This characteristic feature allows for efficient use of central 
jet veto to suppress backgrounds. 

The high QCD background at the LHC naturally leads us to focus on "semi-leptonic" vector boson 
events; that is, those events when one W or Z boson decays leptonically, and the other decays hadron- 
ically. These channels represent the best compromise in that there is only at most one neutrino, so the 
diboson mass may be reconstructed with reasonable resolution, and the backgrounds can be reduced to 
a manageable level by the requirement of leptons and/or missing transverse energy (J$j)- Fully-leptonic 
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events are also useful in cases where clear resonances are present, where a kinematic edge may be visible 
and the backgrounds may be reduced even further. The case of resonant ZZ — > £ + £~ vv can also lead to a 
clean signature. Fully hadronic events may be useable at very high diboson energies, but this possibility 
is not considered further here. Thus, the study of vector boson scattering events will also require a good 
understanding of detector performance for electrons, muons and Jf.j. Although many ATLAS analyses 
will depend on the reconstruction of these objects, the quality of such reconstruction is evaluated here 
for the case of high energy leptons. 

The note is organised as follows. In the next section (2) we describe the Monte Carlo simulations and 
the samples used. Next, the trigger is discussed (Section 3), then the detector performance with particular 
focus on the challenges of this analysis (Section 4). After this, the event selections, efficiencies and 
purities for the various final states are given (Section 5). An attempt to evaluate the expected sensitivity 
is made (Section 6), and the systematic uncertainties are discussed (Section 7), before a final summary 
and conclusion. 

2 Signals and Background Simulation 

2.1 Definition of Signal 

In order to have a gauge invariant set of diagrams for the background, in spite of a Higgsless scenario, a 
low mass Higgs will be assumed. A resonance signal will be defined here as an excess of events in the 
resonance mass region over the number expected from the Standard Model continuum when the Higgs 
boson mass is set at 100 GeV. This ensures that longitudinal vector boson scattering will contribute 
negligibly to the process. This definition follows the prescription of [16]. We note that measurement 
of even a continuum cross section for this process at such high energies would be of great importance, 
but will not be considered here as it should require high luminosity and a very good understanding of 
backgrounds. 

2.2 Overview of Generators 

The Monte Carlo (MC) generators used in the main analysis are as follows. 

• Pythia [17] version 6.4.0.3 was used for the signal, with the CTEQ6L partem distribution function 
and the renormalisation and factorisation scale Q 2 = m 2 v . The hard process was modified to include 
new vector boson scattering amplitudes (see below). 

• MadGraph [18], version 3.95, with Pythia for partem shower, hadronisation and underlying 
event, was used for W+jets and Z+jets backgrounds. The default values of fixed renormalisation 
and factorisation scales of Q 2 = m\ were set and CTEQ6L1 parton distribution functions were 
used. 

• MC@NLO [19], with Herwig [20], for parton shower and hadronisation and Jimmy [21,22] for 
underlying event, was used for tt background. 

The underlying event samples were tuned to data from previous experiments [22]. All samples use 
Photos [23] to simulate final state radiation. Whizard [24] and Alpgen [25, 26] are also used for 
some generator level comparisons. Whizard uses Pythia for parton showering, hadronisation, and 
underlying event. Alpgen uses Herwig/Jimmy. 

The different choice of scales for MadGraph and Pythia is not ideal, but retained for histori- 
cal reasons since large samples were generated with these choices. However, studies showed that the 
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Sample name 


Generator 


0 x Br, fb 


qqWZ — > qqjjil, m = 500 GeV 


Pythia-73 


25.2 


qqWZ — > qqivjj, m = 500 GeV 


Pythia-73 


83.9 


ggjyz _> qq£v££, m = 500 GeV 


Pythia-73 


8.0 


ggWZ — >■ qqjjU, m = 800 GeV 


PYTHIA-ChL 


10.5 


— > qqivjj, m = 800 GeV 


PYTHIA-ChL 


35.2 


ggWZ qqivii, m = 800 GeV 


PYTHIA-ChL 


3.4 


<7<?WZ qqjjU, m=\.\ TeV 


PYTHIA-ChL 


3.7 


ggWZ — ► qqivjj, m=l.\ TeV 


PYTHIA-ChL 


12.3 


9tfWZ -»• ^v^, m = 1.1 TeV 


PYTHIA-ChL 


1.18 


«gW -► 77, m = 499 GeV (s) 


PYTHIA-ChL 


66.5 


9 #W ->■ g^v 77, m = 821 GeV (s) 


PYTHIA-ChL 


27.5 


qqWW -> qqivjj, m = 1134 GeV (s) 


PYTHIA-ChL 


17.0 


^V77, m = 808 GeV (v) 


PYTHIA-ChL 


29.8 


qqWW ->■ 77, m = 1 1 15 GeV (v) 


PYTHIA-ChL 


17.9 


^WW — > ^r^r£v 77, non-resonant 


PYTHIA-ChL 


10.0 


^ZZ OT w^, m = 500 GeV 


PYTHIA-ChL 


4.0 


77WZ 7'7'^V^, background 


MadGraph 


96 


77ZZ ->■ jjwti, background 


MadGraph 


45.5 






a (no Br), pb 


W+ + 4 jets 


MadGraph 


165 ±0.1 


Z + 4 jets 


MadGraph 


87 ± 0.7 


W+ + 3jets 


MadGraph 


6.2 ± 0.02 


Z + 3 jets 


MadGraph 


3.8 ± 0.02 


tt 


MC@NLO 


833 ± 100 



Table 1 : Data samples and generators used in the present study 



major effect is on the cross section rather than on event shapes, and the cross section normalisation is 
determined independently as described below. 

Further details specific to the samples are given below. 

2.3 List of samples 

Table 1 lists the Monte Carlo samples, produced with full detector simulation, used in the present analy- 
sis. 

The first set of samples represents different reference cases of vector boson scattering signals: 

• Pythia-73: For the samples labelled "Pythia-73", the process 73 (longitudinal WZ scattering) 
was selected, with MSTP(46)=5 (QCD-like model of [27] with Pade unitarisation). All other 
switches were left as default. This is meant to represent a generic narrow WZ resonance. 

• PYTHIA-ChL: datasets with generator labelled "PYTHIA-ChL" in the table use a modified version 
of Pythia routine PYSGHG. The modification involves replacing the scattering amplitudes cal- 
culated for processes 73-77 by those given by Dobado et al [10] with parameters a\ and a$. These 
parameters were chosen so as to produce a vector or scalar (indicated by a (v) or an (s) in the table) 
resonance at the desired mass, or signal with no resonances at all. Note that only vector WZ and 
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scalar ZZ resonances are possible, but both scalar and vector WW resonances can be produced. A 
continuum sample was also generated using this model. 

Background samples include events with two vector bosons and two jets in the final state, arising 
from gluon or electroweak vector boson exchange between incoming quarks. The vector bosons are here 
mostly transverse and emitted more centrally than in the case of longitudinal vector boson pair scattering. 

• jjWZ final state, where j is a quark or gluon: The decays of the vector bosons are performed in 
Pythia. Note that the semi-leptonic cases are already included in samples W+jets and Z+ jets 
(see below). Only the purely leptonic cases make use of this background. 

• The background process: tiw with a pair of jets (quark or gluons). The cross section shown in 
Table 1 is for non-hadronic decay of the ZZ's, with a filter requiring two leptons with pj > 5 GeV 
and 1 77 1 < 2.8. 

• W/Z + 3 jets and W/Z + 4 jets: they constitute backgrounds for the cases of high mass and lower 
mass resonances respectively since, in the former case, we expect that most of the vector bosons 
which decay hadronically will be reconstructed as a single jet. A correction factor of 1.38 is ap- 
plied to the W + +jets cross sections to account for the W~+j process. These datasets include all 
tree level diagrams leading to W+4j, Z+4j, W+3j and Z+3j, with the vector bosons decaying lep- 
tonically, including all QCD and electroweak contributions. To keep the cross section manageable, 
preselection cuts were applied at MadGraph level. For the W,Z + 4 jets case, we tag the highest 
rapidity jet (fjet), backward jet (bjet) and 2 central jets by requiring that \ r\fj e t \ > 1-5, \i]bjet\ > 1-5, 
that the forward and backward jet candidates be on different hemispheres: r\fj et r\bjet < 0, that at 
least one forward jet have energy E > 300 GeV, and that the invariant mass of the combined for- 
ward jets be nijj > 250 GeV. We further require pj of at least one of the central jets to be p J T > 50 
GeV, the pj of the vectorial addition of the central jets to be p ] j > 60 GeV and the invariant mass 
of the combined central jets m ; y > 60 GeV. We note that the forward jet preselection suppresses 
this background by a factor of 3.5. 

For the case W,Z + 3 jets, we add the requirements: pj of the W or Z boson pj > 200 GeV, 
\t] w /z\ < 2 > and Pt of one jet (central) p{ > 200GeV, |tj 7 | < 2. 

Additional samples were produced with fast detector simulation to improve background statistics. 

2.4 Comparative studies of generators 

2.4.1 Parton shower matching to matrix elements 

Here, MadGraph was used to generate the W+jets background. A better evaluation of this background 
would be obtained using a generator for which W+n partons, n=0, 1 , 2, 3 or 4 inclusive, are combined in a 
manner which avoids double counting of jets produced by the parton shower in Pythia. Alpgen is one 
such generator (and in fact such matching is now implemented in more recent versions of MadGraph). 
However, due to time constraints, and in order to have a manageable size of background samples, it was 
not practical to use this technique. In order to validate the use of MadGraph, a comparison was made 
of the W + 4jets sample with an appropriate Alpgen sample, with same analysis cuts applied. The 
Alpgen samples are not used in the final analysis since they lack sufficient statistics. 

Distributions of the vector bosons and jets were compared. As an example, the distributions for 
the forward jets are shown in Fig. 2. The overall conclusion is that the shapes of the distributions are 
in reasonable agreement. Therefore, neglecting the effect of parton shower double counting does not 
significantly affect the event topology. To the extent that such an error is made, the tag jets in the 
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Figure 2: Distributions of the forward jets in the W + 4jet background for MadGraph (red) and Alp- 
GEN (black) samples (area normalised). The error bars show the statistical error in each sample. 

Alpgen sample have a lower energy (leading to a depletion with respect to the MadGraph samples at 
high energies of a few %) and so the backgrounds in this analysis can be considered to be conservatively 
overestimated. The sensitivity of such backgrounds to the scales has been discussed, for example, in [28, 
29]. The difference in Q 2 scale of the two samples (Alpgen uses Q 2 = + Pt(W) 2 ) leads to about a 
factor two discrepancy in cross section. This was confirmed by running MadGraph on a small sample 
with the same scale as Alpgen, yielding cross sections smaller by factors 2.05 and 1.77 for the QCD 
and QED processes respectively. These factors will be applied in the present analysis. 

2.4.2 Effective W Approximation 

Whizard [24] is a relatively new event generator originally developed for the ILC. It is able to calculate 
the full 2^6 matrix element needed for the vector boson scattering processes and it implements the 
Chiral Lagrangian model with the K-Matrix unitarisation scheme [27] which does not lead to vector 
boson resonances. Further unitarisation schemes in the form of arbitrary resonances are planned. 

The generator does not assume an effective W approximation, whereby the bosons emitted from 
the quarks are treated as partons, allowing vector boson scattering diagrams to form a gauge-invariant 
subset. This approximation is made in the Pythia signal samples, and might be expected to particularly 
affect the tag jet kinematics. Comparisons between the tag jet distributions from Whizard and Pythia 
are shown in Fig. 3. The two samples are not strictly comparable since here, WHIZARD simulates 
the 2 — > 4 processes including non-scattering electroweak diagrams and applies K-matrix unitarisation. 
Although there are differences (e.g. the tag jets from Whizard are somewhat harder than those from 
PYTHIA) they do not strongly depend upon the vector boson centre-of-mass, and thus the effective W 
approximation is unlikely to be the culprit. The harder tag jets in Whizard mean that if the signal looks 
more like Whizard than Pythia, it would be more likely to pass the selection cuts, thus improving the 
sensitivity. The potential size of the effect was investigated and estimated to be at the few per cent level. 
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Figure 3: Differences between WfflZARD (red) and Pythia (black) for vanishing anomalous couplings 
for tag jet distributions of: transverse momentum (a), pseudo rapidity (b), energy (c) and pseudo rapidity 
difference (d). 
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3 Trigger 

As a first step in the analysis, it is important to evaluate the efficiency of the basic trigger menus for 
a luminosity of 10 33 cm _2 s _1 . The triggers chosen were based on an early menu [30] as an example, 
and the real physics menu is likely to be very different. However, since the signal is at relatively high 
pr, and triggering on vector bosons is a high priority, this is not likely to have a large impact. To 
evaluate this efficiency, we apply the following cuts: for electrons and muons we require single leptons 
to have pr greater than the value corresponding to the threshold dictated by the trigger signature and 
|tj| < 2.5; similarly for jets, but with a pseudorapidity cut of 1 77 1 < 3.2. This is necessary because trigger 
signatures for forward jets exist separately, but unfortunately that trigger information was not available 
in the simulation version used for this study. The trigger efficiency is defined as the number of times 
the trigger passed (with the corresponding cuts applied) divided by the number of truth events in the 
samples (with the same cuts applied). In Table 2 we present a detailed list of efficiencies for the signals 
qqWZ —> qqjjU (m = 1.1 TeV) in the left column and qqWW — > qqjjiv (non-resonant) in the right 1 . 

The poor efficiency of the e25i (see Fig. 4, left) and 2el2i triggers is understood to be due to the 
isolation criterion, which was not optimised for high energy electrons. 




Figure 4: Trigger efficiencies computed with the WW continuum signal. Left: efficiency of the e25i 
trigger as a function of the pj of electrons from the true leptonically-decaying W boson. Right: efficiency 
of the j 160 trigger (black triangles) as a function of the pj of the true hadronically-decaying W boson. 
Also shown with blue circles is the efficiency when the j 160 and 2 j 120 triggers are logically OR'ed. 

It is worth mentioning that the efficiency for the 2j 120 trigger (Fig. 4, right), which requires two jets 
with pj > 120 GeV suffers partly from the fact that the two jets from the vector boson decay are merged 
due to the boost as described in Section 4.1. It is also significantly higher for events with true electrons 
than for those with true muons, probably because the electrons themselves are also reconstructed as jets 
in the calorimeter. 

Finally, various combinations of the trigger signatures might be explored in the future to improve 
the efficiency. For instance, the e60 trigger might be used in conjunction with the e25i to compensate 
the low efficiency of the latter for high-momentum electrons (Fig. 4). Likewise, the 2j 120 trigger might 
be used together with j 160, since the efficiency of the latter drops significantly when the hadronically- 
decaying vector boson has pj < 300 GeV and decays into two distinctly resolvable jets. 

'Triggers 2j 120 and j 160 have been removed from the menu as they are expected to give too high a rate. However, forward 
jet triggering will be available, and more recent developments in the electron trigger has resulted in improved efficiency. 
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WZ signal WW signal 



Trigger Signature 


Cut Loss 


Efficiency 


Cut Loss 


Efficiency 


Electrons 


2el2i 


13% 


36% 


>99% 




e25i 


1% 


78% 


11% 


65% 


e60 


5% 


82% 


29% 


73% 


Muons 


mu6 


5% 


95% 


6% 


80% 


mu20 


5% 


92% 


9% 


73% 


Jets 


2jl20 


67% 


73% 


50% 


80% 


jl60 


34% 


96% 


30% 


86% 



Table 2: Table of high level trigger efficiencies for qqWZ —>■ qqjjii (m = 1.1 TeV) and qqWW —>■ 
qqjjiv (non-resonant). The "Cut Loss" columns indicate the fraction of true events that would be lost 
by applying the pj requirements of each trigger signature on the true electrons, muons and jets. Since 
such events are unlikely to satisfy the trigger conditions, they are not taken into account when the trigger 
efficiencies are evaluated. 

4 Reconstruction Challenges 

In this section, we focus on those parts of the reconstruction which are most particular to vector boson 
fusion at high masses. We discuss the following: 

• Reconstruction of hadronically-decaying vector bosons. In our regime these typically have high 
pj and the decay products are very collimated. We discuss two alternative methods, using k± jets 
and subjets, and using cone jets with different radii. 

• Leptonically decaying vector bosons. These require good lepton and $t measurement, but the 
challenges here are not unique to these channels. 

• Forward 'tag' jets. Measuring jets close to the edge of the detector rapidity acceptance is a chal- 
lenge in common with low mass Higgs searches in vector boson fusion. 

• Central jet veto. Since the vector boson scattering process involves no colour exchange between 
the protons, a suppression of QCD radiation is expected. This can be used to distinguish between 
signal and background, but is sensitive to underlying event and pile-up. 

• Top veto, tt production is a major background for the channels which do not contain leptonic Z 
boson decays. A large fraction of this is removed by explicitly rejecting events containing top 
candidates. 

4.1 Hadronic Vector Boson Identification 

At lower masses and pj, the hadronically-decaying vector bosons are identified as dijet pairs. However, 
for events where a hadronically-decaying vector boson is highly boosted, the decay products are often 
collimated into a single jet. Cuts such as a dijet invariant mass window are no longer applicable in this 
scenario, but a single jet mass cut can be used. 
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The single jet mass is defined as the invariant mass evaluated from the 4- vectors of the constituents of 
the jet. In the ATLAS detector, these constituents are at present calorimeter objects, either topologically 
defined clusters with some local hadronic calibration, called here topoclusters, or calorimeter towers. 
For jets containing the decay products of a boosted vector boson, this single jet mass is near the mass of 
the parent boson. For light quark and gluon jets this mass is generally much lower. Since the background 
to hadronic vector boson identification is so severe, further cuts may be applied on the subjet structure of 
the candidate jet. 

In addition, the transition between the dijet and single jet case as pj increases needs to be dealt with. 
Two methods are used, as follows; 

1. Dynamically select the appropriate method. To do this, we first look at the highest p T jet. If this 
passes the mass window cut, then the single jet selection is applied, as described below. If it does 
not, then combinations of jet pairs in the event are considered. The vector boson is still expected 
to be the highest pj hadronic system, and so the pj of all jet pairs is evaluated, and the highest 
Pt pair is taken to be the vector boson candidate. A mass window cut (dependent upon the jet 
algorithm) is then applied to this pair. Thus a single analysis can be used to scan the data for signs 
of resonances without bias. 

2. When the single jet and jet pair cases yield very different signal to background ratios, it is prefer- 
able to choose a priori which mass region is being investigated, and to use the single jet recon- 
struction for high masses and the dijet reconstruction for low masses. This approach is used in the 
cone algorithm analysis. For the m = 800 GeV resonance, both the dijet and single jet approaches 
are tried independently. 

4.1.1 K± Algorithm 

The k± algorithm is run with an /^-parameter (which determines the "jet size") of 0.6 on calibrated 
topoclusters. The algorithm [31] merges pairs of constituents. 

The k± analysis uses the dynamic selection technique described above to decide whether to use a 
dijet or a single jet for the vector boson candidate. The fraction of vector bosons reconstructed as a 
single jet, as a function of p T of the vector boson candidate, is given in Fig. 5. The transition between 
dijet and single jet takes place between pj = 200 and 300 GeV for this algorithm. 




P T ofW[GeV] 

Figure 5: Fraction of W boson candidates reconstructed from a single jet, as a function of the transverse 
momentum of the reconstructed vector boson, for the WW m = 1 . 1 TeV signal sample. 
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Jet Mass (Reco - True) GeV Y Scale (Reco - True) GeV 



Figure 6: Single jet mass residuals (left) and Y scale residuals (right) from different detector simulations, 
using the k± algorithm. The truth is defined by running the jet algorithm on the hadronic final state of 
the MC generator. 

Single jet mode 

The resolution of the single jet mass for the algorithm has been evaluated for both detector simulations 
(full and fast) for several samples. For the sample with a WW resonance at m = 1.1 TeV (Fig. 6 left) 
for example, the W boson singlet jet mass resolution was found to be 7.4 ± 0.2% GeV from full and fast 
detector simulation. 

A mass cut around the window from m = 68.4 GeV to 97.2 GeV is applied to W boson candidates, 
and from m = 68.7 GeV to 106.3 GeV for hadronic Z boson candidates reconstructed in the single jet 
mode. These mass windows are determined by considering the resolution, the tails, and the background 
contamination. 

The kx merging is intrinsically ordered in scale, making the final merging the hardest. The algorithm 
provides a y value for this final merging, which is a measure of the highest scale at which a jet can be 
resolved into two subjets. The y value can be converted into a "Y scale" in GeV using the relation Y scale 
= Ej x Jy, where Ej is the jet transverse energy. This Y scale is expected to be G{my /2) (where my is 
the mass of the vector boson) for boosted vector boson jets, and much lower than Ej for light jets [32]. 
At the truth and fast simulation levels this variable has been shown to have discriminating power even 
after a single jet mass cut [32-35]. The resolution of the ATLAS detector for this variable is presented in 
Fig. 6 (right). The resolutions, for the same sample as above, are 12.3 ±0.3% and 8.8 ±0.2% with full 
and fast detector simulation, respectively. 

Based on the resolution, the tails, and the background contamination, a Y scale cut around the win- 
dow from 30 GeV to 100 GeV is applied to W and Z boson candidates reconstructed in the single jet 
mode. To evaluate the benefit of cutting on Y scale, a sample of single jet vector boson candidates is 
selected in signal and background by applying a pr > 300 GeV cut, motivated by Fig. 5, and a mass 
window cut. Starting from this sample, the efficiency of the Y scale cut is given in Table 3 for full and 
fast simulation. The numbers suggest that for the W+jets background, an additional rejection factor of 
approximately 2 is provided by the Y scale cut even after a single jet mass cut has been applied. This is 
achieved with a signal efficiency of approximately 80%. 
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Dijet Mass (Reco - True) [GeV] Dijetj^T (Reco - True) 

Figure 7: Dijet mass residuals (left) and y residuals (right) from different detector simulations, using the 
k± algorithm. The truth is defined by running the jet algorithm on the hadronic final state of the MC 
generator. 





1 . 1 TeV Vector Resonance 


W+4 jets 


tt 


Jet Mass 


68% (67%) 


14% (14%) 


28% (28%) 


Y Scale 


77% (84%) 


29% (40%) 


63% (70%) 



Table 3: Efficiency of the jet mass cut and of the Y scale cut in the one-jet case for full (fast) simulation. 
The Y scale cut is applied after the mass cut. 





800 GeV Scalar Resonance 


W+4 jets 


tt 


Jet Mass 


17% (20%) 


6% ( 7%) 


14% (14%) 


Y Scale 


79% (83%) 


48% (49%) 


84% (82%) 



Table 4: Efficiency of the jet mass cut and of the Y scale cut in the two-jet case for full (fast) simulation. 
The Y scale cut is applied after the mass cut. 
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Dijet mode 

A mass cut around the window from m = 62 GeV to 94 GeV is applied to W boson candidates, and from 
m = 66.6 GeV to 106.2 GeV for hadronic Z boson candidates reconstructed in the dijet mode. 

A variable analogous to the y may be calculated, using the pj of the softer jet relative to the harder 
one. This variable is required to be in the range 0. 1 < Jy < 0.45. The efficiency is shown in Table 4. 

The mass and y windows are again determined by considering the resolution, the tails, and the back- 
ground contamination. 

The resolutions of the dijet mass and the y variable for dijet vector boson candidates are shown in 
Fig. 7. They are found to be approximately 5% for the mass and for the y variable, and are comparable 
in fast and full simulation. 

4.1.2 Cone Algorithm 

The problem of the two jets from a boosted hadronically decaying vector boson merging into a single 
jet has also been studied for jets reconstructed using the Cone Algorithm. With this algorithm, jet re- 
construction starts from seeds i.e. constituents (clusters) with pj > 1 GeV. The algorithm collects all 
constituents around a seed within AR = \J (Atj) 2 + A0) 2 < Rq (where Rq can be, for instance 0.4) and 
adds their momenta vectorially. Then it repeats the procedure over the collection around the direction of 
the sum, and computes a new sum. It continues repeating this operation until the resulting sum direction 
is stable. 

Single jet hadronic W boson candidates are identified with the highest pj object in the central region, 
after having removed overlaps with all electrons in the event within a AR of 0. 1. A mass cut in a window 
around the reconstructed W boson mass is applied. 

Figure 8 shows an example of W boson reconstruction using the cone algorithm for the jet-pair case 
(m = 500 GeV resonance) and single jet case (m = 800 GeV resonance). A cone size of 0.8 is used 
for selecting a single jet W boson and 0.4 for the case of a jet pair. The low mass tail is due to events 
where the two jets from the W are well separated. There is a small difference in the W boson mass peak 
reconstruction for the two cases. The jets chosen for this selection have a minimum pj cut of 20 GeV, 
and those overlapping with electrons have been removed. 




Figure 8: Reconstructed W boson for cases where it forms two separated jets (500 GeV) and a single jet 
(800 GeV). The samples used are the m = 500 GeV resonance (in green) and m = 800 GeV resonance 
(in red). 

The exploration of the substructure of a wide jet (typically of size 0.8) is done by searching for 2 
narrow jets (size ~ 0.2) fitting inside the big jet. Various variables can then be studied, among which 
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1.1 TeV Vector Resonance 


W+4jets (QCD) 


tt 


Z+3jets (QED) 


small jets cut 


76.3% 


15.2% 


38.6% 


13.8% 



Table 5: Comparison of efficiencies for the jet sub-structure selection for a typical signal and back- 
grounds. Efficiencies are relative to the selection of a single jet as described in the Y scale method (Table 
3). For the subjet selection, we require 2 small jets with pj > 15 GeV and invariant mass > 60 GeV (see 
text). 




Wide jet invariant Mass [GeV] 




Narrow jet transverse P [GeV] 



Figure 9: Profile histogram of the momentum of the narrow jet orthogonal to the wide jet direction vs 
the invariant mass of the wide jet, for W boson hadronic decay of the resonance signal qqWjjZu ofm = 

1.1 TeV (red) and for Z+3 jets sample (black). Lower graph : normalized distributions of narrow jet 
orthogonal momentum. 

are the energy ratio of the narrow jets, their invariant mass, the distance AR between the leading narrow 
jet and the wide jet, or the momentum component of this narrow jet transverse to the wide jet direction. 
The discriminating power is illustrated for the WZ — > 11 jj channel (1.1 TeV resonance) and its principal 
background in Fig. 9 which shows the latter variable (called here 'p transverse') versus the invariant mass 
reconstructed from two narrow jets. Cutting in the {pj, invariant mass) plane gives results comparable 
to those obtained with the Y scale method above, as illustrated in Table 5. Similarly, Fig. 10 shows the 
Pt versus AR between the leading narrow jet and the wide jet. 

4.2 Leptonic Vector Boson Identification 
4.2.1 Lepton Reconstruction Efficiencies 

All the signals studied in this note involve at least one leptonic vector boson decay. Electrons and muons 
are selected using standard ATLAS criteria [36, 37] for the case of a resonance m(WZ) of 800 MeV 
Figure 1 1 shows the efficiency for W boson daughter leptons, where the trigger consition has not been 
applied. The results for different electron selection criteria are given. The loss of efficiency occurs in the 
forward regions, near the limits of the tracking detectors and at pj values close to the applied cut. The 
efficiencies for the leptonic Z boson channels have been found to be similar. 
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Figure 1 1 : Efficiency of reconstructing and identifying W-daughter electrons (left) and muons (right) 
as functions of true lepton momentum (top) and pseudo-rapidity (bottom). The electron plots show the 
efficiency for 4 different electron selection criteria: All candidate objects (green), loose (black circles), 
medium (red squares), and tight (blue triangles). 
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4.2.2 Leptonic Z Boson Reconstruction 

The Z boson candidates are reconstructed from pairs of e + e~ or i u + i u _ . In the electron case, the mass 
resolution is about 2.7 GeV as is shown in Fig. 12 left, suggesting a mass window selection between 
m = 85 GeV and 97 GeV for m ee . In the case of muons, the resolution for the Z mass reconstruction is 
3.6 GeV (see Fig. 12 right), so the mass requirement is loosened to be between m = 83 GeV and 99 GeV. 
Furthermore, to reduce the backgrounds (particularly the background from tt events), the pr of one of 
the leptons is required to be pj > 50 GeV, and that of the other pj > 35 GeV. In the unlikely case that 
more than one combination of leptons satisfy all these requirements, we choose the composite Zi+e- with 
the mass closest to the actual Z boson mass. 

4.2.3 Leptonic W Boson Reconstruction 

For the signal, after reconstruction of the hadronic vector boson candidate, the highest pj lepton corre- 
sponds to the lepton from the W boson decay in 96% of cases. Attributing the missing momentum to the 
neutrino, and taking the nominal W boson mass (mw = 80.42 GeV) as a constraint, a quadratic equation 
is obtained for the z-component of the neutrino's momentum. The z component is required in order to 
reconstruct the diboson mass in the final analysis. Only events for which at least one real solution exists 
for this quadratic equation are retained. When there are two possible solutions, one is chosen at random 
to avoid kinematic bias on the resonance mass. Options such as selecting the reconstructed W boson 
which is more central have also been considered, and little difference was found in the purity of the 
reconstruction. In the fully leptonic case, the leptonic Z boson is reconstructed and its daughter leptons 
removed before applying the above procedure. 

4.3 Tagged Forward Jets 

One of the well known characteristic features of vector boson scattering is the presence of high energy 
forward jets [14], resulting from the primary quarks from which the vector bosons have radiated (see 
Fig. 13). Such forward jets are expected to be much less prominent in processes involving gluon or 
electroweak boson exchange with bremsstrahlung of vector bosons. In the latter case, these vector bosons 
are mostly transverse and have a harder pj spectrum than in WlWl scattering. Correspondingly, the 
outgoing primary quarks have a harder pj and are therefore less forward. 
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Figure 13: Pseudo-rapidities of the forward and backward quarks in signal events after they radiate the 
vector bosons, obtained from Pythia before any showering, fragmentation, etc. 

Many different strategies are possible for implementing a tag-jet selection. A number of these were 
compared, and the best rejection factors for a given efficiency were obtained as follows: 

1. Require two jets with 

• \n{jet)\ > TJcut and p T (jet) > Prcut 

• opposite signed rapidity 

• at least one of them has an energy greater than a critical value E cu i 

2. If more than one jet with the same sign rapidity satisfies the above cuts, choose the most energetic, 
labelled FJ1. The next one is labelled FJ2. 

• Require the tag -jet with the opposite sign of rapidity to satisfy Ari(FJ\,FJ2) > Ar] cu t and 
E(FJ2)>E 2cut 

In addition a dijet mass cut is currently applied in the cone algorithm analyses. The specific values of the 
cuts in each case are to be optimised depending upon the kinematic region under study. 

4.4 Central Jet Veto 

A useful analysis strategy to suppress backgrounds such as tt is to apply a central jet veto [15,38,39]. 
For vector boson scattering, one expects little QCD radiation in the central region since only colourless 
electroweak vector bosons are produced and the forward jets are not colour connected. Given the forward 
jet cut definition, we unambiguously define the central region of the event as the r\ region between them. 
The central jet veto then simply requires that no other high pj jet (here taken as pj > 30 GeV) other 
than those resulting from the hadronically decaying vector boson lie in the central region. 

Specifically, in the analyses where it is applied, the central jet veto rejects events if there are any 
additional jets with a chosen maximum value for |tj| and minimum value for pj. 
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4.5 Top Quark Rejection 

While the final states from a leptonically decaying Z boson are mostly free of background from top 
processes, tt and tW events form an important source of background for the WW signals. To suppress 
them, events can be vetoed if a reconstructed W boson candidate, combined with another jet in the event 
(excluding those overlapping with an identified electron or within AR < 0.8 of a W candidate), leads to 
an invariant mass close to that of the top quark [32]. A typical mass window is 130 < m t < 240 GeV. 

In a future analysis it is likely that this cut can be improved using b-tag information and better jet 
mass reconstruction, but this has not been investigated here. 

5 Event Selection 

Using the tools outlined in the previous section, we now characterise the samples and outline the specific 
cuts applied for each final state considered. 

5.1 W+W~ -> jj and W+Z -> l±v jj 

The hadronic vector boson candidates and the tag jets are obtained using the k± algorithm as discussed 
in Section 4.1. The leptonic W is identified as described in Section 4.2.3. Both vector boson candidates 
are required to have p T > 200 GeV, |r/| < 2. Tag jet cuts are made as described in Section 4.3, with 
Prcut = 10 GeV, £" cut = £ 2 cut = 300 GeV and Arj cut = 5. The top veto (Section 4.5) and the the central 
jet veto (Section 4.4) are applied. 

The kinematic distributions for the WW channel are shown in Fig. 14. Note that these are signifi- 
cantly biased by the generator-level cuts. The selection efficiencies of the cuts on WW events from four 
example scenarios are summarised in Table 6, along with the efficiency of the combined trigger selection 
described in Section 3. No significant differences are observed between the scalar and vector resonances, 
nor between the WZ and WW channels, except for the m = 500 GeV-resonant samples, where the pr re- 
quirements are found to be less efficient for the WZ sample. The QCD-like model of [27] tends to predict 
softer vector bosons. 

Due to the small background statistics available with full simulation, the full-simulation signal sam- 
ples are used together with fast-simulation background samples in obtaining the final results. The mod- 
elling of the kinematics is good, as shown in Fig. 14 2 . However, in general the efficiency for selecting 
both signal and background is higher in fast simulation by about 25% compared to full simulation. To 
account for this, a constant scaling factor is applied to the fast-simulation samples in estimating the 
significance of the signal over the background (Section 6). 

Figure 15 shows a comparison of the final WW mass for the signal samples using fast and full 
simulation. 

The final WW mass spectra obtained using this analysis are shown in Fig. 16. The backgrounds 
shown have been obtained from the fast-simulation samples and the above mentioned scaling factor has 
been applied. 

5.2 w±z -> jj e+r 

This channel benefits from a very good resolution on the Z boson leptonic reconstruction, which allows 
good suppression of the tt background. 

2 To achieve this agreement it was necessary to correct the lepton-finding efficiency in the fast simulation by fitting a function 
to the efficiency from fast and full simulation as a function of lepton pj and correcting the fast simulation by the ratio of the 
functions. 
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Figure 14: Kinematic distributions for the generated signal and backgrounds (tt and W +4 jets QCD) in 
the 1.1 TeV —> £ ± v channel. The top two plots show the pj and tj for the hadronic W boson 

candidate. The middle two show the same variables for the leptonic W boson candidate. The bottom 
two plots show the tj distribution of all jets which are at higher rapidity than the W boson candidates, 
and the pj distribution of the highest-^r central jet. In each plot, the full-simulation histograms (solid 
lines) have been normalised to unit area and the fast-simulation histograms (dashed lines) have been 
normalised to the same cross section as their full-simulation counterparts. 
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Figure 15: WW (left) and WZ (right) invariant mass spectra in the £v semileptonic channel for the 
three resonant signal samples. The WW plot shows a comparison of the fast- and full-simulation results. 



For the m = 1.1 TeV WZ resonance, only the case of a single heavy jet from the W boson decay will 
be considered as it constitutes the majority of the events. For the m = 800 GeV resonance, not all W 
bosons are boosted sufficiently to produce a single jet. We therefore consider separately the cases of a 
W boson from a single heavy jet and from a jet pair. Finally, for the m = 500 GeV resonance, we only 
consider the jet pair case. In this section, the cone algorithm will be used and compared with an analysis 
using the k± jet algorithm. 

5.2.1 W boson from a single jet 

The main backgrounds will here be Z+ 3 jets and it. 

Table 7 shows the cut flow for the electron-based and the muon-based analyses for the ChL WZ 
resonances of mass m = 1.1 TeV and m = 800 GeV. The m = 500 GeV case is not considered here since 
the W and Z bosons will not be sufficiently boosted, in general, to produce a single jet. The Z — * e + e~ 
and Z — ► }JL + jJL~ selections are shown, which correspond each to about 50% of the sample events. After 
applying electron quality cuts (medium electrons) we select the two highest pj leptons which should 
satisfy respectively: pj(e,n) > 50 GeV and pj{e,\i) > 35 GeV. The low efficiency of the lepton pair 
cut is approximately consistent with the expected selection efficiency per lepton, as shown in Fig. 11, 
as well as the detector acceptance. As can be seen in Fig. 17, the pj cut suppresses mostly the it 
background. A leptonic Z e + e - or Z^+^- is afterwards reconstructed as described in Section 4.2.2, almost 
eliminating completely this background. The efficiency of this cut is somewhat poorer for the signal 
than for the background because most of the background events have Z bosons of relatively low pj, with 
different lepton pair energies and opening angle. 

Using the cone algorithm, size 0.8, the hadronic W boson candidate is identified as a heavy single jet 
having a mass between 70 and 100 GeV, and separated in azimuthal angle from the Z boson candidate 
by A(j)(W,Z) > 2, as described in Section 4.1 (see Fig. 18). At this stage, considering that the fraction 
of single jet W bosons becomes important for pj > 250 GeV (see Fig. 5), and in order to be consistent 
with the preselection cuts on the Z+3 jets background, we apply the following cuts to the reconstructed 
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Figure 16: WW (top 4) and WZ (bottom 3) invariant mass spectra in the £v semileptonic channel, 
showing the total W+jets and it backgrounds and the signal for the three resonant signal samples and the 
continuum sample. The error bars reflect the uncertainty from the Monte Carlo statistics. 
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Figure 18: Mass of the heavy jet for the m = Figure 19: Reconstructed W boson mass from a 
1 . 1 TeV ChL resonance and corresponding back- jet pair for the m = 800 GeV resonance . 
grounds. No tt event is left. 



W and Z bosons: z > 250 GeV and |t] W ' Z | < 2.0. 

After a forward jet selection, (see Section 4.3, PtcxxX. = 20 GeV, E cu t = E 2cu t = 300 GeV, T] cu t = 1.5, 
\ilfjet | > T]centraijet, A^cut = 4.5), the invariant mass of these two jets is required to be greater than 700 
GeV. Note that the efficiency of the forward jet cuts appearing in Table 7 appears artificially good for the 
background because a preselection was already applied. 

A central jet veto was found to be unnecessary, as no tt event survived the selection. Because of 
the lack of statistics for the tt sample, it is not possible to exclude completely a contribution from this 
background. The normalisation factor is 4.9, meaning that tt is excluded, over the whole mass range, 
at the level of 11.3 fb at 90% C.L. To have an estimate of the efficiency of the last two cuts at rejecting 
this background, the mass window for the cut on the Z boson mass was loosened: 60 < mz < 120 GeV, 
allowing 44 events (215 fb) to pass for the Z — > ee channel and 38 events (185 fb) for the Z —> jiji. 
channel. The W boson mass cut alone is found to have an efficiency of 12% and the forward jet cut alone 
lets no event survive. Assuming that the cuts are independent, the overall efficiency of the heavy jet mass 
cut and forward jet tagging combined is higher than 0.15%. The exclusion limit at 95% C.L. (1.64 a) for 
the tt background is shown in Table 7 and it will be assumed that this is negligible in the mass window of 
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Figure 20: Reconstruction of ChL resonance at m = 1.1 TeV (left) and m = 800 GeV (right) in the 
channel qqWjZu (with £ = e,jx), where a single jet cone 0.8 has been used to reconstruct the W. 

the resonance. The Z+4 jets background was not included here because it may be double-counting with 
Z+3 jets with parton shower. In order to evaluate the level of this background, an average over the high 
mass region was taken because of the relatively poor Monte Carlo statistics, yielding about 0.03 fb/100 
GeV. 

For the m = 1.1 TeV case, it was found that the trigger efficiency, based on the OR of e60 , mu20 
and j 160, was 100% at the end of the selection. 

Figure 20 shows the resonance mass resulting when the Z boson has been reconstructed from elec- 
trons or muons and the W boson from a single jet of size 0.8. 

5.2.2 W boson from a jet pair 

As above, after applying electron quality cuts, the lepton transverse momenta are required to satisfy 
PT(e\,fli) > 50 GeV and PT{ei,\l?) > 35 GeV, and a Z e + e - (Z^+ jU ) boson having a mass between 85 
and 97 GeV (83 and 99 GeV) is then reconstructed. Considering all pairs of jets with pj > 30 GeV in 
the central region (|tj| < 3.0) not overlapping with the electron jets from the Z decay, the one yielding 
an invariant mass closest to the mass of a W boson will be the W boson candidate (see Fig. 19). The low 
efficiency of this cut can be explained in part by the fact that a good fraction of events are constituted of 
a single jet W boson. Forward and backward jet selection proceeds as in 5.2.1. A central jet veto is also 
applied: we exclude events with an extra jet, having a pj > 30 GeV, not corresponding to the jets from 
the W boson or the forward and backward jets and we require the W and Z directions to be in the central 
region |t]| < 2. Figure 21 show the resulting reconstructed resonance masses. Table 7 summarizes the 
cut flow for this analysis. Here, by using the technique of widening the Z boson mass window as in 
Sect. 5.2.1, it is estimated that the tt background could be approximately 0.13 fb and it will be assumed 
that this is negligible in the mass window of the resonance. 

5.2.3 Comparison to &j_ analysis 

This channel was also studied with the analysis techniques described in Section 5.1, with the k± algorithm 
but using the leptonic Z boson identification (Section 4.2.2) instead of the leptonic W boson identification 
(Section 4.2.3). The hadronically-decaying W boson is reconstructed dynamically from one or two jets 
(Section 4.1). 

The signal mass distributions for fast and full simulation are shown in Fig. 22. The final mass 
distributions are shown in Fig. 23, and are comparable to the results from the cone algorithm method for 
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Figure 2 1 : Reconstructed ChL resonance at m = 800 GeV (left) and m = 500 GeV (right) in the channel 
qqWjjZu (with £ = e,jJ.) where two jets of cone size 0.4 have been used to reconstruct the W boson. No 
tt events survive the selection. 







(m = 500 GeV) 


WtvZu 


(m = 1.1 TeV) 


W ev Z u jj(SM) 




<r(fb) 


eff. 


a(fb) 


eff. 


<7(fb) 


eff. 


Z ee 


1.47 


18% 


0.23 


20% 


20.7 


16% 




1.09 


14% 


0.18 


15% 


16.7 


13% 


W reconstruction 


1.43 


56% 


0.25 


61% 


18.9 


51% 


Forward jet tagging 
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0.14 


56% 
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Table 8: Cut flow for the We v Z u (m = 500 GeV and 1.1 TeV) signals. All the cuts are described in detail 
in this section. 




the 1.1 TeV case which is not sensitive to the cut on the pj of the VB's of 200 GeV. 

5.3 W ± Z^£ ± v £+£- 

This purely leptonic channel consists of four different signatures: We± v Z£±£? with £ = <?,/!. The main 
background will be WZjj production from the Standard Model. The analysis starts by identifying lep- 
tonic Z e + e - (Z jU + jlt -) bosons as described in Section 4.2.2, after requiring two leptons with pj greater than 
50 and 35 GeV. 

As a second step, we proceed to reconstruct the W boson from the highest pj lepton among those 
remaining in the event, if there is one, and the measured missing transverse energy, as described in 
Section 4.2.3. The solution which yields the highest pj W boson is kept. 

The forward and backward jet selection follows the prescription of the Section 4.3 (Prcut = 20 GeV, 
^CUt = £ 2CUt = 300 GeV, T] cut = 1.5, \r} fjet \ > TJcentraljet), AT] cut = 4.5). 

In Table 8 we present the cut flow of the reconstruction of the resonances for 1.1 TeV and 500 GeV. 
Also in Fig. 24 and Fig. 25 we present the reconstructed resonance and the background WZjj for the 
same resonance mass. 

5.4 ZZ^vv£+£- 

This scalar resonance can be interpreted as a Standard Model Higgs boson produced by vector boson 
fusion. At leading order, the cross section times branching ratio would be 6 fb, compared to 4 fb obtained 
for the ChL model. This signal is characterised by a leptonic Z boson accompanied by large $t, yielding 
a large transverse mass. The backgrounds considered are: ZZjj —>■ ££vvjj and WZjj — > £v££jj. Other 
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Figure 22: WZ invariant mass spectrum in the t + t channel for the three resonant signal samples 
obtained using k± algorithm approach. Dotted lines indicate the fast simulation results. 
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Table 9: Cut flow for the Z vv Zuqq (m = 500 GeV) signal. All the cuts are described in detail in this 
section. 



background can result from Z+jets production, where the tail of the missing transverse energy distribution 
can fake a signal. 

After selecting the leptonically decaying Z e + e - (Z^+^-) boson as usual, with mass between 85 and 
97 GeV (83 and 99 GeV), a minimum $ T of 150 GeV is required. For this high value of $ T , Z+jets 
background is expected to be negligible for a Standard Model Higgs boson signal [39]. The forward jet 
selection is applied (see Section 4.3, p TC ut = 2 0 GeV, £" cut = £" 2 cut = 300 GeV, T] cut = 1.5, A?7 cut = 
4.5). 

The transverse mass, defined as: 

m 2 T = (^p T (Z)2+m 2 z +$ T ) 2 -(p T (Z)+if T ) 2 (1) 
is shown in Fig. 26 and the cut flow can be found in Table 9. 
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Figure 23: WZ invariant mass spectrum in the £ + £~j(j) channel for the three resonant signal samples 
obtained using algorithm approach. Z+jet histogram (in red) represents a direct sum of all Z+3 or 4 
jets backgrounds with no matching, and hence is a conservative estimate. The background from tt events 
has been found to be negligible. 
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Figure 24: Full reconstruction of ChL resonance 
1.1 TeV (W(± v Zf± f T). 



Figure 25: Full reconstruction of QCD-like reso- 
nance m ~ 500 GeV (W^± V Z^ T ). 
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6 Results 

The significance of the signals and the luminosity required for a possible discovery is estimated here. 
From the reconstructed resonance mass distributions in Section 5 one can evaluate the size of the signal 
and background in the resonance mass window. Table 10 summarises the approximate cross sections 
expected after the analyses described above. The table also gives the luminosity required to observe a 
significant excess over the background, showing the uncertainty from MC statistics only, and the signifi- 
cance of a signal for an integrated luminosity of 100 fb _1 . Because of the large statistical and systematic 
uncertainties (see Section 7), the numbers given here must be taken as an only an approximate indication 
of the reach of the LHC for such resonances. 
The significance is calculated as 

significance = ^2((S+B)ln(l+S/B)-S) , (2) 

where S (B) is the number of expected signal (background) events in the signal peak region, which is 
defined as the three consecutive bins (of size given in the figures, chosen to represent the resolution), 
with the highest total number of signal events. The background is averaged over this region. 

In the WW case, only the semileptonic channel is accessible. Thus, as shown in Table 10, around 25 
fb~ ! is needed to start seeing indications of a resonance even in the most optimistic case studied here, 
and around 70 fb _1 is needed for a discovery. 

In the continuum case, the "signal" is spread over an extended mass region with a total of about 0.3 
events expected for each fb~ ! in the mass range m = 400-1900 GeV, compared to about 2.5 background 
events. Measuring this cross-section with any accuracy using the techniques developed here would re- 
quire an integrated luminosity of several hundred fb~ 1 . 

Since the mass windows for hadronic W and Z boson decays overlap, in practice these scenarios 
can probably not be distinguished in this channel, and a combined analysis would in reality have to be 
performed, which would then be compared to those channels containing a leptonic Z boson decay. 

For each of the WZ resonances, results of the different channels, Wf V Z«, WjjZ^ and We v Zjj can, 
in principle, be combined. From Table 10, one can conclude that for two of three mass regions, m = 
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Process 


Cross section (fb) 


Luminosity (fb l ) 


Significance 
for 100 fb" 1 


signal 


background 


for 3a 


for 5a 


WW/WZ->£v jj, 
m = 500 GeV 


0.31 ±0.05 


0.79 ±0.26 


85 


235 


3.3±0.7 


ww/wz^ev jj, 

m = 800 GeV 


0.65 ±0.04 


0.87 ±0.28 


20 


60 


6.3 ±0.9 


ww/wz^ev jj, 

m = 1.1 TeV 


0.24 ±0.03 


0.46 ±0.25 


85 


230 


3.3±0.8 


WjjZu, m = 500 GeV 


0.28 ±0.04 


0.20±0.18 


30 


90 


5.3±1.9 


W tv Z u , m = 500 GeV 


0.40 ±0.03 


0.25 ±0.03 


20 


55 


6.6 ±0.5 


WjjZ u , m = 800 GeV 


0.24 ±0.02 


0.30 ±0.22 


60 


160 


3.9±1.2 


WjZtt, m = 800 GeV 


0.27 ±0.02 ±0.05 


0.23 ±0.07 ±0.05 


38 


105 


4.9±1.1 


WjZ u ,m= 1.1 TeV 


0.19 ±0.01 ±0.04 


0.22 ±0.07 ±0.05 


68 


191 


3.6±1.0 


W* v Z«,m= 1.1 TeV 


0.070 ±0.004 


0.020 ±0.009 


70 


200 


3.6±0.5 


ZyyZu, m = 500 GeV 


0.32 ±0.02 


0.15 ±0.03 


20 


60 


6.6±0.6 



Table 10: Approximate signal and background cross sections expected after the analyses. An approxi- 
mate value of the luminosity required for 3 a and 5 a significance, and the expected significance for 100 
fb -1 are shown. The uncertainties, when given, are due to Monte Carlo statistics only. 



500 GeV and 800 GeV, a chiral Lagrangian vector resonance can be discovered with less than 100 fb -1 . 
The expectations with the alternative k ± analysis described in Section 5.2.3 are not far from the values 
in Table 10. As an example, the integrated luminosity needed for 3 a observation of the m = 800 GeV 
signal is 63 fb -1 , and of the m = 1.1 TeV signal is 81 fb -1 . 

A scalar resonance at m = 500 GeV will require about 60 fb~ 1 to be seen in the ZZ — >• wit channel. 

7 Systematic Uncertainties 

A number of large systematic uncertainties affect the signals studied here. Because of the small cross 
sections and the important backgrounds, it is difficult to estimate them with precision from Monte Carlo 
simulations. Data driven tests will be required to understand better the systematic effects. Some discus- 
sion of the most significant effects is given here. 

7.1 Background Cross sections 

As was discussed in Section 2, the renormalisation and factorisation scales, Q 2 , can affect the cross 
section by as much as a factor of two. This is especially true at high centre of mass energies, where 
the degree of virtuality of partons and choice of scale for a s are quite critical [28,29]. At present 
this represents a theoretical uncertainty on the current sensitivity estimate. While the predictions may 
improve in future, in an eventual analysis, the backgrounds would have to be measured from data and 
the eventual size of the associated systematic uncertainty has not been studied here. 

Another consideration is that for the analyses which dynamically move between the dijet and single 
jet reconstruction technique for the hadronically decaying vector boson (see Section 4.1), to evaluate 
the background with the samples available both W+3 jet and W+4 jet samples must be used. This 
implies some double-counting due to the lack of parton-shower matching in these samples, and so the 
background will be overestimated. This is in addition to the fact that as shown in Section 2.4.1, the 
MadGraph samples used overestimate slightly the energy of the tag jets. The effect is expected to be 
at the few per cent level. 
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7.2 Signal Cross sections 

The PYTHIA signal generation produces softer tag jets than the more exact Whizard MC, and thus the 
signal efficiencies are likely to be underestimated. 

7.3 Monte Carlo Statistics 

We are limited by the very large size of the background samples required. Fast MC simulation was 
shown to be in good agreement with full simulation and was used to evaluate it background for the 
WW signals. In some cases, only upper limits on the backgrounds can be given, although it is expected 
that these limits are very conservative. Again, this represents a systematic uncertainty on the current 
sensitivity estimates, but will not be present in a final data analysis, assuming sufficient simulated data 
will eventually be available. 

7.4 Pile-up and Underlying Event 

Pile-up and underlying event are separate effects which have potentially similar and crucial impact on 
the efficiency of the forward jet cuts, the central jet veto and the top veto, as well as on the jet mass 
resolution. 

Of particular concern is the fact that the top veto in the WW analysis uses jets down to pj = 10 GeV, 
expected to be strongly affected by these effects [40] . Simply raising the cut to 20 GeV admits signif- 
icantly larger background. Some of this can be removed for the higher mass resonances by raising the 
Pt cut on the vector boson. However, a more promising approach is likely to be to exploit ^-tagging and 
improved jet mass reconstruction to improve the veto. 

7.4.1 Pile-up 

Fully simulated samples with pile-up at low luminosity (10 33 cm~ 2 s _1 ) were available, but with much 
lower statistics: we restricted the analysis to one signal sample and one background sample. However 
pile-up effects should be approximately independent of the underlying physics sample, and we assume 
we can safely generalise the results obtained here. 

We compared the same events of the WjjZu at m = 1.1 TeV sample reconstructed with and without 
pile-up simulation. This allows computation of the fraction of events with pile-up having tagged forward 
jets with respect to corresponding non-pile-up events which fail the tagged jet criterion, thus defining a 
'fake' rate. The reciprocal fraction defines a 'miss' rate. The effect of pile-up increases with increasing 
jet radius and decreasing energy threshold, as would be expected. We found that both 'fake' and 'miss' 
effects are essentially due to the degradation of energy resolution in presence of pile-up and that their 
combination contributes to an uncertainty on the efficiency of the order of 5%. 

7.4.2 Underlying Event 

Current simulations use underlying event models tuned to Tevatron and other data [22], but there is a 
large extrapolation needed to 14 TeV. The underlying event would be have to be measured in LHC data, 
and its level is not currently known. 

7.5 Other Systematic Effects 

Systematic effects, such as uncertainties in the luminosity, in efficiencies and resolutions, jet energy 
scale, etc. are of the order of a few percent and will therefore be completely dominated by the above 
effects and by statistical uncertainties. 
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8 Summary and Conclusion 

The Chiral Lagrangian model with Pade unitarisation provides a framework for studying vector boson 
scattering at high mass, in case a light Higgs boson is not found at the LHC in the first years of running. 
With full detector simulation, the search for vector and scalar resonances of masses m = 500, 800 and 
1 100 GeV is studied. To suppress the very high backgrounds from W+jets and Z+jets to acceptable levels 
requires special techniques investigated here. In particular, at these high masses, hadronic vector boson 
decay results in a single jet. The reconstruction of the jet mass is found to be generally quite efficient 
at rejecting QCD jets. The k± and the cone algorithms can be applied to this heavy jet to resolve it into 
two light jets, suppressing further the background. Other conventional techniques for the study of vector 
boson fusion are also found essential for the present analysis: forward jet tagging, central jet veto and 
top-jet veto. 

The cut-based analysis presented here is performed with realistic simulation and reconstruction of 
leptons and jets. Improvements can be expected by more sophisticated analysis and, with real data and a 
good understanding of the detector, further gains can be achieved by improvements in the reconstruction 
efficiencies. 

The discovery of resonances in vector boson scattering at high mass will take a few tens of fb _1 , but 
the different decay channels of the vector boson pairs allow a cross-check of the presence of a resonance. 
These results can be considered generic of vector boson scattering and can therefore be interpreted in 
terms of other theoretical models with possibly different cross-sections. 
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Abstract 

Models with extra space dimensions, in which our Universe exists on a 4- 
dimensional brane embedded in a higher dimensional bulk space-time, offer a 
new way to address outstanding problems in and beyond the Standard Model. 
In such models the Planck scale in the bulk can be of the order of the elec- 
troweak symmetry breaking scale. This allows the coupling strength of gravity 
to increase to a size similar to the other interactions, opening the way to the 
unification of gravity and the gauge interactions. The increased strength of 
gravity in the bulk space-time means that quantum gravity effects would be 
observable in the TeV energy range reachable by the LHC. The most spectac- 
ular phenomenon would be the production of black holes, which would decay 
semi-classically by Hawking radiation emitting high energy particles. In this 
note, we discuss the potential for the ATLAS experiment to discover such black 
holes in the early data (1-1000 pb _1 ) 

1 Introduction 

In this study we simulate the search for black holes in the first 100pb _1 of LHC data with the ATLAS 
detector and software framework. 

The document's structure is as follows: Section 2 gives an overview of the extra dimension models, 
present limits on the size of the extra dimensions and a discussion of black hole production and decay. In 
Section 3 the Monte Carlo simulation samples are described. Section 4 presents basic event properties, 
and is followed by Sections 5 and 6 dealing with the triggering and analysis selection, respectively. The 
expected systematic uncertainties are given in Section 7. Finally, the extraction of model parameters, 
especially of black hole properties, is covered in Section 8. A summary is given in Section 9. 

2 Theory 

2.1 Theoretical Motivation 

The electroweak energy scale and the Planck scale, at which gravitational interactions become strong, 
differ by about sixteen orders of magnitude. This large difference between the scales of the two funda- 
mental interactions is known as the hierarchy problem. Explaining the hierarchy problem is one of the 
outstanding challenges in particle physics. 

Arkani-Hamed, Dimopoulos and Dvali (ADD) [1-3], and Randall and Sundrum (RS) [4,5] have 
pioneered approaches to solving the hierarchy problem by using extra-dimensional space. The hierarchy 
is generated by the geometry of the additional spatial dimensions. ADD models postulate additional flat 
extra dimensions, while RS models invoke a single warped extra dimension. The observed weakness of 
gravity is thus due to the gravitational field being allowed to expand into the higher-dimensional space 
(bulk), while the Standard Model particles are confined to our familiar three-dimensional space (3-brane). 
Extra-dimensional models can also be motivated by string theory. 

In extra-dimensional models, the D-dimensional Planck scale Mo is the fundamental scale from 
which the Planck scale Mp\ = 1.22 x 10 19 GeV in four dimensions is derived 1 . The relationship between 

Several conventions exist for the D-dimensional Planck scale in the ADD model. We denote by M D the parameter defined 
by Giudice, Rattazzi and Wells [6] and used by the PDG [7]: M^~ 2 = (2n) D ^ 4 /(SkGd), where G D is the D-dimensional 
Newton gravity constant. In an alternative convention given by Dimopoulos and Landsberg [8] the D-dimensional Planck scale 
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the two scales is determined by the volume of the extra dimensions in ADD models or by the warp factor 
in RS models. For large extra dimensions or a strongly warped extra dimension, the fundamental scale 
of gravity can be as low as the electroweak scale. If the Planck scale is low enough, black holes could be 
produced at the Large Hadron Collider (LHC) [9, 10]. Detecting them will not only test general relativity 
and probe extra dimensions, but would also teach us about quantum gravity. 

2.2 Experimental Limits 

Assuming that low-scale gravity is due to the existence of extra dimensions 2 , most experimental searches 
for unusual low-scale gravity effects have focused on detecting evidence for extra dimensions. Current 
experimental limits allow the fundamental scale of gravity to be as low as about 1 TeV. In testing the 
ADD models and deriving limits in these models, the compactification radius of all extra dimensions is 
assumed to be the same. ADD models have been tested at length scales comparable to the radius of the 
compactified (i.e. curled-up) dimensions R. Were the effective number of large extra dimensions to be 
n = D — 4, the inverse-square law would smoothly change from the 1 /r 2 form for r » R to a 1 / r 2+n 
form for r <C R. Searches have been performed and constraints on the Planck scale have been set by 
tabletop and particle accelerator experiments, astrophysical observations, cosmic -ray measurements and 
cosmological considerations. Direct searches for black holes at collider experiments have not yet been 
performed. The only direct limits on black hole production in high energy interactions were obtained 
using cosmic-ray data. Table top experiments lead to an upper bound of R < 44 um, at the 95% confidence 
level [12]. The LEP bounds obtained with direct searches vary from 1.5 TeV for n = 2 extra dimensions 
to 0.75 TeV for 5 extra dimensions [13]. The latest direct search result from the CDF collaboration has 
set lower bounds with 1 .1 fb _1 of Run II data on M D of 1 .33 TeV for n = 2 to 0.88 TeV for n = 6 [14]. All 
four LEP experiments combined set a lower limit on M/> of 1 .2 TeV for positive interference, or 1 . 1 TeV 
for negative interference between Standard Model diagrams and graviton exchange [15, 16]. Indirect 
searches by the D0 [17] collaboration set lower limits around 1.28 TeV 3 . Astrophysics places the most 
stringent lower limits on Md in ADD models which however fall sharply with increasing number of 
extra dimension [18-24]. Considerations of neutron star dynamics imposes the strongest constraints: 
Mo > 1760, 77, 9 and 2 TeV for n = 2, 3, 4 and 5 extra dimensions, respectively [18]. One should 
note that all astrophysical and cosmological constraints are based on a number of assumptions, whose 
uncertainties are not included in the limit derivations, so the results are reliable only as order of magnitude 
estimates. Ultra high-energy cosmic-ray particles, through their interaction with the Earth's atmosphere, 
offer a complementary probe of extra dimensions. Cosmic-rays interact with the atmosphere and earth's 
crust with centre-of-mass energies of the order of 100 TeV. The particles can produce black holes deep 
in the atmosphere, leading to quasi-horizontal giant air showers. So far a lower bound on M D in the 
ADD model, ranging from 1 .0 to 1 .4 TeV for scenarios with 4 to 7 extra dimensions has been set at 95% 
confidence level [25]. It is expected that the Pierre Auger Observatory will be able to set more stringent 
limits during the first five years of operation; the estimates place Mq > 3 TeV for n > 4 [26]. 

2.3 Working Model 

Our working model for black holes uses the black disk cross-section, which depends only on the horizon 
radius. The (4 + n) -dimensional Myers-Perry solution [27], similar to the 4-dimensional Schwarzschild 
radius, is chosen for the horizon radius r^. It depends only on the number of dimensions and the Planck 

M DL is defined via Mg L 2 = \/G D . 

2 See Ref. [1 1] for a model of TeV gravity in four dimensions. 

3 This lower limit uses the Hewett approach for the calculation of M D and implies a positive interference term X with the 
Standard Model diagrams. 
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scale. The classical black hole cross-section at the parton level is 

o-a6^BH = , (1) 

where a and b are the parton types. In most cases, we work with initial black hole masses at least five 
times higher than the Planck scale at which the expression for the cross section should be valid. 

The total cross-section is obtained by convoluting the parton-level cross-section with the parton dis- 
tribution functions (PDFs), integrating over the phase space, and summing over the parton types. 

Throughout this study we use the CTEQ6L1 (leading order with leading order a s ) parton distribution 
functions [28] within the LHAPDF framework [29]. The momentum scale for the PDFs is set equal to 
the black hole mass for convenience. 

The transition from the parton-level to the hadron-level cross-section is based on a factorisation 
ansatz. The validity of this formula for the energy region above the Planck scale is unclear. Even if 
factorisation is valid, the extrapolation of the parton distribution functions into this transplanckian region 
based on Standard Model evolution from present energies is questionable, since the evolution equations 
neglect gravity and possible KK states in the proton. 

The details of horizon formation, and the balding and spin-down phases have been ignored 4 . The 
important effects of angular momentum in the production and decay of the black hole in extra dimen- 
sions are not accounted for in the Monte Carlo event generator. The black holes are considered as 
D-dimensional Schwarzschild solutions. Only the Hawking evaporation phase is generated by the simu- 
lation. 

We can view the Hawking evaporation phase as consisting of two parts: determination of the particle 
species and assigning energy to the decay products. A particle species is selected randomly with a 
probability determined by its number of degrees of freedom and the ratio of emissivities 5 . The degrees 
of freedom take into account polarisation, charge and colour. The emitted charge is chosen such that the 
magnitude of the black hole charge decreases. All Standard Model particles are considered, including a 
Higgs boson 6 . The particles are treated as massless, including the gauge bosons and heavy quarks. 

Gravitons have not been included in the simulation, which is another drawback of the current model. 
Because the graviton lives in the bulk, the number of degrees of freedom of the graviton becomes sig- 
nificant for high numbers of dimensions. In addition, the graviton emissivity is highly enhanced as the 
space-time dimensionality increases. Therefore the black hole may lose a significant fraction of its mass 
into the bulk, resulting in missing transverse energy. 

The energy assignment to the decay particles in the Hawking evaporation phase has been imple- 
mented as follows. The particle species selected by the model described above is given an energy ran- 
domly according to its extra-dimensional decay spectrum. A different decay spectrum is used for scalars, 
fermions and vector bosons, i.e. the spin statistics factor is taken into account. Grey-body spectra are 
used without approximations [30]. The grey -body factors depend on the number of dimensions. The 
Hawking temperature is updated after each decay. It is assumed the decay is quasi-stationary in the sense 
that the black hole has time to come into equilibrium at each new temperature before the next particle is 
emitted. The energy of the particle given by the spectrum must be constrained to conserve energy and 
momentum at each step. 

The evaporation phase ends when the chosen energy for the emitted particle is ruled out by the 
kinematics of a two-body decay. At this point an isotropic two-body phase-space decay is performed. In 
our simulation, the decay is performed totally to Standard Model particles and no stable exotic remnants 
survive. 

4 The main effects of these are to reduce the black hole production cross-section, possibly by as much as a few orders of 
magnitude. Work is underway to try to estimate the magnitude of this. 

5 Here emissivity is the fractional emission rate per degree of freedom. Mathematically it is ratio of dE/dt for different 
species, or roughly speaking the area under the black body distribution for a particular type of particle. 

including a scalar Higgs boson is not significant since it has only one degree of freedom. 
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Name 


Description 


Value 


MINMSS 


Minimum mass of black holes 


5TeV 


MAXMSS 


Maximum mass of black holes 


14TeV 


MPLNCK 


Planck scale 


ITeV 


MSSDEF 


Convention for Planck scale 


2 


TOTDIM 


Total number of dimensions 


6 


NBODY 


Number of particles in remnant decay 


2 


GTSCA 


Black hole mass used as PDF momentum scale 


True 


TIMVAR 


Allow Th to change with time 


True 


MSSDEC 


Use all Standard Model particles as decay products 


True 


GRYBDY 


Include grey-body effects 


True 


KINCUT 


Use a kinematic cut-off on the decay 


True 



Table 1 : Default parameters used in the CHARYBDIS generator. 



Baryon number, colour and electric charge are conserved in the black hole production and decay in 
this model. Missing transverse energy in the generator comes only from the neutrinos, while in reality 
missing transverse energy is also possible due to the lost energy in inelastic production, graviton emis- 
sion, a non-detectable black hole remnant and the possibility that the black hole can leave the Standard 
Model brane. For the black holes we consider, only a small amount of energy, on average, is lost due to 
neutrinos. If gravitons were considered, the average energy loss would be approximately 9% [31]. 

3 Monte Carlo Simulations 

3.1 Production of Signal and Background Events 

The event generator CHARYBDIS [32,33] version 1.003 was used within the ATLAS software frame- 
work to generate Monte Carlo signal samples. It was interfaced via the Les Houches accord [29] to 
HERWIG [34, 35] which provides the parton evolution and hadronisation, as well as Standard Model 
particle decays. 

Table 1 shows the default CHARYBDIS parameters used, for which approximately 25000 events 
were generated. Three other black hole signal samples were generated with variations in the number of 
dimensions and in the black hole minimum mass. In all simulations, the parameter MSSDEF was set 
equal to 2, setting the Planck scale MPLNCK to be the D-dimensional Planck scale Mdl in the con- 
vention of Ref . [8] . The above samples subsequently underwent the full ATLAS detector simulation 
and reconstruction. Fast simulation using ATLFAST [36] was employed to widen the range of signal 
samples studied, enabling investigation of the many theoretical uncertainties modelled by generator pa- 
rameter switches (see Table 2). 

Black holes decay democratically to all particles of the Standard Model, so few Standard Model 
processes should produce the same particle spectrum. Black hole decays are characterised by a number 
of high energy and transverse momentum objects, so the primary Standard Model backgrounds are states 
with high multiplicity or high energy jets. The predominant backgrounds to our signal are described 
below and their datasets and cross-sections are listed in Table 3. Sizeable samples are required due to 
their large cross-sections at the LHC. 

• it leptonic and hadronic decay modes. This process yields the largest contribution to the back- 
ground due to its large cross-section at the LHC and the large branching ratio to hadronic final 
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n 


TTiBH ( TeV) 


a(pb) 


Note 


2 


5-14 


40.7 




2 


8-14 


0.34 




4 


5-14 


24.3 




7 


5-14 


22.3 




2 


5-14 


6.4 


MPLNCK=2 


3 


5-14 


28.5 




5 


5-14 


22.7 




2 


5-14 


40.7 


KINCUT=0 


7 


5-14 


22.3 


KINCUT=0 


2 


5-14 


40.7 


TIMEVAR=0 


7 


5-14 


22.3 


TIMEVAR=0 


2 


5-14 


40.7 


NBODY=4 


7 


5-14 


22.3 


NBODY=4 



Table 2: Monte Carlo datasets and their respective cross-sections used in this analysis. The first four 
samples were simulated using both full and fast simulations; the lower nine samples were simulated using 
the fast simulation ATLFAST. The final column shows the CHARYBDIS parameter that was changed 
with respect to the reference set shown in Table 1 . 



states. The matrix element calculation is done with MC@NLO [37] and HERWIG is used to 
perform the parton shower evolution, parton decay and their hadronisation. 

• QCD dijet production. The requirements placed on the hadronic part of the signal events reduces 
the contribution from low-prQCD jets. This background is generated using Pythia 6.4 [38]. 
Note that the complete QCD inclusive jet production is not fully modelled by the Pythia dijet 
simulation due to the lack of higher-order QCD contributions. Very low-statistics samples of 
multijet samples generated by ALPGEN [39] were also used. 

• W —>■ £v + jets production. These backgrounds, though coming from the hard process, have cross- 
sections that rapidly become small compared to the signal as more jets are added. Vector boson 
plus jets samples were generated using ALPGEN. 

• Z — >• ££+ jets production. 

• 7(7) + jets production. 

3.2 Detector Simulation 

The detector simulation and reconstruction of both signal and background Monte Carlo events were 
performed within the ATLAS offline framework. 

Fast simulation (ATLFAST) was used to widen the range of signal samples studied. The primary 
advantage of this is one of processing rate: since no detector interactions are modelled it requires less than 
one second per event. In contrast, full simulation requires approximately 15 minutes for typical Standard 
Model events, and over 30 minutes per black hole event. Despite this advantage, there are drawbacks: 
the fast simulation does not include a complete treatment of lepton isolation and misidentification nor of 
photon conversion. 

The same generator signal samples were passed through the full and fast simulations in order to 
understand the differences in black hole events. Variables ranging from simple multiplicities to event 
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Process 


a(pb) 


Semi/Fully Lep tonic it 


463 


Hadronic tt 


370 


QCD dijets 


12.84 x 10 3 


W —* ev e + jets 


281 


W — > liv^ +jets 


279 


Z — ► ee+jets 


25.8 


Z—^nn +jets 


26.0 


7+jets 


5.00 x 10 3 


77+jets 


67.6 



Table 3: Background Monte Carlo datasets and their respective branching ratio times cross-sections. 




Figure 1 : Total particle multiplicity and missing transverse energy distributions for signal samples from 
fast and full simulation. 

shapes were compared. Sample distributions of particle multiplicity and |?Yare shown in Figure 1. The 
multiplicity difference is due to differing default jet algorithms; we use a cone algorithm with radius 
AR = 0.4 in the full simulation, whereas ATLFAST uses a kj_ algorithm. Using the same algorithm for 
both samples gives very close agreement, nonetheless this discrepancy will have an effect in analyses 
dependent purely upon multiplicity information. All other variables investigated showed concordant 
results. 

4 Event Properties 

The high mass scale, and the thermal nature of the decay process, result in black hole events being 
characterised by a large number of high-prfinal state particles, including all the Standard Model fields. 
Graviton emission is also expected, but is not simulated in CHARYBDIS. Of the final state particles, the 
detector can measure jets, electrons, muons and photons well, and will be able to reconstruct some of the 
Z and W bosons. The missing transverse energy, produced mainly by neutrino and graviton emission, 
can also be measured. In this section, the data sample with two extra dimensions and black hole masses 
above 5 TeV is used as the reference signal sample. 

A key feature of black hole decays is that the Hawking temperature is higher for larger n, for a given 
black hole mass. A higher temperature produces higher energy emissions, with the consequence that the 
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Figure 2: PDG code of particles emitted from black hole decay for a minimum black hole mass of 5 TeV 
and n = 2, 4 and 7 and for a minimum black hole mass of 8 TeV and n = 2 (\PdgId\ = 1 - 6 are quarks, 
11-16 leptons, and 21 - 25 gauge and Higgs bosons). The vertical axis shows multiplicity per black 
hole decay. 



energy is shared between fewer particles. This has a significant effect on the multiplicity and event shape 
distributions. Similarly, the samples with a higher black hole low-mass cutoff produce more high energy 
final state particles. 

4.1 Particle Types and Multiplicities 

Figure 2 shows the types of particles produced directly by black hole decay. The vertical axis shows 
the average number of particles per black hole decay. From this figure, we see that a heavier black hole 
has more decay products. The particle-antiparticle balance is broken by the initial state of two protons 
colliding. Moreover, due to conservation of energy and momentum, colour connection etc., a perfect 
democratic decay cannot be achieved, e.g., the number of top quarks is smaller than that of ligher quarks. 
The possibility of identifying fermions and bosons and determining their branching ratios in black hole 
decays was studied in [40] . 

Figure 3 shows pj and pseudorapidity (r\) distributions of particles produced directly from black 
hole decays. As expected, the shape depends little on particle type. Figure 4 shows the reconstructed 
multiplicity of final-state jets, leptons and photons. Four signal samples are shown for n = 2, 4 and 7 
with a minimum black hole mass of 5 TeV, and for n = 2 with a minimum black hole mass of 8 TeV. The 
figure also compares the reference signal to the backgrounds. The multiplicity in the signal falls as n 
rises, because the black holes decay at a higher temperature. 

4.2 Event Shape 

At first sight, one would expect black hole events to be very different from the background in event 
shape variables [38,41,42] such as sphericity, because of the high multiplicity thermal decay. However, 
the event shape of the black hole events varies considerably with n, making such variables less useful 
than could be hoped. Though the background distributions show less variation, when these are scaled 
by their large cross sections, there is a large degree of overlap, disfavouring their use as a cut variable. 
Additionally, our ignorance of the decay modes of the final black hole remnant introduces a significant 
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Pt[GeV] Eta 



Figure 3: Generator pj distributions (top row): leptons (left) and Z bosons (right) emitted from the black 
hole. The bottom row shows pr and Y\ spectra for all particles emitted from the black hole. 




Multiplicity Multiplicity 



Figure 4: Multiplicities of reconstructed objects for (left) black hole samples and (right) backgrounds. 
They are normalised to the integrated luminosity of 1 fb _1 . 
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Figure 6: Thrust calculated from reconstructed objects for (left) black hole samples and (right) QCD 
dijet and tt backgrounds. 

systematic effect. In our version of CHARYBDIS, once the mass of the black hole has dropped below 
the Planck scale, the remnant decays to either 2 or 4 bodies. We have selected the two-body option for 
our standard samples. This means that at high n, where events can reach this stage after few emissions, 
the circularity of the events is reduced, and the thrust increased. 

The distinguishing power between signal and backgrounds of a selection of event shape variables was 
studied; Figure 5 shows the circularity distribution for the same samples as Figure 4; similarly Figs. 6 
and 7 show their thrust distributions, sphericity and aplanarity. The expected bias towards more "jet- 
like" events is clearly seen at high n. For this reason, we choose not to use event shape variables as a 
discriminant in this analysis. 

5 Trigger 

The ATLAS trigger and data-acquisition system consists of three levels (LI, L2, EF) of online event 
selection [43]. Each subsequent trigger level refines the decisions made at the previous level and may 
apply additional selection criteria. The ATLAS trigger is described in detail in ref. [44]. 
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Figure 7: Different event shape variables for black hole samples and backgrounds. They are normalised 
to the integrated luminosity of 1 fb _1 . 
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Figure 8: Simulated jet trigger efficiencies as functions of the offline reconstructed jet prfor a) LI, b) L2 
and c) EF. The efficiencies are determined for different pj -thresholds: 150 GeV (black), 300 GeV (red), 
400 GeV (blue), 600 GeV (magenta) and 800 GeV (cyan). 

5.1 Triggering on Black Holes 

Each black hole produces multiple decay products, including hadronic jets, leptons and photons, as 
described in Section 4. The jets typically carry a dominant fraction of the visible decay energy and hence 
provide the best option for triggering black hole events. 

The response of the jet trigger, as simulated in the current version of the ATLAS detector simulation, 
is demonstrated in Figure 8. The plots show the trigger efficiencies for various /^-thresholds as functions 
of the jet pj reconstructed offline. For these plots, a match between the jet reconstructed offline and at 
the respective trigger level is required. The matching consists of searching for the closest offline jet in 
AR = y/ (At]) 2 + (A0) 2 , where Atj and A</> are the distances between the reconstructed jet and the trigger 
jet in pseudorapidity tj and azimuth <j>, respectively. To avoid incorrect matching for LI jets, a modified 
criterion is applied: the LI jet closest in energy to the reconstructed jet is chosen among the jets found 
within the AR = 0.5 distance around the reconstructed jet. The shape of the LI efficiency distribution 
for the 800 GeV threshold is due to the saturation of the LI trigger tower energies at 255 GeV. Events in 
which the transverse energy in one trigger tower exceeds 255 GeV are automatically accepted, as larger 
values fill up the memory of the LI trigger analog-to-digital converters. 

The efficiency at each trigger level is determined independently of the decisions at the other levels. 
Were the trigger chain to have the same threshold on all levels, the total efficiency would be the con- 
volution of the respective functions. The L2 algorithms are based on regions of interest provided by 
LI, hence it is not possible to determine their efficiency completely independently of the LI decisions. 
The L2 algorithms were run on all LI jet Rol stalling from the lowest LI jet /^threshold of 35 GeV. 
This is much lower than the thresholds studied, making the L2 decision (shown in Figure 8b) virtually 
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Trigger Threshold [GeV] 



Figure 9: Simulated jet trigger efficiencies for black hole events from the signal sample with n = 2 and 
m > 5 TeV as functions of the jet pj threshold for a) single-jet trigger, b) 3-jet trigger and c) 4-jet trigger. 
The efficiencies are determined for LI (black), L2 (red) and EF (blue). 

independent of the LI efficiency. 

The total trigger efficiencies are listed in Table 4 for three signal samples and demonstrated in Fig- 
ure 9 for the signal sample with n = 2 and m > 5 TeV. The highest efficiency is provided by the single -jet 
trigger, which we consider to be the master trigger for the black hole events. The presence of multiple 
high- pr jets per event, each of which is likely to pass the trigger, results in very high total efficiencies. 
Setting this trigger threshold at 400 GeV will provide greater than 99% efficiency at all trigger levels. 
The Standard Model process rate at this threshold is expected to be less than 0. 1 Hz at an instantaneous 
luminosity of 10 31 cm _2 s _1 , which should allow this trigger to run at this threshold without prescaling 
for the first few years of LHC data taking. The rate of black hole events is expected to be less than 5 mHz 
at the 10 31 cm _2 s _1 luminosity. For the start-up running at the luminosity of 10 31 cm _2 s _1 , it is planned 
to set the highest threshold for the single-jet trigger at 120 GeV, guaranteeing an efficiency of almost 
100% for black hole events. 

Alternatively, a trigger based on the scalar sum of transverse energies of all recorded decay products 
("sum-£V trigger") can be used. No simulation of this trigger is available in the samples used in this 
study. Looking at this sum in the offline reconstruction suggests that this trigger would collect nearly 
100% of black hole events for Planck scales above 1 TeV. It is foreseen to run this trigger in the start-up 
data taking, unprescaled at the threshold of 650 GeV. 

Based on experience from previous collider experiments, one may expect detector hardware problems 
at the beginning of data taking. In particular, noisy channels in the calorimeter or trigger electronics may 
cause high trigger rates for the single-jet trigger and for the sum-Er trigger, such that even the highest 
threshold triggers have to be prescaled. In such cases, a multijet (3- or 4-jet) trigger is considered for use 
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a) CHARYBDIS: n = 2,m > 5 TeV b) CHARYBDIS: n = 4,m > 5 TeV 



Trigger 


LI 


L2 


EF 


Trigger 


LI 


L2 


EF 


jlOO 


1 


1 


1 


jlOO 


1 


1 


1 


j400 


0.997 


0.997 


0.997 


j400 


0.997 


0.997 


0.996 


3jl00 


0.998 


0.998 


0.998 


3jl00 


0.952 


0.952 


0.952 


3j250 


0.972 


0.971 


0.971 


3j250 


0.886 


0.885 


0.885 


4jl00 


0.985 


0.985 


0.985 


4jl00 


0.807 


0.806 


0.806 


4j250 


0.865 


0.862 


0.862 


4j250 


0.612 


0.607 


0.607 


c) CHARYBDIS: n 


= 7,m>5TeV 










Trigger 


LI 


L2 


EF 










jlOO 


1 


1 


1 










j400 


0.990 


0.987 


0.985 










3jl00 


0.807 


0.806 


0.805 










3j250 


0.710 


0.704 


0.704 










4jl00 


0.525 


0.522 


0.522 










4j250 


0.343 


0.341 


0.341 











Table 4: Simulated jet trigger efficiencies for black hole events as functions of the jet-prthreshold for 
different simulation samples. 



until the detector problems are resolved. The efficiencies of such triggers are listed in Table 4. 

In the present study, the minimum mass of a black hole is set at 5 TeV or more in order to be safely 
above the Planck scale. At lower masses one may expect an increased rate of dijet events described by 
a contact interaction. The single-jet trigger or the sum-£"7 trigger at the thresholds considered above are 
well suited for detecting such signatures. Such events may not, however, be selected by multijet triggers. 

The trigger efficiencies, studied here in the simulation, have to be determined from data. An unbiased 
determination requires an "orthogonal" trigger, e.g. a trigger based on fully independent information 
from that used by the master trigger. A muon trigger which is based solely on signals in the muon 
detector should be well suited for such studies. 

6 Signal Selection and Background Rejection 

6.1 Event Selection 

Since all types of Standard Model particles are produced from black hole decay, we make full use of 
particle identification information (PID) from our detectors. First we select muons, electrons, photons 
and jets, which are called objects in this section. Table 5 shows the details of their selection criteria. 

The identification of objects is sometimes ambiguous: e.g., an electron could be simultaneously 
reconstructed as a jet. To resolve this, we apply PID to each object, selecting muons, electrons, photons 
and jets in that order of priority. Once an object passes the PID criteria in a given category, any remaining 
ambiguous assignments are removed if they match the chosen object within a AR of less than 0.1. 

Next we select black hole events using these objects as described below. Then we reconstruct a 
black hole from all the identified objects for the selected event. The mass of the black hole in an event 
is calculated from the four-momenta of the reconstructed final state objects and missing E T , which is 
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included in the calculation to improve the reconstructed mass resolution: 

PBH = £ Pi + ($T,$T x ,$T y ,0), (2) 



W B H = 




We present two methods to select black hole events. One is based on the scalar summation of pj 
and the other on the multiplicity of high-p^ objects. Both make use of the characteristic of a black hole 
having large mass. After that, we require a high- pr lepton to reject backgrounds further. 

Figure 10 shows the scalar summation of the pj of each object, £|jcr|» which demonstrates good 
background discrimination and high signal efficiency for all black hole samples. We require £ \pj\ to be 
larger than 2.5 TeV to reject backgrounds. This requirement is relatively unaffected by changes in the 
model, in particular by changes to the number of extra dimensions n. Figure 1 1 shows m-gu distributions 
after this requirement. The QCD dijet background is already well suppressed, but we also investigated 
the effect of a further selection, requiring a lepton with a pj > 50 GeV. This resulted in the QCD dijet 
background being rejected by a factor greater than 10 6 as shown in Table 6, which summarises the event 
numbers for an integrated luminosity of 1 fb~ 1 . Though the high statistics QCD samples used were 
generated with Pythia, a leading order generator, there were also /^-sliced small ALPGEN multijet 
samples available. When investigated using the £ \pt\ and lepton cut method, a very similar, marginally 
lower number of background events was predicted according to the very limited statistics available. 
Larger scale studies would be needed to conclude anything more concrete. Poisson confidence limits are 
used for samples where fewer than 20 events passed the requirements. Signal cross-section errors are 
statistical only; the theoretical uncertainties are large as discussed in Section 2. 7 

An alternative selection procedure was also used. Figure 12 shows the pj distributions of the leading, 
2nd-, 3rd- and 4th-leading objects out of all the selected objects. The 4th-leading object still has larger pj 
in the signal events than in the background events. We require the number of objects with pj > 200 GeV 
to be equal to or greater than four. Figure 14 (left) shows otbh distributions after this requirement. Since 
QCD processes still remain large, a lepton requirement is again used to decrease it. Figure 13 shows the 
distribution of the highest pj lepton (muon or electron). As expected, the number of leptons from QCD 
processes is small. Requiring the number of leptons (muons or electrons) with pj > 200 GeV to be equal 
to or greater than one results in the wtbh distributions shown in Figure 14(right). 

The shape of the background in the region of high w?bh was fitted with a gaussian plus an asymmetric 
gaussian (Figure 15) and that function is used to estimate the number of background events. 

CHARYBDIS does not include graviton emission. In practice this, and the energy lost in gravita- 
tional interactions during the balding phase, would be another source of fj. Consequently we expect 

7 ln the case of two hadronic subsamples (tt and dijets) where very few events passed the £|pr| requirement, the lepton 
requirement rejection factor was applied to the £|pr| requirement's Poisson bound to estimate the background distribution 



\T]\ <2.5, p T > 15 GeV 

Central track match (0 < % 2 < 100) 

Isolation £Y,cone0.2 < min(100,0.2p r + 20) GeV 

(c) photon 
|tj| <2.5, p T > 15 GeV 
tight selection [45] 

Isolation E T cone0 2 < 0.2p T + 20 GeV 

Table 5: 



(b) electron 

|tj| < 2.5 except for 1.00< |t)| <1.15, 1.37< |tj| <1.52 
p T > 15 GeV 
medium selection [45] 

(d)jet 

Cone algorithm (R = 0.4) based on calorimeter towers 
|tj| <2.5, Pt >20GeV 

tide selection 
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Figure 10: £ \pr\ distributions for (left) black hole samples and (right) backgrounds (QCD dijet, tt and 
vector boson plus jets), along with one signal sample for reference. They are normalised to an integrated 
luminosity of 1 fb -1 . 




Reconstructed BH Mass [GeV] Reconstructed BH Mass [GeV] 



Figure 11: Black hole mass distribution with a requirement £ \pj\ >2.5 TeV (left), and black hole mass 
distribution with an additional requirement on the lepton-/?7- of pj > 50 GeV (right). The signal sample 
with n = 2 and m > 5 TeV and backgrounds are shown. 



Dataset 


Before selection 


Y>r|>2.5TeV 


After requiring a lepton 


acceptance 




(fb) 


(fb) 


(fb) 




« = 2,m>5TeV 


40.7 ±0.1 x 10 3 


39.2±0.3xl0 3 


18.6±0.2x 10 3 


0.46 


n = 4,ra>5TeV 


24.3 ±0.1 x 10 3 


22.6 ± 0.2 x 10 3 


6668 ±83 


0.27 


« = 7,m>5TeV 


22.3 ±0.1 x 10 3 


20.1 ±0.2 x 10 3 


3574 ±60 


0.17 


n = 2,m> 8 TeV 


338.2±1 


338.1 ±2.5 


212±16 


0.63 


tt 


833 ± 100 x 10 3 


23.61^ 


o 9+2.43 


9.8 x 10~ 6 


QCD dijets 


12.8 ± 3.7 x 10 6 


589911™ 


; '- ,/ -2.02 


4.3 x 10~ 7 


Wi V + > 2 jets 


1.9 ± 0.04 x 10 6 


12.31?-° 


4 67+ 8 75 


2.4 x 10~ 6 


Zee + > 3 jets 


51.8±1 x 10 3 


O -7JT+2.02 
Z.I3_ 2m 


9 c 7 +0.95 
Z - ;,/ -0.64 


5.0 x 10~ 5 



Table 6: Acceptance for each signal and background dataset in fb after requiring £ \pr \ >2.5 TeV, and a 
lepton with p T > 50 GeV. 
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Figure 12: pj distributions of leading (top left), 2nd- (top right), 3rd- (bottom left) and 4th-leading 
(bottom right) objects out of all the selected objects for the signal sample with n = 2 and m > 5 TeV and 
backgrounds (see Table 7). 




Figure 13: pj distributions of the leading lepton (electron or muon) after requiring the number of ob- 
jects (electron, muon, photon or jet) with pj > 200 GeV to be larger than 3 for the signal sample with 
n = 2 and m > 5 TeV and backgrounds (see Table 7). 
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Figure 14: Black hole mass distribution for the signal sample with n = 2 and m > 5 TeV and backgrounds 
(see Table 7) after multiplicity requirement of at least 4 objects with pj > 200 GeV (left plot) and an 
additional requirement of a lepton (electron or muon) with pj > 200 GeV (right plot). 



Dataset 


Before selection 
(fb) 


After multi-object 
requirement (fb) 


After lepton requirement 
(fb) 


Acceptance 


n = 2,m> 5 TeV 


40.7 


x 10 3 


38.9 ± 0.4 x 10 3 


14.0±0.2xl0 3 


0.34 


n = 4, m > 5 TeV 


24.3 


x 10 3 


17.9 ± 0.3 x 10 3 


4521 ±126 


0.19 


« = 7,m>5TeV 


22.3 


x 10 3 


9953 ±185 


1956 ±82 


0.087 


n = 2,m> 8 TeV 


338 


338±4 


164 ±3 


0.49 


tt 


833 


x 10 3 


129 ±27 


36^ 2 


4.3 x 10~ 5 


QCD dijets 


12.8 


x 10 6 


38.9 ± 1.9 x 10 3 


6 + i, 07 


5.6 x 10- 7 


W+jets 


560 


x 10 3 






1 x 10~ 3 


Z+jets 


51.8 


x 10 3 


li) ■ r,r » 


19+9° 


4 x 10- 4 


y(7)+jets 


5.1 


< 10 6 


285^ 




< 10~ 5 



Table 7: Acceptance of the 4-object requirements for each dataset in fb. 90% confidence limits are used 
when no events passed the requirements. 
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M BH [GeV] 

Figure 15: The background shape after the 4-object and lepton requirements is shown (data points). The 
points were fitted by the sum (black line) of a gaussian (red line) and an asymmetric gaussian (green 
line). 

CHARYBDIS to underestimate this for black hole events. Nonetheless, each of the black hole samples 
studied in this analysis often have very wide distributions of fa, with tails extending out to several TeV. 
This property of models with black holes is most unusual and hard to reproduce in other new physics 
scenarios, and should make it possible to distinguish between Black Holes and the majority of SUSY 
models for example. 

A requirement on fa above ~ 500 — 600 GeV was studied as an alternative to a lepton require- 
ment for black hole signal selection. Figure 16 shows the potential of this method, and contrasts these 
models with three common supersymmetric models of different cross-section and mass scale. Despite 
the possibility of early evidence for the presence of black holes, there are disadvantages to relying on 
such a selection. Firstly, our ability to reconstruct the black hole's mass is aided by limiting fa to be 
under 100 GeV (Figure 22). Such a signal from high fa events would be dominated by those events 
reconstructed most poorly, limiting their use for cross-section measurement and discovery. The theo- 
retical uncertainties are large and difficult to quantify, and finally there are experimental difficulties in 
calibrating and accurately measuring this variable across a wide energy range. 

6.2 Discovery Reach 

Making a robust discovery potential for black hole events is difficult, because the semi classical assump- 
tions used to model them are only valid well above the Planck scale. Close to the Planck scale, events 
may occur due to gravitational effects with lower multiplicities, but without the signatures anticipated 
by our event selections. As the energy rises above the threshold needed for black hole creation, our 
requirements should become more efficient. Lack of theoretical understanding makes it impossible to 
model this threshold region. 

To account for this, we impose a lower requirement on the true mass of black holes created in our 
simulated samples, BH t h res h, normally set at 5 TeV, and we do not attempt to account for any additional 
signal from lower masses. In order to estimate the discovery potential, two methods have been consid- 
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Figure 16: The left hand plot shows the missing transverse energy distributions after a £ \pj\ > 2.5 TeV 
requirement. A requirement of #t > 500 GeV would leave negligible background and a large number of 
signal events for all samples. The right hand plot compares two black hole samples with three supersym- 
metric models with a range of mass scales; the two classes of models can easily be distinguished by their 
differing cross-sections and the extent of the #t tail. 

ered: 

1. we keep our signal selection requirements constant, and increase the value of BH t h res h- Since the 
analysis requirements are unchanged, the background remains constant, while the signal drops as 
the production of events occurs at higher mass. We then evaluate the luminosity required to detect 
a minimum of 10 signal events, with S/V~B > 5, assuming the production cross-section is as high 
as predicted. Such a study is shown in Figure 17, using the L\pr \ and lepton requirements. This 
method produces conservative limits, taking some account of the uncertainty in the production 
cross-section near the threshold. 

2. We keep the production model unchanged with BH 'thresh = 5 TeV, but apply an additional require- 
ment on the reconstructed black hole mass. This requirement reduces substantially background 
events, while allowing the higher mass signal to pass unchanged. This is less conservative, since it 
allows black hole signal events to be produced at low mass, but to migrate above the reconstructed 
mass requirement because of the detector mass resolution, hence increasing the signal. As be- 
fore, we use the nominal value of the production cross-section, and evaluate the luminosity needed 
to meet our discovery criteria, this time as a function of reconstructed mass. A study using this 
method is shown in Figure 18 using the 4-object and lepton requirements. 

The two approaches are complementary and illustrate the uncertainties in different ways. We observe 
that the search reach is limited eventually at high mass by the falling production cross-section, reflecting 
the falling parton luminosity and the limited energy of the LHC. We conclude that, if the semi classical 
cross-section estimates are valid, black holes can be discovered above a 5 TeV threshold with a few pb _1 
of data, while 1 fb _1 would allow a discovery to be made even if the production threshold was at 8 TeV. 



7 Systematic Uncertainties 
7.1 Signal uncertainties 

We have investigated the systematic uncertainties using fast simulation runs, having checked that the full 
and fast simulations agree well for this purpose. 
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Figure 17: Discovery potential using Y.\pt\ an d lepton selections: required luminosity as a function of 
black hole mass threshold. Error bars reflect statistical uncertainties only. 




Figure 18: Discovery potential for black holes using four-object and lepton requirements. The required 
luminosity is shown as a function of the requirement on the reconstructed black hole mass. The error 
bars correspond to experimental systematic uncertainties. (See text for constraints.) 
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There are a number of theoretical parameters associated with CHARYBDIS which can generate 
systematic errors in the estimates of the acceptance for signal events. These are: 

• The kinematic cutoff. This parameter is normally true, and causes the generator to end thermal 
emission if an unphysical emission is randomly selected. The generator moves immediately to the 
final remnant decay phase. This approximation deteriorates at high numbers of extra dimensions 
because of the high temperature and emitted particle energies. We have investigated the alternative, 
where a new emission is selected until a physical one is chosen. In this case, thermal emission will 
continue until the black hole mass falls below Mdl- 

• Temperature variation. The Hawking temperature of the black hole is normally allowed to increase 
as its mass decreases, as expected if the black hole has time to equilibrate between decays. We 
have investigated the alternative of keeping the temperature fixed at the initial value, as would be 
the case if the black hole decayed very quickly or "suddenly" (see Table 8). 

• Number of extra dimensions. In addition to our full simulation samples with n = 2, 4 and 7, we 
have simulated n = 3 and 5 with the fast simulation. As noted above, the events become more 
jet-like at high n and the particle multiplicity drops (see Figure 20), due to the increased Hawking 
temperature. Our signal selection remains robust, as shown in Table 8. 

• Planck scale. We have investigated changing the Planck scale from its default value of Mdl = 
1 TeV to 2 TeV. We note that, since the model is only valid for black hole masses much larger than 
the Planck scale, this scenario is not well modelled in the range of masses accessible at the LHC. 

• Remnant decay. We have investigated changing the remnant decay model from a two-body to a 
four-body mode (see Figure 20). 

Figure 19 shows the effect of changing the kinematic cutoff on the particle multiplicity and Y,\Pt\ 
distributions. Since the black hole is forced to decay thermally until it falls below Mdl, the multiplicity 
is higher, and the events have lower energy emissions. The total energy remains constant, however so the 
£ \pr \ distribution is relatively stable. 




Figure 19: Particle multiplicity and £ \pr \ distributions, showing the variation when the kinematic cut-off 
parameter is changed. 

The acceptance of the signal for various parameter choices are shown in Table 8. Compared to the 
standard full simulation, the largest effect is observed for high n, when the kinematic cut-off parameter 
is changed. This is as expected, since this parameter has a large effect on the evolution of the black hole 
during its decay. Similarly changing the remnant decay from 2 to 4 bodies has a large effect at high n; 
the effect of this, and of changing number of extra dimensions is shown in Figure 20. 
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Figure 20: Particle multiplicity distributions, showing the variation produced by a change in the number 
of extra dimensions, or the number of particles produced by remnant decay. 



n 


Full Sim 


Fast Sim 


Kin. Cut off 


T/z-variation off 


4-body remnant 


2 


45.8 


42.9 


47.2 


48.7 


47.9 


3 




33.2 








4 


27.4 


26.6 








5 




21.7 








7 


16.1 


15.9 


29.2 


16.6 


27.4 



Table 8: Signal acceptance (%) for different model assumptions. 



7.2 Uncertainties on detector performance 

Two kinds of systematic uncertainties on detector performance were studied. One is an uncertainty on 
lepton identification efficiency. To estimate this effect, we loosened and tightened particle identification 
selections, by changing the hadronic leakage for electrons and the isolation cut for muons. The changes 
in signal efficiencies are around 2%. 

The effect of a 5% error in the jet energy scale (JES) was also considered. The effect of these 
uncertainties on the discovery potential is shown in Figure 18. 

8 Results 

8.1 Search Reach for Black Hole Production at the LHC 

The studies presented show that the ATLAS detector is capable of discovering the production of black 
holes up to the kinematic limit of the LHC, assuming that the signal is correctly modelled. This con- 
clusion is largely based on the predicted huge production cross-section, and the small background from 
QCD dijets at very high-pr, especially when the presence of a high energy lepton is required. How- 
ever, both of these assumptions are suspect. The high production cross-section is subject to considerable 
discussion in the literature, as discussed in Section 2. Moreover, until the LHC has measured the QCD 
cross-section at 14TeV, we cannot be certain of the tails of the QCD distributions. The Monte Carlo 
simulations of these tails are working at the limit of their validity, given the high energies and large 
multiplicities involved. 

For these reasons, we prefer not to place too much weight on the detailed search reach limits. In- 
stead, we confine ourselves to the statement that, with current understanding, the black hole signature 
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considered should be clearly visible if it exists. 



8.2 Determination of Model Parameters 

We have considered the possibility of extracting model parameters from the data, should a signal be 
observed. There are two key parameters: the Planck scale Mo (or Mdl depending on the convention) 
and the number of extra dimensions n. In Ref. [46, 47] a method was proposed to extract Mp from the 
cross-section data, which fixes the Planck scale (within the model assumptions), and from events with 
high energy emissions. 

The Hawking temperature 7# of the black hole depends on n. If we detect events with emissions 
near w?bh/2, the energy of those emissions is a measure of the initial 7#. Hence, over the sample of 
black holes, the probability of such emissions is a measure of the characteristic temperature, and can 
be used to extract n. This method was first put forward in Ref. [46], and here is made compatible 
with the need for background rejection requirements. The requirements described in Section 6 are not 
appropriate: the lepton requirement biases the selected events in favour of final states with many particles, 
and hence against those events with a single high energy emission. A suitable requirement was found to 
be Y,\Pt\ > 3.5 TeV; this removes the background without biasing the signal events selected. 

Figure 21 shows the probability of a hard emission for two samples, compared to the predictions. 
The method requires accurate mass resolution, and so an additional requirement on $t< 100 GeV is 
applied. The addition of this requirement lowers the efficiency noticeably, but does improve the black 
hole mass reconstruction; details of the resolution and efficiency can be found in Figure 22. The data are 
consistent with the expected value of n, but due to this reduction in signal efficiency, more data would 
be required to make a definitive measurement. It should be noted that this measurement requires the 
Planck scale to be known. If this cannot be determined from the production cross-section, it is likely that 
the threshold behaviour near the Planck scale would provide an indication of its value. At present, no 
theoretical model exists to allow us to make predictions in this region. 




Figure 21: The probability of a hard emission near for n = 7 and n = 4. The bands show the ex- 

pected range for n = 7 and n = 6, and for n = 5 and n = 4, respectively, for a luminosity of approximately 
0.75 fb- 1 . 



9 Summary 

The search for black holes in the first 100 pb -1 of LHC data with the ATLAS detector and software 
framework was simulated. 
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Normalisation 


Mean (GeV) 


Resolution (GeV) 


Without 


Narrow 


1018±26 


-217 ±5 


276 ±9 


fir requirement 


Wide 


276 ±30 


-148±9 


722 ±13 


With 


Narrow 


318±12 


-116±8 


215±9 


^7- requirement 


Wide 


108 ±7 


118±18 


635 ±16 



£1600 

^1400 
U 

§1200 
1 1000 
Qj 800 

600 

400 

200 

4b00 -3000 -2000 -1000 0 1000 2000 3000 4000 
Reconstructed Mass - True Mass [GeV] 

Figure 22: Black hole mass resolution distributions and their fits to double Gaussian functions, using a 
Y.\Pt\ an d lepton requirement (1 fb -1 ). The upper curve is without a ^7-cut. The lower curve has an 
additional requirement on $j< 100 GeV. 

We summarised the current experimental limits on black hole production and studied, with the help 
of the black hole event generator CHARYBDIS and Standard Model Monte Carlo data sets, the basic 
event properties, trigger and selection efficiencies, theoretical and experimental uncertainties of black 
hole production at the LHC for a flat ADD extra dimension scenario with the Planck scale Mdl = 1 TeV. 
We have explored the uncertainties inherent in the theoretical modelling and our understanding of the 
detector. We conclude that, if the semi-classical cross section estimates are valid, black holes above a 
5 TeV threshold can be discovered with a few pb _1 of data, while 1 fb _1 would allow a discovery to be 
made even if the production threshold was 8 TeV. 
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